Graylog in a constant loop of relocating / initializaing shards

123dev · March 25, 2020, 4:02pm

Hi
We are having this weird new behavior and are struggling to narrow down the issue.
Graylog version 2.4.6
2 Graylog Servers, 4 Datanodes
Graylog is relocating shards getting them all properly assigned only a little later to see that there are n unassigned shards and the cluster is yellow again
Graylog starts initializing / relocating and self fixing the issue to eventual green and soon after the cycle repeats, we get m unassigned shards.

Obviously this adds additional strain on the resources,
We’ve doubled the instance type of each node to make sure while this is happening, there’s enough resources available, yet we don’t seem to be able to get out of this vicious cycle.

And even though cerebro shows that the nodes have enough resources, Graylog is very unresponsive, very slow.

This is all new, never had this issue before.

Any ideas on where to start looking at or what might be the root cause would greatly help.

Thanks.

macko003 · March 26, 2020, 6:56am

read the elastic’s log first.
If the cluster goes to yellow, it means some shards missing(some replicas), so I think you loose it, and elastic starts to initialize the new ones and not relocating it.
Or if you loose a datanode, and it comes back as empty after the elastic goes to green (it has all shards), it can relocate the data to the empty datanode.

jan · March 29, 2020, 10:22am

I can give you two blog posts that might help you:

I assume you have to many shards or some nodes are hitting the high/low water mark.

as @macko003 wrote, check your elasticsearch log.

Jan

123dev · March 30, 2020, 1:26pm

Thank you both @macko003 and @jan for the follow up.
I believe @jan hit the nail on the head. We had too many shards.
Those documents are awesome, many thanks.

Have a good day and stay safe.

system · April 13, 2020, 1:26pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Graylog cluster, elasticsearch unassigned shards Graylog Central (peer support)	4	2958	May 4, 2021
900 unassigned shards Graylog Central (peer support)	2	268	December 23, 2023
Elasticsearch Shards Graylog Central (peer support)	8	4350	September 7, 2017
Even now still confused over relationship between RED/YELLOW and graylog Graylog Central (peer support)	2	551	September 30, 2017
Elasticsearch cluster is red. Default Index set shard allocation issue Graylog Central (peer support) elastic	5	1356	July 6, 2023

Graylog in a constant loop of relocating / initializaing shards

Related topics