Even now still confused over relationship between RED/YELLOW and graylog

jason · September 15, 2017, 8:55pm

Hi there

Recently I’ve asked several questions about having ES failures and then finding graylog not being happy, and got advise that graylog will continue to push data into ES if it is YELLOW - while it is fixing up unassigned shards

Well yesterday our 4-node ES cluster started reporting OOM errors (set to 16G) and everything died. So I did some google-ing, and as the systems are all 64G, I’ve increased ES_HEAP_SIZE to 24g and restarted the cluster. When it came back up, there were 3500 unassigned shards - so it was state YELLOW. Well graylog is refusing to push data in. The journal is full and I’m losing messages. (ie there are 2000 msg/sec incoming and 0/sec outgoing)

ES is assigning those shards really slowly. It’s been 16 hours and it’s only processed 1000 - there’s still 2400 to go. (btw there are 8 “initializing_shards” too if it matters)

graylog is working - I can do searches - but all I can see is old data - looks like I’m losing all current data

How can I fix this? I thought graylog could push data in under ES state YELLOW? This is GL-2.3.1 and ES-2.4.6

Thanks

jochen · September 16, 2017, 9:43am

Maybe the Graylog journal has been corrupted.

Try deleting (or moving away) the journal files while Graylog is stopped and start Graylog again.

If you have enough resources (i. e. network bandwidth and IOPS), you can increase the bandwidth Elasticsearch is using for shard recovery:

system · September 30, 2017, 9:43am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Graylog cluster, elasticsearch unassigned shards Graylog Central (peer support)	4	2993	May 4, 2021
Graylog Journal Filling Up Graylog Central (peer support)	4	806	June 19, 2017
Graylog CPU spiked to 500% and ES seems slow Graylog Central (peer support)	6	1326	March 18, 2019
Graylog Elasticsearch cluster is yellow since 3 days back Graylog Central (peer support)	10	3155	July 5, 2018
Graylog outgoing traffic to ES increased by 2x in 5min Graylog Central (peer support)	2	847	October 18, 2019

Even now still confused over relationship between RED/YELLOW and graylog

Related topics