Indexing of new messages stoppes occasionally

claudekenni · May 5, 2017, 3:42pm

I have a Graylog Cluster with 2 GL Servers and 3 Elasticsearch Servers.
I have the Problem that sometimes Graylog just stops indexing new messages into Elasticsearch although ES is online.
To fix it I usually have to restart the Elasticsearch Processes.
Sometimes it works fine for a week or two and sometimes it only works for 3 days. Reproducing this doesn’t seem possible.

Where should I look at to see what the source of this is?

jochen · May 5, 2017, 3:49pm

Any warnings or errors in the logs of your Graylog and Elasticsearch nodes (see http://docs.graylog.org/en/2.2/pages/configuration/file_location.html)?

claudekenni · May 10, 2017, 6:58pm

Gotta wait until the error happens again…
It’s not actualy reproducable so it may take a while.

rafaelcarsetimo · May 10, 2017, 7:03pm

The stuck occurs in both GL Servers at the same time?

jtkarvo · May 11, 2017, 4:33pm

Also; do your servers have enough memory?

claudekenni · May 12, 2017, 12:52am

Yes both servers are stopping the processing.

claudekenni · May 12, 2017, 12:54am

The Graylog Servers have 4vCPU/8GB the Elasticsearch(3) Servers have 8vCPU/16GB each

jtkarvo · May 12, 2017, 7:18am

and how much did you allocate to graylog and elasticsearch JVM:s? My guess would be for Elasticsearch max 8G, for Graylog max 2G or 3G

claudekenni · May 14, 2017, 10:21pm

Graylog 4GB each
ES 8GB each

claudekenni · May 14, 2017, 10:39pm

Just had that error again. For whatever reason there were no Logfiles for Elasticsearch for a few days.
But there are files like these on all 3 ES servers.
I had to kill the ES Process to restart them.
-rw------- 1 elasticsearch elasticsearch 5171027968 May 14 18:24 java_pid23573.hprof

jtkarvo · May 15, 2017, 5:01am

Try this http://www.eclipse.org/mat/

jtkarvo · May 15, 2017, 5:05am

Btw. Are you using the recommended Java version (Oracle Java 1.8) in the Elasticsearch nodes?

claudekenni · May 15, 2017, 12:54pm

○ → java -version
java version "1.8.0_45"
Java™ SE Runtime Environment (build 1.8.0_45-b14)
Java HotSpot™ 64-Bit Server VM (build 25.45-b02, mixed mode)

jtkarvo · May 15, 2017, 7:07pm

45 sounds pretty old. The current version is 131. I don’t know if a relevant bug has been fixed in between, but at least it should not make things worse.

To me this sounds like your Elasticsearch crashes and dumps, so it does not seem to be a graylog problem. Btw - I guess you use Elasticsearch 2.4 series version (compatible with Graylog) ? Have you installed any Elasticsearch plugins - removing them might also help.

claudekenni · May 16, 2017, 12:06am

only elasticsarch-head.

I will try to update the JRE and see if it helps

claudekenni · May 16, 2017, 5:44pm

Problem occured again. There are several messages like the following. These occur on all 3 ES Servers

New used memory 5368893176 [5gb] for data of [source] would be larger than configured breaker: 5112122572 [4.7gb], breaking

jtkarvo · May 17, 2017, 6:03pm

you run out of memory in the Elasticsearch nodes. Look at this article:

https://www.elastic.co/guide/en/elasticsearch/guide/current/_limiting_memory_usage.html

claudekenni · May 17, 2017, 7:33pm

got it. Will see if tweaking these setting will help with the issue.
Thanks for the info

Topic		Replies	Views
Graylog stops processing messages seemingly at random times Graylog Central (peer support)	7	1897	June 19, 2020
Graylog-ES Communications Graylog Central (peer support)	12	1668	June 29, 2017
Graylog stops processing logs at the same time every day Graylog Central (peer support)	5	1252	March 15, 2019
Problem with Graylog cluster Graylog Central (peer support)	4	627	December 15, 2020
Graylog not processing messages after crash (ran out of space) Graylog Central (peer support)	5	3665	April 20, 2020

Indexing of new messages stoppes occasionally

Related topics