Nodes with too Long GC pauses with maxed ES ram

Chase · February 17, 2023, 2:19pm

There are Graylog nodes on which the garbage collector runs too long. Garbage collection runs should be as short as possible. Please check whether those nodes are healthy. (Node: 89a7651b-46ca-4879-9543-5cbeb84804e6, GC duration: 1369 ms, GC threshold: 1000 ms)

What is the best way to resolve this? I have a single node that doing, generally under 1000 logs/s. They system that it sits on has maxed the ram that can be allocated to elastic search (32GB), and then another 32GB allocated to graylog, and then a bunch is ram that was left over for the system.

Best guess is that there was an influx of messages that comes in from our filebeat monitor. This isn’t going to go away, and like will happen again in the future. Is there a way to change the alert to a greater number?

Thanks,

Chase

gsmith · February 21, 2023, 12:41am

Hey @Chase

I found this

ihe · February 21, 2023, 2:00pm

your RAM-distibution is not ideal: if you increase your HEAP for a single java process to 32 GB it will do it’s own “magic” and be almost unusable because of internal constrains.
on a 64GB RAM machine I would go for 20GB Graylog, 20GB Elastic/Opensearch and the rest for the OS and caches by the OS. Those are important!
From my experience Graylog works much better, if you have multiple machines up to 32GB of ram with 16GB application. Graylog and Elastic can very well be separated on different machines, and be scaled horizontal.

Chase · February 21, 2023, 2:09pm

I currently have 30g allocated to it. I could do more, but we currently have it matching elastic search.

graylog-admin@graylog:~$ cat /etc/default/graylog-server

Path to a custom java executable. By default the java executable of the

bundled JVM is used.

#JAVA=/usr/bin/java

Default Java options for heap and garbage collection.

GRAYLOG_SERVER_JAVA_OPTS=“-Xms30g -Xmx30g -server -XX:+UseG1GC -XX:-OmitStackTraceInFastThrow”

Thanks,

Chase

Chase · February 21, 2023, 2:55pm

The machine has 128GB of ram. Would you still make the same distribution of ram?

While I’d love to have multiple machines right now, there are a few things prohibiting that. Best I can do is to get this machine working and then expand from here.

Thanks,

Chase

faen · February 21, 2023, 6:41pm

This was very helpful to me - I was very occasionally getting the garbage collector error and bumped up the RAM a bit for the JVM. Hopefully it helps, but I really didn’t see any performance issues before.

Chase · February 21, 2023, 8:06pm

So your saying to bring it up above 30GB, say to 40GB? Would this possibly cause any issues with elasticsearch, as elasticsearch will still be stuck at 32?

Thanks,

Chase

system · March 7, 2023, 8:07pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Nodes with too long GC pauses Graylog Central (peer support)	5	4737	December 21, 2022
Garbage collection runs Graylog Central (peer support)	4	2485	March 22, 2022
Notifications error Graylog Central (peer support) alert	1	62	July 8, 2024
Master node memory problems Graylog Central (peer support)	1	621	March 7, 2017
Elasticsearch Heap requirements skyrocketed Graylog Central (peer support)	5	1023	September 13, 2018

Nodes with too Long GC pauses with maxed ES ram

Path to a custom java executable. By default the java executable of the

bundled JVM is used.

Default Java options for heap and garbage collection.

Related topics