we are running a rather small Graylog server (between 1,5 and 3 million messages a day) and for the second time in about 30 days I got greeted by this message:
Nodes with too long GC pauses
There are Graylog nodes on which the garbage collector runs too long. Garbage collection runs should be as short as possible. Please check whether those nodes are healthy. (Node: *gibberish*, GC duration: *1122 ms*, GC threshold: *1000 ms*)
I checked the java.options for ES and it’s set to 16 GB RAM (both Xms and Xmx) which is half of the physical RAM available. Searching aroud I found this topic: https://community.graylog.org/t/garbage-collection-runs/22868/3 but output_batch_size is already set to 500.
Installed version:
ES: 7.10.2
Graylog: 4.3.8 (will be updated in 2 weeks to latest version)
Java: openjdk 17.0.5
So how can I check that the node is healthy, please? According to the API the status is green.
And is there maybe anything else I can do to fix this warning? Except for getting a faster machine.
Hey Filisimus, the nodes that error message is referring to appears to be your Graylog nodes. Use the default file locations of the Graylog files here to edit the “JVM Settings”. You will see Xms and Xmx arguments within the long string within that file that you can set to your desired amount of heap for your Graylog nodes. After doing this, you need to fully reboot the hosts Graylog is installed on to restart the JVM with the new arguments. This should resolve your issue.