I then have to restart the whole server with “sudo graylog-ctl restart”.
I’m not really sure how to go about troubleshooting the issue. I have double the server ram but that doesn’t seem to have helped much.
When looking at “sudo graylog-ctl tail server” I get a number of
“WARN : org.graylog2.shared.rest.resources.ProxiedResource - Unable to call http://x.x.x.x:12900/system/metrics/multiple on node <22dcf9ff-fc50-4f83-a69e-94259b862541>, caught exception: timeout (class java.net.SocketTimeoutException)”
Any suggestion on how to debug this issue would be great.
you should describe what happens exactly and not only “it crashes” - the more details you deliver what you do, what then happens, what error you have will enable someone to help you.
In the top of this thread I tried to outline as much detail as to what was happening: .
e.g.
Our web interface shows the error (attached in the screenshot)
The message says “we are experiencing problems connecting to the graylog server running on . please verify that the server is healthy and working correctly”
When I run look at the tail of the logs the main warning that occurs several times is: " WARN : org.graylog2.shared.rest.resources.ProxiedResource - Unable to call http://x.x.x.x:12900/system/metrics/multiple on node <22dcf9ff-fc50-4f83-a69e-94259b862541>, caught exception: timeout (class java.net.SocketTimeoutException)"
I also worked out that restarting just the “graylog-server” with sudo graylog-ctl restart graylog-server
brings things back to life.
I don’t really know what other information would help in diagnosing the issue.