HA won't maintain connection


#1

Building out a Graylog cluster in AWS with 3 web servers and 3 datanodes. Using the Graylog 2.2.2 AMI , and the 3 webservers will work for a time before reporting “Journal metrics unavailable.” They come in and out of reporting, then at some point within 2 days, one of the servers will crap out and I have to run “graylog-ctl reconfigure-as-server” to start back up.

Any of the hosts showing “Journal metrics unavailable” also tend to not receive the logs sent to them. Still not sure where to find the right logs to troubleshoot, as the main server logging (“graylog-server”) doesn’t show anything telling.

Security groups have been checked and rechecked, but it seems to be something else, given for how long it works before this behavior starts.


(Jan Doberstein) #2

Hej @razumzhiro

to find the files - just sneak in the documentation. You shall look at the locations for the Omnibus Package.

With only your description given and no log files we can’t assist you. Please give some more Information and Loglines and we might be able to help.


#3

That’s why I ask :smiley: Thank you, I’ll start there.