My instance of Graylog sometimes (about twice a day) stops processing messages. The messages are getting written to the message journal and a restart of Graylog (using systemctl restart graylog-server) fixes the issue temporarily. The logs in the journal get processed and then written to Elasticsearch.
I can see no errors at all in either the Graylog or Elasticsearch log files.
I am a bit stuck on what to do here. Any pointers would be appreciated
I am running Graylog 3.1.4 on 1 server
then on another server I am running elasticsearch 6.8.9
I have some thread dumps from the time that the issue happens. If they would be useful let me know and I shall post them.
you should investigate what is happening during this time? Does your Elasticsearch is having problem? Check what does happen in the system and you will get a trace.