Single Node Instance Keeps Falling Over

GTownson · July 3, 2018, 7:38pm

We have a single node of 8 cores and 32gb that processes on average 1-2k messages per second. The node handles everything really well but then something happens and it just stops message processing. Theres nothing in any of the logs and no other indication as to why this happens.

In some cases the journal seems to be getting corrupted. Graylog won’t process any messages for a few hours and only restarts when I remove the journal. Has anyone come across this before? Are there any know fixes for this?
The node is hosted in Azure and we had to stop the machine and resize the data disk on it, this data disk held the journal. Could the resizing have corrupted it?

Is it just the architecture? (we’re looking to cluster but trying to work out load balancing)

Any other ideas anyone?

regards,

G

system · July 17, 2018, 7:38pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Graylog servers with full journal Graylog Central (peer support)	2	386	October 12, 2020
Message processing - Graylog 3.1 Graylog Central (peer support)	2	430	June 17, 2020
Graylog stops processing messages after log flood Graylog Central (peer support)	3	4053	April 3, 2017
Prevent data lost after crash Graylog Central (peer support)	4	732	February 26, 2021
Journal not Getting processed after ES failure Graylog Central (peer support)	3	2716	October 23, 2017

Single Node Instance Keeps Falling Over

Related topics