What if the master node of graylog server is down in a graylog server cluster? Does alert still work?

Hi all~

We have deployed a graylog server cluster with 2 graylog server nodes(one master node and one slave node), one mongodb and a elasticsearch cluster.

When we shut the master node down, the other graylog node didn’t become a master node automatically and it seems that out alerts are out of work.

When we want to solve this single point failure, we try to start two master nodes at the same time but we found its not appropriate according to the document and the log of the second master node.

Is that normal? What should we do if the master node is down? Does alert stop working after master is down? It seems that he master node may cause a single point failure.

The second question is, what will the alert happen when we deploy more than one master node? Will it run well?

Looking forward your reply.

Thx a lot.

Yes. At least it’s as designed.

Stop the secondary node, set is_master = true, then restart the node.

Yes, alert conditions will only be checked on the Graylog primary node.

Yes, that’s correct.

One of the two nodes will be demoted to secondary (having the same effect as setting is_master = false).

Are there any plans to add support for mechanism to elect Graylog master nodes automatically?
Obviously avoiding split-brain type of problems that happens in Elastic Search?
I suppose making sure that the number of nodes are odd number and minimum 3?

Thanks

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.