Hi, we have an cluster of 2 graylog servers and recently he began to crash suddenly. When that happens, i have to restart the service, but even after that, graylog stays ‘‘unstable’’ showing an screen with this error below:
Looking at the logs, i can see these errors:
`2021-02-09T14:09:56.963-03:00 ERROR [Messages] Failed to index [2] messages. Please check the index error log in your web interface for the reason. Error: One or more of the items in the Bulk request failed, check BulkResult.getItems() for more information.`
2021-02-09T14:10:52.000-03:00 ERROR [Messages] Failed to index [2] messages. Please check the index error log in your web interface for the reason. Error: One or more of the items in the Bulk request failed, check BulkResult.getItems() for more information.
2021-02-09T14:10:52.051-03:00 WARN [ProxiedResource] Unable to call https://graylog.address/system/metrics/multiple on node <81139fd4-1dbc-4ece-a8e0-3d73b72e8c09>
java.net.SocketTimeoutException: connect timed out
2021-02-09T14:10:53.413-03:00 ERROR [LdapUserAuthenticator] Error during LDAP user account sync. Cannot log in user zabbix
I’m pretty new to Graylog and this environment, but i can’t find a solution. Everyday i have to reboot graylog services without finding an solution. Can you please advice?