Problem with time out with node

Danbasik230 · March 21, 2024, 9:51am

Hello everyone. I have a problem that information about one node disappears from time to time. Everything seems to be working correctly, I don’t see any errors in the logs. But this error occurs when I want to see what’s wrong with node.

Getting plugins on node “xxxxxxx” failed: FetchError: There was an error fetching a resource: Internal Server Error. Additional information: timeout

This error occurs every minute, and sometimes it changes the state of the node. The cluster has 2 greylog servers and 5 elasticsearch servers. I checked the availability of everyone, from each server. But I can’t find the error.

OS - Debian 11
Graylog Server - 5.2.4-1
Elasticsearch - 7.10.2

Danbasik230 · March 21, 2024, 10:07am

Danbasik230 · March 21, 2024, 10:07am

Joel_Duffield · March 21, 2024, 11:06am

This most often happens when the nodes cant reliably communicate with each other, what are your settings for publish and external uri on both nodes?

Danbasik230 · March 21, 2024, 11:13am

on slave node and master nodes the same they are communicate by externals ip

$ cat /etc/graylog/server/server.conf | grep "http_bind_address"
http_bind_address=0.0.0.0:9000
$ cat /etc/graylog/server/server.conf | grep "http_publish_uri"
# Default: $http_publish_uri
$ cat /etc/graylog/server/server.conf | grep "http_external_uri"
#http_external_uri =
http_external_uri=https://dns-record-for-graylog/

Danbasik230 · March 21, 2024, 12:04pm

As far as I understand, which parameter can be increased is related to the timeout. Sometimes it just doesn’t have time to open information about the node.
I checked the tcpmdump. There are no problems with the network. I think it has to do with performance maybe.

Danbasik230 · March 21, 2024, 1:18pm

The problem is on the master node. When I turn off the slave, the problem disappears.

Danbasik230 · March 21, 2024, 2:34pm

The error was related to the http_bind_address parameter. This parameter was configured to 0.0.0.0, and due to the fact that both hosts have the same docker subnet 172.0.0.1 - the REST API transport address was the same.

system · April 4, 2024, 2:34pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Graylog api/system timeout failures Graylog Central (peer support)	4	2527	July 26, 2019
REST API timing out when an Elastic node dies Graylog Central (peer support)	2	315	September 28, 2020
Graylog Time out Graylog Central (peer support)	8	1783	May 25, 2021
Problem with REST API timing out Graylog Central (peer support)	9	2637	June 22, 2017
Graylog stopped with timeout Graylog Central (peer support)	11	1506	December 9, 2021

Problem with time out with node

Related Topics