Greylog message error

  1. Describe your incident:

I have a random UI message in Graylog:

“Node cannot be found among active nodes”

The node is real and active:

  • Single-node installation

  • The node ID exists in:

    • /etc/graylog/server/node-id

    • MongoDB (db.nodes.find())

  • The node appears as RUNNING in Graylog

  • Searches and dashboards continue working, but the message appears intermittently and affects the dashboard experience.

Additional observations:

  • Initially OpenSearch/DataNode was configured with only 1 GB heap (-Xms1g -Xmx1g) on a VM with 32 GB RAM.

  • I increased heap to 8 GB and performance improved dramatically.

  • Before increasing heap, jcmd sometimes failed with:
    AttachNotSupportedException: target process doesn't respond within 10500ms

  • After increasing heap:

    • OpenSearch cluster health is GREEN

    • No pending tasks

    • No rejected search/write queues

    • Heap usage around 60%

  • Despite this, the UI message still appears randomly.

  1. Describe your environment:
  • Graylog 7.1.2

  • Single-node deployment

  • CentOS Stream 9

  • Graylog Data Node (embedded OpenSearch)

  • MongoDB local

  • 32 GB RAM VM

  • OpenSearch heap currently:

    • -Xms8g

    • -Xmx8g

  1. What steps have you already taken to try and solve the problem?
  • Verified node ID exists and matches:

    • Graylog configuration

    • MongoDB database

  • Verified cluster health:

    • GREEN

    • No pending tasks

    • No rejected thread pool queues

  • Restarted:

    • graylog-server

    • graylog-datanode

  • Increased OpenSearch heap from 1 GB to 8 GB

  • Verified:

    • Node status RUNNING

    • Searches work

    • Dashboards load correctly

  • Cleared browser cache and tested with direct IP access

  1. How can the community help?

I would like help understanding:

  • Why Graylog UI still randomly reports the node as inactive/not found

  • Whether this is a known Graylog 7.1.x UI/polling issue

  • If there are additional logs, APIs, or configuration checks recommended for troubleshooting this intermittent node detection behavior

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.