Get alerts when unprocessed messages reaches limit

richiexlinares · January 16, 2023, 3:17pm

Newbie here. We’ve been having issues with Graylog going down in some deployments since it was getting too many logs, we solved it (hopefully) by increasing the JVM heap size for both Graylog and Elasticsearch (4G each).

We where at 963,141 unprocessed messages when it went down.

Now we want to set up some kind of alert that can tells us when there are too many unprocessed messages (we have all the infrastructure on AWS) so we can get alerts when Graylog is stuck on those processes since we are not able to know unless we manually check the server.

This is our server.conf

is_master = true
node_id_file = /etc/graylog/server/node-id
bin_dir = /usr/share/graylog-server/bin
data_dir = /var/lib/graylog-server
plugin_dir = /usr/share/graylog-server/plugin
http_bind_address = 0.0.0.0:9000
rotation_strategy = count
elasticsearch_max_docs_per_index = 20000000
elasticsearch_max_number_of_indices = 20
retention_strategy = delete
elasticsearch_shards = 4
elasticsearch_replicas = 0
elasticsearch_index_prefix = graylog
allow_leading_wildcard_searches = false
allow_highlighting = false
elasticsearch_analyzer = standard
output_batch_size = 500
output_flush_interval = 1
output_fault_count_threshold = 5
output_fault_penalty_seconds = 30
processbuffer_processors = 5
outputbuffer_processors = 3
processor_wait_strategy = blocking
ring_size = 65536
inputbuffer_ring_size = 65536
inputbuffer_processors = 2
inputbuffer_wait_strategy = blocking
message_journal_enabled = true
message_journal_dir = /var/lib/graylog-server/journal
lb_recognition_period_seconds = 3
mongodb_uri = mongodb://localhost/graylog
mongodb_max_connections = 1000
mongodb_threads_allowed_to_block_multiplier = 5
proxied_requests_thread_pool_size = 32

Thanks!

ramindia · January 16, 2023, 9:26pm

check this Monitoring :

if you are looking high-volume process, look out for Graylog sizing.

gsmith · January 17, 2023, 1:01am

Hello && welcome @richiexlinares

Adding on to @ramindia suggestion.

Using the Metrics on Graylog node works…

Found here.

Metrics

what I did was enable Prometheus in Graylog config file the Install Grafana. This gave me the ablility to send an alert if something went wrong.

Example:

This also can be done through Zabbix.

Unprocessed message, I belive that would be the Journal.

Example:

system · January 31, 2023, 1:01am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Alert if unprocessed message count is above a certain number Graylog Central (peer support)	3	1136	December 1, 2020
Unprocessed messages are currently in the journal Graylog Central (peer support) sidecar , filebeat-linux , filebeat-windows , nosendlogfblx	18	12417	January 26, 2018
Graylog not processing data Graylog Central (peer support)	13	6950	March 30, 2017
Unprocessed messages is constantly increasing Graylog Central (peer support)	4	4716	June 24, 2020
Unprocessed messages Graylog Central (peer support)	9	2189	January 8, 2018

Get alerts when unprocessed messages reaches limit

Related topics