Sudden performance downgrade and search error

bbfunde · July 27, 2020, 11:28am

Hi everyone,

graylog version: v3.3.2+ade4779
VM resources: 16 vCPU + 96GB RAM
One node

Recently the dashboard performance experiences downgrade so I tried to fine-tune the settings by increasing the graylog JVM heap space to 16GB and the buffer processors settings in graylog server.conf:

processbuffer_processors = 16
outputbuffer_processors = 9

Afterwards, the dashboard took a very long time to search and load the widgets and finally timed out.

I have tried to stop all inputs except one, disable all pipelines, and delete all extractors created shortly before this happened. But don’t help.

I found out that there are tons of unprocessed messages and the process and output buffer are all exhausted. Maybe this is the reason? And if it is, can anyone suggest a way to alleviate the pain?

Thank you in advance.

lcosta · July 28, 2020, 2:07pm

Did you check Elasticsearch log?

jan · July 29, 2020, 6:21am

he @bbfunde

when you have added those 16 cores to the VM that hosts Elasticsearch AND Graylog it is very likely that both are fighting for the CPUs …

You should limit the number of cores that are available for Elasticsearch and not overcommit the processors on Graylog.

The picture that you have output buffer filled is the indicator that your elasticsearch is needing to long to index the messages and return/free the output processor of Graylog, what causes a jam in the working queue.

bbfunde · July 29, 2020, 8:04am

thanks for the reply, I think I have found the cause following Jan’s advice below.

bbfunde · July 29, 2020, 8:13am

Thanks Jan for the advice.

I have done some research on how to optimize the Elasticsearch and adjust some settings including memory and swap file.

The issue is seemingly gone for days now.

But I can’t find where to restrict the processor number for Elasticsearch. I can’t find the setting in Elasticsearch.yml (I am running it as a service). Can only find some running it in Docker or from command line.

Can you help giving me a reference on that.

Thank you a lot.

jan · July 29, 2020, 9:51am

https://www.elastic.co/guide/en/elasticsearch/reference/6.8/modules-threadpool.html#processors

system · August 12, 2020, 9:51am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Performace Problems Graylog Central (peer support)	5	567	December 6, 2019
Process Buffer Flooding 100% process Graylog Central (peer support)	8	4651	May 7, 2020
Elasticsearch optimization Graylog Central (peer support)	3	1088	January 23, 2023
Graylog 6.1.5 performance Graylog Central (peer support)	9	152	February 6, 2025
Graylog very slow Graylog Central (peer support) elastic , architecture , capacity_planning	16	2080	April 26, 2023

Sudden performance downgrade and search error

Related topics