Performace Problems

TomWhi · November 13, 2019, 4:03pm

Hi

We recently installed Graylog and I cannot for the life of me get the logs into elasticsearch quick enough so our unprocessed message count keeps increasing.

We have a CentOS7 server. 8x CPU. 64GB Memory. 20GB assigned to the Graylog Heap. 20GB assigned to the elasticsearch heap.

My server.conf file had these settings
processbuffer_processors = 5
outputbuffer_processors = 3
inputbuffer_processors = 2

I found another post which indicated to tinker with these but changing the values (and restarting the service) hasn’t helped.

I’m quite new to Graylog so any support would be great (and please dumb stuff down for me as much as you can to get me back onto the right track).

Thanks in advance

Tom

jan · November 13, 2019, 4:52pm

He @TomWhi

I guess your Elasticsearch and your Graylog are fighting for CPU that is why it can’t get into speed.

You would need to lower your ingest volumen or add another server with the same specs dedicated to elasticsearch … or add another that is a little smaller dedicated to graylog.

TomWhi · November 14, 2019, 8:26am

Thanks for the advice Jan. Is there any other tweaks I can make?

I did a top and noticed that elastic/java is wanting 233g of virtual memory. Is that normal or would that be solved with more vCPU / splitting my graylog and elastic out?

PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
3193 graylog 20 0 26.4g 20.6g 9872 S 351.5 32.9 2533:28 java
3189 elastic+ 20 0 233.7g 20.5g 144096 S 255.8 32.7 1633:08 java
3315 mysql 20 0 21.3g 717108 7372 S 16.6 1.1 52:15.20 mysqld

shoothub · November 15, 2019, 9:39am

Check also your disk subsystem, if it has enought I/O for such volume of data to written. I see mysql running on same server, that can consume o lot of I/O if it’s high stressed. How powerfull is your disk subsystem? How many disk, raid cache, do you use localdisk/san/nas?
If you use a lot of generic extraction rules, try to move it to pipiline rules and separate types of log by pipeline and rules for specific application/program/device to lower cpu usage of graylog.

TomWhi · November 22, 2019, 8:42am

Thank you, our disks are hosted on a SAN, and I think we get good performance out of the disk array but I will look into it. I must admit it is shared along with other stuff, but I’ll get some perf tests for it.

I think we use Windows and Cisco extraction tools but I’ll look at tuning that. I think we’re getting bombarded with Windows events too so I think I need to tune them away so I’m getting just what I need for now.

system · December 6, 2019, 8:42am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Performance Tuning Whitepaper, Guide, Doc Graylog Central (peer support)	5	4845	August 8, 2017
Graylog CPU spiked to 500% and ES seems slow Graylog Central (peer support)	6	1323	March 18, 2019
Graylog Processing Messages Super Slow Graylog Central (peer support)	3	3996	October 16, 2018
Graylog is processing messages really slowly Graylog Central (peer support)	5	1499	March 6, 2018
Graylog has Millions of Unprocessed Messages Graylog Central (peer support)	2	967	February 10, 2021

Performace Problems

Related topics