Grok optimization

cantipop · May 9, 2017, 5:52am

These are the timings for the grok patterns that I have. These are bad , good?

cantipop · May 9, 2017, 5:53am

and this one:

My entire 4 nodes are processing about 5000 msg/s . If it receive more than 6000 or 7000, it will start filling disk journal.
4 gl-nodes with 8 vCPU each should be able to process more messages?

jan · May 9, 2017, 7:30am

Hej @cantipop

please do not high-jack threads. It looks like you are asking for help to determine if your Grok patterns can be optimized. That is another topic.

That is why I have moved that out of the thread

regards
Jan

jtkarvo · May 9, 2017, 7:42am

hi,

to me, the first screen capture looks grim, but in the second capture the figures don’t look so bad.

I don’t know how to optimize GROK inputs. That is why I have switched most of my GROK patterns to regexes. I got a 10-fold performance improvement from that, and then further 2-3 fold improvement after optimizing my regexes.

The only guess on optimizing GROK from me would be that you need to be sure that ALL log lines have all the fields, as failed matches consume resources quite a lot for nothing.

Mantil · June 2, 2017, 10:35pm

Seems a bit low for that many nodes but I have found that Grok patterns can have a huge impact if not done correctly. We do use some but have learned it is MUCH faster and performance gaining to send the logs preformatted in GELF from the source. Not possible in all instances but a huge benefit where the option is available. NXLOG does a great job of this when configured for it. In general we process around 10-15k per second on just two nodes of around 4 cpus and 16 GB memory right now. Even with those specs We only generally have those nodes hit around 50% utilization most of the time.

Topic		Replies	Views
Optimize graylog nodes Graylog Central (peer support)	2	479	July 7, 2021
Graylog not processing messages with Grok pattern Graylog Central (peer support)	0	651	March 23, 2017
Pipeline performance optimization and metrics Graylog Central (peer support) pipeline-rules , route-to-streampl , grok-patternspl , documentation , architecture	3	223	June 11, 2024
Process buffer gets full with the Grok pattern Extractor Graylog Central (peer support)	2	1299	April 9, 2019
Help to optimize processing Graylog Central (peer support)	4	1794	March 25, 2019

Grok optimization

Related topics