Most efficient way to use pipelines for routing

josh.7 · April 19, 2024, 7:20am

Hello all,

I’m wondering what the most efficient way to route messages to streams is; i’ve been using a pipeline, attached to one input stream, this pipeline has 6 different rules.
Each rule evaluates a certain field, then, if the condition matches, it routes that message to a certain stream.

Would it be more effective to only have one routing rule per pipeline? That way all other rules wouldn’t be evaluated for each messages (each message comes into a ‘Global’ stream, then is routed to a substream, depending on what operation it is, messages are not required to be routed into multiple streams with these pipelines and rules).

Since there are only aggregated pipeline processing time metrics available (not processing time per pipeline), i’m not suire there’s a clear answer to this.

The input stream in question gets ~100 messages per second during normal hours, this will be increasing drastically so want to make sure i’m using the best approach to deal with these messages.

Thanks!

gsmith · April 23, 2024, 4:43am

Hey @josh.7

Actually that sound like a Textbook Pipeline you described As with anything the more it is used, you may have to keep an eye out on resources ( CPU, Memory, etc…). There are Stages also that can be utilized.

Just an idea if you have different devices , say Firewalls, Windows, Linux , switches, etc… it wouldn’t be a bad Idea to separate them into there own Input. You can make index sets for each which would attach to different streams. The reason I say this is we started with One stream , default index set. Went from 5 -10 GB day. As we expanded the amount of logs grew to 100- 200GB a day. It became a pain to start reconfiguring our GL Cluster after that.

system · May 7, 2024, 4:43am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Stream Rules for Routing vs. Routing via Pipeline (Rule) Graylog Central (peer support) pipeline-rules , route-to-streampl	2	1488	October 3, 2019
Routing messages with pipelines, one large pipeline or multiple smaller ones? Graylog Central (peer support) pipeline-rules , route-to-streampl	7	281	May 14, 2024
Many Pipelines with few Rules OR Few Pipelines with many Rules? Graylog Central (peer support) pipeline-rules	3	247	April 4, 2024
Pipelines vs extractor performance Graylog Central (peer support)	3	2590	December 25, 2019
Pipeline performance optimization and metrics Graylog Central (peer support) pipeline-rules , route-to-streampl , grok-patternspl , documentation , architecture	3	228	June 11, 2024

Most efficient way to use pipelines for routing

Related topics