Pipeline Processor vs Extractor

nadx969 · July 19, 2018, 12:08pm

Hello
I am trying to determine which method would use less resources (cpu, load, memory, io) and load on the graylog cluster when extracting key values from a message, pipeline processor or extractor

to provide an example, approximately 20% of messages coming into an apache log input contain an URL string that we would like to parse that contain URL-encoded characters

GET /uri/rr.php?url=https%3A%2F%2Fwww.exampledomain.com/ar/%2Far%2Fconverter%2Fconvert%2F%3Ffield%3D1%26From%3DFIELD%26To%3DSAR&pageId=blank_convert&fromThisy=fieldExample&toThis=anotherFieldExample HTTP/1.1

With a pipeline processor, it would require at least two rules, the first would have regex to identify the message to be processed, and the other to grab certain key values and place in new fields.

With an extractor, this would be performed on the input following similar logic, only process messages containing a certain string.

What I would like to understand, is which method would require less resources on Graylog.

thanks

jochen · July 19, 2018, 12:56pm

rule "message-with-url"
when
  contains(to_string($message.message), "%2F")
then
  // process message
end

nadx969 · July 23, 2018, 4:12pm

Thanks @jochen.

However, I am more interested in understanding which method would be the most useful.
More specifically, which method would use less system resources (cpu, mem, io, load) or is that unknown?

jan · July 24, 2018, 7:43am

@nadx969 that isn’t measured by Graylog - but the processing pipelines is where we will move in the future too.

So, the future save solution is to use and understand the processing pipelines.

system · August 7, 2018, 7:43am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Pipelines vs extractor performance Graylog Central (peer support)	3	2595	December 25, 2019
Extractors vs. Pipelines - What is the preferred way? Graylog Central (peer support) pipeline-rules	2	8895	October 12, 2017
Extractor -> Pipeline Graylog Central (peer support)	2	698	August 13, 2020
K=V extractor help Graylog Central (peer support) pipeline-rules	5	769	September 1, 2021
Apache logs - parsing the request Graylog Central (peer support)	6	1057	March 16, 2023

Pipeline Processor vs Extractor

Related topics