ElasticSearch cluster considerations for graylog

thcp · December 29, 2021, 12:17pm

Based on the documentation provided on the links bellow, im seriously torned between either deploying an elasticsearch cluster with 3 nodes ( each node acting as master/data/ingestion) or deploying 6 nodes where we would have 3 masters/ingestion and 3 as data only.

https://docs.graylog.org/docs/architecture
https://docs.graylog.org/v1/docs/multinode-setup

Considering also that ES good practices mentions that we should split nodes based on their function, and knowing that ES good practices are not mentioned on the Graylog documentation ( because its focused on the graylog product). I would really appreciate any insight or thoughts about which approach would be more appropriate for a daily ingestion of 25 to 50 gb of data.

Arie · December 30, 2021, 10:13am

Some humble thoughts on that…

That depends on a lot of things to consider.

How many memory and cores does each node have and how many data you want toe keep in
in your cluster. As for memory the maximum heap is 32 GB for java nodes could max at 64 GB.

Is replicas used for having a failesafe on your data?

You could start with three master/data nodes and work up from there if this in not enough for your requirements. As I have read in the past a rule of thumb is that when one gets above five to six nodes a master only setup could improve things and for data security two master-only nodes should be minimal, as understood only one master will be processing and data, not knowing how graylog handles two es masters what sounds to split brain to me.

If you have six servers to spend, two master servers and four data servers would be more applicable.

thcp · January 4, 2022, 10:12am

I ended up going with 3 masters and 3 data nodes. Considering that heap size would not be surpassing 20gb, specially because we are running gl on kubernetes for a quite while and these new considerations are related to GL 4 upgrade along with ES from 6 to 7.

thanks for your feedback!

system · January 18, 2022, 10:13am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Clustered environment (ressources) for Graylog 3.x Graylog Central (peer support)	16	786	February 21, 2019
Need help regarding tuning Graylog Cluster Graylog Central (peer support)	3	525	July 12, 2018
Storage questions into graylog cluster Graylog Central (peer support)	3	410	March 28, 2018
Which Elasticsearch Node Roles should be connected to Graylog? Graylog Central (peer support) elastic	3	809	December 3, 2022
Graylog 4.2, ES Cluster, with 3 Master only nodes Graylog Central (peer support)	4	1002	December 28, 2021

ElasticSearch cluster considerations for graylog

Related topics