This is all good and sound, but when I try to use the turnkey Grafana dashboards (this one for example), they do not seem to work (no data is displayed) :
From the troubleshooting I did, it seems the GL metric names used in these dashboards are different to the ones pulled by Prometheus. For example, the dashboards use org_graylog2_journal_append_1_sec_rate while the metric pulled is gl_journal_append_1_sec_rate.
Does anyone have a working Grafana dashboard working with GL 4.1 ?
So that dashboard you linked from Grafana is suuuuuuuuper old and not one maintained by us. The dashboard relies on an outdated prometheus metrics collector plugin, whereas the prometheus endpoint implemented in 4.1 uses different metric names/prefixes.
Let me tweak it a bit more–there are still some old queries in it and I’m not quite happy with how things are displayed. I should have something more readily usable soon
@H2Cyber so I’m getting closer (though there’s going to have to be some future work on some of the metrics). I’m curious about what you’d think would be most useful to see? What I have now is an “at a glance” row at the top, followed by more detailed graphs and metrics.
I have multiple inputs, various extractors for each input, multiple streams, and pipelines attached to each stream. It would be nice to have graphs that help identify bottleneck(s) in these.
Since the Openinsights is a paid service, I don’t think that the dashboard will be made public. However, this is where I’ve gotten so far until there are further improvements made to the existing metrics. Graylog metrics
Is it possible to have a metric that shows the state of the cluster (is it up or not) and a metric that shows whether the VM is a leader or not? please i need Help