Getting the following error message repeat itself several times a second after upgrading from 3.2.2 to 3.2.3. I don’t even know where to begin looking for the source of the issue. Any ideas?
Also mongodb is sitting at almost 100% CPU. Pretty sure that’s not normal.
2020-03-18T17:13:12.796+10:30 ERROR [JobExecutionEngine] Couldn't handle trigger due to a permanent error 5e713508666462083e52b903 - trigger won't be retried
java.lang.IllegalStateException: Couldn't find job definition 5e6afbb5666462083e898539
at org.graylog.scheduler.JobExecutionEngine.lambda$handleTrigger$1(JobExecutionEngine.java:137) ~[graylog.jar:?]
at java.util.Optional.orElseThrow(Optional.java:290) ~[?:1.8.0_242]
at org.graylog.scheduler.JobExecutionEngine.handleTrigger(JobExecutionEngine.java:137) ~[graylog.jar:?]
at org.graylog.scheduler.JobExecutionEngine.lambda$execute$0(JobExecutionEngine.java:119) ~[graylog.jar:?]
at org.graylog.scheduler.worker.JobWorkerPool.lambda$execute$0(JobWorkerPool.java:110) ~[graylog.jar:?]
at com.codahale.metrics.InstrumentedExecutorService$InstrumentedRunnable.run(InstrumentedExecutorService.java:181) [graylog.jar:?]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [?:1.8.0_242]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [?:1.8.0_242]
at com.codahale.metrics.InstrumentedThreadFactory$InstrumentedRunnable.run(InstrumentedThreadFactory.java:66) [graylog.jar:?]
at java.lang.Thread.run(Thread.java:748) [?:1.8.0_242]
OK some more information. I ran ‘echo ‘db.scheduler_triggers.find()’ | mongo graylog’ and I get several entries similar to below.
It looks like what has happened is I created an event trigger with an email alert but I didn’t have email smtp settings configured so it wasn’t able to send. I upgraded to 3.2.3 and deleted the event and alert objects but it seems there are some remnants in the database. Does anyone know how I can remove them?
For anyone stumbling into this issue, I figured out how to delete objects from mongodb and just removed everything from the scheduled_triggers collection which has resolved the issue.