Before you post: Your responses to these questions will help the community help you. Please complete this template if you’re asking a support question.
Don’t forget to select tags to help index your topic!
1. Describe your incident:
2. Describe your environment:
-
OS Information:
OpenSearch 2.0.1
Server OS: Ubuntu 22.04 in LXC guest (16 core 64GB); LXC host: Debian Bookworm (16 core 64GB)
Graylog: 5.0.3+a82acb2 (open/community edition) -
Service logs, configurations, and environment variables:
# Path to a custom java executable. By default the java executable of the
# bundled JVM is used.
#JAVA=/usr/bin/java
# Default Java options for heap and garbage collection.
GRAYLOG_SERVER_JAVA_OPTS="-Xms31g -Xmx31g -server -XX:+UseG1GC -XX:-OmitStackTraceInFastThrow"
# Avoid endless loop with some TLSv1.3 implementations.
GRAYLOG_SERVER_JAVA_OPTS="$GRAYLOG_SERVER_JAVA_OPTS -Djdk.tls.acknowledgeCloseNotify=true"
# Fix for log4j CVE-2021-44228
GRAYLOG_SERVER_JAVA_OPTS="$GRAYLOG_SERVER_JAVA_OPTS -Dlog4j2.formatMsgNoLookups=true"
# Pass some extra args to graylog-server. (i.e. "-d" to enable debug mode)
GRAYLOG_SERVER_ARGS=""
# Program that will be used to wrap the graylog-server command. Useful to
# support programs like authbind.
GRAYLOG_COMMAND_WRAPPER=""
Indices:
80 total shards
Outgoing traffic is between 16-18GB, daily
Indexing failures are through the rough mostly because:
a) OpenSearchException[OpenSearch exception [type=mapper_parsing_exception, reason=failed to parse field [ListBaseType] of type [long] in document with id 'c0b9fbc0-c8c5-11ed-895a-00163ef2bcdd'. Preview of field's value: 'GenericList']]; nested: OpenSearchException[OpenSearch exception [type=illegal_argument_exception, reason=For input string: "GenericList"]];
or
b) OpenSearchException[OpenSearch exception [type=illegal_argument_exception, reason=Limit of total fields [1000] has been exceeded]]
3. What steps have you already taken to try and solve the problem?
Just MacGuyver stuff. Panicked heap increases; inceease total field limits to 2000; forced restarts
4. How can the community help?
Basically I am just looking for suggestions on how to improve my situation. Issue a), above, has been commented on before. I just haven’t gotten my hands dirty, yet.
I think what I need is some guidance as to best practice for indices / shard management.
We are a single node, all-in-one (open/mongo/graylog all on one machine). I have access to heavier hitting hardware but I have to think better management would keep this instance tip-top.
All my knowledge with respect to database management has come by way of Graylog over the past year and half so I am still but a child in this area.
Thank you!
Helpful Posting Tips: Tips for Posting Questions that Get Answers [Hold down CTRL and link on link to open tips documents in a separate tab]