Identify source and reason of bad messages

The problem is that Lucene (the search library underlying Elasticsearch) can index only fields with a size less than 32 KB. Larger fields can be stored but they cannot be indexed (and not analyzed).

Also see this Discourse topic and this GitHub issue: