Re: Solr Indexing error

Shawn Heisey Tue, 28 Aug 2018 05:29:19 -0700

On 8/28/2018 6:03 AM, kunhu0...@gmail.com wrote:

possible analysis error: Document contains at least one immense term in
field="content" (whose UTF8 encoding is longer than the max length 32766),


It's telling you exactly what is wrong.

The field named "content" is probably using a field class with noanalysis, or using the Keyword Tokenizer so the whole field gets treatedas a single term. The length of that field for at least one of yourdocuments is longer than 32766 characters. Maybe it's bytes -- a UTF8character can be more than a single byte. Lucene has a limit on termlength, and your input exceeded that length.

If you change the field type for content to something that's analyzed(split into words, basically) then this problem would likely go away.


Thanks,
Shawn

Re: Solr Indexing error

Reply via email to