Hey,
the attachment stuff, is on the top of my head only.
1. The original document is stored as base64 inside of the source. which is
also stored on indexing. The field itself is not stored iirc.
2. You could exclude it from being stored in the source. Using tika as a
preprocessing step is anothe
Hello there! I have some questions about Attachment field, results
highlighting and suggesting.
1. Is the Attachment field storing whole file or extracted text only by
default?
2. If it stores the whole file is there any way to make it store only
extracted text? Or should I extract