Re: Attachment field questions and some more

2014-01-27 Thread Alexander Reelsen
Hey, the attachment stuff, is on the top of my head only. 1. The original document is stored as base64 inside of the source. which is also stored on indexing. The field itself is not stored iirc. 2. You could exclude it from being stored in the source. Using tika as a preprocessing step is anothe

Attachment field questions and some more

2014-01-23 Thread Iv Igi
Hello there! I have some questions about Attachment field, results highlighting and suggesting. 1. Is the Attachment field storing whole file or extracted text only by default? 2. If it stores the whole file is there any way to make it store only extracted text? Or should I extract