Never mind, a little googling answered that question:
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/root-object.html#source-field
"In a search request, you can ask for only certain fields by
specifying the _source parameter in the request body".

That neatly resolves my issue!

It does mean I'm going to have to change my mapping and, probably,
re-index my entire collection.

Thanks for your help
Jordan

On Thu, Jul 31, 2014 at 10:21 PM, Jordan Reiter <jor...@aace.org> wrote:
> Is there any way to do this so it can be stored but I don't get it
> when I pull in the _source record? Even extracted text is going to be
> huge when you're talking about 20-30+ page documents.
>
> On Thu, Jul 31, 2014 at 10:34 AM, David Pilato <da...@pilato.fr> wrote:
>> In your case, it's not. Because you excluded the attachment field.
>>
>> If you are a Java developer, you could easily use Tika directly in your own
>> code and send to elasticsearch only the extracted content and not the binary
>> file.
>> In that case, you could remove mapper attachment plugin.
>>
>> If not, I think you need to send again the full JSON document, including the
>> binary file.
>>
>> --
>> David Pilato | Technical Advocate | Elasticsearch.com
>> @dadoonet | @elasticsearchfr
>>
>>
>> Le 31 juillet 2014 à 16:32:04, Jordan Reiter (jordantheco...@gmail.com) a
>> écrit:
>>
>> So I guess using updates is not a good idea for records with file
>> attachments.
>> --
>> You received this message because you are subscribed to the Google Groups
>> "elasticsearch" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to elasticsearch+unsubscr...@googlegroups.com.
>>
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/elasticsearch/972f1b41-d10b-4ebc-8ac2-c83b80891924%40googlegroups.com.
>> For more options, visit https://groups.google.com/d/optout.
>>
>> --
>> You received this message because you are subscribed to a topic in the
>> Google Groups "elasticsearch" group.
>> To unsubscribe from this topic, visit
>> https://groups.google.com/d/topic/elasticsearch/26HBTz6XKgM/unsubscribe.
>> To unsubscribe from this group and all its topics, send an email to
>> elasticsearch+unsubscr...@googlegroups.com.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/elasticsearch/etPan.53da5401.684a481a.f0d0%40MacBook-Air-de-David.local.
>>
>> For more options, visit https://groups.google.com/d/optout.
>
>
>
> --
> Jordan Reiter
> AACE - Association for the Advancement of Computing in Education
> Email: jor...@aace.org | Website: www.aace.org | +1.267.438.2388



-- 
Jordan Reiter
AACE - Association for the Advancement of Computing in Education
Email: jor...@aace.org | Website: www.aace.org | +1.267.438.2388

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAD4hTsWbKFb2iA9x-ezsz-EiY8j1gH%2BfkMpGv-khQnyUqv%3DqzA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to