You could may be use source exclude: 
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/mapping-source-field.html#include-exclude

Though I think it would be better to extract yourself content using Tika if you 
are using Java and only send what you need to ES.


David

> Le 12 févr. 2015 à 22:39, warpka...@gmail.com a écrit :
> 
> Hi,
> 
> I have some terabytes of documents (pdf, office, etc) stored in some system 
> outside of ES. Suppose I want to make them searchable with ES, however I will 
> never serve the original documents from ES, but from that other system.
> Is it possible to send the documents to ES (e.g. via base64 encoded field and 
> the attachment type mapping), have ES index them and afterwards delete that 
> base64 field so that the "real content" of my documents is not stored in ES 
> (for cost reasons)?
> Queries will then be served by ES but the real document is served by that 
> other system I have.
> 
> Regards,
> Dieter
> -- 
> You received this message because you are subscribed to the Google Groups 
> "elasticsearch" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to elasticsearch+unsubscr...@googlegroups.com.
> To view this discussion on the web visit 
> https://groups.google.com/d/msgid/elasticsearch/d084274d-8d50-4fb7-8357-8d53f5177e1f%40googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/591A15AE-09DC-4D63-B541-483D2788CD15%40pilato.fr.
For more options, visit https://groups.google.com/d/optout.

Reply via email to