If you are a Java developer, you could use Apache Tika. This is what I'm doing here: https://github.com/dadoonet/fsriver/blob/master/src/main/java/fr/pilato/elasticsearch/river/fs/river/FsRiver.java#L689
-- David Pilato | Technical Advocate | Elasticsearch.com @dadoonet | @elasticsearchfr Le 14 mars 2014 à 11:35:26, Sandeep (sandeep.test...@gmail.com) a écrit: How can I upload/index PDF/HTML/XML format documents to Elastic Search. -- View this message in context: http://elasticsearch-users.115913.n3.nabble.com/Upload-index-document-to-Elastic-Search-tp4051542.html Sent from the ElasticSearch Users mailing list archive at Nabble.com. -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/1394542439478-4051542.post%40n3.nabble.com. For more options, visit https://groups.google.com/d/optout. -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.5322deef.2eb141f2.1ccf%40MacBook-Air-de-David.local. For more options, visit https://groups.google.com/d/optout.