On 12/9/2013 9:20 AM, neerajp wrote:
I tried to use ExtractingUpdateProcessor but soon came to know that the same is not rolled out in solr 4.5 I am not sure how to use ExtractingRequestHandler for an XML document having some of the fields in plain text and some of the fields in random binary format. It seems to me that ExtractingRequestHandler is used to extract text from a binary file input but my input document is in XML format not binary.
ExtractingRequestHandler is a contrib module. It's not included in the Solr application war itself, but it IS in the download. You can find the jars in contrib/extraction/lib in all 4.x versions, including 4.5, 4.5.1, and 4.6.
Thanks, Shawn