Hi,
I posted a question in November last year about indexing content from
multiple binary files into a single Solr document and Jayendra responded
with a simple solution to zip them up and send that single file to Solr.
I understand that the Tika 0.4 JARs supplied with Solr 1.4.1 don't
currently allow this to work and only the file names of the zipped files
are indexed (and not their contents).
I've tried downloading and building the latest Tika (0.8) and replacing
the tika-parsers and tika-core JARS in
<solr-root>\contrib\extraction\lib but this still isn't indexing the
file contents, and not doesn't even index the file names!
Is there a version of Tika that works with the Solr 1.4.1 released
distribution which does index the contents of the zipped files?
Thanks and kind regards,
Gary