[
https://issues.apache.org/jira/browse/SOLR-2416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jayendra Patil updated SOLR-2416:
---------------------------------
Attachment: SOLR-2416_ExtractingDocumentLoader.patch
Fix attached.
> Solr Cell & DataImport Tika handler broken - fails to index Zip file contents
> -----------------------------------------------------------------------------
>
> Key: SOLR-2416
> URL: https://issues.apache.org/jira/browse/SOLR-2416
> Project: Solr
> Issue Type: Bug
> Components: contrib - DataImportHandler, contrib - Solr Cell (Tika
> extraction)
> Affects Versions: 4.0
> Reporter: Jayendra Patil
> Attachments: SOLR-2416_ExtractingDocumentLoader.patch
>
>
> Working with the latest Solr Trunk code and seems the Tika handlers for Solr
> Cell (ExtractingDocumentLoader.java) and Data Import handler
> (TikaEntityProcessor.java) fails to index the zip file contents again.
> It just indexes the file names again.
> This issue was addressed some time back, late last year, but seems to have
> reappeared with the latest code.
> Jira for the Data Import handler part with the patch and the testcase -
> https://issues.apache.org/jira/browse/SOLR-2332.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]