[ https://issues.apache.org/jira/browse/SOLR-2416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16084606#comment-16084606 ]
Tim Allison edited comment on SOLR-2416 at 7/12/17 8:20 PM: ------------------------------------------------------------ For more fun with embedded docs, see the issue on adding the RecursiveParserWrapper's behavior to Solr -- SOLR-7229 . This would create a separate Solr document for each document embedded in the zip (perhaps child documents?). was (Author: talli...@mitre.org): For more fun with embedded docs, see the issue on adding the RecursiveParserWrapper's behavior to Solr -- SOLR-7229 > Solr Cell fails to index Zip file contents > ------------------------------------------ > > Key: SOLR-2416 > URL: https://issues.apache.org/jira/browse/SOLR-2416 > Project: Solr > Issue Type: Bug > Components: contrib - DataImportHandler, contrib - Solr Cell (Tika > extraction) > Affects Versions: 1.4.1 > Reporter: Jayendra Patil > Fix For: 6.0 > > Attachments: SOLR-2416_ExtractingDocumentLoader.patch, SOLR-4216.patch > > > Working with the latest Solr Trunk code and seems the Tika handlers for Solr > Cell (ExtractingDocumentLoader.java) and Data Import handler > (TikaEntityProcessor.java) fails to index the zip file contents again. > It just indexes the file names again. > This issue was addressed some time back, late last year, but seems to have > reappeared with the latest code. > Jira for the Data Import handler part with the patch and the testcase - > https://issues.apache.org/jira/browse/SOLR-2332. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org