[jira] [Updated] (SOLR-2332) TikaEntityProcessor retrieves only File Names from Zip extraction
[ https://issues.apache.org/jira/browse/SOLR-2332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-2332: --- Fix Version/s: (was: 4.0) removing fixVersion=4.0 since there is no evidence that anyone is currently working on this issue. (this can certainly be revisited if volunteers step forward) > TikaEntityProcessor retrieves only File Names from Zip extraction > - > > Key: SOLR-2332 > URL: https://issues.apache.org/jira/browse/SOLR-2332 > Project: Solr > Issue Type: Bug > Components: contrib - DataImportHandler >Reporter: Jayendra Patil > Attachments: SOLR-2332.patch, solr-word.zip > > > Extraction of Zip files using TikaEntityProcessor results in only names of > file. > It does not extract the contents of the Files in the Zip -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] Updated: (SOLR-2332) TikaEntityProcessor retrieves only File Names from Zip extraction
[ https://issues.apache.org/jira/browse/SOLR-2332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-2332: --- Affects Version/s: (was: 4.0) Fix Version/s: 3.2 I can't find any docs suggestion how exactly TikaEntityProcessor should be expected to deal with zip files, particularly what to expect if a zip files contains multiple documents. FWIW: TikaEntityProcessor did not exist in Solr 1.4.1, so the behavior currently seen in the 3x branch (and the 3.1rc1 artifacts) is not a regression. > TikaEntityProcessor retrieves only File Names from Zip extraction > - > > Key: SOLR-2332 > URL: https://issues.apache.org/jira/browse/SOLR-2332 > Project: Solr > Issue Type: Bug > Components: contrib - DataImportHandler >Reporter: Jayendra Patil > Fix For: 3.2 > > Attachments: SOLR-2332.patch, solr-word.zip > > > Extraction of Zip files using TikaEntityProcessor results in only names of > file. > It does not extract the contents of the Files in the Zip -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] Updated: (SOLR-2332) TikaEntityProcessor retrieves only File Names from Zip extraction
[ https://issues.apache.org/jira/browse/SOLR-2332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jayendra Patil updated SOLR-2332: - Attachment: solr-word.zip SOLR-2332.patch Attached is the Patch for the fix and Testcase. Also attached is the Test zip file. > TikaEntityProcessor retrieves only File Names from Zip extraction > - > > Key: SOLR-2332 > URL: https://issues.apache.org/jira/browse/SOLR-2332 > Project: Solr > Issue Type: Bug > Components: contrib - DataImportHandler >Affects Versions: 4.0 >Reporter: Jayendra Patil > Attachments: SOLR-2332.patch, solr-word.zip > > > Extraction of Zip files using TikaEntityProcessor results in only names of > file. > It does not extract the contents of the Files in the Zip -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org