[ https://issues.apache.org/jira/browse/SOLR-1902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tommaso Teofili updated SOLR-1902: ---------------------------------- Attachment: SOLR1902_patch_to_141.txt > Tika no longer properly extracts content in Solr > ------------------------------------------------ > > Key: SOLR-1902 > URL: https://issues.apache.org/jira/browse/SOLR-1902 > Project: Solr > Issue Type: Bug > Components: contrib - Solr Cell (Tika extraction) > Reporter: Grant Ingersoll > Assignee: Grant Ingersoll > Fix For: 4.0 > > Attachments: SOLR1902_patch_to_141.txt > > > See > http://www.lucidimagination.com/search/document/2ca3fe953038a54f/problem_with_pdf_upgrading_cell#22360c8261801f24 > It appears that since the upgrade to Tika 0.7, Tika is now selecting an > EmptyParser when uploading docs, which then outputs an empty XHTML > representation. Still, it's strange that the tests pass. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org