[
https://issues.apache.org/jira/browse/CONNECTORS-200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13036784#comment-13036784
]
Erlend GarĂ¥sen commented on CONNECTORS-200:
---
I checked out the latest from trunk and did a test crawl with documents I know
will return a TikaException due to the following Tika bug:
https://issues.apache.org/jira/browse/TIKA-418
The job ended successfully and MCF did not try to fetch the affected documents
over and over again even though TikaExceptions were thrown. In other words, it
seems to work as it should now.
Solr connector should treat TikaException the same as a 400 response
Key: CONNECTORS-200
URL: https://issues.apache.org/jira/browse/CONNECTORS-200
Project: ManifoldCF
Issue Type: Improvement
Components: Lucene/SOLR connector
Affects Versions: ManifoldCF 0.1, ManifoldCF 0.2, ManifoldCF 0.3
Reporter: Karl Wright
Assignee: Karl Wright
Solr connector should treat TikaException the same as a 400 response, which
is to skip the document.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira