Make the Tika facade implement the Parser and Detector interfaces
-
Key: TIKA-710
URL: https://issues.apache.org/jira/browse/TIKA-710
Project: Tika
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/TIKA-710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting updated TIKA-710:
---
Summary: Expose the Parser and Detector instances within the Tika facade
(was: Make the Tika facade im
[
https://issues.apache.org/jira/browse/TIKA-704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13101077#comment-13101077
]
Jukka Zitting commented on TIKA-704:
Thanks! I added the test cases in revision 1167052.
[
https://issues.apache.org/jira/browse/TIKA-710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved TIKA-710.
Resolution: Fixed
Done in revision 1167051.
> Expose the Parser and Detector instances within the Ti
[
https://issues.apache.org/jira/browse/TIKA-704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13101087#comment-13101087
]
Jukka Zitting commented on TIKA-704:
Hmm, there was still a hidden copy of the Yamaha ma
Word parser doesn't extract optional hyphen correctly
-
Key: TIKA-711
URL: https://issues.apache.org/jira/browse/TIKA-711
Project: Tika
Issue Type: Bug
Components: parser
[
https://issues.apache.org/jira/browse/TIKA-711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless updated TIKA-711:
Attachment: testOptionalHyphen.rtf
testOptionalHyphen.pptx
tes
[
https://issues.apache.org/jira/browse/TIKA-711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13101323#comment-13101323
]
Michael McCandless commented on TIKA-711:
-
The WordExtractor seems to receive ASCII
when i want to index video file with nutch 1.3 i get the following error :
*Error parsing: file:///D:/film.avi: failed(2,0): Can't retrieve Tika parser
for
mime-type video/x-msvideo*
(also it is the same error for images file)
and in hadoop log the detail error is:
*parse.ParserFactory - Par