[ https://issues.apache.org/jira/browse/TIKA-1089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Hong-Thai Nguyen updated TIKA-1089: ----------------------------------- Description: We are using Tika as our major converter of divers file formats to text, html version in a Search Engine. We've collected some documents (46) which Tika can not convert: http://www.mediafire.com/?60clr812lerx3gy was: We are using Tika as our major converter of divers file formats to text, html version in a Search Engine. We've collected some documents (46) which Tika can not convert > Tika conversion failed on following documents > --------------------------------------------- > > Key: TIKA-1089 > URL: https://issues.apache.org/jira/browse/TIKA-1089 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.3 > Environment: windows, api > Reporter: Hong-Thai Nguyen > Labels: test > Attachments: crawler.log > > > We are using Tika as our major converter of divers file formats to text, html > version in a Search Engine. > We've collected some documents (46) which Tika can not convert: > http://www.mediafire.com/?60clr812lerx3gy -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira