Tika --extract fails for DOC ---------------------------- Key: TIKA-873 URL: https://issues.apache.org/jira/browse/TIKA-873 Project: Tika Issue Type: Bug Components: general Affects Versions: 1.0 Environment: Windows 7 + Java v1.6 Reporter: Albert L. Fix For: 1.2
A file that is embedded in an DOCfile doesn't get extracted to disk. To "embed" a file into an DOC, simply drag-drop it into an DOC document when using MS-Word 2010. It will then create an EMF of the embedded file's preview. See this link for an example: http://dl.dropbox.com/u/2490783/embedded.doc -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira