[ https://issues.apache.org/jira/browse/TIKA-872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jukka Zitting updated TIKA-872: ------------------------------- Fix Version/s: (was: 1.3) Issue Type: New Feature (was: Bug) Our RTF parser doesn't yet support embedded documents, so this would be a new feature. See the RTFParser and TextExtractor classes in o.a.t.parser.rtf inside the tika-parsers components for the place where something like this should be implemented. > Tika --extract fails for RTF > ---------------------------- > > Key: TIKA-872 > URL: https://issues.apache.org/jira/browse/TIKA-872 > Project: Tika > Issue Type: New Feature > Components: general > Affects Versions: 1.0 > Environment: Windows 7 with Java v1.6 > Reporter: Albert L. > Attachments: embedded.rtf.zip > > > A file that is embedded in an RTF file doesn't get extracted to disk. > To "embed" a file into an RTF, simply drag-drop it into an RTF document when > using MS-Word 2010. It will then create an EMF of the embedded file's > preview. > See attached file "embedded.rtf.zip" for an example input file that fails > with Tika v1.0. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira