[ https://issues.apache.org/jira/browse/TIKA-526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13178166#comment-13178166 ]
Fabian Lange commented on TIKA-526: ----------------------------------- This has been fixed for 1.1 by Upgading to POI 3.8 beta 5. Perhaps somebody with more power than me can mark this accordingly. > OOXMLParser fails to extract text from within smart tags > -------------------------------------------------------- > > Key: TIKA-526 > URL: https://issues.apache.org/jira/browse/TIKA-526 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 0.7 > Reporter: Geoff Jarrad > Attachments: smarttag-snippet.docx > > > Documents in the .docx format may contain smart-tags (of element type > w:smartTag). Such a smart-tag will surround the tagged text (found in element > w:r). > The OOXMLParser does not extract the text contained within smart-tags. > [Example document to follow] -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira