[ https://issues.apache.org/jira/browse/TIKA-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Reinhard Schwab updated TIKA-1500: ---------------------------------- Attachment: TIKA-1500.patch Patch, which contains the trivial fix. > FeedParser extracts XML markup with BodyContentHandler > ------------------------------------------------------ > > Key: TIKA-1500 > URL: https://issues.apache.org/jira/browse/TIKA-1500 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.6 > Reporter: Reinhard Schwab > Priority: Minor > Fix For: 1.8 > > Attachments: TIKA-1500.patch > > > I am using FeedParser to extract text and links from feeds and have > discovered, that the extracted text contains XML markup. > Usually FeedParser strips markup from text when generating SAX events, > but one line is missing it. > The fix is trivial. I will provide a patch. -- This message was sent by Atlassian JIRA (v6.3.4#6332)