[ https://issues.apache.org/jira/browse/CONNECTORS-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Karl Wright reassigned CONNECTORS-1679: --------------------------------------- Assignee: Julien Massiera > HTML Extractor: output has escaped entities > ------------------------------------------- > > Key: CONNECTORS-1679 > URL: https://issues.apache.org/jira/browse/CONNECTORS-1679 > Project: ManifoldCF > Issue Type: Bug > Components: HTML extractor > Affects Versions: ManifoldCF 2.20 > Reporter: Julien Massiera > Assignee: Julien Massiera > Priority: Major > Fix For: ManifoldCF 2.21 > > Attachments: patch-CONNECTORS-1679.txt > > > The output of the HTML extractor is generated with escaped entities (eg '&' > becomes '& amp ;'), which is not the wanted behavior as we want this > connector to extract text from HTML as it is -- This message was sent by Atlassian Jira (v8.20.1#820001)