[ https://issues.apache.org/jira/browse/TIKA-595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Julien Nioche updated TIKA-595: ------------------------------- Attachment: TIKA-595.patch Any reason why we wouldn't want to have multiple values in the metadata if they are present in the HTML doc? > HtmlHandler does not support multivalue metadata > ------------------------------------------------ > > Key: TIKA-595 > URL: https://issues.apache.org/jira/browse/TIKA-595 > Project: Tika > Issue Type: Bug > Components: metadata, parser > Affects Versions: 0.8 > Reporter: Lutz Pumpenmeier > Priority: Minor > Attachments: TIKA-595.patch > > > The HtmlHandler uses metadata.set(...). So META tags that occure more than > once are not handled correctly (DublinCore metadata can be set more than > once). > The handler should use metadata.add(..) instead. -- This message was sent by Atlassian JIRA (v6.3.4#6332)