[ https://issues.apache.org/jira/browse/TIKA-1438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lewis John McGibbney updated TIKA-1438: --------------------------------------- Attachment: TIKA-1438.patch Patch for trunk, actively validating it right now. > PhoneExtractingContentHandler to not add individual MD entries for individual > phone numbers > ------------------------------------------------------------------------------------------- > > Key: TIKA-1438 > URL: https://issues.apache.org/jira/browse/TIKA-1438 > Project: Tika > Issue Type: Bug > Reporter: Lewis John McGibbney > Assignee: Lewis John McGibbney > Priority: Minor > Fix For: 1.7 > > Attachments: TIKA-1438.patch > > > Right now we have the PhoneExtractingContentHandler adding phone numbers as > individual metadata entires.... I feel that this is cumbersome. > An example would be that we have a webpage with phone numbers on it, we then > have many fields of the same type with different values! > I propose we reverse this and have one field with multiple values. > I would fully understand the current behaviour if we wished to augment the > phone numbers further by associating dialing code, country, carrier, etc, > however we are not currently doing this. > Patch coming for trunk. -- This message was sent by Atlassian JIRA (v6.3.4#6332)