[ https://issues.apache.org/jira/browse/TIKA-1232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13893380#comment-13893380 ]
Tim Allison commented on TIKA-1232: ----------------------------------- Interesting. Thank you, [~johanvanderknijff] and [~anjackson]. I personally like "Extended-Content-Type", but following (http://wiki.apache.org/tika/MetadataRoadmap), is there someone more familiar with Dublin Core and/or XMP who could recommend appropriate tags? Many apologies if either one of those recommends "Extended-Content-Type" :). > Add PDF version to PDFParser output > ----------------------------------- > > Key: TIKA-1232 > URL: https://issues.apache.org/jira/browse/TIKA-1232 > Project: Tika > Issue Type: Improvement > Components: parser > Affects Versions: 1.5 > Environment: JDK6 > Reporter: William Palmer > Assignee: Tim Allison > Priority: Minor > Attachments: pdfversion.patch > > > I'd like to identify the PDF version of files, this is not currently reported > by the PDFParser although the information is available via PDFBox. I have > attached a patch that adds the format version to the Metadata object. > However, I am not familiar enough with the Tika source to know if an > alternative metadata key should be used, or this new one added. > Comments welcome. -- This message was sent by Atlassian JIRA (v6.1.5#6160)