[ https://issues.apache.org/jira/browse/TIKA-858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Craig Stires updated TIKA-858: ------------------------------ Attachment: 7901V5.pdf Attaching the specification docs for the ANPA formats. [7901V5.pdf] This discusses the start of header for mime-type recognition, as well as the spec for how the rest of the document structure. > Tika add parsing support for ANPA-1312 news wire feeds > ------------------------------------------------------ > > Key: TIKA-858 > URL: https://issues.apache.org/jira/browse/TIKA-858 > Project: Tika > Issue Type: New Feature > Components: mime, parser > Affects Versions: 0.10 > Reporter: Craig Stires > Attachments: 7901V5.pdf, IptcAnpaParser.java, > org.apache.tika.parser.Parser_ANPA.patch, tika-mimetypes_ANPA.patch > > > This submission adds support for ANPA-1312 news wire feeds. > Those feeds are the formats used by AP, AFP, NYT, Reuters in their daily news > wire broadcasts. > This was a pretty significant development effort, so am happy to share back > as a thank you to the TIKA community. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira