[ 
https://issues.apache.org/jira/browse/TIKA-858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Craig Stires updated TIKA-858:
------------------------------

    Attachment: 7901V5.pdf

Attaching the specification docs for the ANPA formats. [7901V5.pdf]
This discusses the start of header for mime-type recognition, as well as the 
spec for how the rest of the document structure.
                
> Tika add parsing support for ANPA-1312 news wire feeds
> ------------------------------------------------------
>
>                 Key: TIKA-858
>                 URL: https://issues.apache.org/jira/browse/TIKA-858
>             Project: Tika
>          Issue Type: New Feature
>          Components: mime, parser
>    Affects Versions: 0.10
>            Reporter: Craig Stires
>         Attachments: 7901V5.pdf, IptcAnpaParser.java, 
> org.apache.tika.parser.Parser_ANPA.patch, tika-mimetypes_ANPA.patch
>
>
> This submission adds support for ANPA-1312 news wire feeds.
> Those feeds are the formats used by AP, AFP, NYT, Reuters in their daily news 
> wire broadcasts.
> This was a pretty significant development effort, so am happy to share back 
> as a thank you to the TIKA community. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to