[ 
https://issues.apache.org/jira/browse/TIKA-509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jukka Zitting resolved TIKA-509.
--------------------------------

       Resolution: Fixed
    Fix Version/s: 1.0

Resolving as fixed as discussed above.
                
> Container contents extraction
> -----------------------------
>
>                 Key: TIKA-509
>                 URL: https://issues.apache.org/jira/browse/TIKA-509
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>    Affects Versions: 0.7
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 1.0
>
>         Attachments: 0001-TIKA-509-Container-contents-extraction.patch
>
>
> As discussed on the mailing list:
> http://mail-archives.apache.org/mod_mbox/tika-dev/201009.mbox/%3calpine.deb.1.10.1009010000250.5...@urchin.earth.li%3E
> This service will operate in a push mode, using streaming where possible (not 
> all container formats will support that). Users can control recursion, and 
> will be given the chance to process each embeded file in turn. It's up to 
> them if they process a file or skip it.
> It will work similar to the current Parser code, with each container having 
> its own extractor in the parsers package, and the interface defined in the 
> core package. There will be an Auto extractor in the core package, configured 
> with a list of parser extractors just like AutoDetectParser does.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to