[
https://issues.apache.org/jira/browse/TIKA-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved TIKA-232.
--------------------------------
Resolution: Duplicate
Assignee: Jukka Zitting
With TIKA-238 resolved, the former case above is now the default:
Parser parser = new ZipParser();
And the latter case is much simpler:
TikaConfig config = TikaConfig.getDefaultConfig(); // without a delegate
parser
Parser parser = new AutoDetectParser(config);
Resolving this as a Duplicate of TIKA-238.
> Scanning of archive files
> -------------------------
>
> Key: TIKA-232
> URL: https://issues.apache.org/jira/browse/TIKA-232
> Project: Tika
> Issue Type: New Feature
> Components: parser
> Affects Versions: 0.3
> Environment: All
> Reporter: Karl Heinz Marbaise
> Assignee: Jukka Zitting
> Priority: Minor
>
> If i parse an archive all the files inside the archive will be extracted with
> their text as well. It would be nice to have the choice to extract only the
> list of files (directory) of an archive instead of extracting the whole
> contents. This seemed to be usable only for zip, tar, tar.gz, tar.bz2, .jar.
> May be this could be realized by using a different calling or by a run time
> configuration.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.