[
https://issues.apache.org/jira/browse/TIKA-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13193124#comment-13193124
]
Nick Burch commented on TIKA-850:
---------------------------------
Currently, the objects set onto the ParseContext are:
* Detector.class
* DocumentSelector.class
* EmbeddedDocumentExtractor.class
* Locale.class
* MimeConfig.class
* Parser.class
The ones set onto the Metadata for use by parsers are:
* RESOURCE_NAME_KEY (resourceName)
* CONTENT_TYPE (Content-Type)
* PASSWORD (org.apache.pdfbox.tika.password) *PDF Only*
* TIKA_MIME_FILE (tika.mime.file);
* MIME_TYPE_MAGIC (mime.type.magic);
> Consistent way to supply document passwords to parsers
> ------------------------------------------------------
>
> Key: TIKA-850
> URL: https://issues.apache.org/jira/browse/TIKA-850
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.0
> Reporter: Nick Burch
>
> Currently, PDF document passwords are supplied to the parser via a special
> key on the Metadata object, while the Office Parser has a TODO and only
> supports the default password
> We should update all the parsers that support encrypted documents (currently
> PDF, Office OLE2 and Office OOXML) to receive the password in a consistent way
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira