Better handling of delegating parser implementations
----------------------------------------------------
Key: TIKA-238
URL: https://issues.apache.org/jira/browse/TIKA-238
Project: Tika
Issue Type: Improvement
Components: config, parser
Reporter: Jukka Zitting
Assignee: Jukka Zitting
Priority: Minor
Fix For: 0.4
Delegating Parser implementations (i.e. ones that call another Parser instance
to process some parts of the input document) like the PackageParser subclasses
currently simply instantiate a new AutoDetectParser() whenever a more specific
delegate parser has not been specified. This is troublesome as it 1) makes
customization more difficult and 2) performs poorly since a new
AutoDetectParser (with a new freshly parsed copy of the default configuration)
is currently instantiated whenever such a delegating parser is used.
Instead of working around the issue with things like keeping a static copy of
the pre-parsed default configuration, I'd like to solve this issue by
introducing a new DelegatingParser abstraction for this and hooking it to the
configuration mechanism.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.