Tim Allison created TIKA-1528:
---------------------------------

             Summary: Add an OverrideDetector that overrides other detectors
                 Key: TIKA-1528
                 URL: https://issues.apache.org/jira/browse/TIKA-1528
             Project: Tika
          Issue Type: Improvement
            Reporter: Tim Allison
            Priority: Minor


While working on TIKA-1511, I found a need to bypass our current detection 
mechanism.  I think that there are other use cases for this.  The idea is that 
a client or a tika-internal call wants to specify the Content-Type for a 
document and bypass the regular mime detection chain.

We currently have the TypeDetector that returns the "Content-Type" as specified 
in the Metadata, but there are two deficiencies in using that class for this 
purpose:
* Content-Type is ambiguous, currently, when it comes into a Parser or 
Detector, it could be used as a hint or as a direction.  I'd like the 
OverrideDetector to use a different metadata key from our usual "Content-Type.
* The ordering of the TypeDetector is based on alphabetic order of its class 
name.  I'd like the OverrideDetector to be run first and then short 
circuit/bypass the other detectors.





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to