[
https://issues.apache.org/jira/browse/TIKA-121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting updated TIKA-121:
-------------------------------
Attachment: AutoDetectParser.patch
The current mime type registry in Tika is tightly integrated with parser
configuration, and for now I'd prefer to avoid coupling it too tightly with
client code.
I assume you're using the incoming ContentType header to select (either
manually or via AutoDetectParser) which parser to use, so I'd prefer to put the
relevant code there. See the attached patch (AutoDetectParser.patch) for the
required changes to AutoDetectParser.
Looking forward it might be good to factor such generic code into a standalone
media type package, but as long as our current media type code is tightly
coupled with Tika configuration, I'd prefer to avoid MimeType dependencies
outside configuration code.
> MimeType.clean method no longer exists as a capability
> ------------------------------------------------------
>
> Key: TIKA-121
> URL: https://issues.apache.org/jira/browse/TIKA-121
> Project: Tika
> Issue Type: Bug
> Components: mime
> Affects Versions: 0.1-incubating
> Reporter: Chris A. Mattmann
> Assignee: Chris A. Mattmann
> Fix For: 0.2-incubating
>
> Attachments: AutoDetectParser.patch
>
>
> For some reason, in r591743
> (http://svn.apache.org/viewvc?rev=591743&view=rev), the MimeType.clean
> functionality was removed and never replaced. This is a problem because that
> functionality was somewhat necessary as I'm running into the problem of
> trying to upgrade Nutch to tika-0.1-incubating and Nutch relied on
> MimeType.clean.
> I've been scratching my head trying to determine an appropriate workaround
> for the same capability within the tika-0.1-incubating code, but have yet to
> find one. This functionality needs to be replaced in some form or fashion,
> or, if someone knows of a simple way to achieve the same functionality,
> please let me know.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.