[
https://issues.apache.org/jira/browse/NUTCH-577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann updated NUTCH-577:
------------------------------------
Due Date: 30/Nov/07 (was: 30/Nov/07)
Fix Version/s: (was: 1.1)
- pushing this out per http://bit.ly/c7tBv9
> Use explicit tika-config.xml file to enable mime magic detection to be turned
> on and off
> ----------------------------------------------------------------------------------------
>
> Key: NUTCH-577
> URL: https://issues.apache.org/jira/browse/NUTCH-577
> Project: Nutch
> Issue Type: Improvement
> Components: mime_type_detector
> Affects Versions: 1.0.0
> Environment: Mac Book Pro Intel Core Duo 2.0 Ghz, 2. 0 GB RAM, Mac OS
> X 10.4, although improvement is indep. of env.
> Reporter: Chris A. Mattmann
> Assignee: Chris A. Mattmann
> Priority: Minor
>
> Currently, there is a configuration file for Tika (which the trunk in Nutch
> uses for its mime type detection) called "tika-config.xml" left unexposed (a
> default one lives in the tika-0.1-dev.jar file). Tika's mime system has two
> config files it relies on: tika-mimetypes.xml (which Nutch has its own
> version of, that overrides the version that comes with the tika jar file),
> and tika-config.xml (to turn on or off magic char detection). We should
> probably have a nutch version of tika-config.xml, so that Nutch users can
> employ magic char mime detection. I'll get going on this in the next day or
> so.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.