[ https://issues.apache.org/jira/browse/NUTCH-577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Chris A. Mattmann updated NUTCH-577: ------------------------------------ Due Date: 30/Nov/07 (was: 30/Nov/07) Fix Version/s: (was: 1.1) - pushing this out per http://bit.ly/c7tBv9 > Use explicit tika-config.xml file to enable mime magic detection to be turned > on and off > ---------------------------------------------------------------------------------------- > > Key: NUTCH-577 > URL: https://issues.apache.org/jira/browse/NUTCH-577 > Project: Nutch > Issue Type: Improvement > Components: mime_type_detector > Affects Versions: 1.0.0 > Environment: Mac Book Pro Intel Core Duo 2.0 Ghz, 2. 0 GB RAM, Mac OS > X 10.4, although improvement is indep. of env. > Reporter: Chris A. Mattmann > Assignee: Chris A. Mattmann > Priority: Minor > > Currently, there is a configuration file for Tika (which the trunk in Nutch > uses for its mime type detection) called "tika-config.xml" left unexposed (a > default one lives in the tika-0.1-dev.jar file). Tika's mime system has two > config files it relies on: tika-mimetypes.xml (which Nutch has its own > version of, that overrides the version that comes with the tika jar file), > and tika-config.xml (to turn on or off magic char detection). We should > probably have a nutch version of tika-config.xml, so that Nutch users can > employ magic char mime detection. I'll get going on this in the next day or > so. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.