[
https://issues.apache.org/jira/browse/TIKA-309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann resolved TIKA-309.
------------------------------------
Resolution: Fixed
- fixed in r836035:
* was able to correctly identify RDF/OWL mime types using magic by changing
regex pattern for localName in MimeTypes.java (in the case where only the
<ns:localName..... is read, but there is no ">" at the end since we only read N
first bytes of the magic header)
* added unit tests and URLs from this issue for regression
* refactored o.a.tika.mime.MimeDetectionTest to support URLs as InputStreams
(as well as Files)
* took out <match value="<!--" type="string" offset="0"/> for HTML detection
since comments can appear in HTML, XML, etc., and aren't specific to HTML
> Mime type application/rdf+xml not correctly detected
> ----------------------------------------------------
>
> Key: TIKA-309
> URL: https://issues.apache.org/jira/browse/TIKA-309
> Project: Tika
> Issue Type: Bug
> Components: mime
> Affects Versions: 0.5
> Reporter: Yuan-Fang Li
> Assignee: Chris A. Mattmann
> Priority: Minor
> Fix For: 0.5
>
>
> Mime type detector using AutoDetectParser and Metadata returns
> "application/xml" for the URL http://www.w3.org/2002/07/owl#, where it should
> be "application/rdf+xml". The correct mime type is also suggested here:
> http://www.w3.org/TR/owl-ref/#MIMEType.
> P.S., Tika was downloaded from svn and built with Maven last week.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.