[ 
https://issues.apache.org/jira/browse/NUTCH-2606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16648903#comment-16648903
 ] 

ASF GitHub Bot commented on NUTCH-2606:
---------------------------------------

sebastian-nagel opened a new pull request #392: NUTCH-2606 MIME detection is 
wrong for plain-text documents send as Content-Type "application/msword"
URL: https://github.com/apache/nutch/pull/392
 
 
   - allow text/plain (from MIME magic) to overwrite type derived from HTTP 
Content-Type or file extension

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> MIME detection is wrong for plain-text documents send as Content-Type 
> "application/msword"
> ------------------------------------------------------------------------------------------
>
>                 Key: NUTCH-2606
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2606
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.14
>            Reporter: Sebastian Nagel
>            Priority: Minor
>             Fix For: 1.16
>
>
> Plain-text documents send as Content-Type "application/msword" are tried to 
> parse as Word documents. The MIME detection should be fixed, so that these 
> are correctly identified as plain-text documents. See NUTCH-2603 and 
> https://www.atnf.csiro.au/computing/software/gipsy/doc/update.doc



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to