[ 
https://issues.apache.org/jira/browse/TIKA-1971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15287559#comment-15287559
 ] 

Hudson commented on TIKA-1971:
------------------------------

SUCCESS: Integrated in tika-trunk-jdk1.7 #993 (See 
[https://builds.apache.org/job/tika-trunk-jdk1.7/993/])
TIKA-1971 - add another magic for rfc822 (tallison: rev 
e08d00654a77267686a112e0665c324ef041c033)
* tika-parsers/src/test/resources/test-documents/testRFC822_eml
* tika-parsers/src/test/java/org/apache/tika/parser/mail/RFC822ParserTest.java
* tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
* tika-parsers/src/test/resources/test-documents/testRFC822_date_utf8


> Email saved as .eml with no body not detected as rfc822, while same email 
> saved as plain txt is.
> ------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-1971
>                 URL: https://issues.apache.org/jira/browse/TIKA-1971
>             Project: Tika
>          Issue Type: Bug
>          Components: detector
>    Affects Versions: 1.14
>         Environment: Debian Jessie
> Java(TM) SE Runtime Environment (build 1.8.0_91-b14)
> Mac OSX Mail
>            Reporter: Philipp Steinkrueger
>            Priority: Minor
>             Fix For: 2.0, 1.14
>
>         Attachments: Testemail-empty-doesnotwork.eml, 
> Testemail-empty-works.txt
>
>
> I save an email with no body text
> (1) by dragging it from Mac Mail so that an .eml file is created
> (2) by using 'Save As' in Mac Mail so that a .txt file is created
> I then feed the files to Tika Server with the following command
> curl -T filename http://localhost:9998/detect/stream
> In case (1) the response is text/plain, while in case (2) the response is 
> message/rfc822. This is strange, since (1) includes the full raw header, 
> while (2) only includes a very abbreviated header.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to