[ 
https://issues.apache.org/jira/browse/TIKA-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15583175#comment-15583175
 ] 

Hudson commented on TIKA-2122:
------------------------------

FAILURE: Integrated in Jenkins build tika-2.x-windows #63 (See 
[https://builds.apache.org/job/tika-2.x-windows/63/])
TIKA-2122: Extract all headers from MSG/RFC822 (tallison: rev 
30e03de89fd4b21cb91917c72aec12eede761be3)
* (edit) 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/OutlookExtractor.java
* (edit) tika-parser-modules/tika-parser-web-module/pom.xml
* (edit) 
tika-parser-modules/tika-parser-web-module/src/main/java/org/apache/tika/parser/mail/MailContentHandler.java
* (edit) tika-parser-modules/tika-parser-office-module/pom.xml
* (edit) tika-parser-bundles/tika-parser-office-bundle/pom.xml
* (edit) CHANGES.txt
* (edit) tika-core/src/main/java/org/apache/tika/metadata/Message.java
* (edit) tika-parser-modules/pom.xml
* (edit) tika-core/src/test/java/org/apache/tika/TikaTest.java
* (edit) 
tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/OutlookParserTest.java
* (edit) 
tika-parser-modules/tika-parser-web-module/src/test/java/org/apache/tika/parser/mail/RFC822ParserTest.java


> Extract all email headers from Outlook .msg files into Metadata
> ---------------------------------------------------------------
>
>                 Key: TIKA-2122
>                 URL: https://issues.apache.org/jira/browse/TIKA-2122
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.13
>            Reporter: Chris Knott
>            Priority: Minor
>             Fix For: 2.0, 1.14
>
>         Attachments: msg_raw_headers.xlsx
>
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> Currently most email headers are not added to the Metadata when extracting 
> Outlook .msg files.
> http://svn.apache.org/repos/asf/tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/microsoft/OutlookExtractor.java
> The headers - {{msg.getHeaders()}} - are already being looped through as a 
> way to estimate the date.
> All headers should be added to Metadata, using the name of the header with a 
> prefix such as {{"raw-header:"}}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to