[ 
https://issues.apache.org/jira/browse/TIKA-1865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15167211#comment-15167211
 ] 

Tim Allison commented on TIKA-1865:
-----------------------------------

Good to hear from you, [~lfcnassif]!

I've only looked at this very briefly, but it looks like POI does not currently 
make the sender email address available.  I think the best next step would be 
to figure out how to modify POI to make this info available.  Any interest in 
looking into this?

I did see that the email address exists _sometimes_ in the header {{From:}}, 
and we could pull it out via regex, but several of our test MSG files clearly 
have the sender email in the bytes but have no headers.


> Save sender email address in Outlook MSG metadata
> -------------------------------------------------
>
>                 Key: TIKA-1865
>                 URL: https://issues.apache.org/jira/browse/TIKA-1865
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.11
>         Environment: Windows 7 x64, jre 1.8.0_60 x64
>            Reporter: Luis Filipe Nassif
>
> Sender email address is lost when extracting metadata from Outlook msg files. 
> Currently only sender name is extracted. That is an important information to 
> be extracted for search engines.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to