[ https://issues.apache.org/jira/browse/TIKA-1865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15890786#comment-15890786 ]
Tim Allison commented on TIKA-1865: ----------------------------------- [~mcaruanagalizia], I've added quite a few more Metadata keys under Office and Message for the sender, and I've updated the MSG parser. I still need to update the other message parsers. I'm not thrilled with putting the MAPI specific metadata items in the Office object...perhaps a separate class to handle them?, and I don't like the divide between MAPI and Message, but there really are some things that are specific to MAPI but don't apply to RFC. I added individual keys for the components of exchange addresses {{"/o=blah/ou=blah/cn=recipients/cn=actual name"}}. Let me know what you think. As a side note, we just switched from Apache's git to GitHub. We haven't re-calibrated Jenkins so there isn't a nightly build yet. You'll have to grab from GitHub and build yourself for now. > Save sender email address in Outlook MSG metadata > ------------------------------------------------- > > Key: TIKA-1865 > URL: https://issues.apache.org/jira/browse/TIKA-1865 > Project: Tika > Issue Type: Improvement > Components: parser > Affects Versions: 1.11 > Environment: Windows 7 x64, jre 1.8.0_60 x64 > Reporter: Luis Filipe Nassif > Attachments: report.xlsx > > > Sender email address is lost when extracting metadata from Outlook msg files. > Currently only sender name is extracted. That is an important information to > be extracted for search engines. -- This message was sent by Atlassian JIRA (v6.3.15#6346)