[ https://issues.apache.org/jira/browse/TIKA-2694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16558393#comment-16558393 ]
Celpan Valeria commented on TIKA-2694: -------------------------------------- Okay, thank you > "From" headers is not always extracted correctly on msg mails > ------------------------------------------------------------- > > Key: TIKA-2694 > URL: https://issues.apache.org/jira/browse/TIKA-2694 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.17 > Environment: CentOS 7 > Windows 10 > Reporter: Celpan Valeria > Priority: Major > Attachments: Fw Anime User Analysis.msg > > > For some emails we get instead of the email address for "From" field a value > which looks like `/O=SONY/OU=EXCHANGE ADMINISTRATIVE GROUP > (FYDIBOHF23SPDLT)/CN=RECIPIENTS/CN=EBERGER`. > The issue seems to be connected to the library > `org.apache.poi:poi-scratchpad:3.17` as when running > `org.apache.tika.parser.microsoft.OutlookExtractor::OutlookExtractor(DirectoryNode, > ParserContext)` we get `this.msg.mainChunks.allChunks.SenderEmailAddress = > "/O=SONY/OU=EXCHANGE ADMINISTRATIVE GROUP > (FYDIBOHF23SPDLT)/CN=RECIPIENTS/CN=EBERGER"`. > Check attachment to reproduce this defect. -- This message was sent by Atlassian JIRA (v7.6.3#76005)