Tim Allison created TIKA-3698: --------------------------------- Summary: Duplicate subject/description for Outlook msgs Key: TIKA-3698 URL: https://issues.apache.org/jira/browse/TIKA-3698 Project: Tika Issue Type: Task Reporter: Tim Allison
On TIKA-3629, despite our best efforts to simplify and streamline metadata keys, we backed off and continued to include/added back keywords _and_ subject. Another area where we should probably include both includes msg files. POI's msg.getSubject() is going to "dc:title", and msg.getConversationTopic() is going to "dc:description". Along the lines of what we did on TIKA-3629, I propose adding msg.getConversationTopic() also under the key "dc:subject". -- This message was sent by Atlassian Jira (v8.20.1#820001)