[
https://issues.apache.org/jira/browse/TIKA-4381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924631#comment-17924631
]
Tim Allison commented on TIKA-4381:
-----------------------------------
Does anyone have any links/resources for the property ids?
I did what I could here:
[github|https://github.com/apache/tika/blob/TIKA-4381/tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/msg/ExtendedMetadataExtractor.java#L43]
Specifically:
{noformat}
static {
//TODO -- figure out how these differ and how they overlap with other
types
PROPERTIES.put(0x8003, MAPI.APPT_START_TIME);
PROPERTIES.put(0x8005, MAPI.APPT_START_TIME);
PROPERTIES.put(0x8007, MAPI.APPT_START_TIME);
PROPERTIES.put(0x8009, MAPI.APPT_START_TIME);
PROPERTIES.put(0x801b, MAPI.APPT_START_TIME);
PROPERTIES.put(0x8004, MAPI.APPT_END_TIME);
PROPERTIES.put(0x8006, MAPI.APPT_END_TIME);
PROPERTIES.put(0x801c, MAPI.APPT_END_TIME);
PROPERTIES.put(0x8015, MAPI.APPT_END_REPEAT_TIME);
}
{noformat}
I don't see these values here:
[ms-oxprops|https://learn.microsoft.com/en-us/openspecs/exchange_server_protocols/ms-oxprops/f6ab1613-aefe-447d-a49c-18217230b148
]
> Improve extraction of metadata from Appointment/Task msgs
> ---------------------------------------------------------
>
> Key: TIKA-4381
> URL: https://issues.apache.org/jira/browse/TIKA-4381
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
>
> Our metadata extraction on msgs is mostly focused on "NOTE"/regular emails.
> We could do to improve extraction from appointments, tasks and other msg
> types.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)