Hong-Thai Nguyen created TIKA-1223:
--------------------------------------
Summary: Extract thumbnail of OOXML Office files
Key: TIKA-1223
URL: https://issues.apache.org/jira/browse/TIKA-1223
Project: Tika
Issue Type: Improvement
Components: parser
Affects Versions: 1.4
Reporter: Hong-Thai Nguyen
Priority: Minor
Fix For: 1.5
>From Microsoft Office 2007 file formats, thumbnail could be included in
>package. We can extract this embedded thumbnail for OOXML files.
As discussed in mailing list, we should extract thumbnail as a attachment, not
as metadata (TIKA-90).
embeddedRelationId format is thumbnail_{i}.{extension}.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)