Hi Ray & all, By searching on issues, I found the issue already created: https://issues.apache.org/jira/browse/TIKA-90 It's maybe now the time to realize it.
Thanks, Hong-Thai -----Message d'origine----- De : Ray Gauss II [mailto:ray.ga...@alfresco.com] Envoyé : mercredi 8 janvier 2014 11:49 À : dev@tika.apache.org Objet : Re: Extract thumbnail from openxml office files Hi Hong-Thai, It’s certainly worth investigating. Several other formats can have embedded thumbnails as well so we could implement a generic thumbnail property. We could probably store as something like a Base64 encoded string, but we’d likely want to place limits on the size and may need a thumbnail internet media type field as well to assist in decoding. Unless others feel differently, I would say open a JIRA where we could start discussing the design of such a feature. Thanks! Ray On January 8, 2014 at 5:36:32 AM, Hong-Thai Nguyen (hong-thai.ngu...@polyspot.com) wrote: > > Hi all, > I want to extract thumbnail image included in Open XML office files. > Apparently, we can do it by openxml4j: > http://openxmldeveloper.org/blog/b/openxmldeveloper/archive/2006/11/21 > /openxmlandjava.aspx The question is : should we integrate thumbnail > in default metadata list of ooxml parsing result ? > > > Thanks > > Hong-Thai > >