On Thu, 9 Jan 2014, Hong-Thai Nguyen wrote:
By searching on issues, I found the issue already created: https://issues.apache.org/jira/browse/TIKA-90
I'm not sure if the metadata is the right place to return this. Some formats offer a small thumbnail, others can offer a small thumbnail for every page, and at least one can include a full-size image of the first page.
Would we not be better off exposing these embedded renderings via the existing embedded resources handling, with some sort of handy way to identify what something is (eg this is a full-size PNG of page 1, this is a jpg thumbnail of page 3)?
Nick