[ 
https://issues.apache.org/jira/browse/TIKA-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12879515#action_12879515
 ] 

Jukka Zitting commented on TIKA-442:
------------------------------------

I'd go with XMP as much as possible. XMP leverages Exif for image metadata, and 
the most relevant fields are probably:

    tiff:ImageLength
    tiff:ImageWidth
    tiff:SamplesPerPixel
    tiff:BitsPerSample


> Image extractors use inconsistent metadata keys and formats for common 
> features
> -------------------------------------------------------------------------------
>
>                 Key: TIKA-442
>                 URL: https://issues.apache.org/jira/browse/TIKA-442
>             Project: Tika
>          Issue Type: Improvement
>          Components: metadata, parser
>    Affects Versions: 0.7
>            Reporter: Nick Burch
>            Priority: Minor
>
> Currently Tika has a number of parsers for image formats, but the way they 
> return their data is inconsistent. For example:
> Jpeg: "Image Width" = "420 pixels", "Data Precision" = "8 bits"
> Gif: "width" = "420"
> Png: "width" = "420", "IHDR" = ".... bitDepth = 8 ....."
> Bmp: "width" = "420", "BitsPerSample" = "8 8 8"
> I think that the common keys, such as width and height, should be returned in 
> a consistent format of key and value. If someone would like to suggest the 
> namespace for this (maybe under XMDPM), and the short or long form (eg 420 vs 
> 420 pixels), then I'm happy to work up a patch for this

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to