Charles,
Thanks for getting back to me regarding this issue. I hope that the
patch may be considered, given how small it is (1 line). Without this
patch, my project is unable to use Sanselan. However, with this patch,
my project works perfectly with a large number of images.
Unfortunately, I can't recall which image actually provided the hurdle
that required me to investigate this issue, but I will look into it.
Unfortunately, I am actually leaving on vacation today for a week, so I
will not be able to do this until I get back.
Sanselan is very important to me now, and I use it extensively for it's
support for reading and writing IPTC data. Do you have any plans to
improve Sanselan to make use of Java 5 language features such as
enumerations and generics? I certainly would benefit from an API with
these features.
Cheers,
Jonathan Giles
Charles Matthew Chen wrote:
Hi Jonathan,
Sorry for the delay reply. Thanks for looking into this issue and
contributing a patch. This is a known issue - there seems to be a
large amount of variation in how some IPTC data is encoded. The
possible problem with continuing after find an unknown marker in IPTC
data is that subsequent data might be invalid. I'll look into this
issue more when I get a chance. In the meantime, can you please open
an issue in JIRA and contribute (by attaching it to the issue) a
sample image that demonstrates this issue?
https://issues.apache.org/jira/browse/SANSELAN
Thanks,
Matthew
On Thu, Apr 30, 2009 at 5:10 PM, Jonathan Giles <[email protected]> wrote:
Hi all,
I'm using Sanselan to extract IPTC metadata. Recently I was given an image
that was causing Sanselan to return an empty set when retrieving the
metadata, despite the fact that there is clearly iptc metadata in the image.
I trawled through the Sanselan code and found the problem: it's in
IPTCParser.parseIPTCBlock(...), and is the second 'if' statement within the
while loop. In particular, instead of the following:
if (tagMarker != IPTC_RECORD_TAG_MARKER) {
if (verbose) {
System.out.println("Unexpected record tag marker in IPTC data.");
}
return elements;
}
You should do the following:
if (tagMarker != IPTC_RECORD_TAG_MARKER) {
if (verbose) {
System.out.println("Unexpected record tag marker in IPTC data.");
}
continue;
}
In my case, this was tripping up the continued parsing of the image, as it
was occuring early in the parsing process. If you're interested, the verbose
output of an attempt to parse this file once the little patch above is
applied is shown below. In particular, look at the 'Unexpected record tag
marker in IPTC data.' midway down. I don't know what these values are, and I
am slightly concerned that I may be throwing away good data. Do you have any
thoughts?
Cheers,
Jonathan Giles
Output of verbose run after patch was applied:
==============================
blockType 1002 (0x3ea)
blockSize 6143 (0x17ff)
blockType 1005 (0x3ed)
blockSize 16 (0x10)
blockType 1062 (0x426)
blockSize 14 (0xe)
blockType 1037 (0x40d)
blockSize 4 (0x4)
blockType 1049 (0x419)
blockSize 4 (0x4)
blockType 1011 (0x3f3)
blockSize 9 (0x9)
blockType 1034 (0x40a)
blockSize 1 (0x1)
blockType 10000 (0x2710)
blockSize 10 (0xa)
blockType 1013 (0x3f5)
blockSize 72 (0x48)
blockType 1016 (0x3f8)
blockSize 112 (0x70)
blockType 1032 (0x408)
blockSize 16 (0x10)
blockType 1054 (0x41e)
blockSize 4 (0x4)
blockType 1050 (0x41a)
blockSize 837 (0x345)
blockType 1064 (0x428)
blockSize 12 (0xc)
blockType 1044 (0x414)
blockSize 4 (0x4)
blockType 1036 (0x40c)
blockSize 6151 (0x1807)
blockType 1057 (0x421)
blockSize 85 (0x55)
blockType 1030 (0x406)
blockSize 7 (0x7)
blockType 1028 (0x404)
blockSize 1136 (0x470)
tagMarker 28 (0x1c)
recordNumber 1 (0x1)
tagMarker 0 (0x0)
Unexpected record tag marker in IPTC data.
tagMarker 0 (0x0)
Unexpected record tag marker in IPTC data.
tagMarker 2 (0x2)
Unexpected record tag marker in IPTC data.
tagMarker 0 (0x0)
Unexpected record tag marker in IPTC data.
tagMarker 4 (0x4)
Unexpected record tag marker in IPTC data.
tagMarker 28 (0x1c)
recordNumber 1 (0x1)
tagMarker 90 (0x5a)
Unexpected record tag marker in IPTC data.
tagMarker 0 (0x0)
Unexpected record tag marker in IPTC data.
tagMarker 3 (0x3)
Unexpected record tag marker in IPTC data.
tagMarker 27 (0x1b)
Unexpected record tag marker in IPTC data.
tagMarker 37 (0x25)
Unexpected record tag marker in IPTC data.
tagMarker 71 (0x47)
Unexpected record tag marker in IPTC data.
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 0 (0x0)
ignore record version record! 0
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 5 (0x5)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 25 (0x19)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 40 (0x28)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 55 (0x37)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 80 (0x50)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 85 (0x55)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 103 (0x67)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 105 (0x69)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 110 (0x6e)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 120 (0x78)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 122 (0x7a)
tagMarker 28 (0x1c)
recordNumber 2 (0x2)
recordType 22 (0x16)