Hi,

I was looking at implementing link extraction for Excel files, and
found out that the link information is only available at the end of
the file as a special "cell X links to URI Y" record. The parser could
just slap such link records as artificial <a/> tags at the end of the
produced XHTML SAX stream, but for properly associating the <a/> tags
with the correct text would require dropping the streaming feature.

Should we consider dropping the streaming parser, or provide an
alternative parser that reads the whole document to provide better
output? Note that some time ago we dropped the non-streaming parser in
favor of the (then better) streaming parser contributed by Niall. If
we decide to maintain alternative parsers, which one should be the
default?

BR,

Jukka Zitting

Reply via email to