Hi, I was looking at implementing link extraction for Excel files, and found out that the link information is only available at the end of the file as a special "cell X links to URI Y" record. The parser could just slap such link records as artificial <a/> tags at the end of the produced XHTML SAX stream, but for properly associating the <a/> tags with the correct text would require dropping the streaming feature.
Should we consider dropping the streaming parser, or provide an alternative parser that reads the whole document to provide better output? Note that some time ago we dropped the non-streaming parser in favor of the (then better) streaming parser contributed by Niall. If we decide to maintain alternative parsers, which one should be the default? BR, Jukka Zitting