Takumi Fujiwara wrote:
I am using NekoHTML Parser, I think it translates
entity references during parsing, is it possible to
turn OFF that feature (e.g. if it sees in the
text node, leave it as )?
Currently, it reports *all* character content which
means converting to its character equivalent.
However, you can set a feature so that the standard
HTML entity boundaries are reported[1]. This would,
for example, create a DOM entity ref node for "nbsp"
and the others.
Would this help your problem?
[1] http://www.apache.org/~andyc/neko/doc/html/settings.html
"http://cyberneko.org/html/features/scanner/notify-builtin-refs"
--
Andy Clark * [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]