Takumi Fujiwara wrote:
I am using NekoHTML Parser, I think it translates
entity references during parsing, is it possible to
turn OFF that feature (e.g. if it sees   in the
text node, leave it as  )?

Currently, it reports *all* character content which means converting   to its character equivalent. However, you can set a feature so that the standard HTML entity boundaries are reported[1]. This would, for example, create a DOM entity ref node for "nbsp" and the others.

Would this help your problem?

[1] http://www.apache.org/~andyc/neko/doc/html/settings.html
    "http://cyberneko.org/html/features/scanner/notify-builtin-refs";

--
Andy Clark * [EMAIL PROTECTED]


--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to