Re: Entity Reference in NekoHTML parser

Andy Clark 21 Aug 2002 17:13:48 -0000

Takumi Fujiwara wrote:

I am using NekoHTML Parser, I think it translates
entity references during parsing, is it possible to
turn OFF that feature (e.g. if it sees &nbsp; in the
text node, leave it as &nbsp;)?


Currently, it reports *all* character content which
means converting &nbsp; to its character equivalent.
However, you can set a feature so that the standard
HTML entity boundaries are reported[1]. This would,
for example, create a DOM entity ref node for "nbsp"
and the others.

Would this help your problem?

[1] http://www.apache.org/~andyc/neko/doc/html/settings.html
    "http://cyberneko.org/html/features/scanner/notify-builtin-refs";

--
Andy Clark * [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Entity Reference in NekoHTML parser

Reply via email to