* Eric Scheid <[EMAIL PROTECTED]> [2006-03-23 18:05]:
>It's true that XML has only a half dozen or so entities defined,
>meaning most interesting entities from html can't exist in XML
>... unless maybe they are wrapped like in CDATA block like
>above?

No, a CDATA block simply means that characters like <, & and >
stand for themselves.

>I'm getting the data by scraping an html page, so I'm expecting
>it to be acceptable html code, including html entities.

Then decode the entities to a Unicode string and emit the feed as
Unicode. Simplest thing that will work reliably.

Regards,
-- 
Aristotle Pagaltzis // <http://plasmasturm.org/>

Reply via email to