* Eric Scheid <[EMAIL PROTECTED]> [2006-03-23 18:05]: >It's true that XML has only a half dozen or so entities defined, >meaning most interesting entities from html can't exist in XML >... unless maybe they are wrapped like in CDATA block like >above?
No, a CDATA block simply means that characters like <, & and > stand for themselves. >I'm getting the data by scraping an html page, so I'm expecting >it to be acceptable html code, including html entities. Then decode the entities to a Unicode string and emit the feed as Unicode. Simplest thing that will work reliably. Regards, -- Aristotle Pagaltzis // <http://plasmasturm.org/>