Re: atom:name ... text or html?

2006-03-23 Thread A. Pagaltzis
* Eric Scheid <[EMAIL PROTECTED]> [2006-03-23 23:30]: >Oh well, now to track down a list of html entities and their >corresponding unicodes ... That would be in the spec. http://www.w3.org/TR/REC-html40/sgml/entities.html But you shouldn’t have to. Any self-respecting language has a library for

Re: atom:name ... text or html?

2006-03-23 Thread Tim Bray
On Mar 23, 2006, at 2:20 PM, Eric Scheid wrote: Oh well, now to track down a list of html entities and their corresponding unicodes ... http://www.google.com/search?q=xhtml%20entities

Re: atom:name ... text or html?

2006-03-23 Thread Eric Scheid
On 24/3/06 4:42 AM, "A. Pagaltzis" <[EMAIL PROTECTED]> wrote: >> I'm getting the data by scraping an html page, so I'm expecting >> it to be acceptable html code, including html entities. > > Then decode the entities to a Unicode string and emit the feed as > Unicode. Simplest thing that will wo

Re: atom:name ... text or html?

2006-03-23 Thread Tim Bray
On Mar 23, 2006, at 8:16 AM, Eric Scheid wrote: If I have an author with the name "Bertrand Café", is it acceptable to put that into atom:author like this; or should I be using the unicode numeric entity instead? The key point is that the atom:name element, described in RFC4287 3.2

Re: atom:name ... text or html?

2006-03-23 Thread Tim Bray
On Mar 23, 2006, at 8:57 AM, Eric Scheid wrote: On 24/3/06 3:21 AM, "Anne van Kesteren" <[EMAIL PROTECTED]> wrote: Even if it was "HTML" you couldn't really use the entity, could you? I think you have to use a character reference or the actual character instead, yes. It's true

Re: atom:name ... text or html?

2006-03-23 Thread Tim Bray
On Mar 23, 2006, at 8:01 AM, Sylvain Hellegouarch wrote: Seriously though, the atom:name element is described as "a human- readable name", Do you mean that "human-readable" is equivalent to solely English? Because as a French, having accents in names is so natural that I see it as "hu

Re: atom:name ... text or html?

2006-03-23 Thread James Holderness
David Powell wrote: [Hmm, internal DTD subsets completely fail in IE7's feed reader, throwing up a "friendly error message"] If I remember correctly they considered that a feature. Something to do with DTDs being a security risk. I'm not sure if this also meant they were incapable of process

Re: atom:name ... text or html?

2006-03-23 Thread Antone Roundy
On Mar 23, 2006, at 9:48 AM, James Holderness wrote: Hahaha! It's RSS all over again. In the words of Mark Pilgrim: "Here's something that might be HTML. Or maybe not. I can't tell you, and you can't guess." :-) Seriously though, the atom:name element is described as "a human- readable nam

Re: atom:name ... text or html?

2006-03-23 Thread A. Pagaltzis
* Sylvain Hellegouarch <[EMAIL PROTECTED]> [2006-03-23 18:15]: >Do you mean that "human-readable" is equivalent to solely >English? Because as a French, having accents in names is so >natural that I see it as "human readable" too ;) Even as a French, you probably write é, not é. :-) Regards, --

Re: atom:name ... text or html?

2006-03-23 Thread A. Pagaltzis
* Eric Scheid <[EMAIL PROTECTED]> [2006-03-23 18:05]: >It's true that XML has only a half dozen or so entities defined, >meaning most interesting entities from html can't exist in XML >... unless maybe they are wrapped like in CDATA block like >above? No, a CDATA block simply means that character

Re: atom:name ... text or html?

2006-03-23 Thread Stephane Bortzmeyer
On Thu, Mar 23, 2006 at 05:01:03PM +0100, Sylvain Hellegouarch <[EMAIL PROTECTED]> wrote a message of 11 lines which said: > Because as a French, having accents in names is so natural that I > see it as "human readable" too ;) As I wrote and used and tested on my blog, there is no problem in

Re: atom:name ... text or html?

2006-03-23 Thread David Powell
Thursday, March 23, 2006, 4:57:11 PM, you wrote: > On 24/3/06 3:21 AM, "Anne van Kesteren" <[EMAIL PROTECTED]> wrote: >>> >>> >> Even if it was "HTML" you couldn't really use the entity, could you? I think >> you have to use a character reference or the actual character instead, yes. >> > I

Re: atom:name ... text or html?

2006-03-23 Thread Stephane Bortzmeyer
On Fri, Mar 24, 2006 at 03:16:18AM +1100, Eric Scheid <[EMAIL PROTECTED]> wrote a message of 10 lines which said: > or should I be using the unicode numeric entity instead? Or the character itself, in UTF-8 or any other encoding (but UTF-8 is the most widely implemented, so you limit the risk

Re: atom:name ... text or html?

2006-03-23 Thread James Holderness
Sylvain Hellegouarch wrote: Do you mean that "human-readable" is equivalent to solely English? Because as a French, having accents in names is so natural that I see it as "human readable" too ;) No. I mean that the literal sequence of characters "& e a c u t e ;" is not human-readable (or at

Re: atom:name ... text or html?

2006-03-23 Thread Sylvain Hellegouarch
Seriously though, the atom:name element is described as "a human-readable name", Do you mean that "human-readable" is equivalent to solely English? Because as a French, having accents in names is so natural that I see it as "human readable" too ;) - Sylvain

Re: atom:name ... text or html?

2006-03-23 Thread A. Pagaltzis
* Eric Scheid <[EMAIL PROTECTED]> [2006-03-23 17:30]: >If I have an author with the name "Bertrand Café", is it >acceptable to put that into atom:author like this; > > No. That means the author’s name is Bertrand Café (he must have had very cruel parents), not Bertrand Café. >or should I be

Re: atom:name ... text or html?

2006-03-23 Thread Eric Scheid
On 24/3/06 3:21 AM, "Anne van Kesteren" <[EMAIL PROTECTED]> wrote: >> >> > Even if it was "HTML" you couldn't really use the entity, could you? I think > you have to use a character reference or the actual character instead, yes. > It's true that XML has only a half dozen or so entities defin

Re: atom:name ... text or html?

2006-03-23 Thread James Holderness
Hahaha! It's RSS all over again. In the words of Mark Pilgrim: "Here's something that might be HTML. Or maybe not. I can't tell you, and you can't guess." :-) Seriously though, the atom:name element is described as "a human-readable name", so unless your name really is "Betrand Caf&eacture;"

Re: atom:name ... text or html?

2006-03-23 Thread James M Snell
+1 to what Anne says. If I received that Atom author name, I would display it exactly as presented "Bertrand Café" - James Anne van Kesteren wrote: > > Quoting Eric Scheid <[EMAIL PROTECTED]>: >> If I have an author with the name "Bertrand Café", is it acceptable to >> put >> that into atom:au

Re: atom:name ... text or html?

2006-03-23 Thread Anne van Kesteren
Quoting Eric Scheid <[EMAIL PROTECTED]>: If I have an author with the name "Bertrand Café", is it acceptable to put that into atom:author like this; or should I be using the unicode numeric entity instead? Even if it was "HTML" you couldn't really use the entity, could you? I think you

atom:name ... text or html?

2006-03-23 Thread Eric Scheid
If I have an author with the name "Bertrand Café", is it acceptable to put that into atom:author like this; or should I be using the unicode numeric entity instead? e.