* Eric Scheid <[EMAIL PROTECTED]> [2006-03-23 23:30]:
>Oh well, now to track down a list of html entities and their
>corresponding unicodes ...
That would be in the spec.
http://www.w3.org/TR/REC-html40/sgml/entities.html
But you shouldn’t have to. Any self-respecting language has a
library for
On Mar 23, 2006, at 2:20 PM, Eric Scheid wrote:
Oh well, now to track down a list of html entities and
their corresponding unicodes ...
http://www.google.com/search?q=xhtml%20entities
On 24/3/06 4:42 AM, "A. Pagaltzis" <[EMAIL PROTECTED]> wrote:
>> I'm getting the data by scraping an html page, so I'm expecting
>> it to be acceptable html code, including html entities.
>
> Then decode the entities to a Unicode string and emit the feed as
> Unicode. Simplest thing that will wo
On Mar 23, 2006, at 8:16 AM, Eric Scheid wrote:
If I have an author with the name "Bertrand Café", is it acceptable
to put
that into atom:author like this;
or should I be using the unicode numeric entity instead?
The key point is that the atom:name element, described in RFC4287
3.2
On Mar 23, 2006, at 8:57 AM, Eric Scheid wrote:
On 24/3/06 3:21 AM, "Anne van Kesteren" <[EMAIL PROTECTED]>
wrote:
Even if it was "HTML" you couldn't really use the entity, could
you? I think
you have to use a character reference or the actual character
instead, yes.
It's true
On Mar 23, 2006, at 8:01 AM, Sylvain Hellegouarch wrote:
Seriously though, the atom:name element is described as "a human-
readable name",
Do you mean that "human-readable" is equivalent to solely English?
Because as a French, having accents in names is so natural that I
see it as "hu
David Powell wrote:
[Hmm, internal DTD subsets completely fail in IE7's feed reader,
throwing up a "friendly error message"]
If I remember correctly they considered that a feature. Something to do with
DTDs being a security risk. I'm not sure if this also meant they were
incapable of process
On Mar 23, 2006, at 9:48 AM, James Holderness wrote:
Hahaha! It's RSS all over again. In the words of Mark Pilgrim:
"Here's something that might be HTML. Or maybe not. I can't tell
you, and you can't guess." :-)
Seriously though, the atom:name element is described as "a human-
readable nam
* Sylvain Hellegouarch <[EMAIL PROTECTED]> [2006-03-23 18:15]:
>Do you mean that "human-readable" is equivalent to solely
>English? Because as a French, having accents in names is so
>natural that I see it as "human readable" too ;)
Even as a French, you probably write é, not é. :-)
Regards,
--
* Eric Scheid <[EMAIL PROTECTED]> [2006-03-23 18:05]:
>It's true that XML has only a half dozen or so entities defined,
>meaning most interesting entities from html can't exist in XML
>... unless maybe they are wrapped like in CDATA block like
>above?
No, a CDATA block simply means that character
On Thu, Mar 23, 2006 at 05:01:03PM +0100,
Sylvain Hellegouarch <[EMAIL PROTECTED]> wrote
a message of 11 lines which said:
> Because as a French, having accents in names is so natural that I
> see it as "human readable" too ;)
As I wrote and used and tested on my blog, there is no problem in
Thursday, March 23, 2006, 4:57:11 PM, you wrote:
> On 24/3/06 3:21 AM, "Anne van Kesteren" <[EMAIL PROTECTED]> wrote:
>>>
>>>
>> Even if it was "HTML" you couldn't really use the entity, could you? I think
>> you have to use a character reference or the actual character instead, yes.
>>
> I
On Fri, Mar 24, 2006 at 03:16:18AM +1100,
Eric Scheid <[EMAIL PROTECTED]> wrote
a message of 10 lines which said:
> or should I be using the unicode numeric entity instead?
Or the character itself, in UTF-8 or any other encoding (but UTF-8 is
the most widely implemented, so you limit the risk
Sylvain Hellegouarch wrote:
Do you mean that "human-readable" is equivalent to solely English? Because
as a French, having accents in names is so natural that I see it as "human
readable" too ;)
No. I mean that the literal sequence of characters "& e a c u t e ;" is not
human-readable (or at
Seriously though, the atom:name element is described as "a
human-readable name",
Do you mean that "human-readable" is equivalent to solely English?
Because as a French, having accents in names is so natural that I see it
as "human readable" too ;)
- Sylvain
* Eric Scheid <[EMAIL PROTECTED]> [2006-03-23 17:30]:
>If I have an author with the name "Bertrand Café", is it
>acceptable to put that into atom:author like this;
>
>
No. That means the author’s name is Bertrand Café (he must
have had very cruel parents), not Bertrand Café.
>or should I be
On 24/3/06 3:21 AM, "Anne van Kesteren" <[EMAIL PROTECTED]> wrote:
>>
>>
> Even if it was "HTML" you couldn't really use the entity, could you? I think
> you have to use a character reference or the actual character instead, yes.
>
It's true that XML has only a half dozen or so entities defin
Hahaha! It's RSS all over again. In the words of Mark Pilgrim: "Here's
something that might be HTML. Or maybe not. I can't tell you, and you can't
guess." :-)
Seriously though, the atom:name element is described as "a human-readable
name", so unless your name really is "Betrand Caf&eacture;"
+1 to what Anne says. If I received that Atom author name, I would
display it exactly as presented "Bertrand Café"
- James
Anne van Kesteren wrote:
>
> Quoting Eric Scheid <[EMAIL PROTECTED]>:
>> If I have an author with the name "Bertrand Café", is it acceptable to
>> put
>> that into atom:au
Quoting Eric Scheid <[EMAIL PROTECTED]>:
If I have an author with the name "Bertrand Café", is it acceptable to put
that into atom:author like this;
or should I be using the unicode numeric entity instead?
Even if it was "HTML" you couldn't really use the entity, could you? I
think you
If I have an author with the name "Bertrand Café", is it acceptable to put
that into atom:author like this;
or should I be using the unicode numeric entity instead?
e.
21 matches
Mail list logo