Re: Can/Does/Should the FeedValidator catch improperly escaped XHTML?

2006-09-02 Thread Asbjørn Ulsberg


On Fri, 01 Sep 2006 01:19:35 +0200, James Holderness  
[EMAIL PROTECTED] wrote:


Just because a feed appears to contain well-formed xhtml content today,  
that doesn't mean it's going to be well-formed tomorrow. Encouraging  
people to use xhtml when they don't know enough to have made that  
decision themselves is just asking for trouble. Sooner or later they're  
going to end up with broken xml which will be completely unreadable in  
many aggregators.


Well, that completely depends on how we present this information to the  
user. I was not thinking of a simple message saying This is valid XHTML,  
change @type to 'xhtml' ASAP, you newb!, but more along the lines of  
informing the user that the content looks like wellformed XHTML, that he  
can read more about it here and there, why he should try to keep it  
wellformed and why he should not change it if he's not sure of its  
welformedness.


Something like that.


Also, escaped html tends to be better supported by aggregators anyway.


That's today. I would hope and assume that this changes in the future.

--
Asbjørn Ulsberg -=|=-http://virtuelvis.com/quark/
«He's a loathsome offensive brute, yet I can't look away»



Re: Can/Does/Should the FeedValidator catch improperly escaped XHTML?

2006-09-02 Thread M. David Peterson
 That's today. I would hope and assume that this changes in the future.+1On 9/2/06, Asbjørn Ulsberg 
[EMAIL PROTECTED] wrote:On Fri, 01 Sep 2006 01:19:35 +0200, James Holderness
[EMAIL PROTECTED] wrote: Just because a feed appears to contain well-formed xhtml content today, that doesn't mean it's going to be well-formed tomorrow. Encouraging
 people to use xhtml when they don't know enough to have made that decision themselves is just asking for trouble. Sooner or later they're going to end up with broken xml which will be completely unreadable in
 many aggregators.Well, that completely depends on how we present this information to theuser. I was not thinking of a simple message saying This is valid XHTML,change @type to 'xhtml' ASAP, you newb!, but more along the lines of
informing the user that the content looks like wellformed XHTML, that hecan read more about it here and there, why he should try to keep itwellformed and why he should not change it if he's not sure of its
welformedness.Something like that. Also, escaped html tends to be better supported by aggregators anyway.That's today. I would hope and assume that this changes in the future.--Asbjørn Ulsberg -=|=-
http://virtuelvis.com/quark/«He's a loathsome offensive brute, yet I can't look away»-- /M:DM. David Peterson
http://mdavid.name | http://www.oreillynet.com/pub/au/2354


Re: Can/Does/Should the FeedValidator catch improperly escaped XHTML?

2006-08-31 Thread Sam Ruby


M. David Peterson wrote:
via 
http://www.oreillynet.com/xml/blog/2006/08/and_the_winner_of_the_best_ind_1.html#comment-75533


There's one other small problem though: they put XHTML as CDATA in
html text constructs, while they're supposed to contain HTML 4.
And since it's XHTML, they should embed it directly in xhtml
constructs...

Anthony brings out a good point  
http://www.oreillynet.com/xml/blog/2006/08/and_the_winner_of_the_best_ind_1.html#comment-75822 
,


Odd that the validator isn't saying anything about this.

Should it, or is this an edge case that can be difficult, at best, to catch?


At the moment, the HTML content is passed through the following:

http://docs.python.org/lib/module-HTMLParser.html

Note that this parser includes a handle_startendtag method, which is not 
a part of the HTML standard.  Given the rather loose nature of HTML, 
this only tends to catch things like unmatched angle brackets and quotes.


Also, there are a number of tools that attempt to produce well-formed 
XHTML, but don't do so consistently enough to drop the content into an 
Atom feed in such a manner.


- Sam Ruby



Re: Can/Does/Should the FeedValidator catch improperly escaped XHTML?

2006-08-31 Thread M. David Peterson
Thanks for the response/info Sam!On 8/31/06, Sam Ruby [EMAIL PROTECTED] wrote:
Also, there are a number of tools that attempt to produce well-formedXHTML, but don't do so consistently enough to drop the content into anAtom feed in such a manner.I assume this is from an XHTML standpoint (meaning the result isn't a one-to-one HTML  XHTML ~equivalent) and not an well formed XML standpoint? Not that it really matters all that much as if the result isn't a solid representation of the authors original intent, then theres really no other alternative than to escape it and tag it as HTML.
In short, unless the above is incorrect, then both Anthony and the FeedValidator are doing all that they both can and should be doing.Is this a fair statement?-- /M:DM. David Peterson
http://mdavid.name | http://www.oreillynet.com/pub/au/2354


Re: Can/Does/Should the FeedValidator catch improperly escaped XHTML?

2006-08-31 Thread Asbjørn Ulsberg


On Thu, 31 Aug 2006 13:50:56 +0200, Sam Ruby [EMAIL PROTECTED]  
wrote:


Also, there are a number of tools that attempt to produce well-formed  
XHTML, but don't do so consistently enough to drop the content into an  
Atom feed in such a manner.


But is it possible to check the wellformedness of the markup inside  
atom:content and suggest that the authors use 'type=xhtml' instead of  
'type=html' if it is indeed wellformed X(HT)ML? It's a real stretch in  
terms of how far the validator should go, but I can definately see its  
usefulness.


--
Asbjørn Ulsberg -=|=-http://virtuelvis.com/quark/
«He's a loathsome offensive brute, yet I can't look away»



Re: Can/Does/Should the FeedValidator catch improperly escaped XHTML?

2006-08-31 Thread M. David Peterson
+1 - Definitely would be useful as a suggestion.On 8/31/06, Asbjørn Ulsberg [EMAIL PROTECTED] wrote:
On Thu, 31 Aug 2006 13:50:56 +0200, Sam Ruby 
[EMAIL PROTECTED]wrote: Also, there are a number of tools that attempt to produce well-formed XHTML, but don't do so consistently enough to drop the content into an Atom feed in such a manner.
But is it possible to check the wellformedness of the markup insideatom:content and suggest that the authors use 'type=xhtml' instead of'type=html' if it is indeed wellformed X(HT)ML? It's a real stretch in
terms of how far the validator should go, but I can definately see itsusefulness.--Asbjørn Ulsberg -=|=-http://virtuelvis.com/quark/«He's a loathsome offensive brute, yet I can't look away»
-- /M:DM. David Petersonhttp://mdavid.name | http://www.oreillynet.com/pub/au/2354



Re: Can/Does/Should the FeedValidator catch improperly escaped XHTML?

2006-08-31 Thread James Holderness


Asbjørn Ulsberg wrote:
But is it possible to check the wellformedness of the markup inside 
atom:content and suggest that the authors use 'type=xhtml' instead of 
'type=html' if it is indeed wellformed X(HT)ML? It's a real stretch in 
terms of how far the validator should go, but I can definately see its 
usefulness.


-1

Just because a feed appears to contain well-formed xhtml content today, that 
doesn't mean it's going to be well-formed tomorrow. Encouraging people to 
use xhtml when they don't know enough to have made that decision themselves 
is just asking for trouble. Sooner or later they're going to end up with 
broken xml which will be completely unreadable in many aggregators.


Also, escaped html tends to be better supported by aggregators anyway.

Regards
James



Re: Can/Does/Should the FeedValidator catch improperly escaped XHTML?

2006-08-31 Thread A. Pagaltzis

* James Holderness [EMAIL PROTECTED] [2006-09-01 01:30]:
 Encouraging people to use xhtml when they don't know enough to
 have made that decision themselves is just asking for trouble.

+1

Regards,
-- 
Aristotle Pagaltzis // http://plasmasturm.org/



Re: Can/Does/Should the FeedValidator catch improperly escaped XHTML?

2006-08-31 Thread M. David Peterson
Both points well taken.I would like to change my initial vote to -1, though +1 for both James and Aristotle's follow-up.On 8/31/06, A. Pagaltzis
 [EMAIL PROTECTED] wrote:* James Holderness 
[EMAIL PROTECTED] [2006-09-01 01:30]: Encouraging people to use xhtml when they don't know enough to have made that decision themselves is just asking for trouble.
+1Regards,--Aristotle Pagaltzis // http://plasmasturm.org/-- /M:DM. David Peterson
http://mdavid.name | http://www.oreillynet.com/pub/au/2354