MJ Ray wrote:
> Personally, I suspect it's not
> possible to write such a "damage-free" fix, else one of the supporters
> who writes such huge essays on the topic would have done it by now.

Are you still talking about the spaces issue here? Replacing literal spaces
with "%20" does not break standards because literal spaces are illegal in
the first place. IOW, this simple fix makes illegal URLs
standards-compliant. No, I'm not going to add this to the Python parser
since I maintain JPluck, but I don't think a simple substitution is that
hard to do in Python.

The reality is that most HTML pages on the web are broken in one way or
another. I agree with Tony and favor a pragmatic approach over "pure"
compliance. You have to look at problems on a case-by-case basis. In this
instance a simple search-and-replace will help parsing quite a number of
broken pages - with no side-effects. A win-win situation as far as I can
see.

Discussion the "standards" issue to death is not going to help. The lines
are drawn anyway.


Regards
-Laurens

_______________________________________________
plucker-list mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to