On Mon, 23 Mar 2009 11:25:19 +0100, Julian Reschke <julian.resc...@gmx.de> wrote:
Anne van Kesteren wrote:
Be careful; depending on what you call "Web content". For instance, I would consider the Atom feed content (RFC4287) as "Web content", but Atom really uses IRIs, and doesn't need workarounds for broken IRIs in content (as far as I can tell).
Are you sure browser implementations of feeds reject non-IRIs in some way? I would expect them to use the same URL handling everywhere.

I wasn't talking of "browser implementations of feeds", but feed readers in general.

Well yes, and a subset of those is browser based. Besides that, most feed readers handle HTML. Do you think they should have two separate URL parsing functions?


Don't leak out workarounds into areas where they aren't needed.
I'm not convinced that having two ways of handling essentially the same thing is good.

It's unavoidable, as the relaxed syntax doesn't work in many cases, for instance, when whitespace acts as a delimiter.

Obviously you would first split on whitepace and then parse the URLs. You can still use the same generic URL handling.


--
Anne van Kesteren
http://annevankesteren.nl/

Reply via email to