On Fri, 2007-05-18 at 14:29 +0200, Jörg Sommer wrote: > % cat test.html > <html> > <body> > <a href=".//test.html">a link</a> > </body> > </html> > % webcheck test.html > webcheck: checking site.... > webcheck: file:///tmp/test.html > webcheck: Warning: falling back to the legacy HTML parser, consider > installing BeautifulSoup > webcheck: file:///tmp//test.html > webcheck: file:///tmp///test.html [...] > If this is not a valid URL webcheck should warn about it. But it > should at least assume that multiple slashs in an URL are the same as > one slash.
The way I read RFC3986 (especially sections 3.3 and 6.2) is that these are all separate and valid URLs that point to the same resource. In section 6.2.2.3 only the removal of "." and ".." in paths is mentioned although 6.2.3 does leave some room for other normalisation. -- -- arthur - [EMAIL PROTECTED] - http://people.debian.org/~adejong --
signature.asc
Description: This is a digitally signed message part

