Re: Spider - path conflict [../test.htm,www.nic.nl/index.html]

2005-04-01 Thread Skip Montanaro
martijn> I thought I was ready with my own spider... But then there was martijn> a bug, or in other words a missing part in my code. martijn> I forget that people do this in website html: martijn> http://www.nic.nl/monkey.html";>is oke martijn> error martijn> error pydoc

Re: Spider - path conflict [../test.htm,www.nic.nl/index.html]

2005-04-01 Thread martijn
urllib.basejoin() that's what I need :) haha what a stupid code did I made. Thanks GC-Martijn -- http://mail.python.org/mailman/listinfo/python-list

Re: Spider - path conflict [../test.htm,www.nic.nl/index.html]

2005-04-01 Thread Jeff Epler
I think you want urllib.basejoin(). >>> urllib.basejoin("http://www.example.com/test/page.html";, "otherpage.html") 'http://www.example.com/test/otherpage.html' pgpSOZBAEHiWi.pgp Description: PGP signature -- http://mail.python.org/mailman/listinfo/python-list

Spider - path conflict [../test.htm,www.nic.nl/index.html]

2005-04-01 Thread martijn
H! I thought I was ready with my own spider... But then there was a bug, or in other words a missing part in my code. I forget that people do this in website html: http://www.nic.nl/monkey.html";>is oke error error So now i'm trying to fix my spider but it fails and it fails. I tryed something l