MJ Ray wrote:
> If someone can write the code to pick out all the URIs between
> identification and normalisation, and can do it without slowing the
> parser down even more, come on!  Send a patch to the list!

OK then, here's a quick fix. I'm not a Python expert at all. I just spent
about 5 minutes on this.

Try out this link to verify that the existing Python parser chokes on
spaces:

http://go.sosd.com/

Then replace the existing Url.py with the attached copy and try again.

I just added two lines that replace literal spaces with "%20". Since spaces
are illegal anyway, this should not have any bad side-effects.


Regards
-Laurens


Attachment: Url.py
Description: Binary data

Reply via email to