Randy Winch wrote:

> > Indexer doesn't remove leading spaces in urls before processing. Example:
> >
> > <a href=" http://howellfamily.homepage.com/historykentucky.html">
> >
> > That make the following url for spidering:
> >
> > http://www.rootsweb.com/~kygenweb/ 
>http://howellfamily.homepage.com/historykentucky.html
> >
> > Yes..the space is in there :-)
> >
> > I'll try to patch this locally on my copy once I figure out the "correct" place to 
>patch.
> 
> Simple fix that worked for me:
> 
> int UdmParseURL(UDM_URL *url,char *s){
> char *schema,*anchor,*file;
> 
>         for(;*s && isspace(*s);++s); /* Skip leading space */



  Hello!

This is actially a mistake in HTML... Am I mistaken?

What to do with trailing spaces? Should we remove them too?




-- 
Alexander Barkov
IZHCOM, Izhevsk
email:    [EMAIL PROTECTED]      | http://www.izhcom.ru
Phone:    +7 (3412) 51-32-11 | Fax: +7 (3412) 78-70-10
ICQ:      7748759
______________
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]

Reply via email to