Randy Winch wrote:
> > Indexer doesn't remove leading spaces in urls before processing. Example:
> >
> > <a href=" http://howellfamily.homepage.com/historykentucky.html">
> >
> > That make the following url for spidering:
> >
> > http://www.rootsweb.com/~kygenweb/
>http://howellfamily.homepage.com/historykentucky.html
> >
> > Yes..the space is in there :-)
> >
> > I'll try to patch this locally on my copy once I figure out the "correct" place to
>patch.
>
> Simple fix that worked for me:
>
> int UdmParseURL(UDM_URL *url,char *s){
> char *schema,*anchor,*file;
>
> for(;*s && isspace(*s);++s); /* Skip leading space */
Hello!
This is actially a mistake in HTML... Am I mistaken?
What to do with trailing spaces? Should we remove them too?
--
Alexander Barkov
IZHCOM, Izhevsk
email: [EMAIL PROTECTED] | http://www.izhcom.ru
Phone: +7 (3412) 51-32-11 | Fax: +7 (3412) 78-70-10
ICQ: 7748759
______________
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]