Randy Winch wrote:
> 
> UdmSearch version: 3.0.18
> Platform:          Dual PIII
> OS:                RH 6.2 + 2.2.14-6.1.1 kernel
> Database:          crc
> Statistics:        2-4 gig ndict
> 
> C frontend
> Indexer doesn't remove leading spaces in urls before processing. Example:
> 
> <a href=" http://howellfamily.homepage.com/historykentucky.html">
> 
> That make the following url for spidering:
> 
> http://www.rootsweb.com/~kygenweb/ 
>http://howellfamily.homepage.com/historykentucky.html
> 
> Yes..the space is in there :-)
> 
> I'll try to patch this locally on my copy once I figure out the "correct" place to 
>patch.

Simple fix that worked for me:

int UdmParseURL(UDM_URL *url,char *s){
char *schema,*anchor,*file;

        for(;*s && isspace(*s);++s); /* Skip leading space */

Randy
______________
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]

Reply via email to