Daniel Winter wrote on Tue, 29 October 2002 23:47
> But in the docs from aspseek is written:
> 
> "MaxDocSize bytes
> Sets the maximum document size in bytes, so pages with size more that 
> bytes will not be processed. Default value is 1048576 bytes (1Mb)."
> 
> That is wrong. I would be happy, if it were right.


Documentation is fixed in CVS (will be available in 1.2.11), thanx for the catching.

As for your problem, fixing it requires changing ASPseek's crawler strategy: it should 
first send HEAD request to web server, than check the size and, if allowed, send GET 
request. This can really slow down indexing, so in general it should be avoided. But, 
as ASPseek is free and open source software, you can easily modify the source code to 
achieve what you want, or ask/hire somebody to do it.

--
-- kir

Reply via email to