Matt Sullivan wrote:
Oh, yes it seems, that you are right. Mhh, thats not good for my problem. I want to build a Searchengine for Websites which can be viewed on a mobilephone. The phone for which the search engine should be, can only display sites with max. 10 Kilobytes of Size.On Tue, 29 Oct 2002 at 20:49:03 +0100, Daniel Winter wrote:i have set MaxDocSize to 10240 because i want only index that small sites. But i think that index does ignore the Value MaxDocSize. Why is that so or what do i make wrong?
ASPseek stores the real size of the document as returned by the header 'Content-Length' but will only fetch the specified amount of bytes for indexing purposes.
So, i must say the spider, that all websites with more than 10 Kilobytes should not be indexed in any way (index should not follow links on pages with more than 10 Kilobytes).
Yes, so it is.Here can you see my problem:
go to my Test-Site at http://www.m-find.de and search there for Experiences_None . The first result is an 2,2 MB big Website! MaxDocSize is set to 10240 in aspseek.conf.
Try viewing the cached copy, you should see only the small chunk of the document that is actually stored.
But in the docs from aspseek is written:
"MaxDocSize bytes
Sets the maximum document size in bytes, so pages with size more that bytes will not be processed. Default value is 1048576 bytes (1Mb)."
That is wrong. I would be happy, if it were right.
DanielW
