[Nutch-general] Why Nutch is indexing HTTP 302 pages

2007-06-12 Thread Manoharam Reddy
I find in the search results that lots of HTTP 302 pages have been indexed. This is decreasing the quality of search results. Is there any way to disable indexing such pages? I want only HTTP 200 OK pages to be indexed. -

Re: [Nutch-general] Why Nutch is indexing HTTP 302 pages

2007-06-12 Thread Doğacan Güney
On 6/11/07, Manoharam Reddy [EMAIL PROTECTED] wrote: I find in the search results that lots of HTTP 302 pages have been indexed. This is decreasing the quality of search results. Is there any way to disable indexing such pages? I want only HTTP 200 OK pages to be indexed. If you run fetcher