Maybe I did'nt or maybe I did :-) If you want to filter out certain
sites, what is then the problem with doing it in the query?

My query is wrong however, but adding '-site:' in front of every site
you want to exclude will do the trick. I am sure there is another
smarter way aswell. 

I do not know about any filters like you ask for and must be for someone
else to answer.



-----Opprinnelig melding-----
Fra: Manoharam Reddy [mailto:[EMAIL PROTECTED] 
Sendt: 31. mai 2007 14:56
Til: [EMAIL PROTECTED]
Emne: Re: Any URL filter available for search.jsp?

You didn't get my requirement. I don't want the search to be limited to
a particular site. Instead I want some particular sites to not appear in
the results.

Let's say my crawler has crawled pages from sites 1, 2, 3 .... 500.

But when a user searches for  a term, I don't want the sites 125, 126,
130, 200, 201..250 to appear in the results. Is there a such a filter
(similar to regex-urlfilter.txt) for the search part.

On 5/31/07, Naess, Ronny <[EMAIL PROTECTED]> wrote:
> I guess you could add 'site:"yoursite [AND yourothersite]"' to the 
> query alternativly '-site:"yoursite [AND yourothersite]"' (remove only

> the single quotes). I had exactly the same need but field site was not

> enough, so I had to add custom Fields as separate plugins (both index 
> and search).
>
>
> -----Opprinnelig melding-----
> Fra: Manoharam Reddy [mailto:[EMAIL PROTECTED]
> Sendt: 31. mai 2007 12:42
> Til: [EMAIL PROTECTED]
> Emne: Any URL filter available for search.jsp?
>
> I want to use two filters one for crawling and another for searching 
> through search.jsp.
>
> I am currently using regex-urlfilter.txt for generate, fetch, update 
> cycle. But when a user searches the sites, I want him not to see 
> certain sites in the results that have been crawled.
>
> How can this be achieved?
>
> 
>
>

!DSPAM:465ec5ed21469724523251!


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to