Pike,
Thanks for your quick response. However I was looking for something sightly
different.
I understand the concept of query filtering, but what I really need is some
sort of "category counting" to refine searches.
For e.g. my documents can have a fieldname - location, which could be any
city in a country. I want to display the documents (and count) that match
the search query for each city, so that the user can then search within the
search results. The name of cities is not known in advance.
An example of something similar is -
http://reviews.cnet.com/4566-6501_7-0.html
I just wanted to know whether anyone has tried doing this using Nutch , and
if so then I would be glad if he could give me some pointers for the same.
Regards,
Puneet
On 6/2/07, Pike <[EMAIL PROTECTED]> wrote:
Hi
> I wanted to do some sort of faceted search with Nutch, but am not able
to
> figure out a clean and elegant solution for this. Could anyone give me
any
> sort of pointers on how to achieve this?
hope this answers your question:
every field that lucene indexes is a sort of facet. you
can search within one specific field by passing "fieldname:value"
as the query. one such field by default is title. searching for
title:test return results that contain "test" in
the "title" field, which was derived from the <title> tag.
you could extend the fields that lucene indexes
by writing plugins. this
http://office.labforculture.org:8180/search/search.jsp?query=dc_subject:aboriginal
returns all the urls we have that contain "aboriginal" in
the <meta name="DC:subject" field (and some variations on it).
if you'd define your own metadata, and write your own plugin
to parse that ..
see http://wiki.apache.org/nutch/WritingPluginExample
$2c,
*pike
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general