Hi

> I wanted to do some sort of faceted search with Nutch, but am not able to
> figure out a clean and elegant solution for this. Could anyone give me any
> sort of pointers on how to achieve this?

hope this answers your question:

every field that lucene indexes is a sort of facet. you
can search within one specific field by passing "fieldname:value"
as the query. one such field by default is title. searching for
title:test return results that contain "test" in
the "title" field, which was derived from the <title> tag.

you could extend the fields that lucene indexes
by writing plugins. this
http://office.labforculture.org:8180/search/search.jsp?query=dc_subject:aboriginal
returns all the urls we have that contain "aboriginal" in
the <meta name="DC:subject" field (and some variations on it).

if you'd define your own metadata, and write your own plugin
to parse that ..

see http://wiki.apache.org/nutch/WritingPluginExample

$2c,
*pike


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to