Hi > I wanted to do some sort of faceted search with Nutch, but am not able to > figure out a clean and elegant solution for this. Could anyone give me any > sort of pointers on how to achieve this?
hope this answers your question: every field that lucene indexes is a sort of facet. you can search within one specific field by passing "fieldname:value" as the query. one such field by default is title. searching for title:test return results that contain "test" in the "title" field, which was derived from the <title> tag. you could extend the fields that lucene indexes by writing plugins. this http://office.labforculture.org:8180/search/search.jsp?query=dc_subject:aboriginal returns all the urls we have that contain "aboriginal" in the <meta name="DC:subject" field (and some variations on it). if you'd define your own metadata, and write your own plugin to parse that .. see http://wiki.apache.org/nutch/WritingPluginExample $2c, *pike ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
