Searching multiple indexes with a single nutch instance
-------------------------------------------------------
Key: NUTCH-480
URL: https://issues.apache.org/jira/browse/NUTCH-480
Project: Nutch
Issue Type: Improvement
Components: searcher, web gui
Affects Versions: 0.8
Environment: Linux and Windows
Reporter: Ravi Chintakunta
Searching across multiple indexes with a single instance of Nutch is a cool
feature improvement. I had this requirement for my production site, where we
wanted to list the available categories (indexes) to search as check boxes and
the user could select any combination of indexes to search. The results page
also displays the number of hits in each index.
To do this:
- I modified web.xml to include the paths to various search indexes
- Modified Nutch.java to read all the indexes and create IndexReaders
- Modified IndexSearcher.java to handle multiple IndexReaders
In the attached file you will find the patch to the Nutch 0.8 code base and
also the newly added files:
- SearchServlet - a servlet that is the web interface for search. This is
simplified version of jsp versions (without the i18n) and outputs the results
in text, xml or json format.
- SearchConstants - an interface for messages and constants
Please note that the patch includes the functionality for spell check - aka
"Did you mean?"
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers