[ http://issues.apache.org/jira/browse/NUTCH-79?page=comments#action_12364496 ]
Piotr Kosiorowski commented on NUTCH-79: ---------------------------------------- I think it should work without changes I suggested in previous comment - they would be simply useful additions. I was not using it for quite a while so I would get back to it to make sure it works with latest code (I hope sooner than later) - but no promises at the moment > Fault tolerant searching. > ------------------------- > > Key: NUTCH-79 > URL: http://issues.apache.org/jira/browse/NUTCH-79 > Project: Nutch > Type: New Feature > Components: searcher > Reporter: Piotr Kosiorowski > Attachments: patch > > I have finally managed to prepare first version of fault tolerant searching I > have promised long time ago. > It reads server configuration from search-groups.txt file (in startup > directory or directory specified by searcher.dir) if no search-servers.txt > file is present. If search-servers.txt is presentit would be read and > handled as previously. > --------------------------------------------------- > Format of search-groups.txt: > * <pre> > * search.group.count=[int] > * search.group.name.[i]=[string] (for i=0 to count-1) > * > * For each name: > * [name].part.count=[int] partitionCount > * [name].part.[i].host=[string] (for i=0 to partitionCount-1) > * [name].part.[i].port=int (for i=0 to partitionCount-1) > * > * Example: > * search.group.count=2 > * search.group.name.0=master > * search.group.name.1=backup > * > * master.part.count=2 > * master.part.0.host=host1 > * master.part.0.port=7777 > * master.part.1.host=host2 > * master.part.1.port=7777 > * > * backup.part.count=2 > * backup.part.0.host=host3 > * backup.part.0.port=7777 > * backup.part.1.host=host4 > * backup.part.1.port=7777 > * </pre>. > ------------------------------------------------ > If more than one search group is defined in configuration file requests are > distributed among groups in round-robin fashion. If one of the servers from > the group fails to respond the whole group is treated as inactive and removed > from the pool used to distributed requests. There is a separate recovery > thread that every "searcher.recovery.delay" seconds (default 60) tries to > check if inactive became alive and if so adds it back to the pool of active > groups. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642 _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
