[ 
http://issues.apache.org/jira/browse/NUTCH-79?page=comments#action_12364496 ] 

Piotr Kosiorowski commented on NUTCH-79:
----------------------------------------

I think it should work without changes I suggested in previous comment - they 
would be simply useful additions.
I was not using it for quite a while so I would get back to it to make sure it 
works with latest code (I hope sooner than later) - but no promises at the 
moment

> Fault tolerant searching.
> -------------------------
>
>          Key: NUTCH-79
>          URL: http://issues.apache.org/jira/browse/NUTCH-79
>      Project: Nutch
>         Type: New Feature
>   Components: searcher
>     Reporter: Piotr Kosiorowski
>  Attachments: patch
>
> I have finally managed to prepare first version of fault tolerant searching I 
> have promised long time ago. 
> It reads server configuration from search-groups.txt file (in startup 
> directory or directory specified by searcher.dir) if no search-servers.txt 
> file is present. If search-servers.txt  is presentit would be read and 
> handled as previously.
> ---------------------------------------------------
> Format of search-groups.txt:
> * <pre>
>  *  search.group.count=[int] 
>  *  search.group.name.[i]=[string] (for i=0 to count-1)
>  *  
>  *  For each name: 
>  *  [name].part.count=[int] partitionCount 
>  *  [name].part.[i].host=[string] (for i=0 to partitionCount-1)
>  *  [name].part.[i].port=int (for i=0 to partitionCount-1)
>  *  
>  *  Example: 
>  *  search.group.count=2 
>  *  search.group.name.0=master
>  *  search.group.name.1=backup
>  *  
>  *  master.part.count=2 
>  *  master.part.0.host=host1 
>  *  master.part.0.port=7777
>  *  master.part.1.host=host2 
>  *  master.part.1.port=7777
>  *  
>  *  backup.part.count=2 
>  *  backup.part.0.host=host3 
>  *  backup.part.0.port=7777
>  *  backup.part.1.host=host4 
>  *  backup.part.1.port=7777
>  * </pre>.
> ------------------------------------------------
> If more than one search group is defined in configuration file requests are 
> distributed among groups in round-robin fashion. If one of the servers from 
> the group fails to respond the whole group is treated as inactive and removed 
> from the pool used to distributed requests. There is a separate recovery 
> thread that every "searcher.recovery.delay" seconds (default 60) tries to 
> check if inactive became alive and if so adds it back to the pool of active 
> groups.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira



-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to