[ 
http://issues.apache.org/jira/browse/NUTCH-79?page=comments#action_12364496 ] 

Piotr Kosiorowski commented on NUTCH-79:
----------------------------------------

I think it should work without changes I suggested in previous comment - they 
would be simply useful additions.
I was not using it for quite a while so I would get back to it to make sure it 
works with latest code (I hope sooner than later) - but no promises at the 
moment

> Fault tolerant searching.
> -------------------------
>
>          Key: NUTCH-79
>          URL: http://issues.apache.org/jira/browse/NUTCH-79
>      Project: Nutch
>         Type: New Feature
>   Components: searcher
>     Reporter: Piotr Kosiorowski
>  Attachments: patch
>
> I have finally managed to prepare first version of fault tolerant searching I 
> have promised long time ago. 
> It reads server configuration from search-groups.txt file (in startup 
> directory or directory specified by searcher.dir) if no search-servers.txt 
> file is present. If search-servers.txt  is presentit would be read and 
> handled as previously.
> ---------------------------------------------------
> Format of search-groups.txt:
> * <pre>
>  *  search.group.count=[int] 
>  *  search.group.name.[i]=[string] (for i=0 to count-1)
>  *  
>  *  For each name: 
>  *  [name].part.count=[int] partitionCount 
>  *  [name].part.[i].host=[string] (for i=0 to partitionCount-1)
>  *  [name].part.[i].port=int (for i=0 to partitionCount-1)
>  *  
>  *  Example: 
>  *  search.group.count=2 
>  *  search.group.name.0=master
>  *  search.group.name.1=backup
>  *  
>  *  master.part.count=2 
>  *  master.part.0.host=host1 
>  *  master.part.0.port=7777
>  *  master.part.1.host=host2 
>  *  master.part.1.port=7777
>  *  
>  *  backup.part.count=2 
>  *  backup.part.0.host=host3 
>  *  backup.part.0.port=7777
>  *  backup.part.1.host=host4 
>  *  backup.part.1.port=7777
>  * </pre>.
> ------------------------------------------------
> If more than one search group is defined in configuration file requests are 
> distributed among groups in round-robin fashion. If one of the servers from 
> the group fails to respond the whole group is treated as inactive and removed 
> from the pool used to distributed requests. There is a separate recovery 
> thread that every "searcher.recovery.delay" seconds (default 60) tries to 
> check if inactive became alive and if so adds it back to the pool of active 
> groups.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira

Reply via email to