What you mean seprate by name only here.
I have below urls can you please tell how many queues  will be here if queue 
mode is byhost.

http://www.apple.com/ipad/ <http://www.apple.com/ipad/>
http://www.apple.com/iphone/ <http://www.apple.com/iphone/>
http://itunes.apple.com

Thanks

> On Feb 24, 2016, at 12:52 PM, Markus Jelsma <markus.jel...@openindex.io> 
> wrote:
> 
> Hello Manish - byHost in fetcher|generate.queu.mode means queue/separate by 
> name only. Generator nor fetcher use IP address information for queuing 
> purposes. I am not sure what you mean by working with a load balancer. A 
> hostname resolves to one or more IP's, possibly any casted addresses as well. 
> As far as i know/remember, a single IP is used during the fetch, without any 
> DNS round robin, but this might be different between protocol plugins.
> 
> Do you have a concrete problem to solve?
> 
> Markus 
> 
> -----Original message-----
>> From:Manish Verma <m_ve...@apple.com>
>> Sent: Wednesday 24th February 2016 21:45
>> To: user@nutch.apache.org
>> Subject: How does fetcher.queue.mode seprates  url for queues when it is set 
>> byhost
>> 
>> Hi,
>> 
>> I am little bot confused over how  fetcher.queue.mode property identifies 
>> the urls. 
>> How does it work when the value is given “byhost”, does it identify urls by 
>> IP ? , how does it work with load balancer.
>> I know it creates queue based on host but what does mean by host here ?
>> 
>> Is there any other property which have impact on this.
>> 
>> Thanks 

Reply via email to