Hi Andrzej

Really cooool idea:)
I still remembered we added DNSCache in our log analyzer.
My vote +1

/Jason

On 4/29/05, Andrzej Bialecki <[EMAIL PROTECTED]> wrote:
> Michael Nebel wrote:
> > Hi Jack,
> >
> > I simply don't know, if there's a better alterative to
> > "InetAddress.getLocalHost()". But since nutch is optimized for
> > distribution and scaling, you can crawl from different servers (beside:
> > this works great :-). So I would expect the hostname to be some part of
> > any unique id.
> 
> Let me pipe in with a useful tip: if you use Nutch for Internet
> crawling, I would highly recommend you to install on the same segment a
> dedicated caching DNS server (e.g. the one from djbdns package), and
> configure all Nutch machines to use that server, especially the crawlers.
> 
> In my experience this saves a lot of outside traffic (only the first
> lookup results in external traffic), speeds up the crawling (the caching
> DNS is much quicker and bandwidth-conscious than normal DNS servers),
> and prevents some timeouts (when the connection is too clogged, or
> remote DNS server just collapsed under the load :-) ).
> 
> --
> Best regards,
> Andrzej Bialecki
>  ___. ___ ___ ___ _ _   __________________________________
> [__ || __|__/|__||\/|  Information Retrieval, Semantic Web
> ___|||__||  \|  ||  |  Embedded Unix, System Integration
> http://www.sigram.com  Contact: info at sigram dot com
> 
>


-------------------------------------------------------
SF.Net email is sponsored by: Tell us your software development plans!
Take this survey and enter to win a one-year sub to SourceForge.net
Plus IDC's 2005 look-ahead and a copy of this survey
Click here to start!  http://www.idcswdc.com/cgi-bin/survey?id5hix
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to