Hi Andrzej,

Real bottleneck of Nutch is RegexURLNormalizer, it is still synchronized 
singleton (shared by multiple threads). And similar synchronized plugins which 
should be probably refactored to Nutch core...

-Fuad


> Most of
> the time the politeness limits (max rate of requests per host) are the
> bottleneck.


Reply via email to