Hi Dogacan,

   Fetcher2 gives a better performance when the number of hosts per task is
more than the number of threads that the task can use. In this case, fetcher
might block on some hosts, whereas fetcher2 will use that idle time in
crawling some other host.

   It could be that the number of hosts per task is not significantly higher
than the number of threads per task. In that case, ideally you should see a
similar performance from fetcher2 and fetcher (assuming same url list and
network bandwidth).

  Also, as Andrzej suggested - it would be good to have some more debugging
info.

Regards,

-vishal.

-----Original Message-----
From: Dogacan Güney [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, May 23, 2007 8:21 PM
To: [EMAIL PROTECTED]
Subject: Re: [Nutch-general] Fetcher2 slowness?

On 5/23/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
> So what was Fetcher2's performance like when its number of threads was the
same as that of Fetcher?

It is still slower. I tried giving Fetcher2 more threads,it is still
worse than Fetcher but a bit better than fewer-threaded
Fetcher2(Fetcher finished in 1 hour, Fetcher2 in about 2.5). Though I
have performed other tests where their performance is similar(and I
have no idea why). I am trying to find the cause of problem, but so
far, had no luck.

>
> Otis
>

[snip]

-- 
Dogacan Güney


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to