I get it now! Thanks a lot!
I was running my crawl command with fetcher.parse as true which was
creating the problem..

On Thu, Oct 18, 2012 at 5:53 PM, Markus Jelsma-2 [via Lucene] <
ml-node+s472066n4014609...@n3.nabble.com> wrote:

> You would have to check the generator code to make sure. But why would you
> want to distribute the queue for a single domain to multiple mappers? A
> single local running mapper without parsing on a low-end machine can easily
> fetch 20-40 records per second from the same domain (if it allows you to do
> it). At that speed you can easily fetch a few million records in a day
> orso.
>
> -----Original message-----
>
> > From:shri_s_ram <[hidden 
> > email]<http://user/SendEmail.jtp?type=node&node=4014609&i=0>>
>
> > Sent: Thu 18-Oct-2012 23:11
> > To: [hidden email]<http://user/SendEmail.jtp?type=node&node=4014609&i=1>
> > Subject: RE: Nutch generate fetch lists for a single domain (but with
> multiple urls) crawl
> >
> > Thanks.. But I thought there would be a way around it..
> > Is it possible even to have multiple fetch lists generated (for this
> > problem) at all by tweaking some parameters?
> >
> > [I am thinking of something like partition.url.mode - byRandom]
> >
> >
> >
> > --
> > View this message in context:
> http://lucene.472066.n3.nabble.com/Nutch-generate-fetch-lists-for-a-single-domain-but-with-multiple-urls-crawl-tp4014573p4014582.html
>
> > Sent from the Nutch - User mailing list archive at Nabble.com.
> >
>
>
> ------------------------------
>  If you reply to this email, your message will be added to the discussion
> below:
>
> http://lucene.472066.n3.nabble.com/Nutch-generate-fetch-lists-for-a-single-domain-but-with-multiple-urls-crawl-tp4014573p4014609.html
>  To unsubscribe from Nutch generate fetch lists for a single domain (but
> with multiple urls) crawl, click 
> here<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=4014573&code=c2hyaXJhbWFtYmlAZ21haWwuY29tfDQwMTQ1NzN8LTIxMzkyMTE0Ng==>
> .
> NAML<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml>
>




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Nutch-generate-fetch-lists-for-a-single-domain-but-with-multiple-urls-crawl-tp4014573p4014626.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to