every 6000th page from
> dmoz. It is not unusual to let Nutch run over night, as it needs
> to fetch a lot of sites.
>
> Kind regards,
> Olaf
>
>
> On Mon, 28 Feb 2005 14:04:35 -0800, sub paul <[EMAIL PROTECTED]> wrote:
> > Hi Olaf,
> >
> > Thanks for
nternet.
>
> Depending on your data, start with a much smaller depth.
>
> Kind regards,
> Olaf
>
>
> On Mon, 28 Feb 2005 08:20:22 -0800, sub paul <[EMAIL PROTECTED]> wrote:
> > Hello All,
> >
> > I was running an intranet crawl and It seems like
Hello All,
I was running an intranet crawl and It seems like it did not finish, properly.
It is a pretty default setup, but crawl's depth was 15, and I had
turned on queries by commenting out
# skip URLs containing certain characters as probable queries, etc.
[EMAIL PROTECTED]
other than bunch o