e.apache.org ; [EMAIL PROTECTED]
> *Sent:* Monday, July 28, 2008 2:43 PM
> *Subject:* Re: nutch fetched but no indexed
>
> Hi,
>
> Thank you for wuqi's help.
>
> I check it under luke and can not find it.
>
> Now I import the
the
> segement file..
>
>
>
> - Original Message -
> From: "宫照" <[EMAIL PROTECTED]>
> To: ; <[EMAIL PROTECTED]>
> Sent: Friday, July 25, 2008 9:53 AM
> Subject: Re: nutch fetched but no indexed
>
>
> > Hi Patrick,
> >
> >
check the status of this page in
crawldb,if it is db_fetched, then try to check wheter it exist in the segement
file..
- Original Message -
From: "宫照" <[EMAIL PROTECTED]>
To: ; <[EMAIL PROTECTED]>
Sent: Friday, July 25, 2008 9:53 AM
Subject: Re: nutch fetch
)|...
>
>
>
>
> That's the only thing I can think of at first glance.
>
> Patrick
> -Original Message-
> From: 宫照 [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, July 23, 2008 11:27 PM
> To: nutch-user@lucene.apache.org
> Subject: nutch fetched but n
ECTED]
Sent: Wednesday, July 23, 2008 11:27 PM
To: nutch-user@lucene.apache.org
Subject: nutch fetched but no indexed
Hi everybody,
I face a problem when using nutch. I use nuth to crawl in intranet. It works
well before. But recently, I add some urls to crawl. These urls ara
different with normal
Hi everybody,
I face a problem when using nutch. I use nuth to crawl in intranet. It works
well before. But recently, I add some urls to crawl. These urls ara
different with normal .The new urls like this:
http://compass.mydomain.com/go/247460034
there are many folders or documents under this url