Re: nutch fetched but no indexed

2008-07-29 Thread 宫照
e.apache.org ; [EMAIL PROTECTED] > *Sent:* Monday, July 28, 2008 2:43 PM > *Subject:* Re: nutch fetched but no indexed > > Hi, > > Thank you for wuqi's help. > > I check it under luke and can not find it. > > Now I import the

Re: nutch fetched but no indexed

2008-07-27 Thread 宫照
the > segement file.. > > > > - Original Message - > From: "宫照" <[EMAIL PROTECTED]> > To: ; <[EMAIL PROTECTED]> > Sent: Friday, July 25, 2008 9:53 AM > Subject: Re: nutch fetched but no indexed > > > > Hi Patrick, > > > >

Re: nutch fetched but no indexed

2008-07-24 Thread wuqi
check the status of this page in crawldb,if it is db_fetched, then try to check wheter it exist in the segement file.. - Original Message - From: "宫照" <[EMAIL PROTECTED]> To: ; <[EMAIL PROTECTED]> Sent: Friday, July 25, 2008 9:53 AM Subject: Re: nutch fetch

Re: nutch fetched but no indexed

2008-07-24 Thread 宫照
)|... > > > > > That's the only thing I can think of at first glance. > > Patrick > -Original Message- > From: 宫照 [mailto:[EMAIL PROTECTED] > Sent: Wednesday, July 23, 2008 11:27 PM > To: nutch-user@lucene.apache.org > Subject: nutch fetched but n

RE: nutch fetched but no indexed

2008-07-24 Thread Patrick Markiewicz
ECTED] Sent: Wednesday, July 23, 2008 11:27 PM To: nutch-user@lucene.apache.org Subject: nutch fetched but no indexed Hi everybody, I face a problem when using nutch. I use nuth to crawl in intranet. It works well before. But recently, I add some urls to crawl. These urls ara different with normal

nutch fetched but no indexed

2008-07-23 Thread 宫照
Hi everybody, I face a problem when using nutch. I use nuth to crawl in intranet. It works well before. But recently, I add some urls to crawl. These urls ara different with normal .The new urls like this: http://compass.mydomain.com/go/247460034 there are many folders or documents under this url