send me the log of the crawling if possible. for sure there are some clues
on it

2009/4/1 陈琛 <[email protected]>

> yes, the depth is 10 and topN is 2000...
>
>  So strange....the other urls it is normal..but the 4 urls..
>
>
>
> 2009/4/1 Alejandro Gonzalez <[email protected]>
>
> > seems strange. have u tried to start a crawl just with these 4 seed
> pages?
> >
> > Are you setting the topN parameter?
> >
> >
> > 2009/4/1 陈琛 <[email protected]>
> >
> > >
> > > thanks,i have Collection of urls Only these four can not search a
> subset
> > > of their pages
> > >
> > > the urls and crawl-urlfilter like Attachment
> > >
> > >
> > > 2009/4/1 Alejandro Gonzalez <[email protected]>
> > >
> > > it's your crawl-urlfilter ok? are u sure it's fetching them properly?
> > maybe
> > >> it's not getting the content of the pages and so it cannot extract
> links
> > >> for
> > >> fetch in the next level (i suppose you have set the crawl depth just
> for
> > >> the
> > >> seeds level).
> > >>
> > >> So or your filters are skipping the seeds (i suppose it's not the case
> > >> cause
> > >> you say that urls arrive to Fetcher), or the fetching it's not going
> ok
> > >> (network issues?). take a look on that
> > >>
> > >> 2009/4/1 陈琛 <[email protected]>
> > >>
> > >> > HI,all
> > >> >       I have four urls, like this:
> > >> >       http://www.lao-indochina.com
> > >> >       http://www.nuol.edu.la
> > >> >       http://www.corninc.com.la
> > >> >       http://www.vientianecollege.laopdr.com
> > >> >
> > >> > only fetch the HomePage why? Sub-page is not fetch。。。
> > >> >
> > >>
> > >
> > >
> >
>

Reply via email to