> 08/02/20 15:38:09 WARN crawl.Generator: Generator: 0 records selected
> for fetching, exiting ...
> 08/02/20 15:38:09 INFO crawl.Crawl: Stopping at depth=0 - no more URLs to 
> fetch.
> 08/02/20 15:38:09 WARN crawl.Crawl: No URLs to fetch - check your seed
> list and URL filters.
> 
> I've inserted code at Generator.java:424, which says:
> if (readers == null || readers.length == 0 || !readers[0].next(new
> FloatWritable())) {
>    LOG.warn("Generator: 0 records selected for fetching, exiting ...");
> 
> essentially at the decision point to see which of the conditions
> triggered the 0 records selected message, and the "readers" object is
> perfectly fine, but the SequenceFileOutputFormat is reporting there
> are no values (I suppose of URL scores) at all to be retrieved,
> causing the generator to stop.

There is a problem with the Generator.  There was a change committed
after 0.9 was released.  I implemented this change and it fixed my
problem:

http://www.mail-archive.com/[EMAIL PROTECTED]/msg01991.html

JohnM

-- 
john mendenhall
[EMAIL PROTECTED]
surf utopia
internet services

Reply via email to