When I run nutch 0.9 over hadoop, the mapper in Generator.Selector
assigns the same output key value of 1.0 to all of the crawl datum
entries that it outputs as values. I don't understand why it is doing
that and why it is not assigning different key values to its crawl datum
entries? In my case, the number of reducer tasks is set to 3 and each of
them is processing crawl datum entries with the same key value of 1.0.
This seems very odd... Can someone please explain why the mapper is
assigning the same key values?
Thanks,
Shirley