Simple answer is billions, perhaps tens to hundreds of billions of
records, as it leverages Hadoop. Yahoo is currently using Hadoop to
create its web index. But as Otis pointed out, Hadoop is parallel
processing and as such is completely dependent on amount of hardware.
Dennis
Polsnet wrote:
Nutch 1.0 largest number of data can support? (File size or number of
records)