Simple answer is billions, perhaps tens to hundreds of billions of records, as it leverages Hadoop. Yahoo is currently using Hadoop to create its web index. But as Otis pointed out, Hadoop is parallel processing and as such is completely dependent on amount of hardware.

Dennis

Polsnet wrote:
Nutch 1.0 largest number of data can support? (File size or number of
records)

Reply via email to