I know the index size is very dependent on the content being index...
but running on a unix based machine w/o a filesize limit, best case
scenario... what is the largest number of documents that can be
indexed.
I've seen throughout the list mentions of millions of documents.. 8
million, 20
Given adequate hardware, it can. Take a look at nutch.org. Nutch uses
Lucene at its core.
Otis
--- Chris Fraschetti [EMAIL PROTECTED] wrote:
I know the index size is very dependent on the content being index...
but running on a unix based machine w/o a filesize limit, best case
Chris Fraschetti wrote:
I've seen throughout the list mentions of millions of documents.. 8
million, 20 million, etc etc.. but can lucene potentially handle
billions of documents and still efficiently search through them?
Lucene can currently handle up to 2^31 documents in a single index. To
a