On Jan 26, 2007, at 2:30 PM, Otis Gospodnetic wrote:
It really all depends....right Erik?
Ha! Looks like I've earned a tag line around here, eh?! :)
On the hardware you are using, complexity of queries, query
concurrency, query latency you are willing to live with, the size
of the index, etc. A few million sounds small even for average/
cheap hw. I have several multi-million document indices that are
constantly hammered over on Simpy.com and we use Lucene at
Technorati to index "the blogosphere", so you can imagine those
numbers. To handle that much data things needs to be heavily
distributed, of course.
Admittedly I've not run indexes anywhere close to the numbers folks
have already mentioned on this thread already. I'm about to build
my largest index to date, at ~3.7M documents.
Erik
Otis
----- Original Message ----
From: Bill Taylor <[EMAIL PROTECTED]>
To: [email protected]
Sent: Friday, January 26, 2007 12:45:43 AM
Subject: How many documents in the biggest Lucene index to date?
I have used Lucene to index a small collection - only a few hundred
documents. I have a potential client who wants to index a collection
which will start at about a million documents and could easily grow
to two million.
Has anyone used Lucene with an index that large?
Thank you very much.
Bill Taylor
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]