On Jan 26, 2007, at 2:30 PM, Otis Gospodnetic wrote:
It really all depends....right Erik?
Ha! Looks like I've earned a tag line around here, eh?! :)
On the hardware you are using, complexity of queries, query concurrency, query latency you are willing to live with, the size of the index, etc. A few million sounds small even for average/ cheap hw. I have several multi-million document indices that are constantly hammered over on Simpy.com and we use Lucene at Technorati to index "the blogosphere", so you can imagine those numbers. To handle that much data things needs to be heavily distributed, of course.
Admittedly I've not run indexes anywhere close to the numbers folks have already mentioned on this thread already. I'm about to build my largest index to date, at ~3.7M documents.
Erik
Otis ----- Original Message ---- From: Bill Taylor <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Friday, January 26, 2007 12:45:43 AM Subject: How many documents in the biggest Lucene index to date? I have used Lucene to index a small collection - only a few hundred documents. I have a potential client who wants to index a collection which will start at about a million documents and could easily grow to two million. Has anyone used Lucene with an index that large? Thank you very much. Bill Taylor --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]