Hello,
 I have a question about the scalability of the Lucene.
 I'm a lucene beginner and I would like to use it to index several
 million measurements (400 Millions). A measurement has a type,
 owner, id, precision and data.
 As an experiment, I tried to insert 5M values into a lucene index using
 compound index with merge factor 100,000.
 For searching I have two problems :

 1. The search time for simple queries such as precision:\+0002 is really
  high (4-10 seconds). I want to know if this search time is normal
considering the amount of data I inserted to the lucene (5 Million
values)?
 If not, how can I improve it. I'm sure I can improve it by upgrading
 my current box (1G memory and 3.2 Ghz CPU with 2 MB cache).
 I'm looking for software/configuration solutions ?

 2. The search gives TooManyClauses exception when I'm searching for a
 data item with the queries similar to the one below :

 precision:\+0002 AND data:\+0.85*

 I guess this a bug ?!

Thanks for your help,
Ali Salehi



**************************************************************
Ali Salehi, LSIR - Distributed Information Systems Laboratory
EPFL-IC-IIF-LSIR, Bâtiment BC, Station 14, CH-1015 Lausanne, Switzerland.
http://lsirwww.epfl.ch/
email: [EMAIL PROTECTED]
Tel: +41-21-6936656 Fax: +41-21-6938115


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to