Re: Optimize for large index size

2008-01-18 Thread vivek sar
Thanks Michael for the feedback. Couple more questions, 1) Doesn't Lucene do some sort of optimization internally based on mergefactor, i.e, if the number of segments grow over the mergefactor number Lucene would automatically merge them into one segment - is this different than optimization? Does

Re: index update problems with Linux

2008-01-18 Thread 游泳池的鱼
it seems that close reader not release write lock, test lock release before you add document 2008/1/18, Kevin Dewi <[EMAIL PROTECTED]>: > > Hello, > > I have a problem with this code (updating a lucene index by delete and > adding documents): > > > IndexReader reader = IndexReader.open(directory)

Lucene Performance

2008-01-18 Thread Thibaut Britz
Hi, We have an index of about 9 gigabytes here at work, where a few queries take a very long time to succeed. What I noticed is, that we have a large of number of multiple value fields (50). How does lucene scale with queries going over a large amount of fields? Is it better to use a keyword for

Re: Highlighting with wildcards?

2008-01-18 Thread John Byrne
I think the way to do this is to run the 'rewrite()' method on the wilcard query; this turns it into a boolean collection of term queries, with a term for each match for the wildcard. That way, you're just highlighting a normal term query. I think that would also work for fuzzy queries. Hope th

Highlighting with wildcards?

2008-01-18 Thread Michael Prichard
I have highlighter working out of the box but now I need to highlight wildcards (if they are there). I would imagine someone has done this on this group :) Any suggestions? Thanks! Michael - To unsubscribe, e-mail: [EMAIL

log4j error

2008-01-18 Thread testn
Since I upgraded to Lucene 2.3, I started to see some error message coming from log4j via Lucene. Has any one ever experienced this? Is this classloading issue? javax.ejb.EJBException: EJB Exception: : java.lang.IllegalStateException: Current state = FLUSHED, new state = CODING at java.ni

delete a document from indexwriter

2008-01-18 Thread Cam Bazz
Hello, How do I delete a specific document from an indexwriter? I understand there is deleteDocuments(term) which deletes all the documents matching the term. But what if I want to delete a document that has more then one term in specific. I can search the document with a boolean query, and then g

Re: Update of Lucene in Action ?

2008-01-18 Thread Erik Hatcher
On Jan 18, 2008, at 7:16 AM, thrgroovyboy wrote: Is the book "Lucene In Action" updated ? Or is it the same version based on lucene 1.4 ? The first, and currently only, edition is based on Lucene 1.4.3, and all code works with Lucene 1.9 as well. Lucene 2.0+ changed some API, but it is ea

Update of Lucene in Action ?

2008-01-18 Thread thrgroovyboy
Hi, Is the book "Lucene In Action" updated ? Or is it the same version based on lucene 1.4 ? Thanks a lot -- View this message in context: http://www.nabble.com/Update-of-Lucene-in-Action---tp14949157p14949157.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. ---

index update problems with Linux

2008-01-18 Thread Kevin Dewi
Hello, I have a problem with this code (updating a lucene index by delete and adding documents): IndexReader reader = IndexReader.open(directory); while (i.hasNext()) { reader.deleteDocuments(i.next()); } reader.close(); ... IndexWriter writer = new IndexWriter(directory,

Re: Optimize for large index size

2008-01-18 Thread Michael McCandless
vivek sar wrote: Hi, We are using Lucene 2.2. We have an index of size 70G (within 3-4 days) and growing. We run optimize pretty frequently (once every hour - due to large number of index updates every min - can be up to 100K new documents every min). I have seen every now and then the optim

Optimize for large index size

2008-01-18 Thread vivek sar
Hi, We are using Lucene 2.2. We have an index of size 70G (within 3-4 days) and growing. We run optimize pretty frequently (once every hour - due to large number of index updates every min - can be up to 100K new documents every min). I have seen every now and then the optimize takes 3-4 hours t