RE: Checking for duplicates inside index

2006-05-22 Thread Eugene Tuan
I have created a method that can delete duplicate docs. Basically, during indexing, a doc is associated with an id (a term field defined by you.) that is indexed. Then, call the method to delete duplicates whenever you update index. I haven't contributed back to Lucene community yet because our

RE: Size + memory restrictions

2006-02-14 Thread Eugene Tuan
Yes. We have the same problem. It is mainly because TermInforReader.java that takes memory space to keep *.tii. Eugene -Original Message- From: Leon Chaddock [mailto:[EMAIL PROTECTED] Sent: Tuesday, February 14, 2006 8:43 AM To: java-user@lucene.apache.org Subject: Size + memory restr