indexing multiple pages and proximity search

2008-04-18 Thread Chandan Tamrakar
Hi, I have a document and each page of this document is extracted into single text files For ex, document abc.doc have abc_page1.txt , abc_page2.txt ... abc_pageN.txt , is it possible to index them and still retain the Lucene proximity search because technically it is a single document

batch indexing

2007-04-29 Thread Chandan Tamrakar
I am trying to index a huge documents on batches . Batch size is parameterized to the application say X docs , that means it will hold X no. of Docs in the RAM before I flush to file system using IndexWriter.addIndexes(Directory[]) method My question is : Do I need to set mergefactor ?

RE: batch indexing

2007-04-29 Thread Chandan Tamrakar
for the reader. Best Erick On 4/29/07, Chandan Tamrakar <[EMAIL PROTECTED]> wrote: > > I am trying to index a huge documents on batches . Batch size is > parameterized to the application say X docs , that means it will hold X > no. > of > > Docs in the RAM before I

drawback addindexes method

2007-05-03 Thread Chandan Tamrakar
I found that IndexWriter.addIndexes(Directory[]) always calls optimize method twice I am indexing a documents in batches , i.e I call this method when X no. of documents are buffered in RAM Using RAMDirectory . So as the index size grows , optimize method will only increase by indexing time C

RE: MergeFactor advice wanted

2007-05-03 Thread Chandan Tamrakar
What if we are using addindexes(Ram Directory) method ? it calls optimize function inside the function itself ? Any solution to this ? -Original Message- From: Mark Miller [mailto:[EMAIL PROTECTED] Sent: Thursday, May 03, 2007 4:03 PM To: java-user@lucene.apache.org Subject: Re: MergeFac

RE: Searching with too many clauses + Out of Memory

2007-08-01 Thread Chandan Tamrakar
What is the size of heap u r allocating for your app ? -Original Message- From: Harini Raghavan [mailto:[EMAIL PROTECTED] Sent: Wednesday, August 01, 2007 2:29 PM To: java-user@lucene.apache.org Subject: Searching with too many clauses + Out of Memory Hi Everyone, I am using Compass 1.