indexing multiple pages and proximity search

2008-04-18 Thread Chandan Tamrakar
Hi, I have a document and each page of this document is extracted into single text files For ex, document abc.doc have abc_page1.txt , abc_page2.txt ... abc_pageN.txt , is it possible to index them and still retain the Lucene proximity search because technically it is a single document

Re: indexing multiple pages and proximity search

2008-04-18 Thread Erick Erickson
I think they'd have to be in the same document. So you could read all the pages and add it to, say, a field named "text" and add the doc and all the pages at once in a single document. Be aware that by default Lucene only index the first 10,000 terms, but that can be changed with IndexWriter.setMa