Re: TermEnum - previous() method ?

2007-07-20 Thread muraalee
er all of the terms to allow a previous though. Depending on > your RAM and index size, this could be an option. > > I do wonder why you want a previous though? It sounds like you might be > better off heading down a different path... > > - Mark > > > > mura

TermEnum - previous() method ?

2007-07-18 Thread muraalee
Hi All, I searched in this forum for anybody looking for need for previous() method in TermEnum. I found only this link http://www.nabble.com/How-to-navigate-through-indexed-terms-tf28148.html#a189225 Would it be possible to implement previous() method ? I know i am asking for quick solution here

Dictionary Type Lookup

2007-07-18 Thread muraalee
Hi, I am trying to model a Dictionary Type Search in Lucene. My approach was this - Load the dictionary file ( words & their meanings ) and index each dictionary term and associated meaning as a Lucene Document. - Use IndexReader's term method to peek at the index and get the TermEnum. TermEnum'

Lucene RAM Directory doesn't work for Index Size > 8 GB

2007-07-09 Thread muraalee
Hi, We are facing a strange problem with RAMDirectory for indices greater than 8 GB. We have indexed around 6.5 million lucene documents and the index size is around 8 GB. Below is the contents of Index Directory. 2236964197 _1x.fdt 51811488 _1x.fdx 293 _1x.fnm 2234929832 _1x.f

Scaling Lucene to 500 million+ documents - preferred architecture

2007-07-07 Thread muraalee
Hi Everybody, We are building a search infrastructure using lucene to scale upto 500 million document with search < 500 ms. Here is my rough math on the size of content & index : Total Documents = 500 million documents Size / Document = 10k / document Index Size / Million = 2 GB / million documen

Browse Functionality

2006-06-05 Thread muraalee
Hi , I am looking at exposing a "Browse" like functionality using lucene. Assume this scenario, a user Browses a Field ( Untokenized one like Book Title ) and he sees a list of "terms" [Book titles ] and their associated doc Hits ( Number of articles / documents this term appears in ). I believe t