distinct field values

2008-10-14 Thread Akanksha Baid
I have indexed multiple documents - each of them have 3 fields ( id, tag , text). Is there an easy way to determine the set of tags for a given query without iterating through all the hits? For example if I have 100 documents in my index and my set of tag = {A, B, C}. Query Q on the text field

Re: distinct field values

2008-10-14 Thread Akanksha Baid
here belong to everybody, the opinions to me. The distinction is yours to draw On Tue, Oct 14, 2008 at 12:53 PM, Akanksha Baid [EMAIL PROTECTED] wrote: I have indexed multiple documents - each of them have 3 fields ( id, tag , text). Is there an easy way to determine the set of tags

frequent phrases

2007-08-09 Thread Akanksha Baid
I was wondering if there is a search based method to find the top-k frequent phrases in a set of documents.( I do not have a particular phrase in mind so PhraseQuery can probably be ruled out). I have implemented something that works using termvectors and termpositions but the performance is not

Re: TermFreqVector

2007-07-19 Thread Akanksha Baid
hits.id() should work. karl wettin wrote: 19 jul 2007 kl. 22.58 skrev Kevin Chen: doc = hits.doc(0); TermFreqVector vector = reader.getTermFreqVector(docId, field); How do I get docId? If you use Hits, it is hits.doc()

lucene version?

2007-07-18 Thread Akanksha Baid
Is there a way to test as to which version of Lucene was used to build an index? -Akanksha - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: MoreLikeThis

2007-07-18 Thread Akanksha Baid
Right , I was making a silly mistake there. I have it working now. Thanks for the reply. yu wrote: You can put lucene-queries-2.2.0.jar on your class path or your Eclipse project build path. That's all you need. Jay Akanksha Baid wrote: I am using Lucene 2.1.0 and want to use MoreLikeThis

spannearquery help

2007-06-29 Thread Akanksha Baid
I have two strings - String1 contains multiple words String2 contains just 1 word I need to search my index to find hits where String1 and String2 occur within a distance slop = d of each other. Order is important. Also, ideally I would like to do a fuzzy search on String1. Is there some way