Extract terms not by reader, but by documents

2007-09-04 Thread Rafael Rossini
Hi all, In some custom highlighting, I often write a code like this: Set matchedTerms = new HashSet(); query.rewrite(reader).extractTerms(matchedTerms); With this code the Term Set gets populated by the matched query in your whole index. Is it possible to this with a docume

Re: Extract terms not by reader, but by documents

2007-09-04 Thread Grant Ingersoll
Not sure if I am understanding what you are trying to do. I think you are trying to find out which terms occurred in a particular document, correct? I also am not sure about your first example. My understanding of extractTerms is that it just gives you back the set of all terms that occ

Re: Extract terms not by reader, but by documents

2007-09-05 Thread Rafael Rossini
Thank´s for the reply Grant, let me try to explain exactly what I´d like to do. Take the 2 docs: Doc1: "Microsoft is a nice software company, and Xbox seems to be a nice product too." Doc2: "Nintendo and Sony have been in the game industry for a long time, but now, Microsoft is trying to enter wit

Re: Extract terms not by reader, but by documents

2007-09-05 Thread Karl Wettin
Rafael, are you looking for IndexReader.getTermFreqVector? -- karl 5 sep 2007 kl. 16.48 skrev Rafael Rossini: Thank´s for the reply Grant, let me try to explain exactly what I´d like to do. Take the 2 docs: Doc1: "Microsoft is a nice software company, and Xbox seems to be a nice product

Re: Extract terms not by reader, but by documents

2007-09-05 Thread Grant Ingersoll
On Sep 5, 2007, at 10:48 AM, Rafael Rossini wrote: Thank´s for the reply Grant, let me try to explain exactly what I´d like to do. Take the 2 docs: Doc1: "Microsoft is a nice software company, and Xbox seems to be a nice product too." Doc2: "Nintendo and Sony have been in the game industr

Re: Extract terms not by reader, but by documents

2007-09-06 Thread Rafael Rossini
Karl, I´m aware of IndexReader.getTermFreqVector, with this I can get all terms of a document, but I want all terms of a document that matched a query. Grant, >Yes, I think I understand. You want to know what terms from your >query matched in a given document. Yep, that´s what I want. In the co

Re: Extract terms not by reader, but by documents

2007-09-06 Thread Grant Ingersoll
On Sep 6, 2007, at 1:32 PM, Rafael Rossini wrote: Karl, I´m aware of IndexReader.getTermFreqVector, with this I can get all terms of a document, but I want all terms of a document that matched a query. Grant, Yes, I think I understand. You want to know what terms from your query matched i

Re: Extract terms not by reader, but by documents

2007-09-06 Thread Mike Klaas
On 6-Sep-07, at 11:48 AM, Grant Ingersoll wrote: On Sep 6, 2007, at 1:32 PM, Rafael Rossini wrote: Karl, I´m aware of IndexReader.getTermFreqVector, with this I can get all terms of a document, but I want all terms of a document that matched a query. Grant, Yes, I think I understand.