Re: Getting position increments directly from the the index

2013-05-23 Thread Jack Krupansky
k Krupansky -Original Message- From: Michael McCandless Sent: Thursday, May 23, 2013 10:39 AM To: Lucene Users Subject: Re: Getting position increments directly from the the index On Thu, May 23, 2013 at 9:54 AM, Igor Shalyminov wrote: But, just to clarify, is there a way to get, let

Re: Getting position increments directly from the the index

2013-05-23 Thread Michael McCandless
On Thu, May 23, 2013 at 9:54 AM, Igor Shalyminov wrote: > But, just to clarify, is there a way to get, let's say, a vector of position > increments directly from the index, without re-parsing document contents? Term vectors (as Jack suggested) are one option, but they are very heavy (slows down

Re: Getting position increments directly from the the index

2013-05-23 Thread Jack Krupansky
ssage- From: Michael McCandless Sent: Thursday, May 23, 2013 6:28 AM To: Lucene Users Subject: Re: Getting position increments directly from the the index Do you actually index the sentence boundary as a token? If so, you could just get the totalTermFreq of that token? Mike M

Re: Getting position increments directly from the the index

2013-05-23 Thread Igor Shalyminov
upansky > > -Original Message- > From: Michael McCandless > Sent: Thursday, May 23, 2013 6:28 AM > To: Lucene Users > Subject: Re: Getting position increments directly from the the index > > Do you actually index the sentence boundary as a token?  If so, you > could j

Re: Getting position increments directly from the the index

2013-05-23 Thread Jack Krupansky
-Original Message- From: Michael McCandless Sent: Thursday, May 23, 2013 6:28 AM To: Lucene Users Subject: Re: Getting position increments directly from the the index Do you actually index the sentence boundary as a token? If so, you could just get the totalTermFreq of that token? Mike

Re: Getting position increments directly from the the index

2013-05-23 Thread Michael McCandless
Do you actually index the sentence boundary as a token? If so, you could just get the totalTermFreq of that token? Mike McCandless http://blog.mikemccandless.com On Wed, May 22, 2013 at 10:11 AM, Igor Shalyminov wrote: > Hello! > > I'm storing sentence bounds in the index as position increme

Getting position increments directly from the the index

2013-05-22 Thread Igor Shalyminov
Hello! I'm storing sentence bounds in the index as position increments of 1000. I want to get the total number of sentences in the index, i. e. the number of "1000" increment values. Can I do that some other way rather than just loading each document and extracting position increments with a cus