Take a look at the Term Vectors Component:
http://wiki.apache.org/solr/TermVectorComponent

-- Jack Krupansky

-----Original Message----- From: Igor Shalyminov
Sent: Thursday, May 23, 2013 9:54 AM
To: java-user@lucene.apache.org
Subject: Re: Getting position increments directly from the the index

Thanks, Mike and Jack!

Those are really good options.
But, just to clarify, is there a way to get, let's say, a vector of position increments directly from the index, without re-parsing document contents?

--
Best Regards,
Igor

23.05.2013, 16:13, "Jack Krupansky" <j...@basetechnology.com>:
It might be nice to inquire as to the largest position for a field in a
document. Is that information kept anywhere? Not that I know of, although I
suppose it can be calculated at runtime by running though all the terms of
the field. Then he could just divide by 1000.

-- Jack Krupansky

-----Original Message-----
From: Michael McCandless
Sent: Thursday, May 23, 2013 6:28 AM
To: Lucene Users
Subject: Re: Getting position increments directly from the the index

Do you actually index the sentence boundary as a token?  If so, you
could just get the totalTermFreq of that token?

Mike McCandless

http://blog.mikemccandless.com

On Wed, May 22, 2013 at 10:11 AM, Igor Shalyminov
<ishalymi...@yandex-team.ru> wrote:

 Hello!

 I'm storing sentence bounds in the index as position increments of 1000.
I want to get the total number of sentences in the index, i. e. the number
 of "1000" increment values.
 Can I do that some other way rather than just loading each document and
 extracting position increments with a custom Analyzer?

 --
 Best Regards,
 Igor Shalyminov

 ---------------------------------------------------------------------
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to