Take a look at the Term Vectors Component:
http://wiki.apache.org/solr/TermVectorComponent
-- Jack Krupansky
-----Original Message-----
From: Igor Shalyminov
Sent: Thursday, May 23, 2013 9:54 AM
To: java-user@lucene.apache.org
Subject: Re: Getting position increments directly from the the index
Thanks, Mike and Jack!
Those are really good options.
But, just to clarify, is there a way to get, let's say, a vector of position
increments directly from the index, without re-parsing document contents?
--
Best Regards,
Igor
23.05.2013, 16:13, "Jack Krupansky" <j...@basetechnology.com>:
It might be nice to inquire as to the largest position for a field in a
document. Is that information kept anywhere? Not that I know of, although
I
suppose it can be calculated at runtime by running though all the terms of
the field. Then he could just divide by 1000.
-- Jack Krupansky
-----Original Message-----
From: Michael McCandless
Sent: Thursday, May 23, 2013 6:28 AM
To: Lucene Users
Subject: Re: Getting position increments directly from the the index
Do you actually index the sentence boundary as a token? If so, you
could just get the totalTermFreq of that token?
Mike McCandless
http://blog.mikemccandless.com
On Wed, May 22, 2013 at 10:11 AM, Igor Shalyminov
<ishalymi...@yandex-team.ru> wrote:
Hello!
I'm storing sentence bounds in the index as position increments of 1000.
I want to get the total number of sentences in the index, i. e. the
number
of "1000" increment values.
Can I do that some other way rather than just loading each document and
extracting position increments with a custom Analyzer?
--
Best Regards,
Igor Shalyminov
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org