Re: Getting list of committed documents

2016-11-13 Thread lukes
Thanks Mike. Yeah, i saw the changelist you mentioned. Unfortunately i can't
upgrade to 6.2 because of stack limitations :( .

Regards.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Getting-list-of-committed-documents-tp4305258p4305728.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org



Re: Faceting : what are the limitations of Taxonomy (Separate index and hierarchical facets) and SortedSetDocValuesFacetField ( flat facets and no sidecar index) ?

2016-11-13 Thread Chitra R
 Hey, thank you so much for the fast response, I agree NRT refresh is
somewhat costly operations and this is the major pitfall, suppose we use
doc value faceting.


 While indexing SortedSetDocValuesFacetField , it stores
path and dimension of the given field internally. So Can we achieve
hierarchical facets using DrillDownQuery? Hope, purpose of storing path and
dimension is to achieve hierarchical facets. If yes (ie we can achieve
hierarchy in SSDVFF) , so what is the need to move over taxonomy?
 Else I missed anything?


 What is the real purpose to store path and dimension in
SSDVF field?


Kindly post your suggestions.

Regards,
Chitra



On Sat, Nov 12, 2016 at 4:03 AM, Michael McCandless <
luc...@mikemccandless.com> wrote:

> On Fri, Nov 11, 2016 at 5:21 AM, Chitra R  wrote:
>
> > i)Hope, when opening SortedSetDocValuesReaderState , we are
> > calculating ordinals( this will be used to calculate facet count ) for
> doc
> > values field and this only made the state instance somewhat costly.
> >   Am I right or any other reason behind that?
>
> That's correct.  It adds some latency to an NRT refresh, and some heap
> used to hold the ordinal mappings.
>
> >  ii) During indexing, we are providing facet ordinals in each doc
> > and I think it will be useful in search side, to calculate facet counts
> > only for matching docs.  otherwise, it carries any other benefits?
>
> Well, compared to the taxonomy facets, SSDV facets don't require a
> separate index.
>
> But they add latency/heap usage, and they cannot do hierarchical
> facets yet (though this could be fixed if someone just built it).
>
> >  iii) Is SortedSetDocValuesReaderState thread-safe (ie) multiple
> > threads can call this method concurrently?
>
> Yes.
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>


Re: Getting list of committed documents

2016-11-13 Thread Michael McCandless
Hi lukes,

Sorry, this was a recent change in Lucene:
https://issues.apache.org/jira/browse/LUCENE-7302

You need to upgrade to at least 6.2 to see it.

And the long value that is returned is just an incrementing number,
incremented for every op (add, update, delete) that changes the index.

Mike McCandless

http://blog.mikemccandless.com


On Sat, Nov 12, 2016 at 3:40 PM, lukes  wrote:
> Hi Michael,
>
>   Thanks for the reply. Regarding IW(IndexWriter) returning long sequence
> number, i looked at the signature of commit and it seems to be void. Can you
> please point me in the direction ? I am using Lucene 5.5.2. Also is this
> number aggregation of deletes, updates and new documents ? Is it count
> progressive over time or number of documents which made into only for that
> commit only ? Once you point me, i can look into for more details.
>
> Thanks a lot.
>
> Regards.
>
>
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Getting-list-of-committed-documents-tp4305258p4305644.html
> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>
> -
> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
> For additional commands, e-mail: java-user-h...@lucene.apache.org
>

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org