[ 
https://issues.apache.org/jira/browse/LUCENE-7994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16207865#comment-16207865
 ] 

Michael McCandless commented on LUCENE-7994:
--------------------------------------------

bq. Michael McCandless Issues like LUCENE-7525 would benefit from a int-int 
hashmap - is it possible for it to be added to something more common rather 
than the facets module?

Hmm, I'm not sure how we (facet module) could share this utility class with the 
analysis module?  Maybe the analysis module should depend on HPPC, or poach the 
specific hash map implementation it needs?  Though, maybe that issue should 
instead use a perfect hash?

> Use int/int hash map for int taxonomy facet counts
> --------------------------------------------------
>
>                 Key: LUCENE-7994
>                 URL: https://issues.apache.org/jira/browse/LUCENE-7994
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Michael McCandless
>            Assignee: Michael McCandless
>             Fix For: master (8.0), 7.2
>
>         Attachments: LUCENE-7994.patch
>
>
> Int taxonomy facets today always count into a dense {{int[]}}, which is 
> wasteful in cases where the number of unique facet labels is high and the 
> size of the current result set is small.
> I factored the native hash map from LUCENE-7927 and use a simple heuristic 
> (customizable by the user by subclassing) to decide up front whether to count 
> sparse or dense.  I also made loading of the large children and siblings 
> {{int[]}} lazy, so that they are only instantiated if you really need them.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to