[ 
https://issues.apache.org/jira/browse/LUCENE-10062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17451314#comment-17451314
 ] 

Greg Miller commented on LUCENE-10062:
--------------------------------------

Just as a follow-up, nightly benchmarks are showing the impact of this change. 
For example:
 # 
[https://home.apache.org/~mikemccand/lucenebench/BrowseDayOfYearTaxoFacets.html]
 # [https://home.apache.org/~mikemccand/lucenebench/BrowseMonthTaxoFacets.html]
 # [https://home.apache.org/~mikemccand/lucenebench/BrowseDateTaxoFacets.html]

 

> Explore using SORTED_NUMERIC doc values to encode taxonomy ordinals for 
> faceting
> --------------------------------------------------------------------------------
>
>                 Key: LUCENE-10062
>                 URL: https://issues.apache.org/jira/browse/LUCENE-10062
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: modules/facet
>            Reporter: Greg Miller
>            Assignee: Greg Miller
>            Priority: Minor
>             Fix For: 9.0, 10.0 (main)
>
>          Time Spent: 15h 20m
>  Remaining Estimate: 0h
>
> We currently encode taxonomy ordinals using varint style packing in a binary 
> doc values field. I suspect there have been a number of improvements to 
> SortedNumericDocValues since taxonomy faceting was first introduced, and I 
> plan to explore replacing the custom binary format we have today with a 
> SORTED_NUMERIC type dv field instead.
> I'll report benchmark results and index size impact here.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to