[ https://issues.apache.org/jira/browse/LUCENE-10062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17451314#comment-17451314 ]
Greg Miller commented on LUCENE-10062: -------------------------------------- Just as a follow-up, nightly benchmarks are showing the impact of this change. For example: # [https://home.apache.org/~mikemccand/lucenebench/BrowseDayOfYearTaxoFacets.html] # [https://home.apache.org/~mikemccand/lucenebench/BrowseMonthTaxoFacets.html] # [https://home.apache.org/~mikemccand/lucenebench/BrowseDateTaxoFacets.html] > Explore using SORTED_NUMERIC doc values to encode taxonomy ordinals for > faceting > -------------------------------------------------------------------------------- > > Key: LUCENE-10062 > URL: https://issues.apache.org/jira/browse/LUCENE-10062 > Project: Lucene - Core > Issue Type: Improvement > Components: modules/facet > Reporter: Greg Miller > Assignee: Greg Miller > Priority: Minor > Fix For: 9.0, 10.0 (main) > > Time Spent: 15h 20m > Remaining Estimate: 0h > > We currently encode taxonomy ordinals using varint style packing in a binary > doc values field. I suspect there have been a number of improvements to > SortedNumericDocValues since taxonomy faceting was first introduced, and I > plan to explore replacing the custom binary format we have today with a > SORTED_NUMERIC type dv field instead. > I'll report benchmark results and index size impact here. -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org