[ https://issues.apache.org/jira/browse/LUCENE-9969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17419185#comment-17419185 ]
Greg Miller commented on LUCENE-9969: ------------------------------------- [~zhai7631] I don't think there's any good reason for doing this other than historical/legacy. It looks like that [code|https://github.com/apache/lucene/blob/main/lucene/facet/src/java/org/apache/lucene/facet/taxonomy/directory/TaxonomyIndexArrays.java#L127] dates back to 2012, and I suspect index format options were quite different then. +1 to experimenting with {{NumericDocValues}} for this purpose, especially since they should be dense, with no need to encode the docs-with-values, etc. Want to spin off a separate issue to explore this? > DirectoryTaxonomyReader.taxoArray占用内存较大导致系统OOM宕机 > ------------------------------------------------ > > Key: LUCENE-9969 > URL: https://issues.apache.org/jira/browse/LUCENE-9969 > Project: Lucene - Core > Issue Type: Improvement > Components: modules/facet > Affects Versions: 6.6.2 > Reporter: FengFeng Cheng > Priority: Trivial > Attachments: image-2021-05-24-13-43-43-289.png > > Time Spent: 1h 10m > Remaining Estimate: 0h > > 首先数据量很大,jvm内存为90G,但是TaxonomyIndexArrays几乎占走了一半 > !image-2021-05-24-13-43-43-289.png! > 请问对于TaxonomyReader是否有更好的使用方式或者其他的优化? -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org