[ https://issues.apache.org/jira/browse/HUDI-686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062370#comment-17062370 ]
Vinoth Chandar commented on HUDI-686: ------------------------------------- Have an implementation here https://github.com/vinothchandar/incubator-hudi/tree/hudi-686-bloomindex-v2 which seems to work functionally (tested by making unit tests hit this ) Needs more perf testing and a productionized implementation.. Locally, I am still seeing performance which is about the same as current BloomIndex. May be running on the cluster at load, may yield different results.. cc [~shivnarayan] [~vbalaji] > Implement BloomIndexV2 that does not depend on memory caching > ------------------------------------------------------------- > > Key: HUDI-686 > URL: https://issues.apache.org/jira/browse/HUDI-686 > Project: Apache Hudi (incubating) > Issue Type: Improvement > Components: Index, Performance > Reporter: Vinoth Chandar > Assignee: Vinoth Chandar > Priority: Major > Fix For: 0.6.0 > > > Main goals here is to provide a much simpler index, without advanced > optimizations like auto tuned parallelism/skew handling but a better > out-of-experience for small workloads. -- This message was sent by Atlassian Jira (v8.3.4#803005)