Hi Hudi Community, Hudi has several indices to help lookup records. The most commonly used one is the BloomFilter based index. This index today works by loading the bloom filter from all the data files of interested partitions. This is a time consuming operation. Better would be if can leverage the metadata table infrastructure of the Hudi tables. That is, if all the bloom filters can be loaded directly from a single metadata table partition, it would greatly speed up the entire record key lookup process.
Let me know your thoughts on this high level idea. Planning to start a RFC on this and I can share more details on the design and implementation. Regards, Manoj