sivabalan narayanan created HUDI-2703:
-----------------------------------------

             Summary: [RFC-37] Metadata based bloom index
                 Key: HUDI-2703
                 URL: https://issues.apache.org/jira/browse/HUDI-2703
             Project: Apache Hudi
          Issue Type: Improvement
            Reporter: sivabalan narayanan
             Fix For: 0.10.0


Hudi has indices to assit in tagging incoming records. Most commonly used one 
is Bloom index. This involves looking up (loading) bloom from data files which 
could be time consuming and could have throttling impact in cloud stores like 
S3. So, proposing this RFC to add bloom as a special partition in metadata 
table and implement an index based on that. 

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to