parisni commented on PR #8657:
URL: https://github.com/apache/hudi/pull/8657#issuecomment-1538449686

   If compatible with hudi bucketing, we could provide multiple configuration 
for the bucketing up to the user to select the one they dlike. I can see 
several aspect that vary such :
   - hashing
   - file naming
   - file numbering
   - file sorting
   
   As for file numbering I guess simple bucket could support any but consistent 
hashing would only be supported by hive3/spark3 since they allow more than one 
file per bucket
   
   On May 8, 2023 11:00:03 AM UTC, Danny Chan ***@***.***> wrote:
   >> * the rfc statement about support of hive bucketing 
https://cwiki.apache.org/confluence/display/HUDI/RFC+-+29%3A+Hash+Index
   >
   >Thanks for the detailed analysis, so what the actions that we can do to 
make the Hive bucket table take effect on Hive/Presto? Is it as easy as 
switching to a different hashing algorithm?
   >
   >-- 
   >Reply to this email directly or view it on GitHub:
   >https://github.com/apache/hudi/pull/8657#issuecomment-1538176946
   >You are receiving this because you authored the thread.
   >
   >Message ID: ***@***.***>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to