parisni commented on PR #8657: URL: https://github.com/apache/hudi/pull/8657#issuecomment-1538449686
If compatible with hudi bucketing, we could provide multiple configuration for the bucketing up to the user to select the one they dlike. I can see several aspect that vary such : - hashing - file naming - file numbering - file sorting As for file numbering I guess simple bucket could support any but consistent hashing would only be supported by hive3/spark3 since they allow more than one file per bucket On May 8, 2023 11:00:03 AM UTC, Danny Chan ***@***.***> wrote: >> * the rfc statement about support of hive bucketing https://cwiki.apache.org/confluence/display/HUDI/RFC+-+29%3A+Hash+Index > >Thanks for the detailed analysis, so what the actions that we can do to make the Hive bucket table take effect on Hive/Presto? Is it as easy as switching to a different hashing algorithm? > >-- >Reply to this email directly or view it on GitHub: >https://github.com/apache/hudi/pull/8657#issuecomment-1538176946 >You are receiving this because you authored the thread. > >Message ID: ***@***.***> -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org