[ https://issues.apache.org/jira/browse/HUDI-1074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17253841#comment-17253841 ]
satish commented on HUDI-1074: ------------------------------ Yes, #2263 provides sorting based on bulk insert as well. I think we can close this one as duplicate unless you have other use cases where this strategy doesnt work. > implement merge-sort based clustering strategy > ---------------------------------------------- > > Key: HUDI-1074 > URL: https://issues.apache.org/jira/browse/HUDI-1074 > Project: Apache Hudi > Issue Type: Sub-task > Reporter: satish > Assignee: liwei > Priority: Major > > implement a merge-sort based clustering algorithm. Example: i) sort all small > files by specified column(s) ii) merge N small files into M larger files by > respecting sort order (M < N) -- This message was sent by Atlassian Jira (v8.3.4#803005)