satish created HUDI-1074: ---------------------------- Summary: implement merge-sort based clustering strategy Key: HUDI-1074 URL: https://issues.apache.org/jira/browse/HUDI-1074 Project: Apache Hudi Issue Type: Sub-task Reporter: satish
implement a merge-sort based clustering algorithm. Example: i) sort all small files by specified column(s) ii) merge N small files into M larger files by respecting sort order (M < N) -- This message was sent by Atlassian Jira (v8.3.4#803005)