Alexey Kudinkin created HUDI-4862: ------------------------------------- Summary: Add an option leverage RDDCustomColumnsSortPartitioner in bulk-insert Key: HUDI-4862 URL: https://issues.apache.org/jira/browse/HUDI-4862 Project: Apache Hudi Issue Type: Improvement Reporter: Alexey Kudinkin Assignee: Alexey Kudinkin
Currently, to sort incoming batch using DeltaStreamer you'll have to resort to using SqlQueryBasedTransformer. This however should not be necessary provided that RDDCustomColumnsSortPartitioner does already exist and could be leveraged exactly for that. This come up during discussions w/ [~abmodi] -- This message was sent by Atlassian Jira (v8.20.10#820010)