[ https://issues.apache.org/jira/browse/HUDI-3895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Alexey Kudinkin updated HUDI-3895: ---------------------------------- Status: Patch Available (was: In Progress) > Make sure Hudi relations do proper file-split packing (on par w/ Spark) > ----------------------------------------------------------------------- > > Key: HUDI-3895 > URL: https://issues.apache.org/jira/browse/HUDI-3895 > Project: Apache Hudi > Issue Type: Bug > Reporter: Alexey Kudinkin > Assignee: Alexey Kudinkin > Priority: Blocker > Fix For: 0.11.0 > > > While investigating on HUDI-3891, it was discovered that upon introduction of > Hudi's own Spark's Relation implementations, file-split packing algorithm was > inadvertently subverted: > Spark algorithm does greedy packing which relies on the list of file-splits > being ordered by the file size (descending in order). -- This message was sent by Atlassian Jira (v8.20.1#820001)