[ https://issues.apache.org/jira/browse/HUDI-7489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ethan Guo updated HUDI-7489: ---------------------------- Fix Version/s: 1.0.0 > Row writer clustering collects write statuses on the driver > ----------------------------------------------------------- > > Key: HUDI-7489 > URL: https://issues.apache.org/jira/browse/HUDI-7489 > Project: Apache Hudi > Issue Type: Bug > Components: clustering, spark > Reporter: Jonathan Vexler > Assignee: Jonathan Vexler > Priority: Critical > Labels: pull-request-available > Fix For: 0.15.0, 1.0.0 > > > Row writer clustering collects write statuses on the driver which can be a > very large amount of data. We Immediately parallelize the data again, so we > can avoid this collect. -- This message was sent by Atlassian Jira (v8.20.10#820010)