[ https://issues.apache.org/jira/browse/SPARK-3671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ankur Dave resolved SPARK-3671. ------------------------------- Resolution: Fixed Issue resolved by https://github.com/amplab/spark-indexedrdd/pull/4 > Batch multiput updates within partitions > ---------------------------------------- > > Key: SPARK-3671 > URL: https://issues.apache.org/jira/browse/SPARK-3671 > Project: Spark > Issue Type: Sub-task > Components: Spark Core > Reporter: Ankur Dave > Assignee: Ankur Dave > > Multiputs that touch the same 32-element leaf node multiple times will > currently allocate and throw away a 32-element array per updated element. > Instead, we should batch together all updates to the same leaf node and apply > them in one step. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org