[
https://issues.apache.org/jira/browse/SPARK-53791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
duanshilong updated SPARK-53791:
--------------------------------
Description:
For example, during the insert overwrite directory operation, each rename
operation triggers an RPC request. Therefore, when there are too many files, it
can be time-consuming.
Converting the serial rename operations to multi-threaded operations can save
job execution time.
was:Make rename operation multi-threded.
Summary: Make the rename operations multi-threaded. (was: Make)
> Make the rename operations multi-threaded.
> ------------------------------------------
>
> Key: SPARK-53791
> URL: https://issues.apache.org/jira/browse/SPARK-53791
> Project: Spark
> Issue Type: Improvement
> Components: Spark Core, SQL
> Affects Versions: 4.0.0
> Reporter: duanshilong
> Priority: Major
>
> For example, during the insert overwrite directory operation, each rename
> operation triggers an RPC request. Therefore, when there are too many files,
> it can be time-consuming.
> Converting the serial rename operations to multi-threaded operations can save
> job execution time.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]