[jira] [Updated] (HUDI-7906) improve the parallelism deduce in rdd write

Danny Chen (Jira) Fri, 21 Jun 2024 21:31:05 -0700


     [ 
https://issues.apache.org/jira/browse/HUDI-7906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Danny Chen updated HUDI-7906:
-----------------------------
    Fix Version/s: 0.16.0
                   1.0.0

> improve the parallelism deduce in rdd write
> -------------------------------------------
>
>                 Key: HUDI-7906
>                 URL: https://issues.apache.org/jira/browse/HUDI-7906
>             Project: Apache Hudi
>          Issue Type: Improvement
>            Reporter: KnightChess
>            Assignee: KnightChess
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.16.0, 1.0.0
>
>
> as [https://github.com/apache/hudi/issues/11274] and 
> [https://github.com/apache/hudi/pull/11463] describe, there has two case 
> question.
>  # if the rdd is input rdd without shuffle, the partitiion number is too 
> bigger or too small
>  # user need can not control it easy
>  ## in some case user can set `spark.default.parallelism` change it.
>  ## in some case user can not change because hard-code
>  ## and in spark, the better way is use `spark.default.parallelism` or 
> `spark.sql.shuffle.partitions` can control it, other is advanced in hudi.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (HUDI-7906) improve the parallelism deduce in rdd write

Reply via email to