[ https://issues.apache.org/jira/browse/SPARK-39602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17562326#comment-17562326 ]
Hyukjin Kwon commented on SPARK-39602: -------------------------------------- Why don't you decrease the number of partitions? > Invoking .repartition(100000) in a unit test causes the unit test to take >20 > minutes. > -------------------------------------------------------------------------------------- > > Key: SPARK-39602 > URL: https://issues.apache.org/jira/browse/SPARK-39602 > Project: Spark > Issue Type: Bug > Components: Spark Core > Affects Versions: 3.3.0 > Reporter: Tanin Na Nakorn > Priority: Major > > Here's a proof of concept: > {code} > val result = spark > .createDataset(List("test")) > .rdd > .repartition(100000) > .map { _ => > "test" > } > .collect() > .toList > > println(result) > {code} > This code takes a very long time in unit test. > We aim to test for correctness in unit test... not testing the repartition. > Is there a way to make it faster? (e.g. disable partition in test) -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org