This is correct. It's true by default so that AQE doesn't have performance
regression. If you run a benchmark, larger parallelism usually means better
performance. However, it's recommended to set it to false, so that AQE can
give better resource utilization, which is good for a busy Spark cluster.

On Fri, Sep 3, 2021 at 7:33 PM Jacek Laskowski <ja...@japila.pl> wrote:

> Hi,
>
> Found this new spark.sql.adaptive.coalescePartitions.parallelismFirst
> config property [1] with the default value `true` but the descriptions says
> the opposite:
>
> > It's recommended to set this config to false
>
> Is this OK and I'm misreading it?
>
> [1]
> https://github.com/apache/spark/blob/54cca7f82ecf23e062bb4f6d68697abec2dbcc5b/sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala#L519-L530
>
> Pozdrawiam,
> Jacek Laskowski
> ----
> https://about.me/JacekLaskowski
> "The Internals Of" Online Books <https://books.japila.pl/>
> Follow me on https://twitter.com/jaceklaskowski
>
> <https://twitter.com/jaceklaskowski>
>

Reply via email to