Hi All,

The function *mapPartitions *in RDD.scala
<https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/rdd/RDD.scala>
takes
a boolean parameter *preservesPartitioning. *It seems if that parameter is
passed as *false*, the passed function f will operate on the data only
once, whereas if it's passed as *true *the function will operate on each
partition of the data.

In my case, whatever boolean value I pass, *f* operates on each partition
of data.

Any help, regarding why I am getting this unexpected behaviour?

Reply via email to