On 24/05/2020 11:27, Antonin Delpeuch (lists) wrote: > With this formulation, zipWithIndex would be a special case of > mapWithState (so it could be refactored to be expressed as such).
Forget about this part, it would obviously not, since zipWithIndex can compute the size of each partition independently, so it would be inefficient to refactor it as such. Antonin --------------------------------------------------------------------- To unsubscribe e-mail: dev-unsubscr...@spark.apache.org