Hi,

just a quick question about calling persist with the _2 option. Is the 2x 
replication only useful for fault tolerance, or will it also increase job speed 
by avoiding network transfers? Assuming I’m doing joins or other shuffle 
operations.

Thanks


---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to