Fang, Mike Tue, 04 Aug 2015 22:49:16 -0700
Hi, Does anyone know how I could control the number of reducer when we do operation such as groupie For data frame? I could set spark.sql.shuffle.partitions in sql but not sure how to do in df.groupBy("XX") api.
Thanks, Mike