Rdd.coalesce(1) will coalesce RDD and give only one output file. coalesce(2) will give 2 wise versa. On Jan 23, 2015 4:58 AM, "Sean Owen" <so...@cloudera.com> wrote:
> One output file is produced per partition. If you want fewer, use > coalesce() before saving the RDD. > > On Thu, Jan 22, 2015 at 10:46 PM, Kane Kim <kane.ist...@gmail.com> wrote: > > How I can reduce number of output files? Is there a parameter to > saveAsTextFile? > > > > Thanks. > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > > For additional commands, e-mail: user-h...@spark.apache.org > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > >