Rdd.coalesce(1) will coalesce RDD and give only one output file.
coalesce(2) will give 2 wise versa.
On Jan 23, 2015 4:58 AM, "Sean Owen" <so...@cloudera.com> wrote:

> One output file is produced per partition. If you want fewer, use
> coalesce() before saving the RDD.
>
> On Thu, Jan 22, 2015 at 10:46 PM, Kane Kim <kane.ist...@gmail.com> wrote:
> > How I can reduce number of output files? Is there a parameter to
> saveAsTextFile?
> >
> > Thanks.
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
> > For additional commands, e-mail: user-h...@spark.apache.org
> >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
> For additional commands, e-mail: user-h...@spark.apache.org
>
>

Reply via email to