Re: Shuffle to HDFS

Larry Liu Sun, 25 Jan 2015 23:04:34 -0800

Hi,Jerry

Thanks for your reply.

The reason I have this question is that in Hadoop, mapper intermediate
output (shuffle) will be stored in HDFS. I think the default location for
spark is /tmp I think.

Larry

On Sun, Jan 25, 2015 at 9:44 PM, Shao, Saisai <saisai.s...@intel.com> wrote:

>  Hi Larry,
>
>
>
> I don’t think current Spark’s shuffle can support HDFS as a shuffle
> output. Anyway, is there any specific reason to spill shuffle data to HDFS
> or NFS, this will severely increase the shuffle time.
>
>
>
> Thanks
>
> Jerry
>
>
>
> *From:* Larry Liu [mailto:larryli...@gmail.com]
> *Sent:* Sunday, January 25, 2015 4:45 PM
> *To:* u...@spark.incubator.apache.org
> *Subject:* Shuffle to HDFS
>
>
>
> How to change shuffle output to HDFS or NFS?
>

Re: Shuffle to HDFS

Reply via email to