Re: wordcount job slow while input from NFS mount

Larry Liu Wed, 17 Dec 2014 13:32:13 -0800

Thanks, Matei.

I will give it a try.


Larry

On Wed, Dec 17, 2014 at 1:01 PM, Matei Zaharia <matei.zaha...@gmail.com>
wrote:
>
> I see, you may have something else configured weirdly then. You should
> look at CPU and disk utilization while your Spark job is reading from NFS
> and, if you see high CPU use, run jstack to see where the process is
> spending time. Also make sure Spark's local work directories
> (spark.local.dir) are not on NFS. They shouldn't be though, that should be
> /tmp.
>
> Matei
>
> On Dec 17, 2014, at 11:56 AM, Larry Liu <larryli...@gmail.com> wrote:
>
> Hi, Matei
>
> Thanks for your response.
>
> I tried to copy the file (1G) from NFS and took 10 seconds. The NFS mount
> is a LAN environment and the NFS server is running on the same server that
> Spark is running on. So basically I mount the NFS on the same bare metal
> machine.
>
> Larry
>
> On Wed, Dec 17, 2014 at 11:42 AM, Matei Zaharia <matei.zaha...@gmail.com>
> wrote:
>>
>> The problem is very likely NFS, not Spark. What kind of network is it
>> mounted over? You can also test the performance of your NFS by copying a
>> file from it to a local disk or to /dev/null and seeing how many bytes per
>> second it can copy.
>>
>> Matei
>>
>> > On Dec 17, 2014, at 9:38 AM, Larryliu <larryli...@gmail.com> wrote:
>> >
>> > A wordcounting job for about 1G text file takes 1 hour while input from
>> a NFS
>> > mount. The same job took 30 seconds while input from local file system.
>> >
>> > Is there any tuning required for a NFS mount input?
>> >
>> > Thanks
>> >
>> > Larry
>> >
>> >
>> >
>> > --
>> > View this message in context:
>> http://apache-spark-user-list.1001560.n3.nabble.com/wordcount-job-slow-while-input-from-NFS-mount-tp20747.html
>> > Sent from the Apache Spark User List mailing list archive at Nabble.com
>> .
>> >
>> > ---------------------------------------------------------------------
>> > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
>> > For additional commands, e-mail: user-h...@spark.apache.org
>> >
>>
>>
>

Re: wordcount job slow while input from NFS mount

Reply via email to