Re: application failed on large dataset

2015-09-18 Thread 周千昊
- >> From: qhz...@apache.org >> Date: Wed, 16 Sep 2015 13:52:25 + >> >> Subject: Re: application failed on large dataset >> To: java8...@hotmail.com; user@spark.apache.org >> >> Hi, >> I have switch 'spark.shuffle.blockTransferServ

RE: application failed on large dataset

2015-09-16 Thread java8964
Can you try for "nio", instead of "netty". set "spark.shuffle.blockTransferService", to "nio" and give it a try. Yong From: z.qian...@gmail.com Date: Wed, 16 Sep 2015 03:21:02 + Subject: Re: application failed on large dataset To: java8...@hotmail.com

Re: application failed on large dataset

2015-09-16 Thread 周千昊
;nio", instead of "netty". > > set "spark.shuffle.blockTransferService", to "nio" and give it a try. > > Yong > > -- > From: z.qian...@gmail.com > Date: Wed, 16 Sep 2015 03:21:02 + > > Su

RE: application failed on large dataset

2015-09-16 Thread java8964
This sounds like a memory issue. Do you enable the GC output? When this is happening, are your executors doing full gc? How long is the full gc? Yong From: qhz...@apache.org Date: Wed, 16 Sep 2015 13:52:25 + Subject: Re: application failed on large dataset To: java8...@hotmail.com; user

Re: application failed on large dataset

2015-09-16 Thread 周千昊
ening, are your executors > doing full gc? How long is the full gc? > > Yong > > -- > From: qhz...@apache.org > Date: Wed, 16 Sep 2015 13:52:25 + > > Subject: Re: application failed on large dataset > To: java8...@hotmail.com; user@

Re: application failed on large dataset

2015-09-15 Thread 周千昊
> From: qhz...@apache.org > Date: Tue, 15 Sep 2015 15:02:28 + > Subject: Re: application failed on large dataset > To: user@spark.apache.org > > > has anyone met the same problems? > 周千昊 <qhz...@apache.org>于2015年9月14日周一 下午9:07写道: > > Hi

Re: application failed on large dataset

2015-09-15 Thread 周千昊
has anyone met the same problems? 周千昊 于2015年9月14日周一 下午9:07写道: > Hi, community > I am facing a strange problem: > all executors does not respond, and then all of them failed with the > ExecutorLostFailure. > when I look into yarn logs, there are full of such

RE: application failed on large dataset

2015-09-15 Thread java8964
2015 15:02:28 + Subject: Re: application failed on large dataset To: user@spark.apache.org has anyone met the same problems? 周千昊 <qhz...@apache.org>于2015年9月14日周一 下午9:07写道: Hi, community I am facing a strange problem: all executors does not respond, and then all of them

application failed on large dataset

2015-09-14 Thread 周千昊
Hi, community I am facing a strange problem: all executors does not respond, and then all of them failed with the ExecutorLostFailure. when I look into yarn logs, there are full of such exception 15/09/14 04:35:33 ERROR shuffle.RetryingBlockFetcher: Exception while beginning