Re: akka disassociated on GC

Makoto Yui Wed, 16 Jul 2014 00:06:07 -0700

Hi Xiangrui,

(2014/07/16 15:05), Xiangrui Meng wrote:

I don't remember I wrote that but thanks for bringing this issue up!
There are two important settings to check: 1) driver memory (you can
see it from the executor tab), 2) number of partitions (try to use
small number of partitions). I put two PRs to fix the problem:

For the driver memory, I used 16GB/24GB and it was enough for theexecution (full GC was not happen). I check it by using jmap and topcommand.

BTW, I was faced that the required memory for driver was oddlyproportional to # of tasks/executors. When I used 8GB for the drivermemory, I got OOM in the task serialization. It could be considered as apossible memory leak in the task serialization to be addressed in thefuture.


Each task size is about 24MB and # of tasks/executors is 280.
The size of each task result was about 120MB or so.

> 1) use broadcast in task closure:https://github.com/apache/spark/pull/1427


Does this PR reduce the required memory for the driver?

Is there a big difference in explicit broadcast of feature weights andimplicit task serialization including feature weights?


> 2) use treeAggregate to get the result:
> https://github.com/apache/spark/pull/1110

treeAggregate would reduce the time for aggregation and the requiredmemory of a driver for sure. I would test it.

However, the problem that I am facing now is an akka connection issue onGC, or under heavy loads. And thus, I think the problem is lurkingbehind even though the consumed memory size is reduced by treeAggregate.


Best,
Makoto

Re: akka disassociated on GC

Reply via email to