[jira] [Commented] (SPARK-6056) Unlimit offHeap memory use cause RM killing the container

Aaron Davidson (JIRA) Fri, 27 Feb 2015 13:39:08 -0800

    [ 
https://issues.apache.org/jira/browse/SPARK-6056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14340858#comment-14340858
 ]


Aaron Davidson commented on SPARK-6056:
---------------------------------------

Actually, that line just calls newDirectBuf, but the implementation of 
newDirectBuf will itself only actually create a new direct byte buffer if it's 
"cheap" (see 
[here|https://github.com/netty/netty/blob/netty-4.0.23.Final/transport/src/main/java/io/netty/channel/nio/AbstractNioChannel.java#L397])
 -- i.e., if the current pool allows direct bufs or if there's a threadlocal 
one available already.

In our case, when preferDirectBufs is set to false, we set 
DEFAULT_NUM_DIRECT_ARENA to false:
https://github.com/netty/netty/blob/netty-4.0.23.Final/buffer/src/main/java/io/netty/buffer/PooledByteBufAllocator.java#L76

When nDirectArena == 0, directArenas is set to null in PooledByteBufAllocator, 
which means that isDirectBufferPooled should return false.

So if everything is hooked up as it should be, the code looks like it should 
not do direct allocation. It's possible there's a bug somewhere in the path 
between nDirectArenas and newDirectBuf that causes this not to be the case, 
though.

> Unlimit offHeap memory use cause RM killing the container
> ---------------------------------------------------------
>
>                 Key: SPARK-6056
>                 URL: https://issues.apache.org/jira/browse/SPARK-6056
>             Project: Spark
>          Issue Type: Bug
>          Components: Shuffle, Spark Core
>    Affects Versions: 1.2.1
>            Reporter: SaintBacchus
>
> No matter set the `preferDirectBufs` or limit the number of thread or not 
> ,spark can not limit the use of offheap memory.
> At line 269 of the class 'AbstractNioByteChannel' in netty-4.0.23.Final, 
> Netty had allocated a offheap memory buffer with the same size in heap.
> So how many buffer you want to transfor, the same size offheap memory will be 
> allocated.
> But once the allocated memory size reach the capacity of the overhead momery 
> set in yarn, this executor will be killed.
> I wrote a simple code to test it:
> ```scala
> val bufferRdd = sc.makeRDD(0 to 10, 10).map(x=>new 
> Array[Byte](10*1024*1024)).persist
> bufferRdd.count
> val part =  bufferRdd.partitions(0)
> val sparkEnv = SparkEnv.get
> val blockMgr = sparkEnv.blockManager
> val blockOption = blockMgr.get(RDDBlockId(bufferRdd.id, part.index))
> val resultIt = blockOption.get.data.asInstanceOf[Iterator[Array[Byte]]]
> val len = resultIt.map(_.length).sum
> ```
> If use multi-thread to get len, the physical memery will soon   exceed the 
> limit set by spark.yarn.executor.memoryOverhead



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-6056) Unlimit offHeap memory use cause RM killing the container

Reply via email to