ProcFsBasedProcessTree and clean pages in smaps

Jan Lukavský Thu, 04 Feb 2016 05:12:08 -0800

Hello,

I have a question about the way LinuxResourceCalculatorPlugin calculatesmemory consumed by process tree (it is calculated viaProcfsBasedProcessTree class). When we enable caching (disk) in apachespark jobs run on YARN cluster, the node manager starts to kill thecontainers while reading the cached data, because of "Container isrunning beyond memory limits ...". The reason is that even if we enableparsing of the smaps file(yarn.nodemanager.container-monitor.procfs-tree.smaps-based-rss.enabled)the ProcfsBasedProcessTree calculates mmaped read-only pages as consumedby the process tree, while spark uses FileChannel.map(MapMode.READ_ONLY)to read the cached data. The JVM then consumes *a lot* more memory thanthe configured heap size (and it cannot be really controlled), but thismemory is IMO not really consumed by the process, the kernel can reclaimthese pages, if needed. My question is - is there any explicit reasonwhy "Private_Clean" pages are calculated as consumed by process tree? Ipatched the ProcfsBasedProcessTree not to calculate them, but I don'tknow if this is the "correct" solution.


Thanks for opinions,
 cheers,
 Jan


---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@hadoop.apache.org
For additional commands, e-mail: user-h...@hadoop.apache.org

ProcFsBasedProcessTree and clean pages in smaps

Reply via email to