Re: Java Process Memory Leak

2014-03-17 Thread Young Han
Interesting find.. It looks that bit was added recently ( https://reviews.apache.org/r/17644/diff/3/) and so was not part of Giraph 1.0.0 as far as I can tell. Also, if anyone cares, a clunky (Ubuntu) workaround I'm using is: kill $(ps aux | grep "[j]obcache/job_[0-9]\{12\}_[0-9]\{4\}/" | awk '{pr

Re: Java Process Memory Leak

2014-03-17 Thread Craig Muchinsky
I just noticed a similar problem myself. I did a thread dump and found similar netty client threads lingering. After poking around the source a bit, I'm wondering if the problem is related to this bit of code I found in the NettyClient.stop() method: workerGroup.shutdownGracefully()

Re: clustering coefficient (counting triangles) in giraph.

2014-03-17 Thread Suijian Zhou
Hi, Paven and Kaushik, Great! Yes, this is what I need. In the meantime, could you share your implementation with me? Thanks a lot! Best Regards, Suijian 2014-03-17 14:38 GMT-05:00 Pavan Kumar A : > If what you need is > http://en.wikipedia.org/wiki/Clustering_coefficient#Local_clusterin

Re: Java Process Memory Leak

2014-03-17 Thread Young Han
Oh, I see. I did jstack on a cluster of machines and a single machine... I'm not quite sure how to interpret the output. My best guess is that there might be a deadlock---there's just a bunch of Netty threads waiting. The links to the jstack dumps: http://pastebin.com/0cLuaF07 (PageRank, singl

RE: clustering coefficient (counting triangles) in giraph.

2014-03-17 Thread Pavan Kumar A
If what you need is http://en.wikipedia.org/wiki/Clustering_coefficient#Local_clustering_coefficientthen I implemented it in Giraph, will submit a patch soon Date: Mon, 17 Mar 2014 15:33:07 -0400 Subject: Re: clustering coefficient (counting triangles) in giraph. From: kaushikpatn...@gmail.com T

Re: clustering coefficient (counting triangles) in giraph.

2014-03-17 Thread Kaushik Patnaik
Check out this paper on implementing triangle counting in a BSP model by Prof David Bader from Georgia Tech. http://www.cc.gatech.edu/~bader/papers/GraphBSPonXMT-MTAAP2013.pdf I implemented a similar version in Apache Giraph, and it worked pretty well. You have to "switch on" the write to disk op

Re: clustering coefficient (counting triangles) in giraph.

2014-03-17 Thread Guy Bayes
ha that sounds familiar I never did get around to writing that... if you do it should update the comment chain in this thread http://www.vertica.com/2011/09/21/counting-triangles/ On Mon, Mar 17, 2014 at 12:17 PM, Suijian Zhou wrote: > Hi, Experts, > Does anybody know if there are examples o

clustering coefficient (counting triangles) in giraph.

2014-03-17 Thread Suijian Zhou
Hi, Experts, Does anybody know if there are examples of implementation in giraph for clustering coefficient (counting triangles)? Thanks! Best Regards, Suijian

Re: Java Process Memory Leak

2014-03-17 Thread Avery Ching
Hi Young, Our Hadoop instance (Corona) kills processes after they finish executing so we don't see this. You might want to do a jstack to see where it's hung up on and figure out the issue. Thanks Avery On 3/17/14, 7:56 AM, Young Han wrote: Hi all, With Giraph 1.0.0, I've noticed an issu

Java Process Memory Leak

2014-03-17 Thread Young Han
Hi all, With Giraph 1.0.0, I've noticed an issue where the Java process corresponding to the job loiters around indefinitely even after the job completes (successfully). The process consumes memory but not CPU time. This happens on both a single machine and clusters of machines (in which case ever