Re: Out of memory with giraph-release-1.0.0-RC3, used to work on old Giraph

Avery Ching Wed, 28 Aug 2013 16:57:58 -0700

Try dumping a histogram of memory usage from a running JVM and see wherethe memory is going. I can't think of anything in particular thatchanged...


On 8/28/13 4:39 PM, Jeff Peters wrote:

I am tasked with updating our ancient (circa 7/10/2012) Giraph togiraph-release-1.0.0-RC3. Most jobs run fine but our largest job nowruns out of memory using the same AWS elastic-mapreduce configurationwe have always used. I have never tried to configure either Giraph orthe AWS Hadoop. We build for Hadoop 1.0.2 because that's closest tothe 1.0.3 AWS provides us. The 8 X m2.4xlarge cluster we use seems toprovide 8*14=112 map tasks fitted out with 2GB heap each. Our code iscompletely unchanged except as required to adapt to the new GiraphAPIs. Our vertex, edge, and message data are completely unchanged. Onsmaller jobs, that work, the aggregate heap usage high-water markseems about the same as before, but the "committed heap" seems to runhigher. I can't even make it work on a cluster of 12. In that case Iget one map task that seems to end up with nearly twice as manymessages as most of the others so it runs out of memory anyway. Itonly takes one to fail the job. Am I missing something here? Should Ibe configuring my new Giraph in some way I didn't used to need to withthe old one?

Re: Out of memory with giraph-release-1.0.0-RC3, used to work on old Giraph

Reply via email to