RE: Resources or advice on minimising memory usage in Giraph/Hadoop code ?

2012-06-07 Thread David Garcia
ever reason, it may not be necessary to instantiate them all. From: Claudio Martella [claudio.marte...@gmail.com] Sent: Thursday, June 07, 2012 1:35 AM To: user@giraph.apache.org Subject: Re: Resources or advice on minimising memory usage in Giraph/Hadoop cod

Re: Resources or advice on minimising memory usage in Giraph/Hadoop code ?

2012-06-07 Thread André Kelpe
Hi! One interesting jvm option I learned about lately is -XX:+UseCompressedStrings, which will use a byte [] for all strings, that are fully defined in ASCII. Given that you are working with URIs, I assume that this is true for most of your strings, so I would give it a shot. For more info on JVM

Re: Resources or advice on minimising memory usage in Giraph/Hadoop code ?

2012-06-06 Thread Claudio Martella
Sent from my HTC Inspire™ 4G on AT&T > > - Reply message - > From: "Avery Ching" 'ach...@apache.org');>> > To: "user@giraph.apache.org 'user@giraph.apache.org');>" 'cvml', 'user@giraph.apache.org');>

Re: Resources or advice on minimising memory usage in Giraph/Hadoop code ?

2012-06-06 Thread David Garcia
pache.org" Subject: Resources or advice on minimising memory usage in Giraph/Hadoop code ? Date: Wed, Jun 6, 2012 10:33 pm No article or book, but here's a few tips. 1) Use aggregators! This can drastically can reduce the amount of memory use by combining messages on the server side

Re: Resources or advice on minimising memory usage in Giraph/Hadoop code ?

2012-06-06 Thread Avery Ching
No article or book, but here's a few tips. 1) Use aggregators! This can drastically can reduce the amount of memory use by combining messages on the server side. 2) -Dmapred.child.java.opts="-Xss128k" or some other value (should affect the RPC threads or netty threads) 3) You'll want to minimi

Resources or advice on minimising memory usage in Giraph/Hadoop code ?

2012-06-05 Thread Benjamin Heitmann
Hello, can somebody recommend a web page, article or book on minimising the memory usage of Giraph/Hadoop code ? I am looking for non-obvious advice on what *not* to do, and for best practices on what to do inside of Hadoop... E.g. is it preferable to use Java Strings or Hadoop Text Writables