Re: [Ganglia-developers] Ganglia Reference Architecture

Adam Compton Fri, 12 Sep 2014 11:15:13 -0700

On 9/12/14 8:25 AM, daniel.j.marr...@us.hsbc.com wrote:

Greetings all and apologies if I have directed this to the incorrectlist but I thought it might be best to begin with those closest to thesource, so to speak.
I work for a fairly large international bank and we are currentlyevaluating options for collecting and visualizing performance relatedstatistics for the entirety of our UNIX/Linux estate (with thepossibility of including Windows at some point). Naturally I took tothe internet and came across Ganglia as one of the (widely used)possible options. I then spent some time looking through reports ofissues, etc. and have some questions/concerns regarding how best toorganize my infrastructure should I decide to recommend Ganglia as thesolution.
In preparation I thought it best to do some discovery around the sizeof our estate and any details our end users (system administrators,performance engineers, etc) would say needed to be included in themetric set. To that end I would say that we have approximately 26Kservers today and, given rough extrapolation, could easily wind up inthe neighborhood of 4.5M total metrics within the total system. Ourexpectation is to extend the base set of metrics to include any numberof middleware related measurements which is the primary reason for thesignificant number of metrics. We will also be using unicast ..unless, of course, a compelling enough case can be made for thealternative.
My initial instincts are to subdivide the Ganglia infrastructure bymajor data-center with each one represented by a single grid. Iimagine I would need 6-12 clusters (possibly more) per grid and willdefinitely be looking to use rrdcached. I do not know if that will beenough segregation to allow gmetad to perform as required. Several ofmy larger (more influential) end users have indicated a need for somefairly tight resolutions (15s for 4hrs for a number of high valuemetrics).
I guess my initial question is this ... has anyone done anything likethis, at this scale, with any success and - if so - would it bepossible to get some additional information (scrubbed diagram, etc)regarding how it is best done? I've been searching the net and keepcoming back to a single image showing a hierarchy of gmetad and somefairly interesting descriptions of other implementations but nothingthat actually makes it clear to me.

Hi Daniel,

If you haven't already seen it, I would recommend checking out theGanglia O'Reilly book(http://shop.oreilly.com/product/0636920025573.do). It has several casestudies from large and complex organizations, complete with scaleinformation and some benchmarking, that might help aid your planning.


- Adam Compton

------------------------------------------------------------------------------
Want excitement?
Manually upgrade your production database.
When you want reliability, choose Perforce
Perforce version control. Predictably reliable.
http://pubads.g.doubleclick.net/gampad/clk?id=157508191&iu=/4140/ostg.clktrk

_______________________________________________
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers

Re: [Ganglia-developers] Ganglia Reference Architecture

Reply via email to