[Ganglia-developers] A change in metric-gathering philosophy? [long]

Steven Wagner Tue, 27 Aug 2002 11:35:35 -0700

I've written more than my share of machine.c's, folks, and in doing so I'venoticed that I do the same thing over and over again.

Namely, I almost always chuck the "I'll go query the appropriate subsystem,discard all the data I don't need for this metric, and return the result"method in favor of a method I call, "I'll call a function that gathers allthe appropriate data and loads them into a struct, then return whatever'sin that struct as my metric."

Most of the reason I did this is that I realized that some metrics arerelated. It's no good to gather each of your related memory stats fifteenseconds apart - you've got to grab them all at once, otherwise mem_*doesn't add up to mem_total and the words "margin of error" start to becomemeaningful.

This is especially noticeable with the CPU stats, which have been giving mequite a bit of a headache until about 10 minutes ago when I laid myvengeance upon them*.

So to expand upon one of my caffeinated asides earlier this month, at somepoint (3.0, sooner, I don't know) we should take a very good look at themachines directory and then drain as much non-platform-specific code aspossible out of it and stick it in a utility library.

I am painfully aware that there's about 75 different ways of kicking arunning kernel and getting it to spit out metrics. However, many of theraw data structures that come out are similar across platforms and you endup doing the same calculations on it everywhere. Chances are you will wantto convert the amount of free memory from pages into kilobytes. Chancesare you will want to do a whole lot of voodoo on CPU ticks in order to getpercentages. [insert exceedingly obvious music reference here]

I would especially like to move the responsibility of implementing metricsaway from the machine.c files - in other words, if mtu_func isn'timplemented, it should return a zero instead of bringing the compiler downin a doomed fiery ball of flaming, spherical doom. If machine.c becomes adata collector that populates a struct, and the data processing is doneelsewhere, we gain a uniformity of metrics that we don't see so much rightnow. I am taking it on good faith that the kernel CPU percentages reportedin Linux procfs are approximately similar to the figures I'm getting fromSolaris, IRIX and Tru64 but I have no real way of guaranteeing this sincethe data collection and massaging operations are all slightly different.

Plus, new metrics can be implemented (by anyone) without aforesaid fierydoom. Although we still have this XDR metric number thing. That's kind ofa bummer. If only there was a way to build the XDR metric hash on-the-flyon a per-cluster basis. Is that a totally absurd concept? There's no"master node" on a cluster so either the nodes would have to democraticallyassign a metric number to the new XDR or they'd have to each maintain aseparate list and we're back to a string representation of the metric inthe XDR.

OK, so how about we steal a bit of DHCP? When a "non-standard" (defined inthe new metric code of course :) ) metric first appears on a vanillanetwork (through gmetric, a monitoring core upgrade on one box, etc.), themetric is sent out with a special, distinctive XDR metric value (0, -1,313378649, 0xDEADBEEF, etc.). The oldest node with a current heartbeatvalue is expected to assign the new index value for the metric. If itisn't heard from in 15 seconds, the next oldest node responds, etc. Or the"master node" if such a system is implemented. Or each node decides on itsown. Or each node multicasts an "election packet" containing its nextavailable metric value. On receiving a heartbeat metric from a new node,the first node to hear it (or the master node) sends an ACK/"here comessome config data"/"I'll get it!" packet over the multicast channel and thenstarts sending packets with the metric hash info for that cluster.

This deliberately blurs or erases the line between internal and externalmetrics. Which might be useful. It also makes it possible toauto-configure new nodes (assuming you trust the multicast channel :) ).

Anyway, just stuff I was thinking about while I was debugging the Tru64monitoring core. :O!

* - Know why my percentages were off? Because CPU_STATES enumerates fromzero with a meaningful last value, and the percentage-calculating code(taken straight from top, but top uses the same CPU_STATES value!) says:

for(i = 0; i < CPU_STATES; i++) ... DOH!  Works nice with a <= in there...

[Ganglia-developers] A change in metric-gathering philosophy? [long]

Reply via email to