Hi,

Chukwa uses pig script to analyze the data.  Hence, the analytics is
entirely up to the developer and researcher.  For topN, sorting, it
can be written easily with piglatin.  Take a look of
https://issues.apache.org/jira/browse/CHUKWA-575.  This script is used
to aggregate large node metrics into a cluster summary number.  It
should help you to calculate histogram and load distribution.

For rrd type of down sampling, we need to introduce a Pig UDF which
calculates d(metric)/dt.

regards,
Eric

On Sun, Jan 16, 2011 at 7:17 AM, ZHOU Qi <[email protected]> wrote:
> Hi Guys,
>
> I got used to using ganglia liked software for monitoring and trouble
> shooting cluster with about 100 machines. But with the growth of
> scale, I found out it became more difficult to identify the abnormal
> metrics, machines or the bottle-net part of the current system.
>
> Up to now, we considered to add some features for rrd viewing, such as
> getting the topN, sorting the machine by its metrics, or grouping the
> metrics to find its distribution. We have no more experience on chukwa
> before and I am wondering that is there any templates for metrics
> processing from chukwa (such as sorting, histogram, machine/rack group
> distribution) ?
>
> If you have better idea for viewing these metrics. Would you mind
> introducing it?
>

Reply via email to