Environment variables for the cluster name and IP address are being added in https://issues.apache.org/jira/browse/WHIRR-325. Does that help? You could extend that approach to add a node ID too.
There's also some benchmark code in org.apache.whirr.service.hadoop.integration.benchmark, which might be useful for what you are doing. Cheers, Tom On Mon, Oct 10, 2011 at 10:44 AM, David Alves <[email protected]> wrote: > Hi All > > I'm about to finish writing a small benchmark framework built on top > of whirr (very simple, uploads data to the cloud, prepares client and server > clusters, runs clients against servers and uploads results to the cloud > before destroying all the clusters). > In the final stage I need to extract the metrics and log output of > every node in the cluster and put them in cloud storage (e.g., s3) but I'd > like to package them with the cluster name and node name such as: > [bench]-[cluster]-[node]-metrics.tar.gz. > I was implementing the packaging and uploading of these files as > statements, but I'm having trouble finding the [cluster] and [node] ids. > Are these env variables or something I can get when I run the ssh > commands? orf is there some form of knowing against which node a statement is > being executed (if I call a statement with > clusterController.runScriptOnNodesMathing(… stmt)). > > Cheers > -david
