Re: 3D Cluster Performance Visualization

Steve Loughran Fri, 25 Sep 2009 08:09:56 -0700

Brian Bockelman wrote:

;) Unfortunately, I'm going to go out on a limb and guess that we don'twant to add OpenGL to the dependency list for the namenode... The vizapplication actually doesn't depend on the namenode, it uses the datanodes.
Here's the source:
svn://t2.unl.edu/brian/HadoopViz/trunk
The server portion is a bit hardcoded to our site (simply a pythonserver); the client application is pretty cross-platform. I actuallycompile and display the application on my Mac.
Here's how it works:

1) Client issues read() request
2) Datanode services it.  Logs it with log4j
3) One of the log4j appenders is syslog pointing at a separate server
4) Separate log server recieves UDP packets; one packet per read()
5) Log server parses packets and decides whether they are within thecluster or going to the internet- Currently a Pentium 4 throw-away machine; handles up to 4-5k packetsper second before it starts dropping6) Each client opens a TCP stream to the server and receives thetransfer type, source, and dest, then renders appropriately
It's pretty danged close to real-time; the time the client issues theread() request to seeing something plotted is on the order of 1 second.
I'd really like to see this on a big (Yahoo, Facebook, any takers?)cluster.
Brian

Ok, so this is really an example of a datacentre back-end for Log4J,pushing out UDP packets to something else in the datacentre. A niceside-line to the classic hadoop management displays. Add something aboutjobs executing and you are laughing. Do it all in Java3D and you evenhave cross platformness

Re: 3D Cluster Performance Visualization

Reply via email to