Hi all, I have some money available to replace the infrastructure nodes of one of my company's grid engine clusters and I wanted a sanity check before I order anything new.
Initially we contacted the company we originally bought the cluster from and they quoted us for a combined login/storage/master node with loads of everything and a hefty price tag. I feel an aversion to combining login nodes with storage and master nodes - we already have that on one of the clusters and a user being able to crash the entire cluster seems a bad thing to me and it happened often enough. I read Rayson's blog post about scaling grid engine to 10k nodes at http://blogs.scalablelogic.com/2012/11/running-10000-node-grid-engine-cluster.html and it seems that 4 cores and 1 GB of memory is more than enough to run a grid engine master. Given that I'd be lucky to have 100 nodes to a master, can anybody see a reason to spec a high powered master node? I look at my existing master nodes with 8+ cores and 24+ GB of memory and in Ganglia all I see is acres of green from memory being used as cache and buffers. It seems rather a waste. The other thing I was curious about is what kind of spec seems reasonable to you for a login node. My one cluster with separate login nodes has similar specs to the master nodes - 8 cores, 24 GB memory and it seems wasted. I can see an argument for these nodes to be more than just a low end box, especially if anybody is trying to do some kind of visualization on them, but I've never had complaints about them being under-powered yet. Any thoughts you might have are appreciated. Thanks Biggles
_______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
