Hi all,

I have some money available to replace the infrastructure nodes of one of
my company's grid engine clusters and I wanted a sanity check before I
order anything new.

Initially we contacted the company we originally bought the cluster from
and they quoted us for a combined login/storage/master node with loads of
everything and a hefty price tag. I feel an aversion to combining login
nodes with storage and master nodes - we already have that on one of the
clusters and a user being able to crash the entire cluster seems a bad
thing to me and it happened often enough.

I read Rayson's blog post about scaling grid engine to 10k nodes at
http://blogs.scalablelogic.com/2012/11/running-10000-node-grid-engine-cluster.html
and it seems that 4 cores and 1 GB of memory is more than enough to run a
grid engine master. Given that I'd be lucky to have 100 nodes to a master,
can anybody see a reason to spec a high powered master node? I look at my
existing master nodes with 8+ cores and 24+ GB of memory and in Ganglia all
I see is acres of green from memory being used as cache and buffers. It
seems rather a waste.

The other thing I was curious about is what kind of spec seems reasonable
to you for a login node. My one cluster with separate login nodes has
similar specs to the master nodes - 8 cores, 24 GB memory and it seems
wasted. I can see an argument for these nodes to be more than just a low
end box, especially if anybody is trying to do some kind of visualization
on them, but I've never had complaints about them being under-powered yet.

Any thoughts you might have are appreciated.

Thanks
Biggles
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to