Hi Guys, A newbie here needs an expert opinion regarding Linux HPC.
In my current company we have a Linux(Redhat) cluster implementation, say 100 nodes per cluster. I notice that on the problematic cluster, some nodes are low end server say 2GB memory while the other nodes have 4GB memory. This past few weeks I noticed that user problem keeps on growing and base on my investigation, the leftover jobs is always on the compute nodes which are "low end". We manage to stop/kill/restart the jobs but I know that this is only a temporary solution and I wanted a permanent one. 1. I am suspecting that this might be a hardware related problem but I am not 100% sure. I want to get opinion/suggestion first from HPC guru before I make my move to approach the management and raise my case that hardware upgrade is needed. 2. Or can this problem be attributed to the cluster missconfiguration? Thanks in advance. -- Mike Calizo Registered Linux User # 365113 _________________________________________________ Even the longest journey has to start with a small first-step
_________________________________________________ Philippine Linux Users' Group (PLUG) Mailing List plug@lists.linux.org.ph (#PLUG @ irc.free.net.ph) Read the Guidelines: http://linux.org.ph/lists Searchable Archives: http://archives.free.net.ph