stephen
Rajiv wrote:
Dear All,
I have configured 128-node OSCAR in my company. Since all nodes are kept in the same room some of the nodes fail often. I have 10 nodes dedicated to failure recovery. I have two queries in this situation.
1. Is there any opensource cluster management software that keeps track of the hardware details the systems in the cluster so that when one node fails the other node (dedicated for failure recovery) gets waked up automatically shutting down the failed node.
2. Can heart-beat connection be established in OSCAR itself or HA-OSCAR is required. I want to have 10 nodes dedicated for failure recovery. These 10 nodes do not form part of the normal operation of the cluster. When nodes fails these nodes takes control of the failed node shutting down the later and joins the cluster through cluster management software.
Regards,
Rajiv
-- ------------------------------------------------------------------------ Stephen L. Scott, Ph.D. voice: 865-574-3144 Oak Ridge National Laboratory fax: 865-576-5491 P. O. Box 2008, Bldg. 5600, MS-6016 [EMAIL PROTECTED] Oak Ridge, TN 37831-6016 http://www.csm.ornl.gov/~sscott/ ------------------------------------------------------------------------
------------------------------------------------------- This SF.Net email is sponsored by the new InstallShield X.
From Windows to Linux, servers to mobile, InstallShield X is the
one installation-authoring solution that does it all. Learn more and evaluate today! http://www.installshield.com/Dev2Dev/0504 _______________________________________________ Oscar-devel mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-devel
