Could you also give me an lsxcatd -a on the Management Node. and a tabdump site. Are you using Service Nodes? Also, is this your first install of xCAT and first trying to install nodes? Did you upgrade from a previous release? Give me the output of rpm -qa | grep xCAT on the MN and if you have Service Nodes do the same there.
Thanks! Lissa K. Valletta 2-3/T12 Poughkeepsie, NY 12601 (tie 293) 433-3102 From: Antoine Tabary <[email protected]> To: xCAT Users Mailing list <[email protected]> Date: 11/16/2011 04:12 AM Subject: Re: [xcat-user] Node keeps reinstalling Hello, Some more information. 1/ I was wrong when I said postscripts were not run, they are run. 2/ During install nodestat keeps saying "installing prep" which confirms that for one reason or another the node does not communicate its status to xcatd Here is /tmp/mypostscript on n105 : n105:~ # cat /tmp/mypostscript BLADEMAXP='64' export BLADEMAXP DOMAIN='dsi.upmc.fr' export DOMAIN FSPTIMEOUT='0' export FSPTIMEOUT INSTALLDIR='/install' export INSTALLDIR IPMIMAXP='64' export IPMIMAXP IPMIRETRIES='3' export IPMIRETRIES IPMITIMEOUT='2' export IPMITIMEOUT CONSOLEONDEMAND='yes' export CONSOLEONDEMAND SITEMASTER=172.29.101.2 export SITEMASTER MASTER=172.29.101.2 export MASTER FORWARDERS='134.157.1.23,134.157.0.29' export FORWARDERS NAMESERVERS='172.29.101.2' export NAMESERVERS MAXSSH='8' export MAXSSH PPCMAXP='64' export PPCMAXP PPCRETRY='3' export PPCRETRY PPCTIMEOUT='0' export PPCTIMEOUT SHAREDTFTP='1' export SHAREDTFTP SNSYNCFILEDIR='/var/xcat/syncfiles' export SNSYNCFILEDIR TFTPDIR='/tftpboot' export TFTPDIR XCATDPORT='3001' export XCATDPORT XCATIPORT='3002' export XCATIPORT XCATCONFDIR='/etc/xcat' export XCATCONFDIR TIMEZONE='Europe/Paris' export TIMEZONE USENMAPFROMMN='no' export USENMAPFROMMN ENABLEASMI='no' export ENABLEASMI DB2INSTALLLOC='/mntdb2' export DB2INSTALLLOC DATABASELOC='/var/lib' export DATABASELOC SSHBETWEENNODES='ALLGROUPS' export SSHBETWEENNODES DNSHANDLER='ddns' export DNSHANDLER VSFTP='y' export VSFTP CLEANUPXCATPOST='no' export CLEANUPXCATPOST ENABLESSHBETWEENNODES=YES export ENABLESSHBETWEENNODES NODE=n105 export NODE NFSSERVER=172.29.101.2 export NFSSERVER PRIMARYNIC=eth0 export PRIMARYNIC OSVER=sles11.1 export OSVER ARCH=x86_64 export ARCH PROFILE=compute export PROFILE PATH=`dirname $0`:$PATH export PATH NODESETSTATE='boot' export NODESETSTATE UPDATENODE=0 export UPDATENODE NTYPE=compute export NTYPE MACADDRESS='E4:1F:13:4D:35:88' export MACADDRESS MONSERVER=chou-mgmt export MONSERVER MONMASTER=172.29.101.2 export MONMASTER OSPKGS='@base,@x11,xntp,rsync' export OSPKGS # postscripts-start-here syslog remoteshell syncfiles fsmnt gpfsinst # postscripts-end-here n105:~ # And here is lsdef (I have done a "nodeset n105 boot" while n105 was installing rpms to stop n105 from always reinstalling): chou-mgmt:~ # lsdef n105 Object name: n105 arch=x86_64 bmc=n105-bmc bmcport=0 currchain=boot currstate=boot groups=ipminocons,frame1,compute,all initrd=xcat/sles11.1/x86_64/initrd kcmdline=autoyast=http://172.29.101.2/install/autoinst/n105 install=http://172.29.101.2/install/sles11.1/x86_64/1 netdevice=eth0 kernel=xcat/sles11.1/x86_64/linux mac=E4:1F:13:4D:35:88 mgt=ipmi netboot=xnba nfsserver=172.29.101.2 os=sles11.1 postbootscripts=otherpkgs,dhcpno postscripts=syslog,remoteshell,syncfiles,fsmnt,gpfsinst primarynic=eth0 profile=compute provmethod=install status=booted statustime=11-16-2011 09:44:29 switch=switch2 switchport=5 chou-mgmt:~ # Thank you for your help. Best regards, Antoine 17 Avenue De L'europe (Embedded Tabary image moved to file: pic32189.gif) Certified Bois Colombes Cedex, HPC I/T 92275 Specialist 0233AA France ITS e-mail: [email protected] From: Lissa Valletta <[email protected]> To: xCAT Users Mailing list <[email protected]> Cc: xCAT Users Mailing list <[email protected]> Date: 15/11/2011 19:58 Subject: Re: [xcat-user] Node keeps reinstalling Could you give me the entire contents of /tmp/mypostscript file from the node? Also give lsdef <nodename> Lissa K. Valletta 2-3/T12 Poughkeepsie, NY 12601 (tie 293) 433-3102 From: Antoine Tabary <[email protected]> To: xCAT Users Mailing list <[email protected]> Date: 11/15/2011 11:10 AM Subject: Re: [xcat-user] Node keeps reinstalling Hello, The MASTER variable is OK, but the block : # subroutine used to run postscripts ... # subroutine end is missing Any idea ? Thank you for your help. Best regards, Antoine 17 Avenue De L'europe (Embedded Tabary image moved to file: pic38825.gif) Certified Bois Colombes Cedex, HPC I/T 92275 Specialist 0233AA France ITS e-mail: [email protected] From: Lissa Valletta <[email protected]> To: xCAT Users Mailing list <[email protected]> Cc: xCAT Users Mailing list <[email protected]> Date: 15/11/2011 13:39 Subject: Re: [xcat-user] Node keeps reinstalling Make sure that the site.master attribute is an ip address of hostname that will be resolvable during the install. The usual problem is the node cannot contact the MN to tell it, it is finished. On the node you can look at the /tmp/mypostscript file. It should look something like the following. The last line is important, because that is the we are finished line. But it cannot be sent to the MN unless $MASTER is an address that is resolvable on the node at that point. If this file is empty, it is a pretty good sign that the node could not contact the MN after install. # subroutine used to run postscripts . . . } # subroutine end BLADEMAXP='64' export BLADEMAXP DOMAIN='cluster.com' export DOMAIN FSPTIMEOUT='0' export FSPTIMEOUT . . MASTER=10.16.0.103 export MASTER . # postscripts-start-here run_ps remoteshell run_ps syncfiles run_ps syslog run_ps new_set run_ps setbootfromnet # postscripts-end-here # postbootscripts-start-here run_ps otherpkgs # postbootscripts-end-here updateflag.awk $MASTER 3002 "installstatus booted" Lissa K. Valletta 2-3/T12 Poughkeepsie, NY 12601 (tie 293) 433-3102 From: Antoine Tabary <[email protected]> To: xCAT Users Mailing list <[email protected]> Date: 11/15/2011 05:45 AM Subject: [xcat-user] Node keeps reinstalling Hello, We are tryoing to install an iDataplex cluster with SLES11 nodes. The /tftpboot/xcat/xnba/nodes/n001 is not reset during installation so the node keeps reinstalling itself. unless we type "nodeset n101 boot" before it reboots. Any suggestion on what we have not configured ? It does not run postscripts nor postbootscripts either. We are running xcat 2.6.8 with xcat-dep-201111041626 Thank you for your help. Best regards, Antoine 17 Avenue De L'europe (Embedded Tabary image moved to file: pic40165.gif) Certified Bois Colombes Cedex, HPC I/T 92275 Specialist 0233AA France ITS e-mail: [email protected] Sauf indication contraire ci-dessus:/ Unless stated otherwise above: Compagnie IBM France Siège Social : 17 avenue de l'Europe, 92275 Bois-Colombes Cedex RCS Nanterre 552 118 465 Forme Sociale : S.A.S. Capital Social : 639.291.962.10 € SIREN/SIRET : 552 118 465 03644 - Code NAF 6202A ------------------------------------------------------------------------------ RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 _______________________________________________ xCAT-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/xcat-user [attachment "pic40165.gif" deleted by Antoine Tabary/France/IBM] ------------------------------------------------------------------------------ RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 _______________________________________________ xCAT-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/xcat-user Sauf indication contraire ci-dessus:/ Unless stated otherwise above: Compagnie IBM France Siège Social : 17 avenue de l'Europe, 92275 Bois-Colombes Cedex RCS Nanterre 552 118 465 Forme Sociale : S.A.S. Capital Social : 639.291.962.10 € SIREN/SIRET : 552 118 465 03644 - Code NAF 6202A ------------------------------------------------------------------------------ RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 _______________________________________________ xCAT-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/xcat-user [attachment "pic38825.gif" deleted by Antoine Tabary/France/IBM] ------------------------------------------------------------------------------ RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1_______________________________________________ xCAT-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/xcat-user Sauf indication contraire ci-dessus:/ Unless stated otherwise above: Compagnie IBM France Siège Social : 17 avenue de l'Europe, 92275 Bois-Colombes Cedex RCS Nanterre 552 118 465 Forme Sociale : S.A.S. Capital Social : 639.291.962.10 € SIREN/SIRET : 552 118 465 03644 - Code NAF 6202A ------------------------------------------------------------------------------ RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 _______________________________________________ xCAT-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/xcat-user
<<attachment: pic32189.gif>>
------------------------------------------------------------------------------ RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1
_______________________________________________ xCAT-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/xcat-user
