My first installation was xCAT 2.8. This was not working on xCAT 2.8, when i tried all the scenarios.
So i have updated it to xCAT 2.8.1, to check if it solves any issues. ( I have not re-installed xCAT 2.8.1, i have just updated it from xCAT 2.8) On Thu, Jun 13, 2013 at 2:35 PM, Lissa Valletta <[email protected]> wrote: > Is this your first installation or did you just upgrade to xcat2.8.1 and > this was working before? > > > > The use of the postscript templates is document here. Including how to > use precreatemypostscripts. * You do not need to do this, it is optional* > . > > https://sourceforge.net/apps/mediawiki/xcat/index.php?title=Postscripts_and_Prescripts > > Your custom postscript ""sab-basec6-setup" should be put in the > /install/posstcripts directory as always and you modify the postscripts > table, postscripts attribute to say when you want it to run. If it is > still in /tftpboot/mypostscripts directory, remove it. > > As far as the dsa_key is concerned, it may be the dsa_key on the > Management Node has become corrupted. I would suggest running > xcatconfig -k to regenerate the root ssh keys > then run > updatenode scl-p03-mn -V -k ( you will be prompted for the password) > . this will run remoteshell and you can check to see if it run cleanly. > On the node make sure > > I would then set site.precreatemypostscripts=0 > Then run your install again > > nodeset scl-p03-mn osimage=centos6.4-x86_64-install-all > rsetboot scl-p03-mn net > rpower scl-p03-mn boot > > It is true the wget will not find the file but it will then run another > command from the node (getpostcript.awk) that will create the file and > then can download it. > The wget and the getpostscript.awk will not work, if the node cannot > contact the Management node by the ip address defined for it in the site > table master attribute. I am assuming you are not using service nodes. > > Makesure SELinux is not enabled on the MN also. > > https://sourceforge.net/apps/mediawiki/xcat/index.php?title=XCAT_iDataPlex_Cluster_Quick_Start#.5BRH.5D_Ensure_that_SELinux_is_Disabled > > > > > > > > > Lissa K. Valletta > 8-3/B10 > Poughkeepsie, NY 12601 > (tie 293) 433-3102 > > > > [image: Inactive hide details for Lohit Valleru ---06/13/2013 01:35:22 > PM---With xCAT Version 2.8.1 (svn r16213, built Tue May 7 22:55]Lohit > Valleru ---06/13/2013 01:35:22 PM---With xCAT Version 2.8.1 (svn r16213, > built Tue May 7 22:55:07 EDT 2013). Name : xCAT > > From: Lohit Valleru <[email protected]> > To: [email protected], > Date: 06/13/2013 01:35 PM > Subject: [xcat-user] getcredentials ssh_dsa_hostkey hangs - node does not > deploy > ------------------------------ > > > > With xCAT Version 2.8.1 (svn r16213, built Tue May 7 22:55:07 EDT 2013). > > Name : xCAT Relocations: /opt/xcat > Version : 2.8.1 Vendor: IBM Corp. > Release : snap201305072259 Build Date: Tue 07 May 2013 > 10:59:39 PM EDT > Install Date: Tue 11 Jun 2013 11:52:18 PM EDT Build Host: * > c97p3l4vp01.ppd.pok.ibm.com* <http://c97p3l4vp01.ppd.pok.ibm.com/> > Group : Applications/System Source RPM: > xCAT-2.8.1-snap201305072259.src.rpm > Size : 0 License: EPL > Signature : DSA/SHA1, Tue 07 May 2013 10:59:48 PM EDT, Key ID > 20e475a8da736c68 > Packager : IBM Corp. > Summary : Meta-package for a common, default xCAT setup > Description : > > perl-xCAT.noarch 4:2.8.1-snap201305072255 > xCAT.x86_64 2.8.1-snap201305072259 > xCAT-client.noarch 4:2.8.1-snap201305072255 > xCAT-genesis-base-x86_64.noarch > xCAT-genesis-scripts-x86_64.noarch > xCAT-server.noarch 4:2.8.1-snap201305072255 > > > Please find the procedure as below : > > *1. First Scenario* > > During Normal Installation of a Stateful Node, with following commands. > > nodeset scl-p03-mn osimage=centos6.4-x86_64-install-all > rsetboot scl-p03-mn net > rpower scl-p03-mn boot > > Installation of OS completes fine, but default xCAT postscripts - > syslog,remoteshell,syncfiles - do not run. Neither do my custom postscripts > run. > > /var/log/httpd/access_log show : > > - - [10/Jun/2013:12:00:42 -0400] "GET > /tftpboot/mypostscripts/mypostscript.scl-p03-mn.tmp HTTP/1.0" 404 328 "-" > "Wget/1.12 (linux-gnu)" > - - [10/Jun/2013:12:01:15 -0400] "GET > /tftpboot/mypostscripts/mypostscript.scl-p03-mn.tmp HTTP/1.0" 404 328 "-" > "Wget/1.12 (linux-gnu)" > - - [10/Jun/2013:12:01:46 -0400] "GET > /tftpboot/mypostscripts/mypostscript.scl-p03-mn.tmp HTTP/1.0" 404 328 "-" > "Wget/1.12 (linux-gnu)" > - - [10/Jun/2013:12:02:21 -0400] "GET > /tftpboot/mypostscripts/mypostscript.scl-p03-mn.tmp HTTP/1.0" 404 328 "-" > "Wget/1.12 (linux-gnu)" > - - [10/Jun/2013:12:02:58 -0400] "GET > /tftpboot/mypostscripts/mypostscript.scl-p03-mn.tmp HTTP/1.0" 404 328 "-" > "Wget/1.12 (linux-gnu)" > - - [10/Jun/2013:12:03:29 -0400] "GET > /tftpboot/mypostscripts/mypostscript.scl-p03-mn.tmp HTTP/1.0" 404 328 "-" > "Wget/1.12 (linux-gnu)" > > It goes on a loop searching for mypostscript.scl-p03-mn or > mypostscript.scl-p03-mn.tmp, and after 10 retries - it gives up and boots > without installing xCAT default postscripts or my custom postscript - > "sab-basec6-setup" > > So when i change my custom postscript name "sab-basec6-setup" to > "mypostscript.scl-p03-mn" and copy it into /tftpboot/mypostscripts/ > > xCAT detects the script during postinstall, and runs it, as i can find the > log after reboot and the respective changes. > However, xCAT still doesnt run its default xCAT postscripts - > syslog,remoteshell,syncfiles. > > *Second Scenario :* > > I researched on this topic, and found out if enabling > precreatemypostscripts=1 might solve this problem. > > After this step : > > If i run : nodeset scl-p03-mn osimage=centos6.4-x86_64-install-all > > I see that xCAT default mypostscript gets created inside /tftpboot -> > mypostscript.scl-p03-mn > > Now when i run the below commands : > > nodeset scl-p03-mn osimage=centos6.4-x86_64-install-all > rsetboot scl-p03-mn net > rpower scl-p03-mn boot > > Installation of OS completes fine, but it hangs - when running postscripts. > > /var/log/messages shows: > > Jun 12 12:07:15 no kernel: imklog 5.8.10, log source = /proc/kmsg started. > Jun 12 12:07:15 no rsyslogd: [origin software="rsyslogd" > swVersion="5.8.10" x-pid="12708" > x-info="*http://www.rsyslog.com*<http://www.rsyslog.com/>"] > start > Jun 12 12:07:15 no xCAT: Install: syslog setup > Jun 12 12:07:15 no xcat: Install: setup /etc/ssh/sshd_config > Jun 12 12:07:15 no xcat: Install: setup root .ssh > Jun 12 12:07:37 no xCAT: Before ssh_dsa_hostkey is checked for error > Jun 12 12:07:37 no xCAT: ssh_dsa_hostkey > > After some research - I found out it never comes of the infinite loop > inside "remoteshell" script. > > while [ -z "$MYCONT" ]; do > let SLI=$RANDOM%10 > let SLI=SLI+10 > sleep $SLI > getcredentials.awk ssh_dsa_hostkey | grep -v '<'|sed -e > 's/</</' -e 's/>/>/' -e 's/&/&/' -e 's/"/"/' -e "s/'/'/" > > /etc/ssh/ssh_host_dsa_key > MYCONT=`cat /etc/ssh/ssh_host_dsa_key` > done > > I believe getcredentials.awk ssh_dsa_hostkey is not working as expected. > > After abrupt cancel of installation, and reboot of node - I find that > /tmp/ssh_dsa_hostkey is blank ( which is the one getcredentials.awk is > trying to read) > > Also, I observed that .ssh directory is not created inside root, though > the code has passed that stage in remote shell. > > if [ -d /xcatpost/_ssh ] > then > logger -p *local4.info* <http://local4.info/> -t xcat "Install: > setup root .ssh" > cd /xcatpost/_ssh > mkdir -p /root/.ssh > cp -f * /root/.ssh > cd - >/dev/null > chmod 700 /root/.ssh > chmod 600 /root/.ssh/* > fi > > I see that /var/log/messages mentions : Install: setup root .ssh , but > mkdir -p /root/.ssh > cp -f * /root/.ssh > > does not seem to be working. > > *Third Scenario :** * > > I have also tried rinstall, giving custom definitions with > provmethod=install. > > rinstall -o centos6.4 -p all -a x86_64 scl-p03-mn > > This did not make any difference. > > if /tftpboot/mypostscripts/mypostscript.scl-p03-mn exists ( where > mypostscript.scl-p03-mn is default template created by xCAT) - then it > goes on an infinite loop. > > ------------------------------------------------------- > > tcpdump, shows communication between xCAT port 3001 and the node, but i > could make any sense out of the dump. > > May i please know, how can we correct this issue, and what is the correct > procedure to run postscripts. > > Are we suppose to enable precreatemypostscripts=1 to run default > postscripts? or it doesnt need to be enabled and nodeset is suppose to > create mypostscript inside /tftpboot/mypostscripts ? > > Other than default xCAT postscripts - All I do is , add my custom > postscripts to node definition - but they do not run, along with the > default postscripts. > > Thanks, > > Lohit Valleru > > > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by Windows: > > Build for Windows Store. > > http://p.sf.net/sfu/windows-dev2dev > _______________________________________________ > xCAT-user mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/xcat-user > > > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by Windows: > > Build for Windows Store. > > http://p.sf.net/sfu/windows-dev2dev > _______________________________________________ > xCAT-user mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/xcat-user > >
<<graycol.gif>>
------------------------------------------------------------------------------ This SF.net email is sponsored by Windows: Build for Windows Store. http://p.sf.net/sfu/windows-dev2dev
_______________________________________________ xCAT-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/xcat-user
