Thanks Lissa!

We have 3 service nodes, and plenty of RAM and CPU on them and the MN. It will be a very rare occasion, if ever, that all of the compute nodes need to be booted at once.

I tested out your commands, and am getting strange results now. While it is updating the mypostscript file on the service node that handles the compute node I am using for testing, it seems that it is not using the mypostscript file from the service node when running the otherpkgs postscript with updatenode. This is opposite what I was seeing last time, so I am not sure what is going on. Is otherpkgs treated differently than the other postscripts?

To make sure that it's actually reading the mypostscript file from the management node instead of the service node, I manually edited the OTHERPKGDIR variable contents to have a "45" at the end, then ran "updatenode c100n1 -P otherpkgs". It is indeed reading it from the management node:


[root@master v5.4.8-dl-compute]# updatenode c100n1 -P otherpkgs
c100n1: Tue Jan 21 09:18:39 CST 2014 Running postscript: otherpkgs
c100n1: NFSSERVER=master
c100n1: OTHERPKGDIR=master/install/post/otherpkgs/centos5.4-kernel_three-seven-one/x86_64/v5.4.8-dl-compute45


I also edited the OTHERPKGDIR within the osimage to have a "2" at the end to make sure it's getting pushed out properly to the service nodes, and it is:

[root@master v5.4.8-dl-compute]# lsdef -t osimage v5.4.8-dl-compute
Object name: v5.4.8-dl-compute
<snip>
otherpkgdir=/install/post/otherpkgs/centos5.4-kernel_three-seven-one/x86_64/v5.4.8-dl-compute2


And on the service node:
[root@servicenode01 mypostscripts]# grep OTHERPKGDIR mypostscript.c100n1
OTHERPKGDIR='/install/post/otherpkgs/centos5.4-kernel_three-seven-one/x86_64/v5.4.8-dl-compute2'


Here's an lsdef of the node in question:
[root@master v5.4.8-dl-compute]#
[root@master v5.4.8-dl-compute]# lsdef c100n1
Object name: c100n1
    arch=x86_64
    chain=osimage
    cons=cyclades
    currchain=osimage
    currstate=netboot centos5.4-kernel_three-seven-one-x86_64-v5.4.8-dl-compute
    groups=compute,c100,all
    initrd=xcat/osimage/v5.4.8-dl-compute/initrd-stateless.gz
    kcmdline=imgurl=http://master:80//install/netboot/centos5.4-kernel_three-seven-one/x86_64/v5.4.8-dl-compute/rootimg.sfs XCAT=master:3001 NODE=c100n1 FC=0 ifname=eth0:00:E0:81:32:71:36 netdev=eth0  console=tty0 console=ttyS1,9600
    kernel=xcat/osimage/v5.4.8-dl-compute/kernel
    mac=00:E0:81:32:71:36
    netboot=xnba
    os=centos5.4-kernel_three-seven-one
    postscripts=remoteshell
    power=rackable
    primarynic=eth0
    profile="">     provmethod=v5.4.8-dl-compute
    serialport=1
    serialspeed=9600
    servicenode=servicenode01
    status=booted
    statustime=01-20-2014 17:26:34
    termport=1
    termserver=c100t1
    updatestatus=failed
    updatestatustime=01-21-2014 09:18:47
    xcatmaster=master



Any ideas why it's utilizing the mypostscript file from the headnode instead of the servicenode when running otherpkgs?





On 1/21/2014 7:51 AM, Lissa Valletta wrote:

How many service nodes do you have?   I hope  you have enough  memory, cpus etc on the Management Node and Service nodes to handle 6000 nodes or 500-1000 nodes from each service node.

We also have some hints and tips for Large Cluster tuning.
https://sourceforge.net/apps/mediawiki/xcat/index.php?title=Hints_and_Tips_for_Large_Scale_Clusters

updatenode to the node should  update the mypostscript file when you have precreatemypostscripts=1.  It sounds like something is happening that that does not get downloaded to the nodes.

Try this set the attribute: chtab key=precreatemypostscripts site.value=1
Run an simple updatenode to one of your nodes.

updatenode compute-03 -P mytest

Look in on the MN
/tftpboot/mypostscripts
You should see a file mypostscript.compute-03  where compute-03 is your nodename
It should change (date) each time you run updatenode or nodeset.   That is the file that is put on the node and  modified  during updatenode and should contain the latest changes.  It will be modified and put in /xcatpost/mypostscript

You do have to have /tftpboot mounted on the Service nodes from the Management Node for updatenode,  if you are updating a compute node attached to a service node.  updatenode is actually running on the service node in this case.






Lissa K. Valletta
8-3/B10
Poughkeepsie, NY 12601
(tie 293) 433-3102



Inactive hide details for Russell Jones ---01/20/2014
          09:36:26 PM---Thanks! It is working fine now that I figured
          out what the Russell Jones ---01/20/2014 09:36:26 PM---Thanks! It is working fine now that I figured out what the issue is. We will

From: Russell Jones <russell-l...@jonesmail.me>
To: xcat-user@lists.sourceforge.net,
Date: 01/20/2014 09:36 PM
Subject: Re: [xcat-user] xCAT ignoring my otherpkgdir setting





Thanks!

It is working fine now that I figured out what the issue is. We will gain a lot better performance with site.precreatemypostscripts enabled, so we wish to keep it on. This is a cluster with ~6000 nodes.


On 1/20/2014 7:32 PM, Xiao Peng Wang wrote:

    The question turns to whether to use the site.precreatemypostscripts. This site attribute is used to improve the performance of postscript mechanism in a big cluster.

    If site.precreatpostescripts is enabled, the mypostscript for the node will be generated by nodeset command on xCAT MN. When booting of node, the created mypostscript will be copied to compute node directly for using. The advantage is it can save the computing resource of MN during the node booting process, the shortage is you must run nodeset for each changing which related to postscripts.


    If site.precreatescripts is disabled, nodeset command won't do anything for mypostscirpt. When booting of node, it will try to require the xCAT MN to generate the mypostscript at that time.


    So for your issue, you can try to disable the site.precreatemypostscripts and to see whether it works well.



    Thanks
    Best Regards
    ----------------------------------------------------------------------
    Wang Xiaopeng (王晓朋)
    IBM China System Technology Laboratory
    Tel: 86-10-82453455
    Email:
    w...@cn.ibm.com
    Address: 28,ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road, Haidian District Beijing P.R.China 100193


    Inactive hide details for Russell Jones ---2014/01/18
          03:40:35---I figured out what's going on - we are using the
          site.precreaRussell Jones ---2014/01/18 03:40:35---I figured out what's going on - we are using the  site.precreatemypostscripts=1 option. Even though

    From:
    Russell Jones <russell-l...@jonesmail.me>
    To:
    xcat-user@lists.sourceforge.net,
    Date:
    2014/01/18 03:40
    Subject:
    Re: [xcat-user] xCAT ignoring my otherpkgdir setting



    I figured out what's going on - we are using the site.precreatemypostscripts=1 option. Even though the man page on the site table states that the mypostscript file for the node should be updated when an updatenode is ran as all as a nodeset, this is not happening. I have to do a nodeset on the node to have that file updated.

    There's also a gap here in that since the node utilizes it's local /xcatpost/mypostscript for pulling in variables that the otherpkgs script needs, specifically $OTHERPKGDIR and $OTHERPKGS1, you need to reboot your node or manually push the new copy of this file to the nodes for your changes to OTHERPKGDIR in the osimage to be seen by your node. In my opinion, enhancements to updatenode need to be made so that:

    A) updatenode rewrites the mypostscript file to capture any changes you may have made to your osimage definitions

    and

    B) updatenode pushes this new mypostscript file to the node so that updatenode will be aware of any osimage definition changes.



    On 1/17/2014 12:40 PM, Russell Jones wrote:
      Well, genimage seems to pick up my change just fine - it's just updatenode that doesn't. Here's the temp yum.conf that is created during otherpkgs time with genimage:

      [otherpkgs1]
      name=otherpkgs1
      baseurl=
      file:///install/post/otherpkgs/centos5.4-new-kernel/x86_64/v5.4.8-dl-compute//.
      gpgpcheck=0

      [otherpkgs2]
      name=otherpkgs2
      baseurl=
      file:///install/post/otherpkgs/centos5.4-new-kernel/x86_64/v5.4.8-dl-compute//dell
      gpgpcheck=0


      We are wanting to separate out the otherpkgs based on image name. Being able to configure otherpkgdir is a good feature that seems to work fine during genimage time - I would highly encourage you guys continue with this feature and push it to the otherpkgs postscript.



      On 1/17/2014 12:25 PM, Lissa Valletta wrote:

      .


      Take a look at this documentation ,   the otherpkgdir always ends with the architecture directory and then the directories under that contain the rpms.   I have never seen anything different so this may be a limitation in our code.

      In other words in your case it should be
      otherpkgdir=/install/post/otherpkgs/centos5.4-new-kernel/x86_64

      https://sourceforge.net/apps/mediawiki/xcat/index.php?title=XCAT_iDataPlex_Cluster_Quick_Start#Installing_Additional_Packages_Using_an_Otherpkgs_Pkglist





      Lissa K. Valletta
      8-3/B10
      Poughkeepsie, NY 12601
      (tie 293) 433-3102




      Inactive hide details for Russell Jones ---01/17/2014
            12:42:10 PM---Hi all, I am manually setting my otherpkgdir
            for an osimageRussell Jones ---01/17/2014 12:42:10 PM---Hi all, I am manually setting my otherpkgdir for an osimage that a node belongs

      From:
      Russell Jones <russell-l...@jonesmail.me>
      To:
      xCAT Users Mailing list <xcat-user@lists.sourceforge.net>,
      Date:
      01/17/2014 12:42 PM
      Subject:
      [xcat-user] xCAT ignoring my otherpkgdir setting




      Hi all,

      I am manually setting my otherpkgdir for an osimage that a node belongs to. However when running "updatenode $node -S", the setting is being ignored. The problem is bolded below.

      Example is below, how do I get this thing to look at the right path?


      [root@master v5.4.8-dl-compute]# chdef -t osimage v5.4.8-dl-compute otherpkgdir=/install/post/otherpkgs/centos5.4-new-kernel/x86_64/v5.4.8-dl-compute/
      1 object definitions have been created or modified.

      [root@master v5.4.8-dl-compute]# lsdef -t osimage v5.4.8-dl-compute
      Object name: v5.4.8-dl-compute
       exlist=/install/custom/netboot/centos/v5.4.8-dl-compute.exlist
       groups=all
       imagetype=linux
       kerneldir=/install/netboot/centos5.4-new-kernel/x86_64/v5.4.8-dl-compute
       netdrivers=libphy,tg3,bnx2,e1000,dca,8021q,igb,forcedeth,squashfs,aufs,loop
       nodebootif=eth0
       osarch=x86_64
       osdistroname=centos5.4-new-kernel-x86_64
       osname=Linux
       osvers=centos5.4-new-kernel
       otherpkgdir=/install/post/otherpkgs/centos5.4-new-kernel/x86_64/v5.4.8-dl-compute/
       otherpkglist=/install/custom/netboot/centos/v5.4.8-dl-compute.otherpkgs.pkglist
       pkgdir=/install/centos5.4-new-kernel/x86_64/
       pkglist=/install/custom/netboot/centos/v5.4.8-dl-compute.pkglist
       postinstall=/install/custom/netboot/centos/v5.4.8-dl-compute.postinstall
       postscripts=remoteshell,localrepo5,hardeths,postage.sh
       profile="">  provmethod=netboot
       rootimgdir=/install/netboot/centos5.4-new-kernel/x86_64/v5.4.8-dl-compute


      [root@master v5.4.8-dl-compute]# lsdef c100n4
      Object name: c100n4
      <snip>
       profile="">  provmethod=v5.4.8-dl-compute


      [root@master v5.4.8-dl-compute]# updatenode c100n4 -S
      Performing software maintenance operations. This could take a while, if there are packages to install.

      c100n4: Fri Jan 17 11:32:04 CST 2014 Running postscript: ospkgs
      c100n4: Postscript: ospkgs exited with code 0
      c100n4: Fri Jan 17 11:32:57 CST 2014 Running postscript: otherpkgs
      c100n4: NFSSERVER=master

      c100n4: OTHERPKGDIR=master/install/post/otherpkgs/centos5.4-new-kernel/x86_64

      c100n4: Loaded plugins: changelog, fastestmirror, keys, list-data, tmprepo, verify
      c100n4: Cleaning up Everything
      c100n4: Loaded plugins: changelog, fastestmirror, keys, list-data, tmprepo, verify
      c100n4: Cleaning up Everything
      c100n4: Cleaning up list of fastest mirrors
      c100n4:  yum -y upgrade
      c100n4: Loaded plugins: changelog, fastestmirror, keys, list-data, tmprepo, verify
      c100n4: Loading mirror speeds from cached hostfile
      c100n4:
      http://master/install/post/otherpkgs/centos5.4-new-kernel/x86_64/dell/repodata/repomd.xml: [Errno 14] HTTP Error 404: Not Found
      c100n4: Trying other mirror.
      c100n4: Error: Cannot retrieve repository metadata (repomd.xml) for repository: xcat-otherpkgs0. Please verify its path and try again
      <snip>
      c100n4: Postscript: otherpkgs exited with code 34
      c100n4: Running of Software Maintenance has completed.

      ------------------------------------------------------------------------------
      CenturyLink Cloud: The Leader in Enterprise Cloud Services.
      Learn Why More Businesses Are Choosing CenturyLink Cloud For
      Critical Workloads, Development Environments & Everything In Between.
      Get a Quote or Start a Free Trial Today.

      http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk_______________________________________________
      xCAT-user mailing list

      xCAT-user@lists.sourceforge.net
      https://lists.sourceforge.net/lists/listinfo/xcat-user



      ------------------------------------------------------------------------------
      CenturyLink Cloud: The Leader in Enterprise Cloud Services.
      Learn Why More Businesses Are Choosing CenturyLink Cloud For
      Critical Workloads, Development Environments & Everything In Between.
      Get a Quote or Start a Free Trial Today.

      http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk


      _______________________________________________
      xCAT-user mailing list

      xCAT-user@lists.sourceforge.net
      https://lists.sourceforge.net/lists/listinfo/xcat-user




      ------------------------------------------------------------------------------
      CenturyLink Cloud: The Leader in Enterprise Cloud Services.
      Learn Why More Businesses Are Choosing CenturyLink Cloud For
      Critical Workloads, Development Environments & Everything In Between.
      Get a Quote or Start a Free Trial Today.

      http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk


      _______________________________________________
      xCAT-user mailing list

      xCAT-user@lists.sourceforge.net
      https://lists.sourceforge.net/lists/listinfo/xcat-user
    ------------------------------------------------------------------------------
    CenturyLink Cloud: The Leader in Enterprise Cloud Services.
    Learn Why More Businesses Are Choosing CenturyLink Cloud For
    Critical Workloads, Development Environments & Everything In Between.
    Get a Quote or Start a Free Trial Today.

    http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk_______________________________________________
    xCAT-user mailing list

    xCAT-user@lists.sourceforge.net
    https://lists.sourceforge.net/lists/listinfo/xcat-user



    ------------------------------------------------------------------------------
    CenturyLink Cloud: The Leader in Enterprise Cloud Services.
    Learn Why More Businesses Are Choosing CenturyLink Cloud For
    Critical Workloads, Development Environments & Everything In Between.
    Get a Quote or Start a Free Trial Today.
    http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk


    _______________________________________________
    xCAT-user mailing list
    xCAT-user@lists.sourceforge.net
    https://lists.sourceforge.net/lists/listinfo/xcat-user
------------------------------------------------------------------------------
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments & Everything In Between.
Get a Quote or Start a Free Trial Today.
http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user



------------------------------------------------------------------------------
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments & Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk


_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


------------------------------------------------------------------------------
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments & Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to