Hi,

So I created a formula to add a node. This part work on the exec node. I
created a tar file, source settings.{c,sh} and copied sgeexecd to
/etc/init.d/

However, now, I am trying to add the exec node on the server side. My exec
node will also be a submit host; Here is what I think need to be done (in a
script or though a formula):

- Create a generic sge template for the host
- Create a SGE template for the host group
- Add the host to the host group template
- Modify the host template to match the exec node to add to the server
- Add the node to the sge master through the host template
  -> qconf -Ae $exechostname host.tmpl
- Add the node to the host group:
  -> qconf -Mhgrp @lhosts lhosts.tmpl
- Add the node to the list of submit hosts:
  -> qconf -as $exechostname

Do you think I forget something (except to modify the firewall) ?

Best,

Remy

2016-06-28 12:35 GMT+02:00 William Hay <[email protected]>:

> On Tue, Jun 28, 2016 at 11:06:44AM +0200, R??my Dernat wrote:
> >    Hi,
> >    I am using SGE for some years now. I install my nodes through rocks
> >    cluster with images. However, since Rocks is based on a old version of
> >    CentOS, which is older than the last fedora or RedHat, we think about
> >    moving to a debian base cluster, because we don't need certified stuff
> >    (eg. for IB or nvidia or for some other software/reason).
> >    I will configure my new nodes using the couple FAI/salt.
> >    I have already many salt formulas/recipes.
> >    I would like to manage my future cluster, and obviously my job
> scheduler,
> >    SGE, with salt.
> >    Concerning SGE, I think we will move to the last version of SoGE.
> >    I know that many of you already did it with puppet. I saw this
> repository
> >    on github:
> >    https://github.com/AAFC-MBB/sge-puppet.git
> >    I take a look into this puppet formula, and it seems that there is a
> lot
> >    to configure, not only SGE (ssh keys, iptables, nfs...). BTW, it is
> doing
> >    an update on the exec node, which is *not wanted* here.
> >    On many configuration file, on my master, I have this warning:
> >    ```
> >    # Version: 2011.11p1
> >    #
> >    # DO NOT MODIFY THIS FILE MANUALLY!
> >    #
> >    ```
> >    Indeed, I have OGS/GE 2011.11p1, but how SGE store is configuration
> datas
> >    ? Everything is in configuration files ? If so, why this warning ?
> >    Any idea or best practice to look at ?
> They're not just config files but grid-engine's record of information
> about the object in question.
> They exist to help grid-engine keep track of things over a qmaster
> restart.  The config is
> just part of the information in them.
>
> Take a look at the files for exechosts they contain load values which
> aren't things you configure
> but reports from various load sensors.  If you try writing to this file
> while the qmaster is up
> your changes might get overwritten by gridengine updating a load report or
> the file might get corrupted
> if you try to modify it in place.  There is no documented procedure for
> changing these files
> and then telling a running gridengine to reread them.  Instead you use
> qconf to request a change then
> the qmaster updates the files.
>
> You would also be taking on the burden of keeping cluster queues
> hostgroups and qinstances
> consistent which will be checked automatically if you just use qconf.
>
> William
>
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to