Hello Frank

I was able to get the driver installed and working with the second option :)

Thank you !

Best Regards

Antoine Huette


-------- Message d'origine --------
De : "Heckes Frank (CI/OSB4)" <frank.hec...@de.bosch.com>
Date : 16/11/2018 14:29 (GMT+01:00)
À : xCAT Users Mailing list <xcat-user@lists.sourceforge.net>
Objet : Re: [xcat-user] nvidia driver on stateless cluster

Hello,

I suppose you mean a node equipped with NVIDIA GPU(s).

There’s one option I currently use to install the driver in image of a 
rhel/centos node.

On a node with the kernel-devel RPM of the target node installed (might be MN 
or a build host of sorts),
run the downloaded driver:

./NVIDIA-Linux-x86_64-390.87.run --add-this-kernel

The node don’t have to be the target node.
This will create a self extracting file customized with the kernel running on 
your target node. ./NVIDIA-Linux-x86_64-390.87-custom.run .
In case the kernel isn’t running on the ‘build’ node you  can specify the 
kernel version and src dir via command-line
Options (see –advanced-options output)

Now can start this version from a postscript. The file might be in a network FS 
share or inside the image and deleted afterwards by running:

NVIDIA-Linux-x86_64-390.87-custom.run –x; ./nvidia-install -s

You need to blacklist the noveau in the diskless boot before.

There’s another possibility to use dkms with the nvidia installer. You’d need 
to chroot (and bind /dev/, /proc/, sys) manually and run the installer with 
–dkms option.

Mit freundlichen Grüßen / Best regards

Frank Heckes

CI Operations - Server Services Sun Solaris, Linux (CI/OSB4)
frank.hec...@de.bosch.com<mailto:frank.hec...@de.bosch.com>


Von: Huette, Antoine <antoine.hue...@bechtle.com>
Gesendet: Freitag, 16. November 2018 12:36
An: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>
Betreff: [xcat-user] nvidia driver on stateless cluster

Hello,

On a stateless CentOS 7.5 cluster with Quadro GPUs, I need to install the 
Nvidia driver. I’m using the runfile downloaded from the Nvidia website.
What is the suggested procedure ? Is it better to install the driver in the 
osimage, or should I make the installer run when the nodes start ?

The problem I see with the first option is the fact that the driver checks if a 
GPU is present in the system, so I’m not sure if this method can work.

The problem with the second method is that, after trying it, it’s very 
difficult to have a working X server with a Gnome desktop. The driver installer 
needs the node to be in runlevel 3 (multi-user.target) but once it is 
installed, I need to switch to runlevel 5 (graphical.target) which almost never 
works. So far the only way I’ve found is by installing the driver manually on a 
freshly booted node, run nvidia-xconfig to fill the Xorg.conf file, and then 
restarting the gnome services.

Any help on this subject would be much appreciated ! 😊


Best regards,

Antoine Huette
HPC Engineer
antoine.hue...@bechtle.com<mailto:antoine.hue...@bechtle.com> | 
03.67.07.97.37/07.72.31.82.12 |  bechtle.fr<http://www.bechtle.fr/> |

[Bechtle][http://www.bechtle-contenu.fr/signature/facebook.png]<https://www.facebook.com/Bechtle-France-428229243877854/>
   [http://www.bechtle-contenu.fr/signature/twitter.png] 
<https://twitter.com/BechtleFrance>    
[http://www.bechtle-contenu.fr/signature/linkedin.png] 
<https://www.linkedin.com/company/bechtle-france>    
[http://www.bechtle-contenu.fr/signature/youtube.png] 
<https://www.youtube.com/user/BechtleAG>    [Résultat de recherche d'images 
pour "instagram logo"] <https://www.instagram.com/bechtlefrance/?hl=fr>

_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to