Hi everyone,

thanks for the pointers.   I decided to go back to the very beginning and did a 
clean reinstall of xcat:
Version 2.14.1 (git commit 70d6e7f93cc9714a127c22df2e7ca53d4996a34c, built Fri 
Jun  1 03:00:53 EDT 2018)

then I walked through the documentation - 
https://xcat-docs.readthedocs.io/en/stable - and it works slighly better now.  
I’m no longer getting udev errors, but I’m still getting journald errors:
code killed, status 6/ABRT
on restart ‘/run/log/journal/<string>/system.journal corrupted or uncleanly 
shut down.

which looks like it might be a space/memory issue?

In any case, even just after boot, I have the same problem where I can’t ssh to 
the node or rcons, or even get a console prompt on the drac card (it’s a dell 
C6420).  It’s pingable at the correct ip address.

As per the email below, I checked the image for pkglist, exlist, and postinall:

Object name: SL7.4-statelite-v1
    exlist=/opt/xcat/share/xcat/netboot/rh/compute.rhels7.x86_64.exlist
    imagetype=linux
    osarch=x86_64
    osdistroname=SL7.4-x86_64
    osname=Linux
    osvers=SL7.4
    otherpkgdir=/install/post/otherpkgs/SL7.4/x86_64
    permission=755
    pkgdir=/install/SL7.4/x86_64
    pkglist=/opt/xcat/share/xcat/netboot/rh/compute.rhels7.x86_64.pkglist
    
postinstall=/opt/xcat/share/xcat/netboot/rh/compute.rhels7.x86_64.postinstall
    profile=compute
    provmethod=statelite
    rootimgdir=/install/netboot/SL7.4/x86_64/compute

I had a brief moment where I thought it might be an selinux problem, but in the 
rootimg selinux is disabled in /etc/selinux/config ...
the litefile is standard, but I’m thinking that I might change /var and /run to 
persistent to see if I can some extra insight into what’s happening on the node.
#image,file,options,comments,disable
"ALL","/etc/adjtime","tmpfs",,
"ALL","/etc/securetty","tmpfs",,
"ALL","/etc/lvm/","tmpfs",,
"ALL","/etc/ntp.conf","tmpfs",,
"ALL","/etc/rsyslog.conf","tmpfs",,
"ALL","/etc/rsyslog.conf.XCATORIG","tmpfs",,
"ALL","/etc/udev/","tmpfs",,
"ALL","/etc/ntp.conf.predhclient","tmpfs",,
"ALL","/etc/resolv.conf","tmpfs",,
"ALL","/etc/yp.conf","tmpfs",,
"ALL","/etc/resolv.conf.predhclient","tmpfs",,
"ALL","/etc/sysconfig/","tmpfs",,
"ALL","/etc/ssh/","tmpfs",,
"ALL","/etc/inittab","tmpfs",,
"ALL","/tmp/","tmpfs",,
"ALL","/var/","tmpfs",,
"ALL","/opt/xcat/","tmpfs",,
"ALL","/xcatpost/","tmpfs",,
"ALL","/etc/systemd/system/multi-user.target.wants/","tmpfs",,
"ALL","/root/.ssh/","tmpfs",,
"ALL","/etc/rc3.d/","tmpfs",,
"ALL","/etc/rc2.d/","tmpfs",,
"ALL","/etc/rc4.d/","tmpfs",,
"ALL","/etc/rc5.d/","tmpfs",,

I’m booting with rd.debug and rd.break=cleanup, but I don’t get a shell – I 
think because the root image *is* mounting.

As I said, thanks for the thoughts, and I just wanted to make sure that people 
know that I appreciate the input,

Best,

Jeff Berry




From: Yuan Y Bai [mailto:by...@cn.ibm.com]
Sent: 12 June 2018 10:01
To: xcat-user@lists.sourceforge.net
Cc: xcat-user@lists.sourceforge.net
Subject: Re: [xcat-user] SciLinux 7.4 statelite problems

Hi Jeff,

Could you check your osimage definition about exlist, pkglist and postinstall?
We do not formal ship compute.SL7.pkglist, we user the same files for rhels7. 
so could you try to use the rhels7 related files for your osimage?

Here I give you an example for osimage, you can find the right arch files under 
/opt/xcat/share/xcat/netboot/rh/:
]# lsdef -t osimage rhels7.4-x86_64-statelite-compute -i 
exlist,pkglist,postinstall
Object name: rhels7.4-x86_64-statelite-compute
    exlist=/opt/xcat/share/xcat/netboot/rh/compute.rhels7.x86_64.exlist
    pkglist=/opt/xcat/share/xcat/netboot/rh/compute.rhels7.x86_64.pkglist
    
postinstall=/opt/xcat/share/xcat/netboot/rh/compute.rhels7.x86_64.postinstall


"Failing to install mlx_en", I got the same message when there is no mlx in my 
system.


Best Regards
--------------------------------------------------
Yuan Bai (白媛)

CSTL HPC System Management Development
Tel:86-10-82451401
E-mail: by...@cn.ibm.com<mailto:by...@cn.ibm.com>
Address: IBM ZGC Campus. Ring Building 28,
ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road, Haidian District,
Beijing P.R.China 100193

IBM环宇大厦
北京市海淀区东北旺西路8号,中关村软件园28号楼
邮编:100193


----- Original message -----
From: Jeff Berry 
<jeff.be...@mrc-cbu.cam.ac.uk<mailto:jeff.be...@mrc-cbu.cam.ac.uk>>
To: xCAT Users Mailing list 
<xcat-user@lists.sourceforge.net<mailto:xcat-user@lists.sourceforge.net>>
Cc:
Subject: [xcat-user] SciLinux 7.4 statelite problems
Date: Tue, Jun 12, 2018 4:25 PM


Good morning all,



I’m still wrestling with a SciLinux 7.4 statelite deployment with xcat 2.13.11. 
   The dracut hooks don’t seem to be working properly, which is both making it 
difficult to debug and also probably symptomatic of a larger problem.   Running 
genimage, a few things have caught my eye.



The package list is looking for busybox-anaconda, which doesn’t seem to exist 
for SciLinux 7.  A bit of poking seems to suggest that it is deprecated, but 
it’s not clear to me what a suitable replacement might be.  Is there a 
preferred solution/workaround?



The dracut install also is throwing a couple of errors.  Failing to install 
mlx_en is, I think, benign.  I am also getting this error: “dracut-install: 
ERROR: installing '/etc/udev/udev.conf'”  which seems like it might be more 
significant, especially in light of my dracut problems.  However, I don’t know 
what might be causing this problem, nor how to fix it.



Any insight will be latched upon to with unseemly haste,



Jeff Berry

MRC-CBSU, Cambridge
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user


------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to