Yes, that is correct. Below is the output :
[root@hpc3-14-03 ~]# ip a
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group
default qlen 1000
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
valid_lft forever preferred_lft forever
inet6 ::1/128 scope host
valid_lft forever preferred_lft forever
2: eno1: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default
qlen 1000
link/ether 08:f1:ea:9e:c7:60 brd ff:ff:ff:ff:ff:ff
3: eno2: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default
qlen 1000
link/ether 08:f1:ea:9e:c7:61 brd ff:ff:ff:ff:ff:ff
4: eno3: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq state UP group
default qlen 1000
link/ether 08:f1:ea:e4:35:52 brd ff:ff:ff:ff:ff:ff
inet 10.240.58.16/23 brd 10.240.59.255 scope global eno3
valid_lft forever preferred_lft forever
inet6 fe80::af1:eaff:fee4:3552/64 scope link
valid_lft forever preferred_lft forever
....
What I have noticed is that dhcpd.leases file does not have an entry with
an "fixed address" entry like the following:
host hpc3-gpu-16-03 {
dynamic;
hardware ethernet 20:67:7c:10:ba:86;
uid 20:67:7c:10:ba:86;
fixed-address 10.240.58.61;
supersede server.ddns-hostname = "hpc3-gpu-16-03";
supersede host-name = "hpc3-gpu-16-03";
if option user-class-identifier = "xNBA" and option
client-architecture
= 00:00 {
supersede server.filename =
"http://
${next-server}:80/tftpboot/xcat/xnba/nodes/hpc3-gpu-16-03";
} elsif option client-architecture = 00:00 {
supersede server.filename = "xcat/xnba.kpxe";
} else {
supersede server.filename = "";
}
}
I am assuming the lease file got messed up somehow. What are your thoughts
on reconstructing the file (programmatically) and using a modified file?
Or is there another way from within xcat to add entries in the dhcpd leases
file?
thanks.
On Wed, Sep 1, 2021 at 9:20 AM Russell Jones <[email protected]> wrote:
> Is the mac correct for the node?
>
> On Wed, Sep 1, 2021 at 11:02 AM Imam Toufique <[email protected]> wrote:
>
>> Hi,
>>
>> Need your helpful thoughts here with a problem we have, please.
>>
>> We have nodes that were provisioned with xcat, they are running, OS is
>> working and installed. The boot order is set to PXE first, SSD 2nd.
>>
>> Several days ago, when I rebooted one of the nodes, it went straight to
>> PXE discovery mode - attempting for an install. This is a node that is
>> built, it should have exited the PXE boot mode and boot off the disk, but
>> it never did.
>>
>> I am not sure what's going on, it looks like xcat has lost the status of
>> the node, whether it is installed or not ( need provisioning?)
>>
>> Here is the 'lsdef' output of the node:
>>
>> ```
>> [root@mn] lsdef -t node hpc3-14-03
>> Object name: hpc3-14-03
>> arch=x86_64
>> cpucount=40
>> cputype=Intel(R) Xeon(R) Gold 6148 CPU @ 2.40GHz
>> currchain=boot
>> currstate=boot
>> disksize=sda:224GB,sdb:224GB
>> groups=centos78
>> ip=10.240.58.16
>> mac=08:f1:ea:e4:35:52
>> memory=193122MB
>> mtm=HPE:ProLiant XL170r Gen10
>> netboot=xnba
>> nichostnamesuffixes.ib0=-ib0
>> nichostnamesuffixes.ipmi=-ipmi
>> nicips.ib0=10.240.60.16
>> nicips.ipmi=10.240.62.16
>> os=centos7.7
>> postbootscripts=otherpkgs,hpc3-postscripts/hpc3postbootscript
>>
>> postscripts=syslog,remoteshell,syncfiles,setupntp,hpc3-postscripts/hpc3postscript.1,confignetwork
>> -s
>> profile=compute
>> provmethod=centos7.8-x86_64-install-compute
>> serial=2M294204L9
>> status=booted
>> statustime=06-21-2021 16:41:45
>> supportedarchs=x86,x86_64
>>
>> ```
>>
>> ```
>> [root@mn]# nodediscoverls |grep 14-03
>> 38363730-3535-324D-3239-343230344C39 hpc3-14-03 manual
>> HPE:ProLiant XL170r Gen10 2M294204L9
>> ```
>>
>> ```
>> [root@mn]# lsdef -t network compute_net_1
>>
>> Object name: compute_net_1
>>
>> domain=local
>>
>> dynamicrange=10.240.58.221-10.240.58.240
>>
>> gateway=10.240.58.1
>>
>> mask=255.255.254.0
>> mgtifname=eno1
>> mtu=1500
>> nameservers=10.240.58.4,8.8.8.8,128.200.192.202
>> net=10.240.58.0
>> staticrange=10.240.58.4-10.240.59.220
>> tftpserver=<xcatmaster>
>> ```
>>
>> Any idea what might be going on here? Why an already setup/installed
>> node is going back to discovery ( and wanting to be installed) mode?
>>
>> Can someone please shed some light?
>>
>> thanks a lot!
>>
>>
>> _______________________________________________
>> xCAT-user mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/xcat-user
>>
> _______________________________________________
> xCAT-user mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/xcat-user
>
--
Regards,
*Imam Toufique*
*213-700-5485*
_______________________________________________
xCAT-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/xcat-user