SOPORTE,
I think this is what is happening:
During xcatprobe a temporary node definition xcatmntest with ip=serverip
and mac=aa:aa:aa:aa:aa:aa is created.
Then "makedhcp xcatmntest" command is ran to update the DHCP configuration
file. After that, "makedhcp -q xcatmntest" command is ran to
query the xcatmntest node and verify the same ip and mac are
returned.
For some reason on your system "makedhcp -q xcatmntest" does not return
anything and generic "DHCP server reply is wrong" message is
returned.
You can run "makedhcp -q" against some of your real nodes and verify
correct IP and MAC are retuned for them. If so, you can ignore
this xcatprobe failure.
Mark Gurevich
Poughkeepsie Development Lab
HPC Software Development - xCAT
"If we knew what it was we were doing, it would not be called research,
would it?"
--Albert Einstein
From: "SOPORTE MODEMAT via xCAT-user"
<[email protected]>
To: "xCAT Users Mailing list" <[email protected]>
Cc: "SOPORTE MODEMAT" <[email protected]>
Date: 09/24/2021 04:42 PM
Subject: [EXTERNAL] Re: [xcat-user] xcat dhcp issue
Hi Mark
I did what you said and the output is shown below:
xcatprobe xcatmn -i eno5
[mn]: Checking all xCAT daemons are running...
[ OK ]
[mn]: Checking xcatd can receive command request...
[ OK ]
[mn]: Checking 'site' table is configured...
[ OK ]
[mn]: Checking provision network is configured...
[ OK ]
[mn]: Checking 'passwd' table is configured...
[ OK ]
[mn]: Checking important directories(installdir,tftpdir) are configured...
[ OK ]
[mn]: Checking SELinux is disabled...
[ OK ]
[mn]: Checking HTTP service is configured...
[ OK ]
[mn]: Checking TFTP service is configured...
[ OK ]
[mn]: Checking DNS service is configured...
[ OK ]
[mn]: Checking DHCP service is configured...
[FAIL]
[mn]: Simulate a node xcatmntest<ip=10.50.1.7 mac=aa:aa:aa:aa:aa:aa> to do
dhcp test
[mn]: DHCP server reply is wrong
[mn]: Run 'makedhcp -n' if it has not been ran before.
[mn]: Checking NTP service is configured...
[ OK ]
[mn]: Checking rsyslog service is configured...
[ OK ]
[mn]: Checking firewall is disabled...
[ OK ]
[mn]: Checking minimum disk space for xCAT ['/tmp' needs 1GB;'/var' needs
1GB;'/install' needs 10GB]... [ OK ]
[mn]: Checking Linux ulimits configuration...
[ OK ]
[mn]: Checking network kernel parameter configuration...
[ OK ]
[mn]: Checking site table attribute... xcatmaxconnections=
[INFO]
[mn]: Checking site table attribute... xcatmaxbatchconnections=
[INFO]
[mn]: Checking xCAT daemon attributes configuration...
[ OK ]
[mn]: Checking xCAT log is stored in /var/log/xcat/cluster.log...
[ OK ]
[mn]: Checking xCAT management node IP: <10.50.1.7> is configured to
static... [ OK ]
[mn]: Checking dhcpd.leases file is less than 100M...
[ OK ]
[mn]: Checking DB packages installation...
[ OK ]
=================================== SUMMARY
====================================
[MN]: Checking on MN...
[FAIL]
Checking DHCP service is configured...
[FAIL]
Simulate a node xcatmntest<ip=10.50.1.7 mac=aa:aa:aa:aa:aa:aa> to
do dhcp test
DHCP server reply is wrong
Run 'makedhcp -n' if it has not been ran before.
Kind regards
Soporte
From: Mark Gurevich <[email protected]>
Sent: viernes, 24 de septiembre de 2021 15:30
To: xCAT Users Mailing list <[email protected]>
Subject: Re: [xcat-user] xcat dhcp issue
Ok, the "-V" flag did not do what I expected.
Edit file /opt/xcat/probe/subcmds/xcatmn and around line 19, flip "my
$verbose = 0;" from 0 to 1. Then rerun "xcatprobe xcatmn -i eno5"
without the "-V" flag.
Mark Gurevich
Poughkeepsie Development Lab
HPC Software Development - xCAT
"If we knew what it was we were doing, it would not be called research,
would it?"
--Albert Einstein
Inactive hide details for "SOPORTE MODEMAT via xCAT-user" ---09/24/2021
04:24:48 PM---Hi Mark The xcat version that I have inst"SOPORTE MODEMAT via
xCAT-user" ---09/24/2021 04:24:48 PM---Hi Mark The xcat version that I have
installed is 2.16.2.
From: "SOPORTE MODEMAT via xCAT-user" <[email protected]>
To: "xCAT Users Mailing list" <[email protected]>
Cc: "SOPORTE MODEMAT" <[email protected]>
Date: 09/24/2021 04:24 PM
Subject: [EXTERNAL] Re: [xcat-user] xcat dhcp issue
Hi Mark
The xcat version that I have installed is 2.16.2.
Right now I just have a master node and two compute nodes for testing-
Kind regards.
Soporte
From: Mark Gurevich <[email protected]>
Sent: viernes, 24 de septiembre de 2021 12:58
To: xCAT Users Mailing list <[email protected]>
Subject: Re: [xcat-user] xcat dhcp issue
SOPORTE,
Which version of xCAT ?
Are you running in hierarchical environment with server nodes ?
Mark Gurevich
Poughkeepsie Development Lab
HPC Software Development - xCAT
"If we knew what it was we were doing, it would not be called research,
would it?"
--Albert Einstein
Inactive hide details for "SOPORTE MODEMAT via xCAT-user" ---09/23/2021
08:40:55 PM---Hi. Thank you for your reply. I run the c"SOPORTE MODEMAT via
xCAT-user" ---09/23/2021 08:40:55 PM---Hi. Thank you for your reply. I run
the command below but I do not the reason why the error exists:
From: "SOPORTE MODEMAT via xCAT-user" <[email protected]>
To: "xCAT Users Mailing list" <[email protected]>
Cc: "SOPORTE MODEMAT" <[email protected]>
Date: 09/23/2021 08:40 PM
Subject: [EXTERNAL] [xcat-user] xcat dhcp issue
Hi.
Thank you for your reply. I run the command below but I do not the reason
why the error exists:
xcatprobe -V xcatmn -i eno5 | more
=================================== SUMMARY
====================================
[MN]: Checking on MN...
[FAIL]
Checking DHCP service is configured...
[FAIL]
DHCP server reply is wrong
Run 'makedhcp -n' if it has not been ran before.
I have also tried:
/opt/xcat/probe//subcmds/detect_dhcpd -V -i eno5
[debug] :Send dhcp discover from: NIC = eno5, IP = 10.50.1.7, MAC =
b4:7a:f1:21:ef:d0
[debug] :Current operating system is rh
[debug] :The duration of capturing DHCP package is 10 second(s)
[info] :Start to detect DHCP, please wait 10 seconds
[debug] :The id of process which is used to capture the packet by tcpdump
is 612199
[debug] :Kill process 612199 used to capture the packet by 'tcpdump'
[debug] :Dump all the information captured during last 10 seconds
[debug] :The server found: mac = , clientip = , serverip = , offer = 0
[debug] :The server found: mac = b4:7a:f1:21:ef:d0, clientip = , serverip
= , offer = 0
[debug] :The server found: mac = b4:7a:f1:21:ef:d0, clientip = , serverip
= , offer = 0
[debug] :The server found: mac = b4:7a:f1:21:ef:d0, clientip = , serverip
= , offer = 0
[debug] :The server found: mac = b4:7a:f1:21:ef:d0, clientip = , serverip
= , offer = 0
[info] :++++++++++++++++++++++++++++++++++
[info] :There are 0 servers replied to dhcp discover.
[info] :++++++++++++++++++++++++++++++++++
systemctl status dhcpd
● dhcpd.service - DHCPv4 Server Daemon
Loaded: loaded (/usr/lib/systemd/system/dhcpd.service; enabled; vendor
preset: disabled)
Active: active (running) since Thu 2021-09-23 12:52:31 -05; 14min ago
Docs: man:dhcpd(8)
man:dhcpd.conf(5)
Main PID: 609816 (dhcpd)
Status: "Dispatching packets..."
Tasks: 1 (limit: 1646703)
Memory: 8.1M
CGroup: /system.slice/dhcpd.service
└─609816 /usr/sbin/dhcpd -f -cf /etc/dhcp/dhcpd.conf -user dhcpd
-group dhcpd --no-pid eno5 ib0
Sep 23 13:01:24 quindemaster01 dhcpd[609816]: DHCPDISCOVER from
b4:7a:f1:21:ef:d0 via eno5: network eno5: no free leases
Sep 23 13:01:26 quindemaster01 dhcpd[609816]: DHCPDISCOVER from
b4:7a:f1:21:ef:d0 via eno5: network eno5: no free leases
Sep 23 13:01:28 quindemaster01 dhcpd[609816]: DHCPDISCOVER from
b4:7a:f1:21:ef:d0 via eno5: network eno5: no free leases
Sep 23 13:01:30 quindemaster01 dhcpd[609816]: DHCPDISCOVER from
b4:7a:f1:21:ef:d0 via eno5: network eno5: no free leases
Sep 23 13:02:10 quindemaster01 dhcpd[609816]: DHCPDISCOVER from
b4:7a:f1:21:ef:d0 via eno5: network eno5: no free leases
Sep 23 13:02:12 quindemaster01 dhcpd[609816]: DHCPDISCOVER from
b4:7a:f1:21:ef:d0 via eno5: network eno5: no free leases
Sep 23 13:02:14 quindemaster01 dhcpd[609816]: DHCPDISCOVER from
b4:7a:f1:21:ef:d0 via eno5: network eno5: no free leases
Sep 23 13:02:16 quindemaster01 dhcpd[609816]: DHCPDISCOVER from
b4:7a:f1:21:ef:d0 via eno5: network eno5: no free leases
Sep 23 13:02:18 quindemaster01 dhcpd[609816]: DHCPDISCOVER from
b4:7a:f1:21:ef:d0 via eno5: network eno5: no free leases
Sep 23 13:02:20 quindemaster01 dhcpd[609816]: DHCPDISCOVER from
b4:7a:f1:21:ef:d0 via eno5: network eno5: no free leases
How can I fix that?
Soporte
From: Mark Gurevich <[email protected]>
Sent: jueves, 23 de septiembre de 2021 8:01
To: xCAT Users Mailing list <[email protected]>
Subject: Re: [xcat-user] xCAT 2.16.2 new xNBA issue
SOPORTE,
Is this problem related to "xCAT 2.16.2 new xNBA issue" thread.
If not, can you start a new thread ?
Try to rerun the xcatprobe with verbose flag:
xcatprobe -V xcatmn -i eno5
Mark Gurevich
Poughkeepsie Development Lab
HPC Software Development - xCAT
"If we knew what it was we were doing, it would not be called research,
would it?"
--Albert Einstein
Inactive hide details for "SOPORTE MODEMAT via xCAT-user" ---09/22/2021
07:41:14 PM---Hello guys. I would like you to help me d"SOPORTE MODEMAT via
xCAT-user" ---09/22/2021 07:41:14 PM---Hello guys. I would like you to help
me debug the error that I get every time when I run the command
From: "SOPORTE MODEMAT via xCAT-user" <[email protected]>
To: "xCAT Users Mailing list" <[email protected]>
Cc: "SOPORTE MODEMAT" <[email protected]>
Date: 09/22/2021 07:41 PM
Subject: [EXTERNAL] Re: [xcat-user] xCAT 2.16.2 new xNBA issue
Hello guys.
I would like you to help me debug the error that I get every time when I
run the command that is shown below. The error is related to the dhcp
service:
xcatprobe xcatmn -i eno5
xcatprobe xcatmn -i eno5
[mn]: Checking all xCAT daemons are running...
[ OK ]
[mn]: Checking xcatd can receive command request...
[ OK ]
[mn]: Checking 'site' table is configured...
[ OK ]
[mn]: Checking provision network is configured...
[ OK ]
[mn]: Checking 'passwd' table is configured...
[ OK ]
[mn]: Checking important directories(installdir,tftpdir) are configured...
[ OK ]
[mn]: Checking SELinux is disabled...
[ OK ]
[mn]: Checking HTTP service is configured...
[ OK ]
[mn]: Checking TFTP service is configured...
[ OK ]
[mn]: Checking DNS service is configured...
[ OK ]
[mn]: Checking DHCP service is configured...
[FAIL]
[mn]: DHCP server reply is wrong
[mn]: Run 'makedhcp -n' if it has not been ran before.
[mn]: Checking NTP service is configured...
[ OK ]
[mn]: Checking rsyslog service is configured...
[ OK ]
[mn]: Checking firewall is disabled...
[ OK ]
[mn]: Checking minimum disk space for xCAT ['/install' needs 10GB;'/var'
needs 1GB;'/tmp' needs 1GB]... [ OK ]
[mn]: Checking Linux ulimits configuration...
[ OK ]
[mn]: Checking network kernel parameter configuration...
[ OK ]
[mn]: Checking xCAT daemon attributes configuration...
[ OK ]
[mn]: Checking xCAT log is stored in /var/log/xcat/cluster.log...
[ OK ]
[mn]: Checking xCAT management node IP: <10.50.1.7> is configured to
static... [ OK ]
[mn]: Checking dhcpd.leases file is less than 100M...
[ OK ]
[mn]: Checking DB packages installation...
[ OK ]
=================================== SUMMARY
====================================
[MN]: Checking on MN...
[FAIL]
Checking DHCP service is configured...
[FAIL]
DHCP server reply is wrong
Run 'makedhcp -n' if it has not been ran before.
I have also run the command 'makedhcp -n' but the error still persists.
The dhcp configuration is shown below in the attached file
[attachment "dhcp.conf" deleted by Mark Gurevich/Poughkeepsie/IBM]
_______________________________________________
xCAT-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/xcat-user
_______________________________________________
xCAT-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/xcat-user
_______________________________________________
xCAT-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/xcat-user
_______________________________________________
xCAT-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/xcat-user
_______________________________________________
xCAT-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/xcat-user