Public bug reported:
Observed on akis, blanka, cortez, and hidon. This occurs while NVIDIA
fabric-manager is installed and active, as it binds to TCP port 16000.
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] invoked with './stress-ng
-v -t 5 --sigurg 4 --sigurg-ops 3000 --ignite-cpu --syslog --verbose --verify
--oomable' by user 0 'root'
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] stress-ng 0.18.06
g9ea345f5dfda
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] system: Linux akis
6.8.0-1022-nvidia #25-Ubuntu SMP PREEMPT_DYNAMIC Tue Jan 28 05:14:01 UTC 2025
x86_64, gcc 13.3.0, glibc 2.39, little endian
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] RAM total: 1.5T, RAM free:
1.5T, swap free: 9.0G
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] temporary file path:
'/home/ubuntu/autotest/client/tmp/ubuntu_stress_smoke_test/src/stress-ng',
filesystem type: ext2 (214458870 blocks available)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] CPUs have 5 idle states:
C0, C1, C1E, C6, POLL
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] 96 processors online, 96
processors configured
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] setting to a 5 secs run per
stressor
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] CPU data cache: L1: 32K,
L2: 1024K, L3: 33792K
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] cache allocate: shared
cache buffer size: 67584K (LLC size x 2 NUMA nodes)
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] dispatching hogs: 4 sigurg
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] starting stressors
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] 4 stressors started
18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: [222796] started
(instance 0 on CPU 57)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: [222797] started
(instance 1 on CPU 11)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: [222798] started
(instance 2 on CPU 84)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: [222799] started
(instance 3 on CPU 60)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: process [222796]
using socket port 16000
18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: process [222797]
using socket port 16001
18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: process [222798]
using socket port 16002
18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: process [222799]
using socket port 16003
18:35:14 DEBUG| [stdout] stress-ng: fail: [222796] sigurg: bind failed on port
16000, errno=98 (Address already in use)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: [222796] exited
(instance 0 on CPU 57)
18:35:14 DEBUG| [stdout] stress-ng: error: [222795] sigurg: [222796] terminated
with an error, exit status=2 (stressor failed)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222796] terminated
(stressor failed)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: [222797] exited
(instance 1 on CPU 11)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222797] terminated
(success)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: [222798] exited
(instance 2 on CPU 84)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222798] terminated
(success)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: [222799] exited
(instance 3 on CPU 60)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222799] terminated
(success)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] metrics-check: all stressor
metrics validated and sane
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] skipped: 0
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] passed: 3: sigurg (3)
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] failed: 1: sigurg (1)
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] metrics untrustworthy: 0
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] unsuccessful run completed
in 0 secs
18:35:14 DEBUG| [stdout]
18:35:14 DEBUG| [stdout]
18:35:14 DEBUG| [stdout]
18:35:14 DEBUG| [stdout] Summary:
18:35:14 DEBUG| [stdout] Stressors run: 1
18:35:14 DEBUG| [stdout] Skipped: 0,
18:35:14 DEBUG| [stdout] Failed: 1, sigurg
18:35:14 DEBUG| [stdout] Oopsed: 0,
18:35:14 DEBUG| [stdout] Oomed: 0,
18:35:14 DEBUG| [stdout] Passed: 0,
18:35:14 DEBUG| [stdout] Badret: 0,
18:35:14 DEBUG| [stdout]
18:35:14 DEBUG| [stdout] Tests took 0 seconds to run
** Affects: ubuntu-kernel-tests
Importance: Undecided
Assignee: Jacob Martin (jacobmartin)
Status: New
** Tags: amd64 sru-20250113 ubuntu-stress-smoke-test
** Summary changed:
- ubuntu_stress_smoke_test sigurg failed to bind port 16000 with NVIDIA fabric
manager installed
+ ubuntu_stress_smoke_test sigurg failed to bind port 16000 with NVIDIA fabric
manager active
--
You received this bug notification because you are a member of Canonical
Platform QA Team, which is subscribed to ubuntu-kernel-tests.
https://bugs.launchpad.net/bugs/2097652
Title:
ubuntu_stress_smoke_test sigurg failed to bind port 16000 with NVIDIA
fabric manager active
Status in ubuntu-kernel-tests:
New
Bug description:
Observed on akis, blanka, cortez, and hidon. This occurs while NVIDIA
fabric-manager is installed and active, as it binds to TCP port 16000.
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] invoked with './stress-ng
-v -t 5 --sigurg 4 --sigurg-ops 3000 --ignite-cpu --syslog --verbose --verify
--oomable' by user 0 'root'
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] stress-ng 0.18.06
g9ea345f5dfda
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] system: Linux akis
6.8.0-1022-nvidia #25-Ubuntu SMP PREEMPT_DYNAMIC Tue Jan 28 05:14:01 UTC 2025
x86_64, gcc 13.3.0, glibc 2.39, little endian
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] RAM total: 1.5T, RAM
free: 1.5T, swap free: 9.0G
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] temporary file path:
'/home/ubuntu/autotest/client/tmp/ubuntu_stress_smoke_test/src/stress-ng',
filesystem type: ext2 (214458870 blocks available)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] CPUs have 5 idle states:
C0, C1, C1E, C6, POLL
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] 96 processors online, 96
processors configured
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] setting to a 5 secs run
per stressor
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] CPU data cache: L1: 32K,
L2: 1024K, L3: 33792K
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] cache allocate: shared
cache buffer size: 67584K (LLC size x 2 NUMA nodes)
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] dispatching hogs: 4 sigurg
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] starting stressors
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] 4 stressors started
18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: [222796] started
(instance 0 on CPU 57)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: [222797] started
(instance 1 on CPU 11)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: [222798] started
(instance 2 on CPU 84)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: [222799] started
(instance 3 on CPU 60)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: process [222796]
using socket port 16000
18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: process [222797]
using socket port 16001
18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: process [222798]
using socket port 16002
18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: process [222799]
using socket port 16003
18:35:14 DEBUG| [stdout] stress-ng: fail: [222796] sigurg: bind failed on
port 16000, errno=98 (Address already in use)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: [222796] exited
(instance 0 on CPU 57)
18:35:14 DEBUG| [stdout] stress-ng: error: [222795] sigurg: [222796]
terminated with an error, exit status=2 (stressor failed)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222796]
terminated (stressor failed)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: [222797] exited
(instance 1 on CPU 11)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222797]
terminated (success)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: [222798] exited
(instance 2 on CPU 84)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222798]
terminated (success)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: [222799] exited
(instance 3 on CPU 60)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222799]
terminated (success)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] metrics-check: all
stressor metrics validated and sane
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] skipped: 0
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] passed: 3: sigurg (3)
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] failed: 1: sigurg (1)
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] metrics untrustworthy: 0
18:35:14 DEBUG| [stdout] stress-ng: info: [222795] unsuccessful run
completed in 0 secs
18:35:14 DEBUG| [stdout]
18:35:14 DEBUG| [stdout]
18:35:14 DEBUG| [stdout]
18:35:14 DEBUG| [stdout] Summary:
18:35:14 DEBUG| [stdout] Stressors run: 1
18:35:14 DEBUG| [stdout] Skipped: 0,
18:35:14 DEBUG| [stdout] Failed: 1, sigurg
18:35:14 DEBUG| [stdout] Oopsed: 0,
18:35:14 DEBUG| [stdout] Oomed: 0,
18:35:14 DEBUG| [stdout] Passed: 0,
18:35:14 DEBUG| [stdout] Badret: 0,
18:35:14 DEBUG| [stdout]
18:35:14 DEBUG| [stdout] Tests took 0 seconds to run
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/2097652/+subscriptions
--
Mailing list: https://launchpad.net/~canonical-ubuntu-qa
Post to : [email protected]
Unsubscribe : https://launchpad.net/~canonical-ubuntu-qa
More help : https://help.launchpad.net/ListHelp