apport information

** Attachment added: "ProcCpuinfoMinimal.txt"
   
https://bugs.launchpad.net/bugs/2034447/+attachment/5697977/+files/ProcCpuinfoMinimal.txt

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/2034447

Title:
  `refcount_t: underflow; use-after-free.` on hidon w/ 5.15.0-85-generic

Status in linux package in Ubuntu:
  Incomplete

Bug description:
  Seeing a panic on hidon (an Nvidia H100) after booting the
  5.15.0-85-generic kernel:

  [   58.935877] ------------[ cut here ]------------
  [   58.935893] refcount_t: underflow; use-after-free.
  [   58.935920] WARNING: CPU: 207 PID: 2985 at lib/refcount.c:28 
refcount_warn_saturate+0xf7/0x150
  [   58.935943] Modules linked in: x86_pkg_temp_thermal(+) intel_powerclamp 
coretemp nls_iso8859_1 rapl irdma(+) i40e qat_4xxx(+) isst_if_mbox_pci 
intel_qat pmt_telemetry pmt_crashlog idxd(+) isst_if_mmio pmt_class 
isst_if_common authenc idxd_bus intel_th_gth mei_me intel_th_pci intel_th mei 
switchtec ipmi_ssif acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler mac_hid 
sch_fq_codel dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua ramoops 
reed_solomon pstore_blk pstore_zone efi_pstore ip_tables x_tables autofs4 btrfs 
blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy 
async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 multipath linear 
mlx5_ib ib_uverbs ib_core ast i2c_algo_bit drm_vram_helper drm_ttm_helper ttm 
drm_kms_helper raid0 mlx5_core syscopyarea sysfillrect sysimgblt 
crct10dif_pclmul fb_sys_fops crc32_pclmul ixgbe cec mlxfw ghash_clmulni_intel 
aesni_intel psample crypto_simd xfrm_algo ice rc_core cryptd tls nvme i2c_i801 
dca xhci_pci intel_pmt drm
  [   58.936077]  pci_hyperv_intf i2c_ismt i2c_smbus
  [   58.936080] QAT: Could not find a device on node 1
  [   58.936080] QAT: Could not find a device on node 1
  [   58.936080] QAT: Could not find a device on node 1
  [   58.936080] QAT: Could not find a device on node 1
  [   58.936080] QAT: Could not find a device on node 1
  [   58.936080] QAT: Could not find a device on node 1
  [   58.936083]  mdio
  [   58.936096]  xhci_pci_renesas nvme_core wmi pinctrl_emmitsburg
  [   58.936106] CPU: 207 PID: 2985 Comm: systemd-udevd Not tainted 
5.15.0-85-generic #95-Ubuntu
  [   58.936115] Hardware name: NVIDIA DGXH100/DGXH100, BIOS 1.0.7 05/08/2023
  [   58.936119] RIP: 0010:refcount_warn_saturate+0xf7/0x150
  [   58.936130] Code: eb 9e 0f b6 1d 5e e6 b9 01 80 fb 01 0f 87 f4 63 6f 00 83 
e3 01 75 89 48 c7 c7 88 c3 23 9e c6 05 42 e6 b9 01 01 e8 d8 e4 6b 00 <0f> 0b e9 
6f ff ff ff 0f b6 1d 2d e6 b9 01 80 fb 01 0f 87 b1 63 6f
  [   58.936135] RSP: 0018:ff4d5d94b2c7fa28 EFLAGS: 00010282
  [   58.936142] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 
0000000000000027
  [   58.936146] RDX: ff314dbbbf9e0588 RSI: 0000000000000001 RDI: 
ff314dbbbf9e0580
  [   58.936149] RBP: ff4d5d94b2c7fa30 R08: 0000000000000026 R09: 
ff4d5d94b2c7f9c0
  [   58.936153] R10: 0000000000000028 R11: 0000000000000001 R12: 
0000000000000000
  [   58.936156] R13: ff314cbfdbcb6900 R14: ff314cbfdbcb67b8 R15: 
ff314cbfd24b4000
  [   58.936159] FS:  00007fadd2f6c8c0(0000) GS:ff314dbbbf9c0000(0000) 
knlGS:0000000000000000
  [   58.936163] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
  [   58.936167] CR2: 00007fadd243b584 CR3: 000000012972c006 CR4: 
0000000000771ee0
  [   58.936171] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
0000000000000000
  [   58.936174] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 
0000000000000400
  [   58.936177] PKRU: 55555554
  [   58.936179] Call Trace:
  [   58.936184]  <TASK>
  [   58.936188]  ? show_trace_log_lvl+0x1d6/0x2ea
  [   58.936204]  ? show_trace_log_lvl+0x1d6/0x2ea
  [   58.936212]  ? crypto_mod_put+0x6b/0x80
  [   58.936225]  ? show_regs.part.0+0x23/0x29
  [   58.936232]  ? show_regs.cold+0x8/0xd
  [   58.936239]  ? refcount_warn_saturate+0xf7/0x150
  [   58.936246]  ? __warn+0x8c/0x100
  [   58.936255]  ? refcount_warn_saturate+0xf7/0x150
  [   58.936263]  ? report_bug+0xa4/0xd0
  [   58.936274]  ? down_trylock+0x2e/0x40
  [   58.936285]  ? handle_bug+0x39/0x90
  [   58.936296]  ? exc_invalid_op+0x19/0x70
  [   58.936301]  ? asm_exc_invalid_op+0x1b/0x20
  [   58.936310]  ? refcount_warn_saturate+0xf7/0x150
  [   58.936317]  ? refcount_warn_saturate+0xf7/0x150
  [   58.936323]  crypto_mod_put+0x6b/0x80
  [   58.936329]  crypto_destroy_tfm+0x4e/0xa0
  [   58.936336]  pkcs1pad_exit_tfm+0x15/0x20
  [   58.936345]  crypto_akcipher_exit_tfm+0x13/0x20
  [   58.936352]  crypto_destroy_tfm+0x43/0xa0
  [   58.936358]  public_key_verify_signature+0x2dc/0x3c0
  [   58.936366]  ? find_asymmetric_key+0xd2/0x1d0
  [   58.936374]  ? kfree+0x1f7/0x250
  [   58.936385]  public_key_verify_signature_2+0x15/0x20
  [   58.936389]  verify_signature+0x37/0x60
  [   58.936393]  pkcs7_validate_trust_one.constprop.0+0x156/0x1e0
  [   58.936400]  pkcs7_validate_trust+0x4a/0xa0
  [   58.936406]  verify_pkcs7_message_sig+0x83/0x120
  [   58.936418]  verify_pkcs7_signature+0x4f/0x80
  [   58.936424]  mod_verify_sig+0xb5/0xf0
  [   58.936435]  load_module+0x275/0xbc0
  [   58.936440]  ? kernel_read_file_from_fd+0x56/0xa0
  [   58.936450]  __do_sys_finit_module+0xbf/0x120
  [   58.936496]  __x64_sys_finit_module+0x18/0x20
  [   58.936504]  do_syscall_64+0x59/0xc0
  [   58.936510]  ? exit_to_user_mode_prepare+0x37/0xb0
  [   58.936521]  ? syscall_exit_to_user_mode+0x35/0x50
  [   58.936530]  ? __x64_sys_mmap+0x33/0x50
  [   58.936539]  ? do_syscall_64+0x69/0xc0
  [   58.936544]  ? syscall_exit_to_user_mode+0x35/0x50
  [   58.936550]  ? do_syscall_64+0x69/0xc0
  [   58.936555]  entry_SYSCALL_64_after_hwframe+0x62/0xcc
  [   58.936560] RIP: 0033:0x7fadd3663a3d
  [   58.936566] Code: 5b 41 5c c3 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 48 89 
f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 
f0 ff ff 73 01 c3 48 8b 0d c3 a3 0f 00 f7 d8 64 89 01 48
  [   58.936570] RSP: 002b:00007ffef20c7b08 EFLAGS: 00000246 ORIG_RAX: 
0000000000000139
  [   58.936576] RAX: ffffffffffffffda RBX: 000055651a57b310 RCX: 
00007fadd3663a3d
  [   58.936579] RDX: 0000000000000000 RSI: 00007fadd37fc441 RDI: 
0000000000000011
  [   58.936582] RBP: 0000000000020000 R08: 0000000000000000 R09: 
00007ffef20c7c40
  [   58.936585] R10: 0000000000000011 R11: 0000000000000246 R12: 
00007fadd37fc441
  [   58.936587] R13: 000055651a54b780 R14: 000055651a539530 R15: 
000055651a554b40
  [   58.936592]  </TASK>
  [   58.936595] ---[ end trace 6b4de64023014d9a ]---
  [   58.942796] #PF: supervisor read access in kernel mode
  [   58.942802] #PF: error_code(0x0000) - not-present page
  [   58.942806] PGD 0 
  [   58.942810] Oops: 0000 [#1] SMP NOPTI
  [   59.008727] ------------[ cut here ]------------
  [   59.013012] CPU: 2 PID: 0 Comm: swapper/2 Tainted: G        W         
5.15.0-85-generic #95-Ubuntu
  [   59.013022] Hardware name: NVIDIA DGXH100/DGXH100, BIOS 1.0.7 05/08/2023
  [   59.013024] RIP: 0010:qat_rsa_cb+0x3a/0x130 [intel_qat]
  [   59.018446] WARNING: CPU: 128 PID: 1536 at kernel/rcu/tree.c:3323 
kfree_rcu_work+0x2d1/0x390
  [   59.023838] Code: 41 55 41 54 53 48 8b 5f 08 44 0f b6 67 05 48 8b 83 e0 00 
00 00 48 8b 33 41 c1 fc 06 4c 8b ab e8 00 00 00 48 8b 90 85 00 00 00 <48> 8b 52 
20 4c 8b 72 64 ba ea ff ff ff 49 81 c6 d0 00 00 00 41 83
  [   59.023844] RSP: 0018:ff4d5d94803fce68 EFLAGS: 00010246
  [   59.029256] Modules linked in:
  [   59.034454] RAX: ff314bc0710e9d18 RBX: ff314bc0710e9d00 RCX: 
0000000000000001
  [   59.039857]  x86_pkg_temp_thermal(+)
  [   59.049538] RDX: 0080030000000000 RSI: 000000013a8e7400 RDI: 
ff314bc06f1db240
  [   59.049544] RBP: ff4d5d94803fce88 R08: ff314bc05bd667c0 R09: 
1615e897948f1e86
  [   59.146810]  intel_powerclamp
  [   59.151904] R10: ffffffff9ea060c0 R11: ff4d5d94803fcff8 R12: 
0000000000000000
  [   59.151909] R13: ff314bc0710e9cb8 R14: ff314bc05bd667c0 R15: 
0000000000000002
  [   59.157316]  coretemp
  [   59.162713] FS:  0000000000000000(0000) GS:ff314cbbbe480000(0000) 
knlGS:0000000000000000
  [   59.162719] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
  [   59.168121]  nls_iso8859_1
  [   59.173517] CR2: 0080030000000020 CR3: 000000011be7c004 CR4: 
0000000000771ee0
  [   59.173523] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
0000000000000000
  [   59.178927]  rapl
  [   59.184324] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 
0000000000000400
  [   59.184330] PKRU: 55555554
  [   59.186521]  irdma(+)
  [   59.193086] Call Trace:
  [   59.193092]  <IRQ>
  [   59.202495]  i40e
  [   59.210042]  ? show_trace_log_lvl+0x1d6/0x2ea
  [   59.215942]  qat_4xxx(+)
  [   59.237039]  ? show_trace_log_lvl+0x1d6/0x2ea
  [   59.242933]  isst_if_mbox_pci
  [   59.250963]  ? qat_alg_asym_callback+0x1f/0x40 [intel_qat]
  [   59.259003]  intel_qat
  [   59.267036]  ? show_regs.part.0+0x23/0x29
  [   59.275080]  pmt_telemetry
  [   59.283106]  ? __die_body.cold+0x8/0xd
  [   59.292225]  pmt_crashlog
  [   59.298696]  ? __die+0x2b/0x37
  [   59.306736]  idxd(+)
  [   59.314768]  ? page_fault_oops+0x13b/0x170
  [   59.322811]  isst_if_mmio
  [   59.325864]  ? wake_affine+0x111/0x310
  [   59.328639]  pmt_class
  [   59.331016]  ? do_user_addr_fault+0x321/0x670
  [   59.335935]  isst_if_common
  [   59.340842]  ? ttwu_queue_wakelist+0x131/0x1c0
  [   59.345182]  authenc
  [   59.349699]  ? exc_page_fault+0x77/0x170
  [   59.353833]  idxd_bus
  [   59.359034]  ? asm_exc_page_fault+0x27/0x30
  [   59.362676]  intel_th_gth
  [   59.367879]  ? qat_rsa_cb+0x3a/0x130 [intel_qat]
  [   59.371823]  mei_me
  [   59.375952]  qat_alg_asym_callback+0x1f/0x40 [intel_qat]
  [   59.379898]  intel_th_pci
  [   59.384225]  adf_ring_response_handler+0xc1/0x190 [intel_qat]
  [   59.388951]  intel_th
  [   59.394150]  ? sysvec_call_function_single+0x4e/0x90
  [   59.394161]  adf_response_handler+0x1c/0x40 [intel_qat]
  [   59.399365]  mei
  [   59.403496]  tasklet_action_common.constprop.0+0xeb/0xf0
  [   59.408030]  switchtec
  [   59.412448]  tasklet_hi_action+0x1f/0x30
  [   59.417564]  ipmi_ssif
  [   59.422080]  __do_softirq+0xd6/0x2e7
  [   59.422089]  irq_exit_rcu+0x94/0xc0
  [   59.427687]  acpi_ipmi
  [   59.432592]  common_interrupt+0x8e/0xa0
  [   59.436246]  ipmi_si
  [   59.441838]  </IRQ>
  [   59.441842]  <TASK>
  [   59.441844]  asm_common_interrupt+0x27/0x40
  [   59.446174]  ipmi_devintf
  [   59.452640] RIP: 0010:cpuidle_enter_state+0xd9/0x620
  [   59.457370]  ipmi_msghandler
  [   59.462569] Code: 3d dc 69 98 62 e8 a7 66 67 ff 49 89 c7 0f 1f 44 00 00 31 
ff e8 e8 73 67 ff 80 7d d0 00 0f 85 61 01 00 00 fb 66 0f 1f 44 00 00 <45> 85 f6 
0f 88 6d 01 00 00 4d 63 ee 49 83 fd 09 0f 87 e7 03 00 00
  [   59.467491]  mac_hid
  [   59.471620] RSP: 0018:ff4d5d94803f7e28 EFLAGS: 00000246
  [   59.475666]  sch_fq_codel
  [   59.480967] RAX: ff314cbbbe4b14c0 RBX: ff7f5c947e493c60 RCX: 
0000000000000000
  [   59.480971] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 
0000000000000000
  [   59.485886]  dm_multipath
  [   59.490798] RBP: ff4d5d94803f7e78 R08: 0000000db8cd49ba R09: 
00000000000c3500
  [   59.490802] R10: 0000000000000004 R11: 071c71c71c71c71c R12: 
ffffffff9ecd6a20
  [   59.494844]  scsi_dh_rdac
  [   59.500239] R13: 0000000000000002 R14: 0000000000000002 R15: 
0000000db8cd49ba
  [   59.500247]  ? cpuidle_enter_state+0xc8/0x620
  [   59.505646]  scsi_dh_emc
  [   59.509969]  cpuidle_enter+0x2e/0x50
  [   59.514211]  scsi_dh_alua
  [   59.519605]  cpuidle_idle_call+0x142/0x1e0
  [   59.523840]  ramoops
  [   59.529529]  do_idle+0x83/0xf0
  [   59.533562]  reed_solomon
  [   59.554660]  cpu_startup_entry+0x20/0x30
  [   59.559735] ens6f0 speed is unknown, defaulting to 1000
  [   59.559980] ens6f0 speed is unknown, defaulting to 1000
  [   59.563190]  pstore_blk
  [   59.571216]  start_secondary+0x12a/0x180
  [   59.579260]  pstore_zone
  [   59.587284]  secondary_startup_64_no_verify+0xc2/0xcb
  [   59.595325]  efi_pstore
  [   59.603358]  </TASK>
  [   59.603362] Modules linked in: x86_pkg_temp_thermal(+)
  [   59.605841]  ip_tables
  [   59.611040]  intel_powerclamp coretemp nls_iso8859_1
  [   59.616841]  x_tables
  [   59.622625]  rapl irdma(+)
  [   59.624916]  autofs4
  [   59.629044]  i40e qat_4xxx(+) isst_if_mbox_pci
  [   59.634259]  btrfs
  [   59.644335]  intel_qat pmt_telemetry pmt_crashlog idxd(+) isst_if_mmio
  [   59.651894]  blake2b_generic
  [   59.657780]  pmt_class isst_if_common authenc
  [   59.667283]  zstd_compress
  [   59.688374]  idxd_bus intel_th_gth mei_me intel_th_pci
  [   59.694268]  raid10
  [   59.697714]  intel_th mei switchtec
  [   59.705754]  raid456
  [   59.709783]  ipmi_ssif acpi_ipmi ipmi_si
  [   59.717825]  async_raid6_recov
  [   59.725850]  ipmi_devintf ipmi_msghandler mac_hid
  [   59.729214]  async_memcpy
  [   59.737240]  sch_fq_codel dm_multipath scsi_dh_rdac
  [   59.745279]  async_pq
  [   59.747844]  scsi_dh_emc
  [   59.756957]  async_xor
  [   59.763424]  scsi_dh_alua ramoops reed_solomon
  [   59.766495]  async_tx
  [   59.774518]  pstore_blk pstore_zone efi_pstore
  [   59.782556]  xor
  [   59.784735]  ip_tables x_tables autofs4
  [   59.792780]  raid6_pq
  [   59.795837]  btrfs
  [   59.798415]  libcrc32c
  [   59.801178]  blake2b_generic zstd_compress raid10
  [   59.803462]  raid1
  [   59.805644]  raid456 async_raid6_recov async_memcpy
  [   59.810562]  multipath
  [   59.813424]  async_pq async_xor async_tx
  [   59.818344]  linear
  [   59.821693]  xor raid6_pq libcrc32c
  [   59.827881]  mlx5_ib
  [   59.830544]  raid1 multipath linear mlx5_ib
  [   59.835063]  ib_uverbs
  [   59.838121]  ib_uverbs ib_core ast
  [   59.842361]  ib_core
  [   59.845323]  i2c_algo_bit drm_vram_helper drm_ttm_helper
  [   59.848778]  ast
  [   59.851251]  ttm
  [   59.855874]  i2c_algo_bit
  [   59.858835]  drm_kms_helper raid0 mlx5_core syscopyarea
  [   59.863071]  drm_vram_helper
  [   59.865737]  sysfillrect sysimgblt crct10dif_pclmul
  [   59.870657]  drm_ttm_helper
  [   59.873808]  fb_sys_fops crc32_pclmul ixgbe
  [   59.878825]  ttm
  [   59.881299]  cec mlxfw ghash_clmulni_intel
  [   59.885738]  drm_kms_helper
  [   59.888301]  aesni_intel
  [   59.893023]  raid0
  [   59.895981]  psample
  [   59.901192]  mlx5_core
  [   59.903565]  crypto_simd
  [   59.909555]  syscopyarea
  [   59.912515]  xfrm_algo ice rc_core cryptd tls
  [   59.918998]  sysfillrect
  [   59.921563]  nvme i2c_i801 dca xhci_pci
  [   59.927166]  sysimgblt
  [   59.933050]  intel_pmt drm pci_hyperv_intf
  [   59.935136]  crct10dif_pclmul
  [   59.941124]  i2c_ismt i2c_smbus mdio
  [   59.943796]  fb_sys_fops
  [   59.948217]  xhci_pci_renesas nvme_core wmi pinctrl_emmitsburg
  [   59.950895]  crc32_pclmul
  [   59.954928] CR2: 0080030000000020
  [   59.958869]  ixgbe
  [   59.961538] ---[ end trace 6b4de64023014d9b ]---
  [   59.961540] BUG: kernel NULL pointer dereference, address: 0000000000000030
  [   59.961549] #PF: supervisor read access in kernel mode
  [   59.961554] #PF: error_code(0x0000) - not-present page
  [   59.961558] PGD 1853c2067 P4D 0 
  [   59.961566] Oops: 0000 [#2] SMP NOPTI
  [   59.961573] CPU: 211 PID: 2869 Comm: systemd-journal Tainted: G      D W   
      5.15.0-85-generic #95-Ubuntu
  [   59.961580] Hardware name: NVIDIA DGXH100/DGXH100, BIOS 1.0.7 05/08/2023
  [   59.961583] RIP: 0010:seccomp_run_filters+0x8e/0x150
  [   59.961598] Code: 21 d0 48 98 48 0f a3 01 0f 92 c0 41 be 00 00 ff 7f 84 c0 
75 52 41 be 00 00 ff 7f 4d 8b bc 24 98 00 00 00 e8 04 63 f1 ff 66 90 <49> 8b 47 
30 49 8d 77 48 4c 89 ef ff d0 0f 1f 00 89 c3 e8 1b 31 f2
  [   59.961604] RSP: 0018:ff4d5d949e32bd48 EFLAGS: 00010202
  [   59.961611] RAX: 0000000000000001 RBX: 0000000000000000 RCX: 
ff314bc05b526b10
  [   59.961614] RDX: ff314bc0a7128000 RSI: ff4d5d949e32bd90 RDI: 
ff4d5d949e32bd98
  [   59.961618] RBP: ff4d5d949e32bd80 R08: 0000000000002000 R09: 
00007ffdc82eeb90
  [   59.961622] R10: 0000000000000009 R11: 0000000000000000 R12: 
ff314bc05b526b00
  [   59.961626] R13: ff4d5d949e32bd98 R14: 000000007fff0000 R15: 
0000000000000000
  [   59.961630] FS:  00007f85a1778900(0000) GS:ff314dbbbfac0000(0000) 
knlGS:0000000000000000
  [   59.961636] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
  [   59.961640] CR2: 0000000000000030 CR3: 000000012b0ae006 CR4: 
0000000000771ee0
  [   59.961644] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
0000000000000000
  [   59.961648] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 
0000000000000400
  [   59.961651] PKRU: 55555554
  [   59.961654] Call Trace:
  [   59.961657]  <TASK>
  [   59.961661]  ? show_trace_log_lvl+0x1d6/0x2ea
  [   59.961672]  ? show_trace_log_lvl+0x1d6/0x2ea
  [   59.961680]  ? __seccomp_filter+0x4a/0x4a0
  [   59.961687]  ? show_regs.part.0+0x23/0x29
  [   59.961694]  ? __die_body.cold+0x8/0xd
  [   59.961702]  ? __die+0x2b/0x37
  [   59.961711]  ? page_fault_oops+0x13b/0x170
  [   59.961723]  ? do_user_addr_fault+0x321/0x670
  [   59.961730]  ? devkmsg_poll+0x5a/0xa0
  [   59.961742]  ? exc_page_fault+0x77/0x170
  [   59.961752]  ? asm_exc_page_fault+0x27/0x30
  [   59.961760]  ? seccomp_run_filters+0x8e/0x150
  [   59.961765]  ? seccomp_run_filters+0x8c/0x150
  [   59.961771]  __seccomp_filter+0x4a/0x4a0
  [   59.961778]  __secure_computing+0xa9/0x120
  [   59.961783]  syscall_trace_enter.constprop.0+0xa7/0x1c0
  [   59.961793]  syscall_enter_from_user_mode+0x2f/0x40
  [   59.961801]  do_syscall_64+0x37/0xc0
  [   59.961807]  ? __secure_computing+0xa9/0x120
  [   59.961814]  ? syscall_trace_enter.constprop.0+0xa7/0x1c0
  [   59.961820]  ? exit_to_user_mode_prepare+0x37/0xb0
  [   59.961828]  ? syscall_exit_to_user_mode+0x35/0x50
  [   59.961835]  ? __do_sys_gettid+0x1b/0x30
  [   59.961844]  ? do_syscall_64+0x69/0xc0
  [   59.961849]  ? exit_to_user_mode_prepare+0x37/0xb0
  [   59.961856]  ? irqentry_exit_to_user_mode+0x17/0x20
  [   59.961863]  ? irqentry_exit+0x1d/0x30
  [   59.961870]  ? exc_page_fault+0x89/0x170
  [   59.961877]  entry_SYSCALL_64_after_hwframe+0x62/0xcc
  [   59.961882] RIP: 0033:0x7f85a21479cc
  [   59.961888] Code: ec 28 48 89 54 24 18 48 89 74 24 10 89 7c 24 08 e8 b9 c0 
f7 ff 48 8b 54 24 18 48 8b 74 24 10 41 89 c0 8b 7c 24 08 31 c0 0f 05 <48> 3d 00 
f0 ff ff 77 34 44 89 c7 48 89 44 24 08 e8 ff c0 f7 ff 48
  [   59.961893] RSP: 002b:00007ffdc82ee2e0 EFLAGS: 00000246 ORIG_RAX: 
0000000000000000
  [   59.961899] RAX: ffffffffffffffda RBX: 00007ffdc82f0c00 RCX: 
00007f85a21479cc
  [   59.961903] RDX: 0000000000002000 RSI: 00007ffdc82eeb90 RDI: 
0000000000000009
  [   59.961906] RBP: 00007ffdc82f0cd0 R08: 0000000000000000 R09: 
0000000000000001
  [   59.961909] R10: 00007ffdc8373080 R11: 0000000000000246 R12: 
0000000000000000
  [   59.961913] R13: 00007ffdc82eeb90 R14: 0000000000000000 R15: 
0000000000000001
  [   59.961918]  </TASK>
  [   59.961920] Modules linked in: x86_pkg_temp_thermal(+) intel_powerclamp 
coretemp nls_iso8859_1 rapl irdma(+) i40e qat_4xxx(+) isst_if_mbox_pci 
intel_qat pmt_telemetry pmt_crashlog idxd(+) isst_if_mmio pmt_class 
isst_if_common authenc idxd_bus intel_th_gth mei_me intel_th_pci intel_th mei 
switchtec ipmi_ssif acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler mac_hid 
sch_fq_codel dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua ramoops 
reed_solomon pstore_blk pstore_zone efi_pstore ip_tables x_tables autofs4 btrfs 
blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy 
async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 multipath linear 
mlx5_ib ib_uverbs ib_core ast i2c_algo_bit drm_vram_helper drm_ttm_helper ttm 
drm_kms_helper raid0 mlx5_core syscopyarea sysfillrect sysimgblt 
crct10dif_pclmul fb_sys_fops crc32_pclmul ixgbe cec mlxfw ghash_clmulni_intel 
aesni_intel psample crypto_simd xfrm_algo ice rc_core cryptd tls nvme i2c_i801 
dca xhci_pci intel_pmt drm
  [   59.962150]  pci_hyperv_intf i2c_ismt i2c_smbus mdio xhci_pci_renesas 
nvme_core wmi pinctrl_emmitsburg
  [   59.962177] CR2: 0000000000000030
  [   59.962183] ---[ end trace 6b4de64023014d9c ]---
  [   59.965868]  cec
  [   60.046995] RIP: 0010:qat_rsa_cb+0x3a/0x130 [intel_qat]
  [   60.052384]  mlxfw
  [   60.055637] Code: 41 55 41 54 53 48 8b 5f 08 44 0f b6 67 05 48 8b 83 e0 00 
00 00 48 8b 33 41 c1 fc 06 4c 8b ab e8 00 00 00 48 8b 90 85 00 00 00 <48> 8b 52 
20 4c 8b 72 64 ba ea ff ff ff 49 81 c6 d0 00 00 00 41 83
  [   60.135449] RIP: 0010:qat_rsa_cb+0x3a/0x130 [intel_qat]
  [   60.135491] Code: 41 55 41 54 53 48 8b 5f 08 44 0f b6 67 05 48 8b 83 e0 00 
00 00 48 8b 33 41 c1 fc 06 4c 8b ab e8 00 00 00 48 8b 90 85 00 00 00 <48> 8b 52 
20 4c 8b 72 64 ba ea ff ff ff 49 81 c6 d0 00 00 00 41 83
  [   60.135498] RSP: 0018:ff4d5d94803fce68 EFLAGS: 00010246
  [   60.135506] RAX: ff314bc0710e9d18 RBX: ff314bc0710e9d00 RCX: 
0000000000000001
  [   60.135512] RDX: 0080030000000000 RSI: 000000013a8e7400 RDI: 
ff314bc06f1db240
  [   60.135518] RBP: ff4d5d94803fce88 R08: ff314bc05bd667c0 R09: 
1615e897948f1e86
  [   60.135524] R10: ffffffff9ea060c0 R11: ff4d5d94803fcff8 R12: 
0000000000000000
  [   60.135529] R13: ff314bc0710e9cb8 R14: ff314bc05bd667c0 R15: 
0000000000000002
  [   60.135532] FS:  0000000000000000(0000) GS:ff314cbbbe480000(0000) 
knlGS:0000000000000000
  [   60.135538] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
  [   60.135544] CR2: 0080030000000020 CR3: 000000011be7c004 CR4: 
0000000000771ee0
  [   60.135548] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
0000000000000000
  [   60.135552] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 
0000000000000400
  [   60.135556] PKRU: 55555554
  [   60.135560] Kernel panic - not syncing: Fatal exception in interrupt
  [   61.196485] RSP: 0018:ff4d5d94803fce68 EFLAGS: 00010246
  [   61.202378] RAX: ff314bc0710e9d18 RBX: ff314bc0710e9d00 RCX: 
0000000000000001
  [   61.210411] RDX: 0080030000000000 RSI: 000000013a8e7400 RDI: 
ff314bc06f1db240
  [   61.218449] RBP: ff4d5d94803fce88 R08: ff314bc05bd667c0 R09: 
1615e897948f1e86
  [   61.226481] R10: ffffffff9ea060c0 R11: ff4d5d94803fcff8 R12: 
0000000000000000
  [   61.234518] R13: ff314bc0710e9cb8 R14: ff314bc05bd667c0 R15: 
0000000000000002
  [   61.242555] FS:  00007f85a1778900(0000) GS:ff314dbbbfac0000(0000) 
knlGS:0000000000000000
  [   61.251666] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
  [   61.258143] CR2: 0000000000000030 CR3: 000000012b0ae006 CR4: 
0000000000771ee0
  [   61.266176] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
0000000000000000
  [   61.274213] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 
0000000000000400
  [   61.282245] PKRU: 55555554
  [   61.285383] Kernel Offset: 0x1bc00000 from 0xffffffff81000000 (relocation 
range: 0xffffffff80000000-0xffffffffbfffffff)
  [   61.375816] ---[ end Kernel panic - not syncing: Fatal exception in 
interrupt ]---
  --- 
  ProblemType: Bug
  AlsaDevices:
   total 0
   crw-rw---- 1 root audio 116,  1 Sep  6 12:25 seq
   crw-rw---- 1 root audio 116, 33 Sep  6 12:25 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.20.11-0ubuntu82.5
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  CRDA: N/A
  CasperMD5CheckResult: unknown
  DistroRelease: Ubuntu 22.04
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
  Lsusb:
   Bus 002 Device 002: ID 0451:8140 Texas Instruments, Inc. TUSB8041 4-Port Hub
   Bus 002 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
   Bus 001 Device 002: ID 0451:8142 Texas Instruments, Inc. TUSB8041 4-Port Hub
   Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
  Lsusb-t:
   /:  Bus 02.Port 1: Dev 1, Class=root_hub, Driver=xhci_hcd/10p, 10000M
       |__ Port 10: Dev 2, If 0, Class=Hub, Driver=hub/4p, 5000M
   /:  Bus 01.Port 1: Dev 1, Class=root_hub, Driver=xhci_hcd/16p, 480M
       |__ Port 10: Dev 2, If 0, Class=Hub, Driver=hub/4p, 480M
  MachineType: NVIDIA DGXH100
  Package: linux (not installed)
  PciMultimedia:
   
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   LANG=C.UTF-8
   SHELL=/bin/bash
  ProcFB: 0 astdrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.15.0-83-generic 
root=UUID=43a0d64c-490d-4a20-a5e2-8c1ba57e2600 ro sysrq_always_enabled 
console=ttyS0,115200n8 iommu=pt
  ProcVersionSignature: Ubuntu 5.15.0-83.92-generic 5.15.116
  RelatedPackageVersions:
   linux-restricted-modules-5.15.0-83-generic N/A
   linux-backports-modules-5.15.0-83-generic  N/A
   linux-firmware                             20220329.git681281e4-0ubuntu3.18
  RfKill: Error: [Errno 2] No such file or directory: 'rfkill'
  Tags:  jammy uec-images
  Uname: Linux 5.15.0-83-generic x86_64
  UpgradeStatus: No upgrade log present (probably fresh install)
  UserGroups: N/A
  _MarkForUpload: True
  dmi.bios.date: 05/08/2023
  dmi.bios.release: 1.0
  dmi.bios.vendor: NVIDIA
  dmi.bios.version: 1.0.7
  dmi.board.asset.tag: Default string
  dmi.board.name: DGXH100
  dmi.board.vendor: NVIDIA
  dmi.board.version: 555.07L01.0001
  dmi.chassis.asset.tag: 00000000000000000000000000000000
  dmi.chassis.type: 23
  dmi.chassis.vendor: NVIDIA
  dmi.chassis.version: 920-24387-2540-000
  dmi.modalias: 
dmi:bvnNVIDIA:bvr1.0.7:bd05/08/2023:br1.0:svnNVIDIA:pnDGXH100:pvrA.5:rvnNVIDIA:rnDGXH100:rvr555.07L01.0001:cvnNVIDIA:ct23:cvr920-24387-2540-000:sku920-24387-2540-000:
  dmi.product.family: DGX
  dmi.product.name: DGXH100
  dmi.product.sku: 920-24387-2540-000
  dmi.product.version: A.5
  dmi.sys.vendor: NVIDIA

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2034447/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to