Re: crash probablement lié à amdgpu

2023-09-27 Par sujet NoSpam

Bonjour

Le 27/09/2023 à 13:03, LECOQ Vincent a écrit :
[...]
Depuis mon dernier apt full-upgrade hier, je constate un crash assez 
rapide de ma session gnome wayland.

[...]
Linux b550 6.5.0-1-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.5.3-1 
(2023-09-13) x86_64 GNU/Linux

oktail@b550:~$ cat /etc/debian_version
trixie/sid


[...]

Les plaisirs de SID :) Ouvrir un ticket de bug est la solution et 
revenir aux versions antérieurs des paquets concernés.




crash probablement lié à amdgpu

2023-09-27 Par sujet LECOQ Vincent
Bonjour,

J'ai jeté un oeil (rapide, trop?) aux bugs ouverts sans trouver, donc je
rapporte ma petite misère.
Depuis mon dernier apt full-upgrade hier, je constate un crash assez rapide
de ma session gnome wayland.
mon dmesg indique alors:
[ 4765.695352] [ cut here ]
[ 4765.695354] WARNING: CPU: 2 PID: 721753 at
drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:615 amdgpu_irq_put+0x46/0x70
[amdgpu]
[ 4765.695512] Modules linked in: rfcomm snd_seq_dummy snd_hrtimer snd_seq
rpcsec_gss_krb5 auth_rpcgss nf_tables nfnetlink nfsv4 dns_resolver nfs
lockd grace fscache netfs qrtr cmac algif_hash algif_skcipher af_alg bnep
sunrpc binfmt_misc nls_ascii nls_cp437 intel_rapl_msr vfat
intel_rapl_common fat edac_mce_amd mt7921e btusb mt7921_common btrtl btbcm
kvm_amd mt76_connac_lib btintel btmtk mt76 bluetooth kvm mac80211
sha3_generic jitterentropy_rng irqbypass uvcvideo drbg videobuf2_vmalloc
libarc4 ghash_clmulni_intel uvc videobuf2_memops snd_hda_codec_hdmi
ansi_cprng videobuf2_v4l2 sha512_ssse3 snd_hda_intel ecdh_generic
sha512_generic snd_usb_audio videodev snd_intel_dspcfg ecc cfg80211
snd_intel_sdw_acpi snd_usbmidi_lib snd_hda_codec snd_rawmidi
videobuf2_common snd_seq_device aesni_intel snd_hda_core crypto_simd mc
cryptd snd_pci_acp6x snd_hwdep snd_pci_acp5x snd_pcm rfkill
snd_rn_pci_acp3x rapl wmi_bmof snd_timer snd_acp_config snd_soc_acpi snd
pcspkr sp5100_tco k10temp ccp watchdog snd_pci_acp3x soundcore joydev sg
[ 4765.695578]  evdev msr parport_pc ppdev lp parport fuse loop efi_pstore
configfs ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 btrfs
blake2b_generic efivarfs raid10 raid456 async_raid6_recov async_memcpy
async_pq async_xor async_tx xor raid6_pq libcrc32c crc32c_generic raid1
raid0 multipath linear md_mod hid_cmedia amdgpu hid_generic amdxcp
drm_buddy gpu_sched i2c_algo_bit drm_suballoc_helper usbhid uas
drm_display_helper hid usb_storage sd_mod cec rc_core dm_mod drm_ttm_helper
ttm ahci drm_kms_helper libahci nvme xhci_pci xhci_hcd nvme_core libata drm
t10_pi usbcore scsi_mod igc crc32_pclmul crc64_rocksoft crc32c_intel crc64
crc_t10dif crct10dif_generic crct10dif_pclmul i2c_piix4 crct10dif_common
usb_common scsi_common video wmi gpio_amdpt gpio_generic button
[ 4765.695636] CPU: 2 PID: 721753 Comm: kworker/u64:2 Tainted: GW
   6.5.0-1-amd64 #1  Debian 6.5.3-1
[ 4765.695639] Hardware name: BESSTAR TECH LIMITED B550/B550, BIOS 5.17
03/31/2022
[ 4765.695640] Workqueue: amdgpu-reset-dev drm_sched_job_timedout
[gpu_sched]
[ 4765.695646] RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 4765.695796] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04
88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 cf 5d 1d c4 e9 5a fd ff ff
<0f> 0b b8 ea ff ff ff e9 be 5d 1d c4 b8 ea ff ff ff e9 b4 5d 1d c4
[ 4765.695798] RSP: 0018:bc5f85a17c80 EFLAGS: 00010246
[ 4765.695800] RAX: 9642e26b1370 RBX: 96420e88 RCX:

[ 4765.695801] RDX:  RSI: 96420e8a78a8 RDI:
96420e88
[ 4765.695802] RBP: 96420e88 R08: eb8d0e5d R09:
eb8d0e5cc001
[ 4765.695803] R10: 0002 R11:  R12:
1050
[ 4765.695804] R13: 96420e8c1218 R14: 964358662000 R15:

[ 4765.695806] FS:  () GS:9650de28()
knlGS:
[ 4765.695807] CS:  0010 DS:  ES:  CR0: 80050033
[ 4765.695808] CR2: 7f6a18805760 CR3: 00010c2ae000 CR4:
00750ee0
[ 4765.695810] PKRU: 5554
[ 4765.695810] Call Trace:
[ 4765.695813]  
[ 4765.695815]  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 4765.695963]  ? __warn+0x81/0x130
[ 4765.695970]  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 4765.696108]  ? report_bug+0x191/0x1c0
[ 4765.696112]  ? handle_bug+0x3c/0x80
[ 4765.696116]  ? exc_invalid_op+0x17/0x70
[ 4765.696118]  ? asm_exc_invalid_op+0x1a/0x20
[ 4765.696123]  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 4765.696250]  gfx_v9_0_hw_fini+0x35/0x710 [amdgpu]
[ 4765.696380]  amdgpu_device_ip_suspend_phase2+0x101/0x1a0 [amdgpu]
[ 4765.696497]  ? amdgpu_device_ip_suspend_phase1+0x6f/0xe0 [amdgpu]
[ 4765.696614]  amdgpu_device_ip_suspend+0x36/0x70 [amdgpu]
[ 4765.696731]  amdgpu_device_pre_asic_reset+0xd3/0x2a0 [amdgpu]
[ 4765.696849]  amdgpu_device_gpu_recover+0x4c6/0xd70 [amdgpu]
[ 4765.696968]  amdgpu_job_timedout+0x186/0x270 [amdgpu]
[ 4765.697112]  ? srso_alias_return_thunk+0x5/0x7f
[ 4765.697118]  drm_sched_job_timedout+0x7a/0x110 [gpu_sched]
[ 4765.697124]  process_one_work+0x1e1/0x3f0
[ 4765.697128]  worker_thread+0x51/0x390
[ 4765.697130]  ? _raw_spin_lock_irqsave+0x27/0x60
[ 4765.697133]  ? __pfx_worker_thread+0x10/0x10
[ 4765.697134]  kthread+0xf7/0x130
[ 4765.697137]  ? __pfx_kthread+0x10/0x10
[ 4765.697140]  ret_from_fork+0x34/0x50
[ 4765.697143]  ? __pfx_kthread+0x10/0x10
[ 4765.697146]  ret_from_fork_asm+0x1b/0x30
[ 4765.697152]  
[ 4765.697153] ---[ end trace  ]---
[ 4765.697161] [ cut here ]

L'écran clignote du noir