Hi, I notice a bug report on Bugzilla [1]. Quoting from it:
> ith Kernel 6.5.5, I've noticed a sudden system slowdown during a heavy > compilation workload even though plenty of free RAM still available (only > around 9 GB used out of 48 GB RAM in total). I first thought of being OOM, > but as stated, plenty of free RAM was still available. The system freezed for > a couple of seconds, became responsive again for some seconds but this cycle > repeated itself until rebooting the system. While the compilation workload > could finish (albeit slowly), the system became next to unusable for anything > else. > > In the logs, I've seen the following: > > [ 4394.309138] ------------[ cut here ]------------ > [ 4394.309141] ------------[ cut here ]------------ > [ 4394.309143] WARNING: CPU: 11 PID: 436221 at kernel/rcu/tree_nocb.h:124 > call_rcu+0xa85/0x14e0 > [ 4394.309142] WARNING: CPU: 8 PID: 366 at kernel/rcu/tree_nocb.h:104 > lazy_rcu_shrink_scan+0x311/0x340 > [ 4394.309150] Modules linked in: > [ 4394.309151] Modules linked in: > [ 4394.309151] vfat > [ 4394.309152] vfat > [ 4394.309153] fat > [ 4394.309153] fat > [ 4394.309155] snd_hda_codec_realtek > [ 4394.309155] snd_hda_codec_realtek > [ 4394.309156] snd_hda_codec_generic > [ 4394.309156] snd_hda_codec_generic ledtrig_audio > [ 4394.309158] ledtrig_audio intel_rapl_msr > [ 4394.309159] intel_rapl_msr > [ 4394.309160] snd_hda_codec_hdmi > [ 4394.309160] snd_hda_codec_hdmi > [ 4394.309162] intel_rapl_common > [ 4394.309162] intel_rapl_common > [ 4394.309163] sb_edac > [ 4394.309163] sb_edac x86_pkg_temp_thermal > [ 4394.309165] x86_pkg_temp_thermal intel_powerclamp > [ 4394.309166] intel_powerclamp snd_hda_intel > [ 4394.309167] snd_hda_intel snd_intel_dspcfg > [ 4394.309168] snd_intel_dspcfg snd_hda_codec > [ 4394.309170] snd_hda_codec snd_hwdep > [ 4394.309171] snd_hwdep > [ 4394.309172] coretemp > [ 4394.309172] coretemp > [ 4394.309173] crct10dif_pclmul > [ 4394.309173] crct10dif_pclmul snd_hda_core > [ 4394.309174] snd_hda_core crc32_pclmul > [ 4394.309175] crc32_pclmul > [ 4394.309177] polyval_clmulni > [ 4394.309177] polyval_clmulni polyval_generic > [ 4394.309178] polyval_generic gf128mul > [ 4394.309179] gf128mul > [ 4394.309180] ghash_clmulni_intel > [ 4394.309180] ghash_clmulni_intel sha512_ssse3 > [ 4394.309181] sha512_ssse3 snd_pcm > [ 4394.309182] snd_pcm > [ 4394.309184] sha256_ssse3 > [ 4394.309184] sha256_ssse3 > [ 4394.309185] sha1_ssse3 > [ 4394.309185] sha1_ssse3 > [ 4394.309186] aesni_intel > [ 4394.309186] aesni_intel > [ 4394.309188] snd_timer > [ 4394.309188] snd_timer crypto_simd > [ 4394.309189] crypto_simd cryptd > [ 4394.309190] cryptd i2c_i801 > [ 4394.309191] i2c_i801 acpi_cpufreq > [ 4394.309192] acpi_cpufreq i2c_smbus > [ 4394.309194] i2c_smbus snd > [ 4394.309195] snd igb > [ 4394.309196] igb lpc_ich > [ 4394.309197] lpc_ich > [ 4394.309198] soundcore mei_wdt > [ 4394.309199] soundcore mei_wdt > [ 4394.309200] razerkbd(O) > [ 4394.309201] razerkbd(O) > [ 4394.309201] mousedev sch_fq_codel > [ 4394.309202] mousedev sch_fq_codel > [ 4394.309203] usbip_host usbip_core > [ 4394.309204] usbip_host usbip_core > [ 4394.309206] pkcs8_key_parser crypto_user > [ 4394.309207] pkcs8_key_parser crypto_user > [ 4394.309208] fuse loop > [ 4394.309209] fuse > [ 4394.309210] zram > [ 4394.309210] loop zram > [ 4394.309211] bpf_preload ip_tables > [ 4394.309212] bpf_preload > [ 4394.309214] x_tables > [ 4394.309214] ip_tables x_tables > [ 4394.309215] ext4 > [ 4394.309216] ext4 > [ 4394.309216] crc32c_generic mbcache > [ 4394.309218] crc32c_generic mbcache > [ 4394.309219] crc16 > [ 4394.309220] crc16 > [ 4394.309220] jbd2 usbhid > [ 4394.309221] jbd2 usbhid > [ 4394.309222] amdgpu mfd_core > [ 4394.309223] amdgpu > [ 4394.309224] drm_buddy drm_suballoc_helper > [ 4394.309225] mfd_core > [ 4394.309226] crc32c_intel > [ 4394.309226] drm_buddy drm_suballoc_helper > [ 4394.309227] video drm_ttm_helper > [ 4394.309228] crc32c_intel > [ 4394.309229] ttm i2c_algo_bit > [ 4394.309230] video drm_ttm_helper > [ 4394.309231] drm_display_helper cec > [ 4394.309232] ttm i2c_algo_bit > [ 4394.309234] xhci_pci gpu_sched > [ 4394.309235] drm_display_helper > [ 4394.309236] xhci_pci_renesas > [ 4394.309236] cec xhci_pci > [ 4394.309237] amdxcp wmi > [ 4394.309238] gpu_sched xhci_pci_renesas > > [ 4394.309240] amdxcp wmi > [ 4394.309240] CPU: 8 PID: 366 Comm: kswapd0 Tainted: G O > 6.5.5-2.1-cachyos-lto #1 cc6c6175d85fa32a794326555b105a9657ffd076 > [ 4394.309244] Hardware name: LENOVO GAMING TF/X99-TF Gaming, BIOS CX99DE26 > 10/10/2020 > [ 4394.309242] CPU: 11 PID: 436221 Comm: cc1plus Tainted: G O > 6.5.5-2.1-cachyos-lto #1 cc6c6175d85fa32a794326555b105a9657ffd076 > [ 4394.309245] RIP: 0010:lazy_rcu_shrink_scan+0x311/0x340 > [ 4394.309246] Hardware name: LENOVO GAMING TF/X99-TF Gaming, BIOS CX99DE26 > 10/10/2020 > [ 4394.309248] Code: 03 00 00 75 25 4c 89 e7 e8 ac 81 87 00 f0 41 ff 8e 74 01 > 00 00 41 f6 86 e8 00 00 00 04 0f 85 1a fe ff ff 0f 0b e9 13 fe ff ff <0f> 0b > eb d7 48 c7 c7 20 ff ec 9c e8 ff d1 86 00 31 c0 49 83 fd 01 > [ 4394.309248] RIP: 0010:call_rcu+0xa85/0x14e0 > [ 4394.309250] RSP: 0000:ffff8e82a3fdba88 EFLAGS: 00010093 > [ 4394.309251] Code: 02 00 00 00 0f 85 f0 00 00 00 f6 83 e8 00 00 00 01 74 ac > 48 8b 83 80 00 00 00 48 83 38 00 0f 94 c0 89 44 24 08 e9 06 f9 ff ff <0f> 0b > 0f ae e8 e9 d3 f6 ff ff 0f 0b e9 cc f6 ff ff 4c 89 ef e8 62 > > [ 4394.309253] RAX: 0000000000000008 RBX: 000000000000000b RCX: > 0000000000000000 > [ 4394.309254] RSP: 0000:ffff8e7a8a033388 EFLAGS: 00010002 > [ 4394.309255] RDX: ffff8e84cf8b0148 RSI: 0000000000000286 RDI: > ffff8e84cf8f0140 > > [ 4394.309257] RBP: 0000000000000001 R08: ffff8e84cf82c540 R09: > 0000000000000094 > [ 4394.309258] RAX: 000000000000000b RBX: ffff8e84cf8eff40 RCX: > 0000000000001388 > [ 4394.309258] R10: 0000000000000001 R11: 0000000000000000 R12: > ffff8e84cf8f0140 > [ 4394.309260] RDX: 40000077ed480115 RSI: ffff8e84cf8effb8 RDI: > ffff8e7db6814da8 > [ 4394.309260] R13: 0000000000000006 R14: ffff8e84cf8eff40 R15: > 0000000000000202 > [ 4394.309262] FS: 0000000000000000(0000) GS:ffff8e84cf800000(0000) > knlGS:0000000000000000 > [ 4394.309262] RBP: 0000000000000002 R08: 0000000000000040 R09: > ffff8e7db6814d90 > [ 4394.309264] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [ 4394.309264] R10: 000000000000000c R11: ffff8e843519d4f0 R12: > ffff8e84cf8f0148 > [ 4394.309266] CR2: 00007f86c9ef0000 CR3: 0000000163681001 CR4: > 00000000001706e0 > [ 4394.309266] R13: 0000000000000002 R14: 0000000000000000 R15: > ffff8e7db6814da8 > [ 4394.309268] Call Trace: > [ 4394.309268] FS: 00007f1566065b40(0000) GS:ffff8e84cf8c0000(0000) > knlGS:0000000000000000 > [ 4394.309270] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [ 4394.309271] <TASK> > [ 4394.309272] CR2: 00007f1548773000 CR3: 00000007a5401003 CR4: > 00000000001706e0 > [ 4394.309274] Call Trace: > [ 4394.309276] <TASK> > [ 4394.309274] ? __warn+0x9e/0x160 > [ 4394.309278] ? lazy_rcu_shrink_scan+0x311/0x340 > [ 4394.309278] ? __warn+0x9e/0x160 > [ 4394.309282] ? call_rcu+0xa85/0x14e0 > [ 4394.309281] ? report_bug+0x147/0x180 > [ 4394.309284] ? handle_bug+0x48/0x80 > [ 4394.309285] ? report_bug+0x147/0x180 > [ 4394.309287] ? exc_invalid_op+0x16/0x40 > [ 4394.309289] ? handle_bug+0x48/0x80 > [ 4394.309290] ? asm_exc_invalid_op+0x16/0x20 > [ 4394.309292] ? exc_invalid_op+0x16/0x40 > [ 4394.309294] ? lazy_rcu_shrink_scan+0x311/0x340 > [ 4394.309295] ? asm_exc_invalid_op+0x16/0x20 > [ 4394.309297] ? lazy_rcu_shrink_scan+0x10e/0x340 > [ 4394.309299] shrink_slab+0x95a/0xae0 > [ 4394.309300] ? call_rcu+0xa85/0x14e0 > [ 4394.309303] shrink_node+0x1b48/0x29c0 > [ 4394.309303] ? sysvec_apic_timer_interrupt+0x11/0x80 > [ 4394.309308] kswapd+0x9fd/0x13e0 > [ 4394.309307] xas_store+0xb7a/0x1480 > [ 4394.309311] ? schedule+0x61a/0x1d20 > [ 4394.309313] shadow_lru_isolate+0x103/0x1c0 > [ 4394.309314] kthread+0x1bd/0x260 > [ 4394.309317] ? check_move_unevictable_folios+0x4a0/0x4a0 > [ 4394.309318] ? xas_store+0x1480/0x1480 > [ 4394.309321] ? count_shadow_nodes+0x460/0x460 > [ 4394.309321] ? __blk_mq_get_driver_tag+0x180/0x180 > [ 4394.309325] ret_from_fork+0x30/0x40 > [ 4394.309324] __list_lru_walk_one+0x10f/0x1c0 > [ 4394.309328] ? __blk_mq_get_driver_tag+0x180/0x180 > [ 4394.309331] ret_from_fork_asm+0x11/0x20 > [ 4394.309331] ? count_shadow_nodes+0x460/0x460 > [ 4394.309334] </TASK> > [ 4394.309334] list_lru_walk_one_irq+0x53/0x80 > [ 4394.309335] ---[ end trace 0000000000000000 ]--- > [ 4394.309338] shrink_slab+0x95a/0xae0 > [ 4394.309341] shrink_node+0x1b48/0x29c0 > [ 4394.309344] ? __rmqueue_pcplist+0x2e4/0x1660 > [ 4394.309350] do_try_to_free_pages+0x21b/0x700 > [ 4394.309355] try_to_free_pages+0x193/0xb80 > [ 4394.309361] __folio_alloc+0xa65/0x6e80 > [ 4394.309364] ? __alloc_pages+0x19b/0x3580 > [ 4394.309370] ? asm_sysvec_apic_timer_interrupt+0x16/0x20 > [ 4394.309374] ? __pte_offset_map_lock+0x143/0x200 > [ 4394.309379] do_pte_missing+0x647/0x2da0 > [ 4394.309385] ? task_tick_fair+0x745/0x2880 > [ 4394.309390] handle_mm_fault+0x97d/0xdc0 > [ 4394.309395] do_user_addr_fault+0x66a/0x2f80 > [ 4394.309401] exc_page_fault+0x66/0x160 > [ 4394.309405] asm_exc_page_fault+0x22/0x30 > [ 4394.309408] RIP: 0033:0x7f1560da1c5d > [ 4394.309425] Code: 4c 17 f0 c3 66 0f 1f 84 00 00 00 00 00 48 8b 4c 16 f8 48 > 8b 36 48 89 37 48 89 4c 17 f8 c3 c5 fe 6f 54 16 e0 c5 fe 6f 5c 16 c0 <c5> fe > 7f 07 c5 fe 7f 4f 20 c5 fe 7f 54 17 e0 c5 fe 7f 5c 17 c0 c5 > [ 4394.309426] RSP: 002b:00007ffcbe93e318 EFLAGS: 00010246 > [ 4394.309428] RAX: 00007f1548773000 RBX: 00007f1548773000 RCX: > 000000000000001f > [ 4394.309430] RDX: 0000000000000080 RSI: 00007f155c0dc880 RDI: > 00007f1548773000 > [ 4394.309431] RBP: 00007ffcbe93e370 R08: 0000000000000000 R09: > 00000007f1548773 > [ 4394.309433] R10: 00007f1548773000 R11: 000000000000a795 R12: > 00007f155c0dc880 > [ 4394.309434] R13: 0000000000000026 R14: 0000000000000026 R15: > 0000000000000003 > [ 4394.309437] </TASK> > [ 4394.309438] ---[ end trace 0000000000000000 ]--- See Bugzilla for the full thread and attached full dmesg output. Thanks. [1]: https://bugzilla.kernel.org/show_bug.cgi?id=217948 -- An old man doll... just what I always wanted! - Clara
