[Bug 1973153] Re: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel", "howzit-kernel"
Patches are now queued for all applicable stable trees (linux-4.14.y+) -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1973153 Title: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel", "howzit-kernel" To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1973153/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1973153] Re: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel", "howzit-kernel"
A fix for this is currently queued in linux-next: commit 1bbc21785b7336619fb6a67f1fff5afdaf229acc Author: Lorenzo Pieralisi Date: Thu Apr 7 11:51:20 2022 +0100 ACPI: sysfs: Fix BERT error region memory mapping My plan is to propose this for stable after it lands during the merge window. ** Changed in: ubuntu-kernel-tests Status: New => Invalid ** Changed in: linux (Ubuntu) Status: Incomplete => In Progress ** Changed in: linux (Ubuntu Jammy) Status: Incomplete => In Progress -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1973153 Title: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel", "howzit-kernel" To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1973153/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1973153] Re: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel", "howzit-kernel"
** Changed in: linux (Ubuntu) Assignee: (unassigned) => dann frazier (dannf) ** Changed in: linux (Ubuntu Jammy) Assignee: (unassigned) => dann frazier (dannf) -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1973153 Title: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel", "howzit-kernel" To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1973153/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1973153] Re: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel"
This would fail on another ARM64 node howzit-kernel as well with Jammy 5.15.0-27-generic But without the messages from mpt3sas_cm0 dmesg output: [ 2945.964263] LTP: starting read_all_sys (read_all -d /sys -q -r 3) [ 2947.922462] WARNING! power/level is deprecated; use power/control instead [ 2949.808190] bdi 7:6: the stable_pages_required attribute has been removed. Use the stable_writes queue attribute instead. [ 2949.931340] No UUID available providing old NGUID [ 2949.937332] No UUID available providing old NGUID [ 2949.937814] No UUID available providing old NGUID [ 2950.630986] Unable to handle kernel paging request at virtual address 80007cf003bf [ 2950.631121] Unable to handle kernel paging request at virtual address 80007cf003bf [ 2950.631160] Unable to handle kernel paging request at virtual address 80007cf003bf [ 2950.631168] Mem abort info: [ 2950.631172] ESR = 0x9621 [ 2950.631175] EC = 0x25: DABT (current EL), IL = 32 bits [ 2950.631180] SET = 0, FnV = 0 [ 2950.631183] EA = 0, S1PTW = 0 [ 2950.631186] FSC = 0x21: alignment fault [ 2950.631190] Data abort info: [ 2950.631193] ISV = 0, ISS = 0x0021 [ 2950.631196] CM = 0, WnR = 0 [ 2950.631199] swapper pgtable: 4k pages, 48-bit VAs, pgdp=400a02db7000 [ 2950.631204] [80007cf003bf] pgd=181b3003, p4d=181b3003, pud=18000c3dc003, pmd=18007a118003, pte=006888230f0f [ 2950.631228] Internal error: Oops: 9621 [#1] SMP [ 2950.631236] Modules linked in: arm_spe_pmu efi_pstore nls_iso8859_1 input_leds joydev acpi_ipmi ipmi_ssif arm_cmn xgene_hwmon arm_dmc620_pmu arm_dsu_pmu cppc_cpufreq acpi_tad sch_fq_codel dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua ipmi_devintf ipmi_msghandler ip_tables x_tables autofs4 btrfs blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor xor_neon raid6_pq libcrc32c raid1 raid0 multipath linear hid_generic usbhid cdc_ether hid usbnet mlx5_ib ib_uverbs ib_core uas usb_storage ast drm_vram_helper drm_ttm_helper ttm i2c_algo_bit drm_kms_helper crct10dif_ce ghash_ce syscopyarea sha2_ce sysfillrect sysimgblt sha256_arm64 fb_sys_fops mlx5_core sha1_ce cec rc_core nvme mlxfw drm xhci_pci psample nvme_core xhci_pci_renesas tls aes_neon_bs aes_neon_blk aes_ce_blk crypto_simd cryptd aes_ce_cipher [ 2950.631420] CPU: 5 PID: 46308 Comm: read_all Not tainted 5.15.0-27-generic #28-Ubuntu [ 2950.631426] Hardware name: WIWYNN Mt.Jade Server System B81.030Z1.0007/Mt.Jade Motherboard, BIOS 1.6.20210526 (SCP: 1.06.20210526) 2021/05/26 [ 2950.631430] pstate: 8049 (Nzcv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--) [ 2950.631437] pc : __memcpy+0x168/0x260 [ 2950.631456] lr : memory_read_from_buffer+0x58/0x80 [ 2950.631470] sp : 8000776ebb60 [ 2950.631473] x29: 8000776ebb60 x28: 07fff4eb x27: [ 2950.631485] x26: x25: x24: 07ff8ac2ae20 [ 2950.631495] x23: 8000776ebc60 x22: 03ff x21: 8000776ebbc8 [ 2950.631506] x20: 03ff x19: 03ff x18: [ 2950.631516] x17: x16: x15: [ 2950.631526] x14: x13: x12: [ 2950.631536] x11: x10: x9 : [ 2950.631546] x8 : x7 : x6 : [ 2950.631556] x5 : 07fff26a6bff x4 : 80007cf003ff x3 : 07fff26a6b80 [ 2950.631566] x2 : ffef x1 : 80007cf003c0 x0 : 07fff26a6800 [ 2950.631576] Call trace: [ 2950.631579] __memcpy+0x168/0x260 [ 2950.631585] acpi_data_show+0x5c/0x90 [ 2950.631596] sysfs_kf_bin_read+0x78/0xa0 [ 2950.631607] kernfs_file_read_iter+0x9c/0x1a4 [ 2950.631613] kernfs_fop_read_iter+0x34/0x50 [ 2950.631619] new_sync_read+0xf0/0x184 [ 2950.631629] vfs_read+0x158/0x1f0 [ 2950.631635] ksys_read+0x74/0x100 [ 2950.631640] __arm64_sys_read+0x28/0x34 [ 2950.631645] invoke_syscall+0x78/0x100 [ 2950.631657] el0_svc_common.constprop.0+0x54/0x184 [ 2950.631664] do_el0_svc+0x34/0x9c [ 2950.631670] el0_svc+0x48/0x1b0 [ 2950.631681] el0t_64_sync_handler+0xa4/0x130 [ 2950.631686] el0t_64_sync+0x1a4/0x1a8 [ 2950.631697] Code: a984346c a9c4342c f1010042 54fffee8 (a97c3c8e) [ 2950.631703] ---[ end trace 36f3d711c3548ceb ]--- [ 2950.638926] Mem abort info: [ 2950.646846] Mem abort info: [ 2950.646852] ESR = 0x9621 [ 2950.646855] EC = 0x25: DABT (current EL), IL = 32 bits [ 2950.646860] SET = 0, FnV = 0 [ 2950.646861] EA = 0, S1PTW = 0 [ 2950.646866] FSC = 0x21: alignment fault [ 2950.654786] ESR = 0x9621 [ 2950.654792] EC = 0x25: DABT (current EL), IL = 32 bits [ 2950.654797] SET = 0, FnV = 0 [ 2950.654800] EA = 0, S1PTW = 0 [ 2950.657589] Data abort info: [ 2950.657593] ISV = 0, ISS = 0x0021 [ 2950.657596] CM = 0, WnR = 0 [ 2950.657599] swapper pgtable: 4k pages, 48-bit VAs, pgdp=400
[Bug 1973153] Re: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel"
** Summary changed: - kernel oops tiggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel" + kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel" ** Description changed: Issue found on Jammy 5.17.0-8-generic #8~22.04.2-Ubuntu and Jammy 5.15.0-27-generic with ARM64 node helo-kernel only. It looks like this is hardware-specific. The read_all_sys test in fs from ubuntu_ltp will cause kernel oops and test will timeout. + + Steps to reproduce this: + git clone -b sru git://git.launchpad.net/~canonical-kernel-team/+git/ltp + cd ltp + make autotools + ./configure + make + make install + echo "read_all_sys read_all -d /sys -q -r 3" > /tmp/fs + sudo /opt/ltp/runltp -f /tmp/fs Test log: <<>> tag=read_all_sys stime=1652343855 cmdline="read_all -d /sys -q -r 3" contacts="" analysis=exit <<>> incrementing stop tst_test.c:1456: TINFO: Timeout per run is 0h 30m 00s Test timeouted, sending SIGKILL! tst_test.c:1500: TINFO: Killed the leftover descendant processes tst_test.c:1506: TINFO: If you are running on slow machine, try exporting LTP_TIMEOUT_MUL > 1 tst_test.c:1508: TBROK: Test killed! (timeout?) Summary: passed 0 failed 0 broken 1 skipped 0 warnings 0 <<>> initiation_status="ok" duration=1800 termination_type=exited termination_id=2 corefile=no cutime=39 cstime=140 <<>> dmesg output: [ 1614.203083] LTP: starting read_all_sys (read_all -d /sys -q -r 3) [ 1617.509566] mpt3sas :8d:00.0: invalid VPD tag 0x00 (size 0) at offset 0; assume missing optional EEPROM [ 1617.543837] mpt3sas_cm0: host_trace_buffer_size_show: host_trace_buffer is not registered [ 1617.550373] mpt3sas_cm0: host_trace_buffer_size_show: host_trace_buffer is not registered [ 1617.550381] mpt3sas_cm0: host_trace_buffer_size_show: host_trace_buffer is not registered [ 1617.550474] mpt3sas_cm0: host_trace_buffer_show: host_trace_buffer is not registered [ 1617.550504] mpt3sas_cm0: host_trace_buffer_show: host_trace_buffer is not registered [ 1617.550593] mpt3sas_cm0: BRM_status_show: BRM attribute is only for warpdrive [ 1617.550622] mpt3sas_cm0: BRM_status_show: BRM attribute is only for warpdrive [ 1617.598371] mpt3sas_cm0: host_trace_buffer_show: host_trace_buffer is not registered [ 1617.606183] mpt3sas_cm0: BRM_status_show: BRM attribute is only for warpdrive [ 1617.641990] mpt3sas :8d:00.0: VPD access failed. This is likely a firmware bug on this device. Contact the card vendor for a firmware update [ 1617.973894] WARNING! power/level is deprecated; use power/control instead [ 1619.368112] bdi 7:6: the stable_pages_required attribute has been removed. Use the stable_writes queue attribute instead. [ 1627.430319] Unable to handle kernel paging request at virtual address 800033b503bf [ 1627.438256] Mem abort info: [ 1627.441086] ESR = 0x9621 [ 1627.444133] EC = 0x25: DABT (current EL), IL = 32 bits [ 1627.449469] SET = 0, FnV = 0 [ 1627.452515] EA = 0, S1PTW = 0 [ 1627.455676] FSC = 0x21: alignment fault [ 1627.459701] Data abort info: [ 1627.462597] ISV = 0, ISS = 0x0021 [ 1627.466449] CM = 0, WnR = 0 [ 1627.469434] swapper pgtable: 4k pages, 48-bit VAs, pgdp=f4aba000 [ 1627.476160] [800033b503bf] pgd=10bffcfff003, p4d=10bffcfff003, pud=10bffcffe003, pmd=1008948f5003, pte=006880213f0f [ 1627.488712] Internal error: Oops: 9621 [#1] SMP [ 1627.493585] Modules linked in: efi_pstore nls_iso8859_1 joydev input_leds acpi_ipmi ipmi_ssif thunderx2_pmu cppc_cpufreq sch_fq_codel dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua ipmi_devintf ipmi_msghandler ip_tables x_tables autofs4 btrfs blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor xor_neon raid6_pq libcrc32c raid1 raid0 multipath linear hid_generic usbhid uas hid usb_storage ast drm_vram_helper drm_ttm_helper i2c_smbus ttm i2c_algo_bit drm_kms_helper syscopyarea crct10dif_ce sysfillrect ghash_ce sysimgblt sha2_ce fb_sys_fops cec sha256_arm64 rc_core sha1_ce qede mpt3sas qed raid_class drm scsi_transport_sas xhci_pci ahci xhci_pci_renesas gpio_xlp i2c_xlp9xx aes_neon_bs aes_neon_blk aes_ce_blk aes_ce_cipher [ 1627.500861] Unable to handle kernel paging request at virtual address 800033b503bf [ 1627.562614] CPU: 71 PID: 4190 Comm: read_all Not tainted 5.17.0-8-generic #8~22.04.2-Ubuntu [ 1627.562623] Hardware name: To be filled by O.E.M. Saber/Saber, BIOS 0ACKL030 06/04/2020 [ 1627.562626] pstate: 8049 (Nzcv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--) [ 1627.570544] Mem abort info: [ 1627.571497] Unable to handle kernel paging request at virtual address 800033b503bf [ 1627.571505] Mem abort info: [ 1627.571508] ESR = 0x9621 [ 1627.571511] EC = 0x25: DABT (current EL), IL = 32 bits [ 1627.571516] SET = 0