[Bug 1973153] Re: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel", "howzit-kernel"

2022-05-26 Thread dann frazier
Patches are now queued for all applicable stable trees (linux-4.14.y+)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1973153

Title:
  kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 /
  J-5.17 ARM64 "helo-kernel", "howzit-kernel"

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1973153/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1973153] Re: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel", "howzit-kernel"

2022-05-21 Thread dann frazier
A fix for this is currently queued in linux-next:

commit 1bbc21785b7336619fb6a67f1fff5afdaf229acc
Author: Lorenzo Pieralisi 
Date:   Thu Apr 7 11:51:20 2022 +0100

ACPI: sysfs: Fix BERT error region memory mapping

My plan is to propose this for stable after it lands during the merge
window.

** Changed in: ubuntu-kernel-tests
   Status: New => Invalid

** Changed in: linux (Ubuntu)
   Status: Incomplete => In Progress

** Changed in: linux (Ubuntu Jammy)
   Status: Incomplete => In Progress

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1973153

Title:
  kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 /
  J-5.17 ARM64 "helo-kernel", "howzit-kernel"

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1973153/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1973153] Re: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel", "howzit-kernel"

2022-05-19 Thread dann frazier
** Changed in: linux (Ubuntu)
 Assignee: (unassigned) => dann frazier (dannf)

** Changed in: linux (Ubuntu Jammy)
 Assignee: (unassigned) => dann frazier (dannf)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1973153

Title:
  kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 /
  J-5.17 ARM64 "helo-kernel", "howzit-kernel"

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1973153/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1973153] Re: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel"

2022-05-12 Thread Po-Hsu Lin
This would fail on another ARM64 node howzit-kernel as well with Jammy
5.15.0-27-generic

But without the messages from mpt3sas_cm0

dmesg output:
[ 2945.964263] LTP: starting read_all_sys (read_all -d /sys -q -r 3)
[ 2947.922462] WARNING! power/level is deprecated; use power/control instead
[ 2949.808190] bdi 7:6: the stable_pages_required attribute has been removed. 
Use the stable_writes queue attribute instead.
[ 2949.931340] No UUID available providing old NGUID
[ 2949.937332] No UUID available providing old NGUID
[ 2949.937814] No UUID available providing old NGUID
[ 2950.630986] Unable to handle kernel paging request at virtual address 
80007cf003bf
[ 2950.631121] Unable to handle kernel paging request at virtual address 
80007cf003bf
[ 2950.631160] Unable to handle kernel paging request at virtual address 
80007cf003bf
[ 2950.631168] Mem abort info:
[ 2950.631172]   ESR = 0x9621
[ 2950.631175]   EC = 0x25: DABT (current EL), IL = 32 bits
[ 2950.631180]   SET = 0, FnV = 0
[ 2950.631183]   EA = 0, S1PTW = 0
[ 2950.631186]   FSC = 0x21: alignment fault
[ 2950.631190] Data abort info:
[ 2950.631193]   ISV = 0, ISS = 0x0021
[ 2950.631196]   CM = 0, WnR = 0
[ 2950.631199] swapper pgtable: 4k pages, 48-bit VAs, pgdp=400a02db7000
[ 2950.631204] [80007cf003bf] pgd=181b3003, p4d=181b3003, 
pud=18000c3dc003, pmd=18007a118003, pte=006888230f0f
[ 2950.631228] Internal error: Oops: 9621 [#1] SMP
[ 2950.631236] Modules linked in: arm_spe_pmu efi_pstore nls_iso8859_1 
input_leds joydev acpi_ipmi ipmi_ssif arm_cmn xgene_hwmon arm_dmc620_pmu 
arm_dsu_pmu cppc_cpufreq acpi_tad sch_fq_codel dm_multipath scsi_dh_rdac 
scsi_dh_emc scsi_dh_alua ipmi_devintf ipmi_msghandler ip_tables x_tables 
autofs4 btrfs blake2b_generic zstd_compress raid10 raid456 async_raid6_recov 
async_memcpy async_pq async_xor async_tx xor xor_neon raid6_pq libcrc32c raid1 
raid0 multipath linear hid_generic usbhid cdc_ether hid usbnet mlx5_ib 
ib_uverbs ib_core uas usb_storage ast drm_vram_helper drm_ttm_helper ttm 
i2c_algo_bit drm_kms_helper crct10dif_ce ghash_ce syscopyarea sha2_ce 
sysfillrect sysimgblt sha256_arm64 fb_sys_fops mlx5_core sha1_ce cec rc_core 
nvme mlxfw drm xhci_pci psample nvme_core xhci_pci_renesas tls aes_neon_bs 
aes_neon_blk aes_ce_blk crypto_simd cryptd aes_ce_cipher
[ 2950.631420] CPU: 5 PID: 46308 Comm: read_all Not tainted 5.15.0-27-generic 
#28-Ubuntu
[ 2950.631426] Hardware name: WIWYNN Mt.Jade Server System 
B81.030Z1.0007/Mt.Jade Motherboard, BIOS 1.6.20210526 (SCP: 1.06.20210526) 
2021/05/26
[ 2950.631430] pstate: 8049 (Nzcv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
[ 2950.631437] pc : __memcpy+0x168/0x260
[ 2950.631456] lr : memory_read_from_buffer+0x58/0x80
[ 2950.631470] sp : 8000776ebb60
[ 2950.631473] x29: 8000776ebb60 x28: 07fff4eb x27: 
[ 2950.631485] x26:  x25:  x24: 07ff8ac2ae20
[ 2950.631495] x23: 8000776ebc60 x22: 03ff x21: 8000776ebbc8
[ 2950.631506] x20: 03ff x19: 03ff x18: 
[ 2950.631516] x17:  x16:  x15: 
[ 2950.631526] x14:  x13:  x12: 
[ 2950.631536] x11:  x10:  x9 : 
[ 2950.631546] x8 :  x7 :  x6 : 
[ 2950.631556] x5 : 07fff26a6bff x4 : 80007cf003ff x3 : 07fff26a6b80
[ 2950.631566] x2 : ffef x1 : 80007cf003c0 x0 : 07fff26a6800
[ 2950.631576] Call trace:
[ 2950.631579]  __memcpy+0x168/0x260
[ 2950.631585]  acpi_data_show+0x5c/0x90
[ 2950.631596]  sysfs_kf_bin_read+0x78/0xa0
[ 2950.631607]  kernfs_file_read_iter+0x9c/0x1a4
[ 2950.631613]  kernfs_fop_read_iter+0x34/0x50
[ 2950.631619]  new_sync_read+0xf0/0x184
[ 2950.631629]  vfs_read+0x158/0x1f0
[ 2950.631635]  ksys_read+0x74/0x100
[ 2950.631640]  __arm64_sys_read+0x28/0x34
[ 2950.631645]  invoke_syscall+0x78/0x100
[ 2950.631657]  el0_svc_common.constprop.0+0x54/0x184
[ 2950.631664]  do_el0_svc+0x34/0x9c
[ 2950.631670]  el0_svc+0x48/0x1b0
[ 2950.631681]  el0t_64_sync_handler+0xa4/0x130
[ 2950.631686]  el0t_64_sync+0x1a4/0x1a8
[ 2950.631697] Code: a984346c a9c4342c f1010042 54fffee8 (a97c3c8e) 
[ 2950.631703] ---[ end trace 36f3d711c3548ceb ]---
[ 2950.638926] Mem abort info:
[ 2950.646846] Mem abort info:
[ 2950.646852]   ESR = 0x9621
[ 2950.646855]   EC = 0x25: DABT (current EL), IL = 32 bits
[ 2950.646860]   SET = 0, FnV = 0
[ 2950.646861]   EA = 0, S1PTW = 0
[ 2950.646866]   FSC = 0x21: alignment fault
[ 2950.654786]   ESR = 0x9621
[ 2950.654792]   EC = 0x25: DABT (current EL), IL = 32 bits
[ 2950.654797]   SET = 0, FnV = 0
[ 2950.654800]   EA = 0, S1PTW = 0
[ 2950.657589] Data abort info:
[ 2950.657593]   ISV = 0, ISS = 0x0021
[ 2950.657596]   CM = 0, WnR = 0
[ 2950.657599] swapper pgtable: 4k pages, 48-bit VAs, 

[Bug 1973153] Re: kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 ARM64 "helo-kernel"

2022-05-12 Thread Po-Hsu Lin
** Summary changed:

- kernel oops tiggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 
ARM64 "helo-kernel"
+ kernel oops triggered by read_all_sys in ubuntu_ltp/fs on J-5.15 / J-5.17 
ARM64 "helo-kernel"

** Description changed:

  Issue found on Jammy 5.17.0-8-generic #8~22.04.2-Ubuntu and Jammy
  5.15.0-27-generic with ARM64 node helo-kernel only.
  
  It looks like this is hardware-specific.
  
  The read_all_sys test in fs from ubuntu_ltp will cause kernel oops and
  test will timeout.
+ 
+ Steps to reproduce this:
+ git clone -b sru git://git.launchpad.net/~canonical-kernel-team/+git/ltp
+ cd ltp
+ make autotools
+ ./configure
+ make
+ make install
+ echo "read_all_sys read_all -d /sys -q -r 3" > /tmp/fs
+ sudo /opt/ltp/runltp -f /tmp/fs
  
  Test log:
  <<>>
  tag=read_all_sys stime=1652343855
  cmdline="read_all -d /sys -q -r 3"
  contacts=""
  analysis=exit
  <<>>
  incrementing stop
  tst_test.c:1456: TINFO: Timeout per run is 0h 30m 00s
  Test timeouted, sending SIGKILL!
  tst_test.c:1500: TINFO: Killed the leftover descendant processes
  tst_test.c:1506: TINFO: If you are running on slow machine, try exporting 
LTP_TIMEOUT_MUL > 1
  tst_test.c:1508: TBROK: Test killed! (timeout?)
  
  Summary:
  passed   0
  failed   0
  broken   1
  skipped  0
  warnings 0
  <<>>
  initiation_status="ok"
  duration=1800 termination_type=exited termination_id=2 corefile=no
  cutime=39 cstime=140
  <<>>
  
  dmesg output:
  [ 1614.203083] LTP: starting read_all_sys (read_all -d /sys -q -r 3)
  [ 1617.509566] mpt3sas :8d:00.0: invalid VPD tag 0x00 (size 0) at offset 
0; assume missing optional EEPROM
  [ 1617.543837] mpt3sas_cm0: host_trace_buffer_size_show: host_trace_buffer is 
not registered
  [ 1617.550373] mpt3sas_cm0: host_trace_buffer_size_show: host_trace_buffer is 
not registered
  [ 1617.550381] mpt3sas_cm0: host_trace_buffer_size_show: host_trace_buffer is 
not registered
  [ 1617.550474] mpt3sas_cm0: host_trace_buffer_show: host_trace_buffer is not 
registered
  [ 1617.550504] mpt3sas_cm0: host_trace_buffer_show: host_trace_buffer is not 
registered
  [ 1617.550593] mpt3sas_cm0: BRM_status_show: BRM attribute is only for 
warpdrive
  [ 1617.550622] mpt3sas_cm0: BRM_status_show: BRM attribute is only for 
warpdrive
  [ 1617.598371] mpt3sas_cm0: host_trace_buffer_show: host_trace_buffer is not 
registered
  [ 1617.606183] mpt3sas_cm0: BRM_status_show: BRM attribute is only for 
warpdrive
  [ 1617.641990] mpt3sas :8d:00.0: VPD access failed.  This is likely a 
firmware bug on this device.  Contact the card vendor for a firmware update
  [ 1617.973894] WARNING! power/level is deprecated; use power/control instead
  [ 1619.368112] bdi 7:6: the stable_pages_required attribute has been removed. 
Use the stable_writes queue attribute instead.
  [ 1627.430319] Unable to handle kernel paging request at virtual address 
800033b503bf
  [ 1627.438256] Mem abort info:
  [ 1627.441086]   ESR = 0x9621
  [ 1627.444133]   EC = 0x25: DABT (current EL), IL = 32 bits
  [ 1627.449469]   SET = 0, FnV = 0
  [ 1627.452515]   EA = 0, S1PTW = 0
  [ 1627.455676]   FSC = 0x21: alignment fault
  [ 1627.459701] Data abort info:
  [ 1627.462597]   ISV = 0, ISS = 0x0021
  [ 1627.466449]   CM = 0, WnR = 0
  [ 1627.469434] swapper pgtable: 4k pages, 48-bit VAs, pgdp=f4aba000
  [ 1627.476160] [800033b503bf] pgd=10bffcfff003, p4d=10bffcfff003, 
pud=10bffcffe003, pmd=1008948f5003, pte=006880213f0f
  [ 1627.488712] Internal error: Oops: 9621 [#1] SMP
  [ 1627.493585] Modules linked in: efi_pstore nls_iso8859_1 joydev input_leds 
acpi_ipmi ipmi_ssif thunderx2_pmu cppc_cpufreq sch_fq_codel dm_multipath 
scsi_dh_rdac scsi_dh_emc scsi_dh_alua ipmi_devintf ipmi_msghandler ip_tables 
x_tables autofs4 btrfs blake2b_generic zstd_compress raid10 raid456 
async_raid6_recov async_memcpy async_pq async_xor async_tx xor xor_neon 
raid6_pq libcrc32c raid1 raid0 multipath linear hid_generic usbhid uas hid 
usb_storage ast drm_vram_helper drm_ttm_helper i2c_smbus ttm i2c_algo_bit 
drm_kms_helper syscopyarea crct10dif_ce sysfillrect ghash_ce sysimgblt sha2_ce 
fb_sys_fops cec sha256_arm64 rc_core sha1_ce qede mpt3sas qed raid_class drm 
scsi_transport_sas xhci_pci ahci xhci_pci_renesas gpio_xlp i2c_xlp9xx 
aes_neon_bs aes_neon_blk aes_ce_blk aes_ce_cipher
  [ 1627.500861] Unable to handle kernel paging request at virtual address 
800033b503bf
  [ 1627.562614] CPU: 71 PID: 4190 Comm: read_all Not tainted 5.17.0-8-generic 
#8~22.04.2-Ubuntu
  [ 1627.562623] Hardware name: To be filled by O.E.M. Saber/Saber, BIOS 
0ACKL030 06/04/2020
  [ 1627.562626] pstate: 8049 (Nzcv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
  [ 1627.570544] Mem abort info:
  [ 1627.571497] Unable to handle kernel paging request at virtual address 
800033b503bf
  [ 1627.571505] Mem abort info:
  [ 1627.571508]   ESR = 0x9621
  [ 1627.571511]   EC = 0x25: DABT (current EL), IL = 32 bits
  [ 1627.571516]   SET =