I came back to this and found that I now can get a failure w/ error
messages when applying the fix (see comment #4) to bionic - see crash
log below. So, I figured I could just bisect between v4.15 and v5.11
upstream w/ the fix applied and and figure out what other change(s) are
required to avoid the crash. Unfortunately, I hit a kernel 5.0.0-rc5+
where the same build sometimes crashes (w/ the below backtrace) and
sometimes boots fine. So it seems as though there maybe an underlying
race. If that race is truly fixed in newer kernels, bisection will
probably not be the best tool to find the fix since the failure case
isn't 100% reproducible.

== bionic kernel w/ patch applied ==
[   12.160242] CPU: All CPU(s) started at EL2
[   12.165438] alternatives: patching kernel code
[   12.186187] Unable to handle kernel paging request at virtual address 
8dcaae1e1004
[   12.194589] Mem abort info:
[   12.197676]   ESR = 0x96000004
[   12.201055]   Exception class = DABT (current EL), IL = 32 bits
[   12.207619]   SET = 0, FnV = 0
[   12.210996]   EA = 0, S1PTW = 0
[   12.214471] Data abort info:
[   12.217654]   ISV = 0, ISS = 0x00000004
[   12.221902]   CM = 0, WnR = 0
[   12.225186] [00008dcaae1e1004] user address but active_mm is swapper
[   12.232238] Internal error: Oops: 96000004 [#1] SMP
[   12.237644] Modules linked in:
[   12.241026] Process swapper/0 (pid: 1, stack limit = 0x        (ptrval))
[   12.248459] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 4.15.18+ #1
[   12.255216] pstate: 80800009 (Nzcv daif -PAN +UAO)
[   12.260531] pc : build_sched_domains+0xb04/0xfd0
[   12.265651] lr : build_sched_domains+0xae0/0xfd0
[   12.270768] sp : ffff00000843bd20
[   12.274434] x29: ffff00000843bd20 x28: ffffc5dfb98c0f80 
[   12.280320] x27: 00000000ffffffff x26: ffff3815115f2000 
[   12.286211] x25: 0000000000000100 x24: 0000000000000000 
[   12.292102] x23: ffff381511d69894 x22: ffffc5dfb9891600 
[   12.297988] x21: ffff381511d68e38 x20: ffffe5ffbb5fd200 
[   12.303880] x19: 0000000000000000 x18: ffffc5dfbfaec188 
[   12.309767] x17: 000000004cae2fed x16: 00000000804179ac 
[   12.315658] x15: 00000000bcf71eef x14: 0000000085f50aeb 
[   12.321546] x13: 0000000021ce98a4 x12: 00000000ffffff80 
[   12.327433] x11: ffff7f97feee5500 x10: 00000000fb44ed3c 
[   12.333319] x9 : 0000000000003b1b x8 : 0000000000000000 
[   12.339205] x7 : ffffc5dfbe007c00 x6 : 0000000000000002 
[   12.345098] x5 : ffffffffffffffff x4 : 0000000000000000 
[   12.350986] x3 : 0000000000000000 x2 : 00008dcaae1e1000 
[   12.356871] x1 : 0000000000000004 x0 : 0000000000000004 
[   12.362761] Call trace:
[   12.365463]  build_sched_domains+0xb04/0xfd0
[   12.370196]  sched_init_domains+0x88/0xb0
[   12.374640]  sched_init_smp+0x3c/0x90
[   12.378696]  kernel_init_freeable+0xf4/0x240
[   12.383432]  kernel_init+0x1c/0x114
[   12.387294]  ret_from_fork+0x10/0x18
[   12.391254] Code: b4000201 93407e78 aa0103e0 f8787aa2 (f8626800) 
[   12.398067] ---[ end trace a7ac5adb59ec4af4 ]---
[   12.403191] Kernel panic - not syncing: Attempted to kill init! 
exitcode=0x0000000b
[   12.403191]

== kernel that sometimes boots OK w/ fix applied, sometimes doesn't ==
[   11.975494] alternatives: patching kernel code
[   11.985402] Unable to handle kernel paging request at virtual address 000067
44c1718004
[   11.994200] Mem abort info:
[   11.997287]   ESR = 0x96000004
[   12.000667]   Exception class = DABT (current EL), IL = 32 bits
[   12.007236]   SET = 0, FnV = 0
[   12.010617]   EA = 0, S1PTW = 0
[   12.014092] Data abort info:
[   12.017278]   ISV = 0, ISS = 0x00000004
[   12.021528]   CM = 0, WnR = 0
[   12.024810] [00006744c1718004] user address but active_mm is swapper
[   12.031859] Internal error: Oops: 96000004 [#1] SMP
[   12.037266] Modules linked in:
[   12.040648] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 5.0.0-rc5+ #7
[   12.047601] pstate: 80800009 (Nzcv daif -PAN +UAO)
[   12.052917] pc : build_sched_domains+0x9f4/0x1138
[   12.058133] lr : build_sched_domains+0x9d0/0x1138
[   12.063342] sp : ffff00001043bcf0
[   12.067011] x29: ffff00001043bcf0 x28: ffffb75d3ae21a00 
[   12.072900] x27: ffff50187e5dc730 x26: ffffb75d3a806e80 
[   12.078788] x25: ffff50187e5dd3a4 x24: ffffb75d3a8077a0 
[   12.084675] x23: 0000000000000000 x22: ffff50187e5dd3a4 
[   12.090561] x21: ffff50187e5dc730 x20: ffffd77cfb981400 
[   12.096452] x19: 0000000000000000 x18: 0000000000000014 
[   12.102342] x17: 00000000c60b0fdd x16: 00000000eb2df79d 
[   12.108231] x15: 000000001a6f88f6 x14: 00000000a5b719f8 
[   12.114122] x13: 00000000006ba184 x12: 000000004b281177 
[   12.120013] x11: ffff7f5df3eebf80 x10: 00000000cf4217a7 
[   12.125901] x9 : 0000000000003570 x8 : 0000000000210d00 
[   12.131791] x7 : ffffd77cfbaee580 x6 : 0000000000000002 
[   12.137680] x5 : ffffd77d7fe741c0 x4 : ffffffffffffffff 
[   12.143571] x3 : 0000000000000000 x2 : 00006744c1718000 
[   12.149460] x1 : 0000000000000004 x0 : 0000000000000004 
[   12.155352] Process swapper/0 (pid: 1, stack limit = 0x(____ptrval____))
[   12.162785] Call trace:
[   12.165490]  build_sched_domains+0x9f4/0x1138
[   12.170314]  sched_init_domains+0x88/0xb0
[   12.174760]  sched_init_smp+0x3c/0x90
[   12.178812]  kernel_init_freeable+0x180/0x320
[   12.183641]  kernel_init+0x1c/0x110
[   12.187502]  ret_from_fork+0x10/0x18
[   12.191460] Code: b4000201 93407e77 aa0103e0 f8777aa2 (f8626800) 
[   12.198259] ---[ end trace 90837fdb22e7ef78 ]---
[   12.203390] Kernel panic - not syncing: Attempted to kill init! 
exitcode=0x0000000b
[   12.211906] SMP: stopping secondary CPUs
[   12.216276] ---[ end Kernel panic - not syncing: Attempted to kill init! 
exitcode=0x0000000b ]---

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

Status in kunpeng920:
  New
Status in ubuntu-kernel-tests:
  New
Status in linux package in Ubuntu:
  Incomplete
Status in linux source package in Bionic:
  Incomplete
Status in linux source package in Focal:
  Incomplete
Status in linux source package in Hirsute:
  Incomplete

Bug description:
  On scobee-kernel(arm64) with hirsute:linux(5.11.0-41.45) for
  sru-20211108 there are several reports about the sched domain not
  covering the full range. The same does not happen on kuzzle. But 32 is
  a bit of a suspicious number.

    Running tests.......
    cpuset_sched_domains 1 TINFO: CPUs are numbered continuously starting at 0 
(0-127)
    cpuset_sched_domains 1 TINFO: Nodes are numbered continuously starting at 0 
(0-3)
    cpuset_sched_domains 1 TINFO: root group load balance test
    cpuset_sched_domains 1 TINFO:      sched load balance: 0
    cpuset_sched_domains 1 TINFO: CPU hotplug:
    cpuset_check_domains    1  TPASS  :  check_sched_domains passed
    cpuset_sched_domains 1 TPASS: partition sched domains succeeded.
    cpuset_sched_domains 3 TINFO: root group load balance test
    cpuset_sched_domains 3 TINFO:      sched load balance: 1
    cpuset_sched_domains 3 TINFO: CPU hotplug:
    cpuset_check_domains    1  TFAIL  :  cpuset_sched_domains_check.c:110: 
cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95).
    cpuset_sched_domains 3 TFAIL: partition sched domains failed.
    cpuset_sched_domains 5 TINFO: root group load balance test
    cpuset_sched_domains 5 TINFO:      sched load balance: 0
    cpuset_sched_domains 5 TINFO: CPU hotplug:
    cpuset_check_domains    1  TPASS  :  check_sched_domains passed
    cpuset_sched_domains 5 TPASS: partition sched domains succeeded.
    cpuset_sched_domains 7 TINFO: root group load balance test
    cpuset_sched_domains 7 TINFO:      sched load balance: 0
    cpuset_sched_domains 7 TINFO: CPU hotplug:
    cpuset_check_domains    1  TPASS  :  check_sched_domains passed
    cpuset_sched_domains 7 TPASS: partition sched domains succeeded.
    cpuset_sched_domains 9 TINFO: root group load balance test
    cpuset_sched_domains 9 TINFO:      sched load balance: 1
    cpuset_sched_domains 9 TINFO: CPU hotplug:
    cpuset_check_domains    1  TFAIL  :  cpuset_sched_domains_check.c:110: 
cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95).
    cpuset_sched_domains 9 TFAIL: partition sched domains failed.
    cpuset_sched_domains 11 TINFO: root group load balance test
    cpuset_sched_domains 11 TINFO:      sched load balance: 1
    cpuset_sched_domains 11 TINFO: CPU hotplug:
    cpuset_check_domains    1  TFAIL  :  cpuset_sched_domains_check.c:110: 
cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95).
    cpuset_sched_domains 11 TFAIL: partition sched domains failed.
    cpuset_sched_domains 13 TINFO: general group load balance test
    cpuset_sched_domains 13 TINFO: root group info:
    cpuset_sched_domains 13 TINFO:      sched load balance: 0
    cpuset_sched_domains 13 TINFO: general group info:
    cpuset_sched_domains 13 TINFO:      cpus: -
    cpuset_sched_domains 13 TINFO:      sched load balance: 1
    cpuset_check_domains    1  TPASS  :  check_sched_domains passed
    cpuset_sched_domains 13 TPASS: partition sched domains succeeded.
    cpuset_sched_domains 15 TINFO: general group load balance test
    cpuset_sched_domains 15 TINFO: root group info:
    cpuset_sched_domains 15 TINFO:      sched load balance: 0
    cpuset_sched_domains 15 TINFO: general group info:
    cpuset_sched_domains 15 TINFO:      cpus: 1
    cpuset_sched_domains 15 TINFO:      sched load balance: 0
    cpuset_check_domains    1  TPASS  :  check_sched_domains passed
    cpuset_sched_domains 15 TPASS: partition sched domains succeeded.
    cpuset_sched_domains 17 TINFO: general group load balance test
    cpuset_sched_domains 17 TINFO: root group info:
    cpuset_sched_domains 17 TINFO:      sched load balance: 1
    cpuset_sched_domains 17 TINFO: general group info:
    cpuset_sched_domains 17 TINFO:      cpus: -
    cpuset_sched_domains 17 TINFO:      sched load balance: 1
    cpuset_check_domains    1  TFAIL  :  cpuset_sched_domains_check.c:110: 
cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95).
    cpuset_sched_domains 17 TFAIL: partition sched domains failed.
    cpuset_sched_domains 19 TINFO: general group load balance test
    cpuset_sched_domains 19 TINFO: root group info:
    cpuset_sched_domains 19 TINFO:      sched load balance: 1
    cpuset_sched_domains 19 TINFO: general group info:
    cpuset_sched_domains 19 TINFO:      cpus: 1
    cpuset_sched_domains 19 TINFO:      sched load balance: 1
    cpuset_check_domains    1  TFAIL  :  cpuset_sched_domains_check.c:110: 
cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95).
    cpuset_sched_domains 19 TFAIL: partition sched domains failed.
    cpuset_sched_domains 21 TINFO: general group load balance test
    cpuset_sched_domains 21 TINFO: root group info:
    cpuset_sched_domains 21 TINFO:      sched load balance: 0
    cpuset_sched_domains 21 TINFO: general group info:
    cpuset_sched_domains 21 TINFO:      cpus: 1,2
    cpuset_sched_domains 21 TINFO:      sched load balance: 0
    cpuset_check_domains    1  TPASS  :  check_sched_domains passed
    cpuset_sched_domains 21 TPASS: partition sched domains succeeded.
    cpuset_sched_domains 23 TINFO: general group load balance test
    cpuset_sched_domains 23 TINFO: root group info:
    cpuset_sched_domains 23 TINFO:      sched load balance: 0
    cpuset_sched_domains 23 TINFO: general group info:
    cpuset_sched_domains 23 TINFO:      cpus: 1,2
    cpuset_sched_domains 23 TINFO:      sched load balance: 1
    cpuset_check_domains    1  TPASS  :  check_sched_domains passed
    cpuset_sched_domains 23 TPASS: partition sched domains succeeded.
    cpuset_sched_domains 25 TINFO: general group load balance test
    cpuset_sched_domains 25 TINFO: root group info:
    cpuset_sched_domains 25 TINFO:      sched load balance: 0
    cpuset_sched_domains 25 TINFO: general group info:
    cpuset_sched_domains 25 TINFO:      cpus: 
0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124,125,126,127
    cpuset_sched_domains 25 TINFO:      sched load balance: 1
    cpuset_check_domains    1  TFAIL  :  cpuset_sched_domains_check.c:110: 
cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95).
    cpuset_sched_domains 25 TFAIL: partition sched domains failed.
    cpuset_sched_domains 27 TINFO: general group load balance test
    cpuset_sched_domains 27 TINFO: root group info:
    cpuset_sched_domains 27 TINFO:      sched load balance: 0
    cpuset_sched_domains 27 TINFO: general group1 info:
    cpuset_sched_domains 27 TINFO:      cpus: 1
    cpuset_sched_domains 27 TINFO:      sched load balance: 1
    cpuset_sched_domains 27 TINFO: general group2 info:
    cpuset_sched_domains 27 TINFO:      cpus: 0
    cpuset_sched_domains 27 TINFO:      sched load balance: 1
    cpuset_sched_domains 27 TINFO: CPU hotplug: none
    cpuset_sched_domains 27 TPASS: partition sched domains succeeded.
    cpuset_sched_domains 29 TINFO: general group load balance test
    cpuset_sched_domains 29 TINFO: root group info:
    cpuset_sched_domains 29 TINFO:      sched load balance: 0
    cpuset_sched_domains 29 TINFO: general group1 info:
    cpuset_sched_domains 29 TINFO:      cpus: 1,2
    cpuset_sched_domains 29 TINFO:      sched load balance: 1
    cpuset_sched_domains 29 TINFO: general group2 info:
    cpuset_sched_domains 29 TINFO:      cpus: 0-3
    cpuset_sched_domains 29 TINFO:      sched load balance: 0
    cpuset_sched_domains 29 TINFO: CPU hotplug: none
    cpuset_sched_domains 29 TPASS: partition sched domains succeeded.
    cpuset_sched_domains 31 TINFO: general group load balance test
    cpuset_sched_domains 31 TINFO: root group info:
    cpuset_sched_domains 31 TINFO:      sched load balance: 0
    cpuset_sched_domains 31 TINFO: general group1 info:
    cpuset_sched_domains 31 TINFO:      cpus: 1,2
    cpuset_sched_domains 31 TINFO:      sched load balance: 1
    cpuset_sched_domains 31 TINFO: general group2 info:
    cpuset_sched_domains 31 TINFO:      cpus: 0,3
    cpuset_sched_domains 31 TINFO:      sched load balance: 1
    cpuset_sched_domains 31 TINFO: CPU hotplug: none
    cpuset_sched_domains 31 TPASS: partition sched domains succeeded.
    cpuset_sched_domains 33 TINFO: general group load balance test
    cpuset_sched_domains 33 TINFO: root group info:
    cpuset_sched_domains 33 TINFO:      sched load balance: 0
    cpuset_sched_domains 33 TINFO: general group1 info:
    cpuset_sched_domains 33 TINFO:      cpus: 1,2
    cpuset_sched_domains 33 TINFO:      sched load balance: 1
    cpuset_sched_domains 33 TINFO: general group2 info:
    cpuset_sched_domains 33 TINFO:      cpus: 1,3
    cpuset_sched_domains 33 TINFO:      sched load balance: 1
    cpuset_sched_domains 33 TINFO: CPU hotplug: none
    cpuset_sched_domains 33 TPASS: partition sched domains succeeded.
    cpuset_sched_domains 35 TINFO: general group load balance test
    cpuset_sched_domains 35 TINFO: root group info:
    cpuset_sched_domains 35 TINFO:      sched load balance: 0
    cpuset_sched_domains 35 TINFO: general group1 info:
    cpuset_sched_domains 35 TINFO:      cpus: 1,2
    cpuset_sched_domains 35 TINFO:      sched load balance: 1
    cpuset_sched_domains 35 TINFO: general group2 info:
    cpuset_sched_domains 35 TINFO:      cpus: 1,3
    cpuset_sched_domains 35 TINFO:      sched load balance: 1
    cpuset_sched_domains 35 TINFO: CPU hotplug: offline
    cpuset_sched_domains 35 TPASS: partition sched domains succeeded.
    cpuset_sched_domains 37 TINFO: general group load balance test
    cpuset_sched_domains 37 TINFO: root group info:
    cpuset_sched_domains 37 TINFO:      sched load balance: 0
    cpuset_sched_domains 37 TINFO: general group1 info:
    cpuset_sched_domains 37 TINFO:      cpus: 1,2
    cpuset_sched_domains 37 TINFO:      sched load balance: 1
    cpuset_sched_domains 37 TINFO: general group2 info:
    cpuset_sched_domains 37 TINFO:      cpus: 1,3
    cpuset_sched_domains 37 TINFO:      sched load balance: 1
    cpuset_sched_domains 37 TINFO: CPU hotplug: online
    cpuset_sched_domains 37 TPASS: partition sched domains succeeded.
    INFO: ltp-pan reported some tests FAIL
    LTP Version: 20210927
    INFO: Test end time: Sat Nov  6 19:28:17 UTC 2021

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to