[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
= bionic verification = ubuntu@d06-4:~$ cat /proc/version Linux version 4.15.0-179-generic (buildd@bos02-arm64-025) (gcc version 7.5.0 (Ubuntu/Linaro 7.5.0-3ubuntu1~18.04)) #188-Ubuntu SMP Tue May 10 20:51:17 UTC 2022 ubuntu@d06-4:~$ grep domain2 /proc/schedstat | wc -l 128 ubuntu@d06-4:~$ grep domain3 /proc/schedstat | wc -l 128 ubuntu@d06-4:~$ ** Tags removed: verification-needed-bionic ** Tags added: verification-done-bionic -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
** Description changed: [Impact] The LTP cpuset_sched_domains test, authored by Miao Xie, fails on a Kunpeng920 server that has 4 NUMA nodes: - https://launchpad.net/bugs/1951289 + https://launchpad.net/bugs/1951289 This does appear to be a real bug. /proc/schedstat displays 4 domain levels for CPUs on 2 of the nodes, but only 3 levels for the others 2 (see below). I assume this means the scheduler is making suboptimal decisions about where to place/move processes. [Test Case] On a 128 core Kunpeng 920 system, observe that half the CPUs are missing a 3rd level scheduling domain: ubuntu@d06-4:~$ grep domain2 /proc/schedstat | wc -l 128 ubuntu@d06-4:~$ grep domain3 /proc/schedstat | wc -l 64 - ubuntu@d06-4:~$ + ubuntu@d06-4:~$ [What Could Go Wrong] - This changes the code used for populating sched domains, so it could potentially break on other systems, potentially leading to poor scheduling characteristics (higher latencies, lower overall throughput etc). + This changes the code used for populating sched domains, so it could potentially break on other systems, leading to poor scheduling characteristics (higher latencies, lower overall throughput etc). -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
This bug is awaiting verification that the linux/4.15.0-179.188 kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-bionic' to 'verification-done-bionic'. If the problem still exists, change the tag 'verification-needed-bionic' to 'verification-failed-bionic'. If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you! ** Tags added: verification-needed-bionic -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
** Changed in: linux (Ubuntu Bionic) Status: In Progress => Fix Committed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
** Description changed: - On scobee-kernel(arm64) with hirsute:linux(5.11.0-41.45) for - sru-20211108 there are several reports about the sched domain not - covering the full range. The same does not happen on kuzzle. But 32 is a - bit of a suspicious number. + [Impact] + The LTP cpuset_sched_domains test, authored by Miao Xie, fails on a Kunpeng920 + server that has 4 NUMA nodes: + https://launchpad.net/bugs/1951289 - Running tests... - cpuset_sched_domains 1 TINFO: CPUs are numbered continuously starting at 0 (0-127) - cpuset_sched_domains 1 TINFO: Nodes are numbered continuously starting at 0 (0-3) - cpuset_sched_domains 1 TINFO: root group load balance test - cpuset_sched_domains 1 TINFO: sched load balance: 0 - cpuset_sched_domains 1 TINFO: CPU hotplug: - cpuset_check_domains1 TPASS : check_sched_domains passed - cpuset_sched_domains 1 TPASS: partition sched domains succeeded. - cpuset_sched_domains 3 TINFO: root group load balance test - cpuset_sched_domains 3 TINFO: sched load balance: 1 - cpuset_sched_domains 3 TINFO: CPU hotplug: - cpuset_check_domains1 TFAIL : cpuset_sched_domains_check.c:110: cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95). - cpuset_sched_domains 3 TFAIL: partition sched domains failed. - cpuset_sched_domains 5 TINFO: root group load balance test - cpuset_sched_domains 5 TINFO: sched load balance: 0 - cpuset_sched_domains 5 TINFO: CPU hotplug: - cpuset_check_domains1 TPASS : check_sched_domains passed - cpuset_sched_domains 5 TPASS: partition sched domains succeeded. - cpuset_sched_domains 7 TINFO: root group load balance test - cpuset_sched_domains 7 TINFO: sched load balance: 0 - cpuset_sched_domains 7 TINFO: CPU hotplug: - cpuset_check_domains1 TPASS : check_sched_domains passed - cpuset_sched_domains 7 TPASS: partition sched domains succeeded. - cpuset_sched_domains 9 TINFO: root group load balance test - cpuset_sched_domains 9 TINFO: sched load balance: 1 - cpuset_sched_domains 9 TINFO: CPU hotplug: - cpuset_check_domains1 TFAIL : cpuset_sched_domains_check.c:110: cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95). - cpuset_sched_domains 9 TFAIL: partition sched domains failed. - cpuset_sched_domains 11 TINFO: root group load balance test - cpuset_sched_domains 11 TINFO: sched load balance: 1 - cpuset_sched_domains 11 TINFO: CPU hotplug: - cpuset_check_domains1 TFAIL : cpuset_sched_domains_check.c:110: cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95). - cpuset_sched_domains 11 TFAIL: partition sched domains failed. - cpuset_sched_domains 13 TINFO: general group load balance test - cpuset_sched_domains 13 TINFO: root group info: - cpuset_sched_domains 13 TINFO: sched load balance: 0 - cpuset_sched_domains 13 TINFO: general group info: - cpuset_sched_domains 13 TINFO: cpus: - - cpuset_sched_domains 13 TINFO: sched load balance: 1 - cpuset_check_domains1 TPASS : check_sched_domains passed - cpuset_sched_domains 13 TPASS: partition sched domains succeeded. - cpuset_sched_domains 15 TINFO: general group load balance test - cpuset_sched_domains 15 TINFO: root group info: - cpuset_sched_domains 15 TINFO: sched load balance: 0 - cpuset_sched_domains 15 TINFO: general group info: - cpuset_sched_domains 15 TINFO: cpus: 1 - cpuset_sched_domains 15 TINFO: sched load balance: 0 - cpuset_check_domains1 TPASS : check_sched_domains passed - cpuset_sched_domains 15 TPASS: partition sched domains succeeded. - cpuset_sched_domains 17 TINFO: general group load balance test - cpuset_sched_domains 17 TINFO: root group info: - cpuset_sched_domains 17 TINFO: sched load balance: 1 - cpuset_sched_domains 17 TINFO: general group info: - cpuset_sched_domains 17 TINFO: cpus: - - cpuset_sched_domains 17 TINFO: sched load balance: 1 - cpuset_check_domains1 TFAIL : cpuset_sched_domains_check.c:110: cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95). - cpuset_sched_domains 17 TFAIL: partition sched domains failed. - cpuset_sched_domains 19 TINFO: general group load balance test - cpuset_sched_domains 19 TINFO: root group info: - cpuset_sched_domains 19 TINFO: sched load balance: 1 - cpuset_sched_domains 19 TINFO: general group info: - cpuset_sched_domains 19 TINFO: cpus: 1 - cpuset_sched_domains 19 TINFO: sched load balance: 1 - cpuset_check_domains1 TFAIL : cpuset_sched_domains_check.c:110: cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95). - cpuset_sched_domains 19 TFAIL: partition sched domains failed. - cpuset_sched_domains 21 TINFO: general group load balance test - cpuset_sched_domains 21 TINFO: root group info: - cpuset_sched_domains 21 TINFO: sched load balance: 0 -
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
** Changed in: kunpeng920/ubuntu-20.04 Status: In Progress => Fix Committed ** Changed in: kunpeng920/ubuntu-18.04-hwe Status: In Progress => Fix Committed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
5.4.183 has now been merged into focal ** Changed in: linux (Ubuntu Focal) Status: In Progress => Fix Committed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
Released upstream in 5.4.183. The inclusion of the changes from that stable release is being tracked in bug 1969239. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
Not a bug in the test. ** Changed in: linux (Ubuntu Hirsute) Status: Incomplete => Won't Fix ** Changed in: linux (Ubuntu) Status: Incomplete => Fix Released ** Changed in: linux (Ubuntu Bionic) Status: Incomplete => In Progress ** Changed in: linux (Ubuntu Focal) Status: Incomplete => In Progress ** Changed in: linux (Ubuntu Bionic) Assignee: (unassigned) => dann frazier (dannf) ** Changed in: linux (Ubuntu Focal) Assignee: (unassigned) => dann frazier (dannf) ** Changed in: kunpeng920 Status: New => In Progress ** Also affects: kunpeng920/ubuntu-18.04-hwe Importance: Undecided Status: New ** Also affects: kunpeng920/ubuntu-18.04 Importance: Undecided Status: New ** Also affects: kunpeng920/upstream-kernel Importance: Undecided Status: New ** Also affects: kunpeng920/ubuntu-20.04 Importance: Undecided Status: New ** Changed in: ubuntu-kernel-tests Status: New => Invalid ** Changed in: kunpeng920/upstream-kernel Milestone: None => linux-v5.12 ** Changed in: kunpeng920/ubuntu-18.04 Status: New => In Progress ** Changed in: kunpeng920/ubuntu-18.04 Assignee: (unassigned) => dann frazier (dannf) ** Changed in: kunpeng920/ubuntu-18.04-hwe Status: New => In Progress ** Changed in: kunpeng920/ubuntu-18.04-hwe Assignee: (unassigned) => dann frazier (dannf) ** Changed in: kunpeng920/ubuntu-20.04 Status: New => In Progress ** Changed in: kunpeng920/ubuntu-20.04 Assignee: (unassigned) => dann frazier (dannf) ** Changed in: kunpeng920/upstream-kernel Status: New => Fix Released -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
After some debugging, I realized the above is the same issue that this commit fixed upstream: commit 71e5f6644fb2f3304fcb310145ded234a37e7cc1 Author: Dietmar Eggemann Date: Mon Feb 1 10:53:53 2021 +0100 sched/topology: Fix sched_domain_topology_level alloc in sched_init_numa() I've backported these fixes and submitted them to stable (5.10.y, 5.4.y & 4.19.y): https://www.spinics.net/lists/stable/msg539011.html https://www.spinics.net/lists/stable/msg539981.html 4.14.y's code is too different for these changes to easily apply. I'll wait for them to bake there to shake out any regressions before submitting to Ubuntu. I expect that focal will pick up the fix from there naturally, but bionic will need an explicit submission since they won't make it into a 4.14.y release. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
Here's a decoded backtrace of the 5.0-rc5+ crash (commit 41ceb5e8 w/ the fix from commit 620a6dc4075 applied), which looks quite plausible. static void __free_domain_allocs(struct s_data *d, enum s_alloc what, const struct cpumask *cpu_map) { [...] 1196case sa_sd_storage: 1197__sdt_free(cpu_map); /* Fall through */ [...] } static void __sdt_free(const struct cpumask *cpu_map) { [...] 1781if (sdd->sd) { 1782sd = *per_cpu_ptr(sdd->sd, j); <<< crash here [...] } static int build_sched_domains(const struct cpumask *cpu_map, struct sched_domain_attr *a\ ttr) { [...] error: 1989__free_domain_allocs(, alloc_state, cpu_map); 1990 1991return ret; } [ 11.975494] alternatives: patching kernel code [ 11.985402] Unable to handle kernel paging request at virtual address 67 44c1718004 [ 11.994200] Mem abort info: [ 11.997287] ESR = 0x9604 [ 12.000667] Exception class = DABT (current EL), IL = 32 bits [ 12.007236] SET = 0, FnV = 0 [ 12.010617] EA = 0, S1PTW = 0 [ 12.014092] Data abort info: [ 12.017278] ISV = 0, ISS = 0x0004 [ 12.021528] CM = 0, WnR = 0 [ 12.024810] [6744c1718004] user address but active_mm is swapper [ 12.031859] Internal error: Oops: 9604 [#1] SMP [ 12.037266] Modules linked in: [ 12.040648] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 5.0.0-rc5+ #7 [ 12.047601] pstate: 8089 (Nzcv daif -PAN +UAO) [ 12.052917] pc : build_sched_domains (/home/ubuntu/linux/kernel/sched/topology.c:1782 /home/ubuntu/linux/kernel/sched/topology.c:1197 /home/ubuntu/linux/kernel/sched/topology.c:1989) [ 12.058133] lr : build_sched_domains (/home/ubuntu/linux/kernel/sched/topology.c:1778 /home/ubuntu/linux/kernel/sched/topology.c:1197 /home/ubuntu/linux/kernel/sched/topology.c:1989) [ 12.063342] sp : 1043bcf0 [ 12.067011] x29: 1043bcf0 x28: b75d3ae21a00 [ 12.072900] x27: 50187e5dc730 x26: b75d3a806e80 [ 12.078788] x25: 50187e5dd3a4 x24: b75d3a8077a0 [ 12.084675] x23: x22: 50187e5dd3a4 [ 12.090561] x21: 50187e5dc730 x20: d77cfb981400 [ 12.096452] x19: x18: 0014 [ 12.102342] x17: c60b0fdd x16: eb2df79d [ 12.108231] x15: 1a6f88f6 x14: a5b719f8 [ 12.114122] x13: 006ba184 x12: 4b281177 [ 12.120013] x11: 7f5df3eebf80 x10: cf4217a7 [ 12.125901] x9 : 3570 x8 : 00210d00 [ 12.131791] x7 : d77cfbaee580 x6 : 0002 [ 12.137680] x5 : d77d7fe741c0 x4 : [ 12.143571] x3 : x2 : 6744c1718000 [ 12.149460] x1 : 0004 x0 : 0004 [ 12.155352] Process swapper/0 (pid: 1, stack limit = 0x(ptrval)) [ 12.162785] Call trace: [ 12.165490] build_sched_domains (/home/ubuntu/linux/kernel/sched/topology.c:1782 /home/ubuntu/linux/kernel/sched/topology.c:1197 /home/ubuntu/linux/kernel/sched/topology.c:1989) [ 12.170314] sched_init_domains (/home/ubuntu/linux/kernel/sched/topology.c:2064) [ 12.174760] sched_init_smp (/home/ubuntu/linux/kernel/sched/core.c:5876) [ 12.178812] kernel_init_freeable (/home/ubuntu/linux/init/main.c:950 /home/ubuntu/linux/init/main.c:964 /home/ubuntu/linux/init/main.c:982 /home/ubuntu/linux/init/main.c:1135) [ 12.183641] kernel_init (/home/ubuntu/linux/init/main.c:1057) [ 12.187502] ret_from_fork (/home/ubuntu/linux/arch/arm64/kernel/entry.S:1073) [ 12.191460] Code: b4000201 93407e77 aa0103e0 f8777aa2 (f8626800) All code 0: b4000201cbz x1, 0x40 4: 93407e77sxtwx23, w19 8: aa0103e0mov x0, x1 c: f8777aa2ldr x2, [x21, x23, lsl #3] 10:* f8626800ldr x0, [x0, x2]<-- trapping instruction Code starting with the faulting instruction === 0: f8626800ldr x0, [x0, x2] [ 12.198259] ---[ end trace 90837fdb22e7ef78 ]--- [ 12.203390] Kernel panic - not syncing: Attempted to kill init! exitcode=0x000b [ 12.211906] SMP: stopping secondary CPUs [ 12.216276] ---[ end Kernel panic - not syncing: Attempted to kill init! exitcode=0x000b ]--- -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
I came back to this and found that I now can get a failure w/ error messages when applying the fix (see comment #4) to bionic - see crash log below. So, I figured I could just bisect between v4.15 and v5.11 upstream w/ the fix applied and and figure out what other change(s) are required to avoid the crash. Unfortunately, I hit a kernel 5.0.0-rc5+ where the same build sometimes crashes (w/ the below backtrace) and sometimes boots fine. So it seems as though there maybe an underlying race. If that race is truly fixed in newer kernels, bisection will probably not be the best tool to find the fix since the failure case isn't 100% reproducible. == bionic kernel w/ patch applied == [ 12.160242] CPU: All CPU(s) started at EL2 [ 12.165438] alternatives: patching kernel code [ 12.186187] Unable to handle kernel paging request at virtual address 8dcaae1e1004 [ 12.194589] Mem abort info: [ 12.197676] ESR = 0x9604 [ 12.201055] Exception class = DABT (current EL), IL = 32 bits [ 12.207619] SET = 0, FnV = 0 [ 12.210996] EA = 0, S1PTW = 0 [ 12.214471] Data abort info: [ 12.217654] ISV = 0, ISS = 0x0004 [ 12.221902] CM = 0, WnR = 0 [ 12.225186] [8dcaae1e1004] user address but active_mm is swapper [ 12.232238] Internal error: Oops: 9604 [#1] SMP [ 12.237644] Modules linked in: [ 12.241026] Process swapper/0 (pid: 1, stack limit = 0x(ptrval)) [ 12.248459] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 4.15.18+ #1 [ 12.255216] pstate: 8089 (Nzcv daif -PAN +UAO) [ 12.260531] pc : build_sched_domains+0xb04/0xfd0 [ 12.265651] lr : build_sched_domains+0xae0/0xfd0 [ 12.270768] sp : 0843bd20 [ 12.274434] x29: 0843bd20 x28: c5dfb98c0f80 [ 12.280320] x27: x26: 3815115f2000 [ 12.286211] x25: 0100 x24: [ 12.292102] x23: 381511d69894 x22: c5dfb9891600 [ 12.297988] x21: 381511d68e38 x20: e5ffbb5fd200 [ 12.303880] x19: x18: c5dfbfaec188 [ 12.309767] x17: 4cae2fed x16: 804179ac [ 12.315658] x15: bcf71eef x14: 85f50aeb [ 12.321546] x13: 21ce98a4 x12: ff80 [ 12.327433] x11: 7f97feee5500 x10: fb44ed3c [ 12.19] x9 : 3b1b x8 : [ 12.339205] x7 : c5dfbe007c00 x6 : 0002 [ 12.345098] x5 : x4 : [ 12.350986] x3 : x2 : 8dcaae1e1000 [ 12.356871] x1 : 0004 x0 : 0004 [ 12.362761] Call trace: [ 12.365463] build_sched_domains+0xb04/0xfd0 [ 12.370196] sched_init_domains+0x88/0xb0 [ 12.374640] sched_init_smp+0x3c/0x90 [ 12.378696] kernel_init_freeable+0xf4/0x240 [ 12.383432] kernel_init+0x1c/0x114 [ 12.387294] ret_from_fork+0x10/0x18 [ 12.391254] Code: b4000201 93407e78 aa0103e0 f8787aa2 (f8626800) [ 12.398067] ---[ end trace a7ac5adb59ec4af4 ]--- [ 12.403191] Kernel panic - not syncing: Attempted to kill init! exitcode=0x000b [ 12.403191] == kernel that sometimes boots OK w/ fix applied, sometimes doesn't == [ 11.975494] alternatives: patching kernel code [ 11.985402] Unable to handle kernel paging request at virtual address 67 44c1718004 [ 11.994200] Mem abort info: [ 11.997287] ESR = 0x9604 [ 12.000667] Exception class = DABT (current EL), IL = 32 bits [ 12.007236] SET = 0, FnV = 0 [ 12.010617] EA = 0, S1PTW = 0 [ 12.014092] Data abort info: [ 12.017278] ISV = 0, ISS = 0x0004 [ 12.021528] CM = 0, WnR = 0 [ 12.024810] [6744c1718004] user address but active_mm is swapper [ 12.031859] Internal error: Oops: 9604 [#1] SMP [ 12.037266] Modules linked in: [ 12.040648] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 5.0.0-rc5+ #7 [ 12.047601] pstate: 8089 (Nzcv daif -PAN +UAO) [ 12.052917] pc : build_sched_domains+0x9f4/0x1138 [ 12.058133] lr : build_sched_domains+0x9d0/0x1138 [ 12.063342] sp : 1043bcf0 [ 12.067011] x29: 1043bcf0 x28: b75d3ae21a00 [ 12.072900] x27: 50187e5dc730 x26: b75d3a806e80 [ 12.078788] x25: 50187e5dd3a4 x24: b75d3a8077a0 [ 12.084675] x23: x22: 50187e5dd3a4 [ 12.090561] x21: 50187e5dc730 x20: d77cfb981400 [ 12.096452] x19: x18: 0014 [ 12.102342] x17: c60b0fdd x16: eb2df79d [ 12.108231] x15: 1a6f88f6 x14: a5b719f8 [ 12.114122] x13: 006ba184 x12: 4b281177 [ 12.120013] x11: 7f5df3eebf80 x10: cf4217a7 [ 12.125901] x9 : 3570 x8 : 00210d00 [ 12.131791] x7 : d77cfbaee580 x6 : 0002 [ 12.137680] x5 : d77d7fe741c0 x4 : [ 12.143571] x3 : x2 : 6744c1718000 [ 12.149460] x1 : 0004 x0 : 0004 [ 12.155352] Process swapper/0 (pid: 1, stack limit =
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
** Changed in: kunpeng920 Importance: Undecided => Low ** Changed in: kunpeng920 Assignee: (unassigned) => dann frazier (dannf) -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
The fix above also cherry-picks back to bionic, but strangely it causes the bionic kernel to fail to boot. I don't see any kernel messages after the EFI stub. I tried adding "earlycon" to get more debug info, but that somehow avoids the problem and boots fine w/ the fix. With earlycon, I can verify that the LTP test now passes. There's just something else missing. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32
** Also affects: linux (Ubuntu) Importance: Undecided Status: New ** Also affects: linux (Ubuntu Bionic) Importance: Undecided Status: New ** Also affects: linux (Ubuntu Hirsute) Importance: Undecided Status: New ** Also affects: linux (Ubuntu Focal) Importance: Undecided Status: New -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1951289 Title: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25 report incorrect sched domain for cpu#32 To manage notifications about this bug go to: https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs