[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-05-13 Thread dann frazier
= bionic verification =
ubuntu@d06-4:~$ cat /proc/version
Linux version 4.15.0-179-generic (buildd@bos02-arm64-025) (gcc version 7.5.0 
(Ubuntu/Linaro 7.5.0-3ubuntu1~18.04)) #188-Ubuntu SMP Tue May 10 20:51:17 UTC 
2022
ubuntu@d06-4:~$ grep domain2 /proc/schedstat | wc -l
128
ubuntu@d06-4:~$ grep domain3 /proc/schedstat | wc -l
128
ubuntu@d06-4:~$ 


** Tags removed: verification-needed-bionic
** Tags added: verification-done-bionic

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-05-13 Thread dann frazier
** Description changed:

  [Impact]
  The LTP cpuset_sched_domains test, authored by Miao Xie, fails on a Kunpeng920
  server that has 4 NUMA nodes:
-   https://launchpad.net/bugs/1951289
+   https://launchpad.net/bugs/1951289
  
  This does appear to be a real bug. /proc/schedstat displays 4 domain levels 
for
  CPUs on 2 of the nodes, but only 3 levels for the others 2 (see below).
  I assume this means the scheduler is making suboptimal decisions about
  where to place/move processes.
  
  [Test Case]
  On a 128 core Kunpeng 920 system, observe that half the CPUs are missing a 
3rd level scheduling domain:
  
  ubuntu@d06-4:~$ grep domain2 /proc/schedstat  | wc -l
  128
  ubuntu@d06-4:~$ grep domain3 /proc/schedstat  | wc -l
  64
- ubuntu@d06-4:~$ 
+ ubuntu@d06-4:~$
  
  [What Could Go Wrong]
- This changes the code used for populating sched domains, so it could 
potentially break on other systems, potentially leading to poor scheduling 
characteristics (higher latencies, lower overall throughput etc).
+ This changes the code used for populating sched domains, so it could 
potentially break on other systems, leading to poor scheduling characteristics 
(higher latencies, lower overall throughput etc).

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-05-11 Thread Ubuntu Kernel Bot
This bug is awaiting verification that the linux/4.15.0-179.188 kernel
in -proposed solves the problem. Please test the kernel and update this
bug with the results. If the problem is solved, change the tag
'verification-needed-bionic' to 'verification-done-bionic'. If the
problem still exists, change the tag 'verification-needed-bionic' to
'verification-failed-bionic'.

If verification is not done by 5 working days from today, this fix will
be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how
to enable and use -proposed. Thank you!


** Tags added: verification-needed-bionic

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-05-05 Thread Luke Nowakowski-Krijger
** Changed in: linux (Ubuntu Bionic)
   Status: In Progress => Fix Committed

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-05-03 Thread dann frazier
** Description changed:

- On scobee-kernel(arm64) with hirsute:linux(5.11.0-41.45) for
- sru-20211108 there are several reports about the sched domain not
- covering the full range. The same does not happen on kuzzle. But 32 is a
- bit of a suspicious number.
+ [Impact]
+ The LTP cpuset_sched_domains test, authored by Miao Xie, fails on a Kunpeng920
+ server that has 4 NUMA nodes:
+   https://launchpad.net/bugs/1951289
  
-   Running tests...
-   cpuset_sched_domains 1 TINFO: CPUs are numbered continuously starting at 0 
(0-127)
-   cpuset_sched_domains 1 TINFO: Nodes are numbered continuously starting at 0 
(0-3)
-   cpuset_sched_domains 1 TINFO: root group load balance test
-   cpuset_sched_domains 1 TINFO:  sched load balance: 0
-   cpuset_sched_domains 1 TINFO: CPU hotplug:
-   cpuset_check_domains1  TPASS  :  check_sched_domains passed
-   cpuset_sched_domains 1 TPASS: partition sched domains succeeded.
-   cpuset_sched_domains 3 TINFO: root group load balance test
-   cpuset_sched_domains 3 TINFO:  sched load balance: 1
-   cpuset_sched_domains 3 TINFO: CPU hotplug:
-   cpuset_check_domains1  TFAIL  :  cpuset_sched_domains_check.c:110: 
cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95).
-   cpuset_sched_domains 3 TFAIL: partition sched domains failed.
-   cpuset_sched_domains 5 TINFO: root group load balance test
-   cpuset_sched_domains 5 TINFO:  sched load balance: 0
-   cpuset_sched_domains 5 TINFO: CPU hotplug:
-   cpuset_check_domains1  TPASS  :  check_sched_domains passed
-   cpuset_sched_domains 5 TPASS: partition sched domains succeeded.
-   cpuset_sched_domains 7 TINFO: root group load balance test
-   cpuset_sched_domains 7 TINFO:  sched load balance: 0
-   cpuset_sched_domains 7 TINFO: CPU hotplug:
-   cpuset_check_domains1  TPASS  :  check_sched_domains passed
-   cpuset_sched_domains 7 TPASS: partition sched domains succeeded.
-   cpuset_sched_domains 9 TINFO: root group load balance test
-   cpuset_sched_domains 9 TINFO:  sched load balance: 1
-   cpuset_sched_domains 9 TINFO: CPU hotplug:
-   cpuset_check_domains1  TFAIL  :  cpuset_sched_domains_check.c:110: 
cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95).
-   cpuset_sched_domains 9 TFAIL: partition sched domains failed.
-   cpuset_sched_domains 11 TINFO: root group load balance test
-   cpuset_sched_domains 11 TINFO:  sched load balance: 1
-   cpuset_sched_domains 11 TINFO: CPU hotplug:
-   cpuset_check_domains1  TFAIL  :  cpuset_sched_domains_check.c:110: 
cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95).
-   cpuset_sched_domains 11 TFAIL: partition sched domains failed.
-   cpuset_sched_domains 13 TINFO: general group load balance test
-   cpuset_sched_domains 13 TINFO: root group info:
-   cpuset_sched_domains 13 TINFO:  sched load balance: 0
-   cpuset_sched_domains 13 TINFO: general group info:
-   cpuset_sched_domains 13 TINFO:  cpus: -
-   cpuset_sched_domains 13 TINFO:  sched load balance: 1
-   cpuset_check_domains1  TPASS  :  check_sched_domains passed
-   cpuset_sched_domains 13 TPASS: partition sched domains succeeded.
-   cpuset_sched_domains 15 TINFO: general group load balance test
-   cpuset_sched_domains 15 TINFO: root group info:
-   cpuset_sched_domains 15 TINFO:  sched load balance: 0
-   cpuset_sched_domains 15 TINFO: general group info:
-   cpuset_sched_domains 15 TINFO:  cpus: 1
-   cpuset_sched_domains 15 TINFO:  sched load balance: 0
-   cpuset_check_domains1  TPASS  :  check_sched_domains passed
-   cpuset_sched_domains 15 TPASS: partition sched domains succeeded.
-   cpuset_sched_domains 17 TINFO: general group load balance test
-   cpuset_sched_domains 17 TINFO: root group info:
-   cpuset_sched_domains 17 TINFO:  sched load balance: 1
-   cpuset_sched_domains 17 TINFO: general group info:
-   cpuset_sched_domains 17 TINFO:  cpus: -
-   cpuset_sched_domains 17 TINFO:  sched load balance: 1
-   cpuset_check_domains1  TFAIL  :  cpuset_sched_domains_check.c:110: 
cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95).
-   cpuset_sched_domains 17 TFAIL: partition sched domains failed.
-   cpuset_sched_domains 19 TINFO: general group load balance test
-   cpuset_sched_domains 19 TINFO: root group info:
-   cpuset_sched_domains 19 TINFO:  sched load balance: 1
-   cpuset_sched_domains 19 TINFO: general group info:
-   cpuset_sched_domains 19 TINFO:  cpus: 1
-   cpuset_sched_domains 19 TINFO:  sched load balance: 1
-   cpuset_check_domains1  TFAIL  :  cpuset_sched_domains_check.c:110: 
cpu32's sched domain is wrong(Domain: 0-127, CPU's Sched Domain: 0-95).
-   cpuset_sched_domains 19 TFAIL: partition sched domains failed.
-   cpuset_sched_domains 21 TINFO: general group load balance test
-   cpuset_sched_domains 21 TINFO: root group info:
-   cpuset_sched_domains 21 TINFO:  sched load balance: 0
-   

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-05-03 Thread dann frazier
** Changed in: kunpeng920/ubuntu-20.04
   Status: In Progress => Fix Committed

** Changed in: kunpeng920/ubuntu-18.04-hwe
   Status: In Progress => Fix Committed

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-05-03 Thread dann frazier
5.4.183 has now been merged into focal

** Changed in: linux (Ubuntu Focal)
   Status: In Progress => Fix Committed

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-04-27 Thread dann frazier
Released upstream in 5.4.183. The inclusion of the changes from that
stable release is being tracked in bug 1969239.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-03-10 Thread dann frazier
Not a bug in the test.

** Changed in: linux (Ubuntu Hirsute)
   Status: Incomplete => Won't Fix

** Changed in: linux (Ubuntu)
   Status: Incomplete => Fix Released

** Changed in: linux (Ubuntu Bionic)
   Status: Incomplete => In Progress

** Changed in: linux (Ubuntu Focal)
   Status: Incomplete => In Progress

** Changed in: linux (Ubuntu Bionic)
 Assignee: (unassigned) => dann frazier (dannf)

** Changed in: linux (Ubuntu Focal)
 Assignee: (unassigned) => dann frazier (dannf)

** Changed in: kunpeng920
   Status: New => In Progress

** Also affects: kunpeng920/ubuntu-18.04-hwe
   Importance: Undecided
   Status: New

** Also affects: kunpeng920/ubuntu-18.04
   Importance: Undecided
   Status: New

** Also affects: kunpeng920/upstream-kernel
   Importance: Undecided
   Status: New

** Also affects: kunpeng920/ubuntu-20.04
   Importance: Undecided
   Status: New

** Changed in: ubuntu-kernel-tests
   Status: New => Invalid

** Changed in: kunpeng920/upstream-kernel
Milestone: None => linux-v5.12

** Changed in: kunpeng920/ubuntu-18.04
   Status: New => In Progress

** Changed in: kunpeng920/ubuntu-18.04
 Assignee: (unassigned) => dann frazier (dannf)

** Changed in: kunpeng920/ubuntu-18.04-hwe
   Status: New => In Progress

** Changed in: kunpeng920/ubuntu-18.04-hwe
 Assignee: (unassigned) => dann frazier (dannf)

** Changed in: kunpeng920/ubuntu-20.04
   Status: New => In Progress

** Changed in: kunpeng920/ubuntu-20.04
 Assignee: (unassigned) => dann frazier (dannf)

** Changed in: kunpeng920/upstream-kernel
   Status: New => Fix Released

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-03-07 Thread dann frazier
After some debugging, I realized the above is the same issue that this
commit fixed upstream:

commit 71e5f6644fb2f3304fcb310145ded234a37e7cc1
Author: Dietmar Eggemann 
Date:   Mon Feb 1 10:53:53 2021 +0100

sched/topology: Fix sched_domain_topology_level alloc in
sched_init_numa()

I've backported these fixes and submitted them to stable (5.10.y, 5.4.y
& 4.19.y):

https://www.spinics.net/lists/stable/msg539011.html
https://www.spinics.net/lists/stable/msg539981.html

4.14.y's code is too different for these changes to easily apply.

I'll wait for them to bake there to shake out any regressions before
submitting to Ubuntu. I expect that focal will pick up the fix from
there naturally, but bionic will need an explicit submission since they
won't make it into a 4.14.y release.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-02-18 Thread dann frazier
Here's a decoded backtrace of the 5.0-rc5+ crash (commit 41ceb5e8 w/ the
fix from commit 620a6dc4075 applied), which looks quite plausible.

static void __free_domain_allocs(struct s_data *d, enum s_alloc what,
 const struct cpumask *cpu_map)
{
[...]
1196case sa_sd_storage:
1197__sdt_free(cpu_map);
/* Fall through */
[...]
}

static void __sdt_free(const struct cpumask *cpu_map)
{
[...]
1781if (sdd->sd) {
1782sd = *per_cpu_ptr(sdd->sd, j);  <<< crash here
[...]
}

static int
build_sched_domains(const struct cpumask *cpu_map, struct sched_domain_attr *a\
ttr)
{
[...]
error:
1989__free_domain_allocs(, alloc_state, cpu_map);
1990
1991return ret;
}


[ 11.975494] alternatives: patching kernel code
[ 11.985402] Unable to handle kernel paging request at virtual address 67
44c1718004
[ 11.994200] Mem abort info:
[ 11.997287] ESR = 0x9604
[ 12.000667] Exception class = DABT (current EL), IL = 32 bits
[ 12.007236] SET = 0, FnV = 0
[ 12.010617] EA = 0, S1PTW = 0
[ 12.014092] Data abort info:
[ 12.017278] ISV = 0, ISS = 0x0004
[ 12.021528] CM = 0, WnR = 0
[ 12.024810] [6744c1718004] user address but active_mm is swapper
[ 12.031859] Internal error: Oops: 9604 [#1] SMP
[ 12.037266] Modules linked in:
[ 12.040648] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 5.0.0-rc5+ #7
[ 12.047601] pstate: 8089 (Nzcv daif -PAN +UAO)
[   12.052917] pc : build_sched_domains 
(/home/ubuntu/linux/kernel/sched/topology.c:1782 
/home/ubuntu/linux/kernel/sched/topology.c:1197 
/home/ubuntu/linux/kernel/sched/topology.c:1989) 
[   12.058133] lr : build_sched_domains 
(/home/ubuntu/linux/kernel/sched/topology.c:1778 
/home/ubuntu/linux/kernel/sched/topology.c:1197 
/home/ubuntu/linux/kernel/sched/topology.c:1989) 
[ 12.063342] sp : 1043bcf0
[ 12.067011] x29: 1043bcf0 x28: b75d3ae21a00
[ 12.072900] x27: 50187e5dc730 x26: b75d3a806e80
[ 12.078788] x25: 50187e5dd3a4 x24: b75d3a8077a0
[ 12.084675] x23:  x22: 50187e5dd3a4
[ 12.090561] x21: 50187e5dc730 x20: d77cfb981400
[ 12.096452] x19:  x18: 0014
[ 12.102342] x17: c60b0fdd x16: eb2df79d
[ 12.108231] x15: 1a6f88f6 x14: a5b719f8
[ 12.114122] x13: 006ba184 x12: 4b281177
[ 12.120013] x11: 7f5df3eebf80 x10: cf4217a7
[ 12.125901] x9 : 3570 x8 : 00210d00
[ 12.131791] x7 : d77cfbaee580 x6 : 0002
[ 12.137680] x5 : d77d7fe741c0 x4 : 
[ 12.143571] x3 :  x2 : 6744c1718000
[ 12.149460] x1 : 0004 x0 : 0004
[ 12.155352] Process swapper/0 (pid: 1, stack limit = 0x(ptrval))
[ 12.162785] Call trace:
[   12.165490] build_sched_domains 
(/home/ubuntu/linux/kernel/sched/topology.c:1782 
/home/ubuntu/linux/kernel/sched/topology.c:1197 
/home/ubuntu/linux/kernel/sched/topology.c:1989) 
[   12.170314] sched_init_domains 
(/home/ubuntu/linux/kernel/sched/topology.c:2064) 
[   12.174760] sched_init_smp (/home/ubuntu/linux/kernel/sched/core.c:5876) 
[   12.178812] kernel_init_freeable (/home/ubuntu/linux/init/main.c:950 
/home/ubuntu/linux/init/main.c:964 /home/ubuntu/linux/init/main.c:982 
/home/ubuntu/linux/init/main.c:1135) 
[   12.183641] kernel_init (/home/ubuntu/linux/init/main.c:1057) 
[   12.187502] ret_from_fork 
(/home/ubuntu/linux/arch/arm64/kernel/entry.S:1073) 
[ 12.191460] Code: b4000201 93407e77 aa0103e0 f8777aa2 (f8626800)
All code

   0:   b4000201cbz x1, 0x40
   4:   93407e77sxtwx23, w19
   8:   aa0103e0mov x0, x1
   c:   f8777aa2ldr x2, [x21, x23, lsl #3]
  10:*  f8626800ldr x0, [x0, x2]<-- trapping instruction

Code starting with the faulting instruction
===
   0:   f8626800ldr x0, [x0, x2]
[ 12.198259] ---[ end trace 90837fdb22e7ef78 ]---
[ 12.203390] Kernel panic - not syncing: Attempted to kill init! 
exitcode=0x000b
[ 12.211906] SMP: stopping secondary CPUs
[ 12.216276] ---[ end Kernel panic - not syncing: Attempted to kill init! 
exitcode=0x000b ]---

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-02-18 Thread dann frazier
I came back to this and found that I now can get a failure w/ error
messages when applying the fix (see comment #4) to bionic - see crash
log below. So, I figured I could just bisect between v4.15 and v5.11
upstream w/ the fix applied and and figure out what other change(s) are
required to avoid the crash. Unfortunately, I hit a kernel 5.0.0-rc5+
where the same build sometimes crashes (w/ the below backtrace) and
sometimes boots fine. So it seems as though there maybe an underlying
race. If that race is truly fixed in newer kernels, bisection will
probably not be the best tool to find the fix since the failure case
isn't 100% reproducible.

== bionic kernel w/ patch applied ==
[   12.160242] CPU: All CPU(s) started at EL2
[   12.165438] alternatives: patching kernel code
[   12.186187] Unable to handle kernel paging request at virtual address 
8dcaae1e1004
[   12.194589] Mem abort info:
[   12.197676]   ESR = 0x9604
[   12.201055]   Exception class = DABT (current EL), IL = 32 bits
[   12.207619]   SET = 0, FnV = 0
[   12.210996]   EA = 0, S1PTW = 0
[   12.214471] Data abort info:
[   12.217654]   ISV = 0, ISS = 0x0004
[   12.221902]   CM = 0, WnR = 0
[   12.225186] [8dcaae1e1004] user address but active_mm is swapper
[   12.232238] Internal error: Oops: 9604 [#1] SMP
[   12.237644] Modules linked in:
[   12.241026] Process swapper/0 (pid: 1, stack limit = 0x(ptrval))
[   12.248459] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 4.15.18+ #1
[   12.255216] pstate: 8089 (Nzcv daif -PAN +UAO)
[   12.260531] pc : build_sched_domains+0xb04/0xfd0
[   12.265651] lr : build_sched_domains+0xae0/0xfd0
[   12.270768] sp : 0843bd20
[   12.274434] x29: 0843bd20 x28: c5dfb98c0f80 
[   12.280320] x27:  x26: 3815115f2000 
[   12.286211] x25: 0100 x24:  
[   12.292102] x23: 381511d69894 x22: c5dfb9891600 
[   12.297988] x21: 381511d68e38 x20: e5ffbb5fd200 
[   12.303880] x19:  x18: c5dfbfaec188 
[   12.309767] x17: 4cae2fed x16: 804179ac 
[   12.315658] x15: bcf71eef x14: 85f50aeb 
[   12.321546] x13: 21ce98a4 x12: ff80 
[   12.327433] x11: 7f97feee5500 x10: fb44ed3c 
[   12.19] x9 : 3b1b x8 :  
[   12.339205] x7 : c5dfbe007c00 x6 : 0002 
[   12.345098] x5 :  x4 :  
[   12.350986] x3 :  x2 : 8dcaae1e1000 
[   12.356871] x1 : 0004 x0 : 0004 
[   12.362761] Call trace:
[   12.365463]  build_sched_domains+0xb04/0xfd0
[   12.370196]  sched_init_domains+0x88/0xb0
[   12.374640]  sched_init_smp+0x3c/0x90
[   12.378696]  kernel_init_freeable+0xf4/0x240
[   12.383432]  kernel_init+0x1c/0x114
[   12.387294]  ret_from_fork+0x10/0x18
[   12.391254] Code: b4000201 93407e78 aa0103e0 f8787aa2 (f8626800) 
[   12.398067] ---[ end trace a7ac5adb59ec4af4 ]---
[   12.403191] Kernel panic - not syncing: Attempted to kill init! 
exitcode=0x000b
[   12.403191]

== kernel that sometimes boots OK w/ fix applied, sometimes doesn't ==
[   11.975494] alternatives: patching kernel code
[   11.985402] Unable to handle kernel paging request at virtual address 67
44c1718004
[   11.994200] Mem abort info:
[   11.997287]   ESR = 0x9604
[   12.000667]   Exception class = DABT (current EL), IL = 32 bits
[   12.007236]   SET = 0, FnV = 0
[   12.010617]   EA = 0, S1PTW = 0
[   12.014092] Data abort info:
[   12.017278]   ISV = 0, ISS = 0x0004
[   12.021528]   CM = 0, WnR = 0
[   12.024810] [6744c1718004] user address but active_mm is swapper
[   12.031859] Internal error: Oops: 9604 [#1] SMP
[   12.037266] Modules linked in:
[   12.040648] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 5.0.0-rc5+ #7
[   12.047601] pstate: 8089 (Nzcv daif -PAN +UAO)
[   12.052917] pc : build_sched_domains+0x9f4/0x1138
[   12.058133] lr : build_sched_domains+0x9d0/0x1138
[   12.063342] sp : 1043bcf0
[   12.067011] x29: 1043bcf0 x28: b75d3ae21a00 
[   12.072900] x27: 50187e5dc730 x26: b75d3a806e80 
[   12.078788] x25: 50187e5dd3a4 x24: b75d3a8077a0 
[   12.084675] x23:  x22: 50187e5dd3a4 
[   12.090561] x21: 50187e5dc730 x20: d77cfb981400 
[   12.096452] x19:  x18: 0014 
[   12.102342] x17: c60b0fdd x16: eb2df79d 
[   12.108231] x15: 1a6f88f6 x14: a5b719f8 
[   12.114122] x13: 006ba184 x12: 4b281177 
[   12.120013] x11: 7f5df3eebf80 x10: cf4217a7 
[   12.125901] x9 : 3570 x8 : 00210d00 
[   12.131791] x7 : d77cfbaee580 x6 : 0002 
[   12.137680] x5 : d77d7fe741c0 x4 :  
[   12.143571] x3 :  x2 : 6744c1718000 
[   12.149460] x1 : 0004 x0 : 0004 
[   12.155352] Process swapper/0 (pid: 1, stack limit = 

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2022-02-17 Thread Andrew Cloke
** Changed in: kunpeng920
   Importance: Undecided => Low

** Changed in: kunpeng920
 Assignee: (unassigned) => dann frazier (dannf)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2021-12-06 Thread dann frazier
The fix above also cherry-picks back to bionic, but strangely it causes
the bionic kernel to fail to boot. I don't see any kernel messages after
the EFI stub. I tried adding "earlycon" to get more debug info, but that
somehow avoids the problem and boots fine w/ the fix. With earlycon, I
can verify that the LTP test now passes. There's just something else
missing.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1951289] Re: ubuntu_ltp_controllers:cpuset_sched_domains: tests 3, 9, 11, 17, 19, 25 report incorrect sched domain for cpu#32

2021-11-24 Thread dann frazier
** Also affects: linux (Ubuntu)
   Importance: Undecided
   Status: New

** Also affects: linux (Ubuntu Bionic)
   Importance: Undecided
   Status: New

** Also affects: linux (Ubuntu Hirsute)
   Importance: Undecided
   Status: New

** Also affects: linux (Ubuntu Focal)
   Importance: Undecided
   Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1951289

Title:
  ubuntu_ltp_controllers:cpuset_sched_domains: tests 3,9,11,17,19,25
  report incorrect sched domain for cpu#32

To manage notifications about this bug go to:
https://bugs.launchpad.net/kunpeng920/+bug/1951289/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs