Verification on Trusty ESM updates. $ uname -rv 3.13.0-175-generic #226-Ubuntu SMP Fri Nov 8 15:26:34 UTC 2019
The problem does not happen with the test-case. (kmod.c updated for the new kernel uname strings, same instruction addresses from test build apply.) Test case snippets: Migrated from CPUs 0-3/node0 to 4-7/node1. [ 1629.548016] cpu 4/pid 1924/task TWO :: change_prot_numa() :: address = 0x7fc0ade00000, end = 0x7fc0ae000000 [ 1631.552332] cpu 5/pid 1925/task ONE :: do_huge_pmd_numa_page() :: addr/mask = 0x7fc0ade00000, addr = 0x7fc0ade00000, pmd = 0x80003ffecba009e6, *pmdp = 0x80003ffecba009e6, pmdp = 0xffff880139881b78 [ 1631.552356] cpu 4/pid 1924/task TWO :: do_huge_pmd_numa_page() :: addr/mask = 0x7fc0ade00000, addr = 0x7fc0ade00000, pmd = 0x80003ffecba009e6, *pmdp = 0x80003ffecba009e6, pmdp = 0xffff880139881b78 [ 1631.552361] cpu 4/pid 1924/task TWO :: Stage 1. T2 awaking up T1. Task 2 at entry point. [ 1631.552364] cpu 4/pid 1924/task TWO :: Stage 2. T2 waiting on T1. Task 2 waiting for Task 1 at exit/migration point. [ 1631.556003] cpu 5/pid 1925/task ONE :: Stage 1. T1 waiting on T2. Task 1 at entry point, checking for Task 2 at entry point too. [ 1631.556003] cpu 5/pid 1925/task ONE :: Stage 1. T1 resumed by T2 [ 1631.572927] cpu 5/pid 1925/task ONE :: migrate_misplaced_transhuge_page() :: addr/mask = 0x7fc0ade00000, addr = 0x7fc0ade00000, pmd = 0x80003ffecba009e6, *pmdp = 0x80003ffecba009e6, pmdp = 0xffff880139881b78, node = 1 [ 1631.576916] cpu 5/pid 1925/task ONE :: Stage 2. T1 awaking up T2. Task 1 at migration point. [ 1631.576916] cpu 5/pid 1925/task ONE :: Stage 3. T1 waiting on T2. Task 1 waiting for Task 2 to check migration status. [ 1631.556007] cpu 4/pid 1924/task TWO :: Stage 2. T2 resumed by T1 [ 1631.579727] cpu 4/pid 1924/task TWO :: Stage 3. T2 awaking up T1. Task 2 has checked migration status. [ 1631.579728] cpu 4/pid 1924/task TWO :: Stage 4. T2 waiting on T1. Task 2 waiting for Task 1 to clear the PMD. [ 1631.576916] cpu 5/pid 1925/task ONE :: Stage 3. T1 resumed by T2 [ 1631.590927] cpu 5/pid 1925/task ONE :: Stage 4. T1 awaking up T2. Task 1 has cleared the PMD. [ 1631.593172] cpu 5/pid 1925/task ONE :: Stage 4. T1 warning up T2: it may BUG! [ 1631.594209] cpu 5/pid 1925/task ONE :: Stage 5. T1 sleep for 5s. [ 1632.576116] cpu 4/pid 1924/task TWO :: Stage 4. T2 resumed by T1 [ 1632.579828] cpu 4/pid 1924/task TWO :: Stage 4. T2 sleep for 1s. any BUG afterward? [ 1636.592130] cpu 5/pid 1925/task ONE :: Stage 5. T1 slept for 5s. Migrated from CPUs 4-7/node1 to 0-3/node0. [ 123.072019] cpu 0/pid 1340/task TWO :: change_prot_numa() :: address = 0x7fb236c00000, end = 0x7fb236e00000 [ 125.076242] cpu 1/pid 1341/task ONE :: do_huge_pmd_numa_page() :: addr/mask = 0x7fb236c00000, addr = 0x7fb236c00000, pmd = 0x80003ffdc98009e6, *pmdp = 0x80003ffdc98009e6, pmdp = 0xffff8802380b0db0 [ 125.076246] cpu 0/pid 1340/task TWO :: do_huge_pmd_numa_page() :: addr/mask = 0x7fb236c00000, addr = 0x7fb236c00000, pmd = 0x80003ffdc98009e6, *pmdp = 0x80003ffdc98009e6, pmdp = 0xffff8802380b0db0 [ 125.076249] cpu 0/pid 1340/task TWO :: Stage 1. T2 awaking up T1. Task 2 at entry point. [ 125.076253] cpu 0/pid 1340/task TWO :: Stage 2. T2 waiting on T1. Task 2 waiting for Task 1 at exit/migration point. [ 125.080047] cpu 1/pid 1341/task ONE :: Stage 1. T1 waiting on T2. Task 1 at entry point, checking for Task 2 at entry point too. [ 125.080047] cpu 1/pid 1341/task ONE :: Stage 1. T1 resumed by T2 [ 125.094778] cpu 1/pid 1341/task ONE :: migrate_misplaced_transhuge_page() :: addr/mask = 0x7fb236c00000, addr = 0x7fb236c00000, pmd = 0x80003ffdc98009e6, *pmdp = 0x80003ffdc98009e6, pmdp = 0xffff8802380b0db0, node = 0 [ 125.098769] cpu 1/pid 1341/task ONE :: Stage 2. T1 awaking up T2. Task 1 at migration point. [ 125.098769] cpu 1/pid 1341/task ONE :: Stage 3. T1 waiting on T2. Task 1 waiting for Task 2 to check migration status. [ 125.080004] cpu 0/pid 1340/task TWO :: Stage 2. T2 resumed by T1 [ 125.101893] cpu 0/pid 1340/task TWO :: Stage 3. T2 awaking up T1. Task 2 has checked migration status. [ 125.101894] cpu 0/pid 1340/task TWO :: Stage 4. T2 waiting on T1. Task 2 waiting for Task 1 to clear the PMD. [ 125.098769] cpu 1/pid 1341/task ONE :: Stage 3. T1 resumed by T2 [ 125.113262] cpu 1/pid 1341/task ONE :: Stage 4. T1 awaking up T2. Task 1 has cleared the PMD. [ 125.115574] cpu 1/pid 1341/task ONE :: Stage 4. T1 warning up T2: it may BUG! [ 125.116643] cpu 1/pid 1341/task ONE :: Stage 5. T1 sleep for 5s. [ 126.104253] cpu 0/pid 1340/task TWO :: Stage 4. T2 resumed by T1 [ 126.107601] cpu 0/pid 1340/task TWO :: Stage 4. T2 sleep for 1s. any BUG afterward? [ 130.117013] cpu 1/pid 1341/task ONE :: Stage 5. T1 slept for 5s. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1813018 Title: Kernel Oops - unable to handle kernel paging request; RIP is at wait_migrate_huge_page+0x51/0x70 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1813018/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs