On Thu, 2019-06-13 at 15:11 +0300, Mike Rapoport wrote: > The log Qian Cai posted at [1] and partially cited below confirms that the > failure happens when *user* PGDs are allocated and the addition of > __GFP_ACCOUNT to gfp flags used by pgd_alloc() only uncovered another > issue. > > I'm still failing to reproduce it with qemu and I'm not really familiar > with slub/memcg code to say anything smart about it. Will keep looking. > > Note, that as failures start way after efi_virtmap_init() that allocates a > PGD for efi_mm, there are no real fixes required for the original series, > except that the check for mm == &init_mm I copied for some reason from > powerpc is bogus and can be removed.
Yes, there is more places are not happy with __GFP_ACCOUNT other than efi_mm. For example, [ 132.786842][ T1501] kobject_add_internal failed for pgd_cache(49:systemd- udevd.service) (error: -2 parent: cgroup) [ 132.795589][ T1889] CPU: 9 PID: 1889 Comm: systemd-udevd Tainted: G W 5.2.0-rc4-next-20190613+ #8 [ 132.807356][ T1889] Hardware name: HPE Apollo 70 /C01_APACHE_MB , BIOS L50_5.13_1.0.9 03/01/2019 [ 132.817872][ T1889] Call trace: [ 132.821017][ T1889] dump_backtrace+0x0/0x268 [ 132.825372][ T1889] show_stack+0x20/0x2c [ 132.829380][ T1889] dump_stack+0xb4/0x108 [ 132.833475][ T1889] pgd_alloc+0x34/0x5c [ 132.837396][ T1889] mm_init+0x27c/0x32c [ 132.841315][ T1889] dup_mm+0x84/0x7b4 [ 132.845061][ T1889] copy_process+0xf20/0x24cc [ 132.849500][ T1889] _do_fork+0xa4/0x66c [ 132.853420][ T1889] __arm64_sys_clone+0x114/0x1b4 [ 132.858208][ T1889] el0_svc_handler+0x198/0x260 [ 132.862821][ T1889] el0_svc+0x8/0xc > > I surely can add pgd_alloc_kernel() to be used by the EFI code to make sure > we won't run into issues with memcg in the future. > > [ 82.125966] Freeing unused kernel memory: 28672K > [ 87.940365] Checked W+X mappings: passed, no W+X pages found > [ 87.946769] Run /init as init process > [ 88.040040] systemd[1]: System time before build time, advancing clock. > [ 88.054593] systemd[1]: Failed to insert module 'autofs4': No such file or > directory > [ 88.374129] modprobe (1726) used greatest stack depth: 28464 bytes left > [ 88.470108] systemd[1]: systemd 239 running in system mode. (+PAM +AUDIT > +SELINUX +IMA -APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT > +GNUTLS +ACL +XZ +LZ4 +SECCOMP +BLKID +ELFUTILS +KMOD +IDN2 -IDN +PCRE2 > default-hierarchy=legacy) > [ 88.498398] systemd[1]: Detected architecture arm64. > [ 88.506517] systemd[1]: Running in initial RAM disk. > [ 89.621995] mkdir (1730) used greatest stack depth: 27872 bytes left > [ 90.222658] random: systemd: uninitialized urandom read (16 bytes read) > [ 90.230072] systemd[1]: Reached target Swap. > [ 90.240205] random: systemd: uninitialized urandom read (16 bytes read) > [ 90.251088] systemd[1]: Reached target Timers. > [ 90.261303] random: systemd: uninitialized urandom read (16 bytes read) > [ 90.271209] systemd[1]: Listening on udev Control Socket. > [ 90.283238] systemd[1]: Reached target Local File Systems. > [ 90.296232] systemd[1]: Reached target Slices. > [ 90.307239] systemd[1]: Listening on udev Kernel Socket. > [ 90.608597] kobject_add_internal failed for pgd_cache(13:init.scope) > (error: -2 parent: cgroup) > [ 90.678007] kobject_add_internal failed for pgd_cache(13:init.scope)(error: > -2 parent: cgroup) > [ 90.713260] kobject_add_internal failed for pgd_cache(21:systemd-tmpfiles- > setup.service) (error: -2 parent: cgroup) > [ 90.820012] systemd-tmpfile (1759) used greatest stack depth: 27184 bytes > left > [ 90.861942] kobject_add_internal failed for pgd_cache(13:init.scope) error: > -2 parent: cgroup) > > > Thanks, > > Mark. > > > > [1] https://cailca.github.io/files/dmesg.txt >