On 10/12/20 1:19 π.μ., Andreas Beckmann wrote:
but I'm not sure whether it is worth backporting them,
since you most likely will be affected by
#973729 - nvidia-uvm does not work with Linux 5.9
which is fixed in 455.45.01

Well, I did the replace you suggested below and even though the modules load, I don't get a display, here is what dmesg gives:

[   15.889326] NVRM: GPU 0000:01:00.0: DMA address not in addressable range of device (0x80020054de80000-0x80020054de8ffff, 0x800000000000000-0x80000ffffffffff) [   15.889341] NVRM: GPU 0000:01:00.0: DMA address not in addressable range of device (0x80020054de80000-0x80020054de8ffff, 0x800000000000000-0x80000ffffffffff) [   15.890377] NVRM: GPU 0000:01:00.0: DMA address not in addressable range of device (0x80020054de80000-0x80020054de8ffff, 0x800000000000000-0x80000ffffffffff) [   15.890564] NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x24:0x1e:1224) [   15.890601] NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0 [   15.995590] NVRM: GPU 0030:01:00.0: DMA address not in addressable range of device (0x80020054a310000-0x80020054a31ffff, 0x800000000000000-0x80000ffffffffff) [   15.995601] NVRM: GPU 0030:01:00.0: DMA address not in addressable range of device (0x80020054a310000-0x80020054a31ffff, 0x800000000000000-0x80000ffffffffff) [   15.996482] NVRM: GPU 0030:01:00.0: DMA address not in addressable range of device (0x80020054a310000-0x80020054a31ffff, 0x800000000000000-0x80000ffffffffff) [   15.996650] NVRM: GPU 0030:01:00.0: RmInitAdapter failed! (0x24:0x1e:1224) [   15.996705] NVRM: GPU 0030:01:00.0: rm_init_adapter failed, device minor number 1
[   34.850800] ------------[ cut here ]------------
[   34.850801] remap_4k_pfn called with wrong pfn value
[   34.850966] WARNING: CPU: 5 PID: 1584 at arch/powerpc/include/asm/book3s/64/hash-64k.h:166 nvidia_mmap_helper+0x6bc/0x800 [nvidia] [   34.850967] Modules linked in: xt_conntrack(E) nf_conntrack_netlink(E) xfrm_user(E) xfrm_algo(E) xt_addrtype(E) br_netfilter(E) overlay(E) xt_CHECKSUM(E) nft_chain_nat(E) xt_MASQUERADE(E) nf_nat(E) nf_conntrack(E) nf_defrag_ipv6(E) nf_defrag_ipv4(E) libcrc32c(E) nft_counter(E) xt_tcpudp(E) nft_compat(E) bridge(E) stp(E) llc(E) nf_tables(E) nfnetlink(E) rfkill(E) nvidia_drm(POE) drm_kms_helper(E) syscopyarea(E) sysfillrect(E) sysimgblt(E) fb_sys_fops(E) nvidia_modeset(POE) nvidia(POE) binfmt_misc(E) evdev(E) joydev(E) snd_hda_codec_hdmi(E) snd_hda_intel(E) snd_intel_dspcfg(E) snd_hda_codec(E) snd_hda_core(E) snd_hwdep(E) snd_pcm(E) snd_timer(E) ctr(E) cbc(E) snd(E) vmx_crypto(E) soundcore(E) gf128mul(E) ofpart(E) ipmi_powernv(E) powernv_flash(E) ipmi_devintf(E) mtd(E) ipmi_msghandler(E) opal_prd(E) at24(E) regmap_i2c(E) parport_pc(E) lp(E) drm(E) parport(E) sunrpc(E) fuse(E) configfs(E) drm_panel_orientation_quirks(E) ip_tables(E) x_tables(E) autofs4(E) ext4(E) crc16(E) mbcache(E) jbd2(E) [   34.850994]  crc32c_generic(E) ecb(E) aes_generic(E) libaes(E) xts(E) hid_generic(E) usbhid(E) hid(E) dm_crypt(E) dm_mod(E) xhci_pci(E) xhci_hcd(E) tg3(E) usbcore(E) nvme(E) libphy(E) nvme_core(E) ptp(E) pps_core(E) usb_common(E) t10_pi(E) crc_t10dif(E) crct10dif_generic(E) crct10dif_common(E) [   34.851008] CPU: 5 PID: 1584 Comm: Xorg Tainted: P OE     5.9.0-4-powerpc64le #1 Debian 5.9.11-1 [   34.851009] NIP:  c00800000e44c664 LR: c00800000e44c660 CTR: 0000000000000000 [   34.851010] REGS: c000000007493750 TRAP: 0700   Tainted: P           OE      (5.9.0-4-powerpc64le Debian 5.9.11-1) [   34.851011] MSR:  9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE>  CR: 28042222  XER: 00000000
[   34.851014] CFAR: c0000000001314e4 IRQMASK: 0
               GPR00: c00800000e44c660 c0000000074939e0 c00800000f09ec00 0000000000000028                GPR04: 0000000000000001 0000000000000004 0000000000000027 c0000005ff6cbf90                GPR08: 0000000000000023 ffffffffffffffd8 0000000000000027 0000000000000000                GPR12: 0000000000002000 c0000005fffea600 00000001473aaad0 00007fffeac1ac14                GPR16: 0000000000000000 0000000000000000 0000000000000013 0000000000000008                GPR20: 0000000000000000 0000000000010000 0000000000001000 0060000002400000                GPR24: c0000005f7085e08 c0000005f7085800 0000000000002000 0000000000000003                GPR28: 0006000000240000 c0000005f7085800 c0000005fa441800 c0000005f38fbb80 [   34.851105] NIP [c00800000e44c664] nvidia_mmap_helper+0x6bc/0x800 [nvidia]
[   34.851187] LR [c00800000e44c660] nvidia_mmap_helper+0x6b8/0x800 [nvidia]
[   34.851188] Call Trace:
[   34.851270] [c0000000074939e0] [c00800000e44c660] nvidia_mmap_helper+0x6b8/0x800 [nvidia] (unreliable) [   34.851353] [c000000007493ac0] [c00800000e44c814] nvidia_mmap+0x6c/0xc0 [nvidia] [   34.851434] [c000000007493b00] [c00800000e4400ec] nvidia_frontend_mmap+0x54/0x80 [nvidia]
[   34.851438] [c000000007493b20] [c0000000003bf51c] mmap_region+0x4cc/0x840
[   34.851439] [c000000007493c00] [c0000000003bfcac] do_mmap+0x41c/0x650
[   34.851441] [c000000007493c90] [c0000000003871f4] vm_mmap_pgoff+0x124/0x190 [   34.851442] [c000000007493d20] [c0000000003bce70] ksys_mmap_pgoff+0x240/0x2b0
[   34.851444] [c000000007493da0] [c000000000013b88] sys_mmap+0x48/0x90
[   34.851446] [c000000007493dc0] [c00000000002f54c] system_call_exception+0x14c/0x230 [   34.851447] [c000000007493e20] [c00000000000d948] system_call_common+0xe8/0x218
[   34.851448] Instruction dump:
[   34.851449] 4bfffe68 60000000 60420000 eae10098 4bfffaf0 f9210070 39210070 4bffffb8 [   34.851453] 3d220000 e8698690 48bebdd5 e8410018 <0fe00000> 7f03c378 3bc0fff5 48bed289
[   34.851457] ---[ end trace 0fa4d523132d0cdc ]---
[   35.320351] ------------[ cut here ]------------
[   35.320352] remap_4k_pfn called with wrong pfn value
[   35.320524] WARNING: CPU: 4 PID: 1584 at arch/powerpc/include/asm/book3s/64/hash-64k.h:166 nvidia_mmap_helper+0x6bc/0x800 [nvidia] [   35.320524] Modules linked in: xt_conntrack(E) nf_conntrack_netlink(E) xfrm_user(E) xfrm_algo(E) xt_addrtype(E) br_netfilter(E) overlay(E) xt_CHECKSUM(E) nft_chain_nat(E) xt_MASQUERADE(E) nf_nat(E) nf_conntrack(E) nf_defrag_ipv6(E) nf_defrag_ipv4(E) libcrc32c(E) nft_counter(E) xt_tcpudp(E) nft_compat(E) bridge(E) stp(E) llc(E) nf_tables(E) nfnetlink(E) rfkill(E) nvidia_drm(POE) drm_kms_helper(E) syscopyarea(E) sysfillrect(E) sysimgblt(E) fb_sys_fops(E) nvidia_modeset(POE) nvidia(POE) binfmt_misc(E) evdev(E) joydev(E) snd_hda_codec_hdmi(E) snd_hda_intel(E) snd_intel_dspcfg(E) snd_hda_codec(E) snd_hda_core(E) snd_hwdep(E) snd_pcm(E) snd_timer(E) ctr(E) cbc(E) snd(E) vmx_crypto(E) soundcore(E) gf128mul(E) ofpart(E) ipmi_powernv(E) powernv_flash(E) ipmi_devintf(E) mtd(E) ipmi_msghandler(E) opal_prd(E) at24(E) regmap_i2c(E) parport_pc(E) lp(E) drm(E) parport(E) sunrpc(E) fuse(E) configfs(E) drm_panel_orientation_quirks(E) ip_tables(E) x_tables(E) autofs4(E) ext4(E) crc16(E) mbcache(E) jbd2(E) [   35.320549]  crc32c_generic(E) ecb(E) aes_generic(E) libaes(E) xts(E) hid_generic(E) usbhid(E) hid(E) dm_crypt(E) dm_mod(E) xhci_pci(E) xhci_hcd(E) tg3(E) usbcore(E) nvme(E) libphy(E) nvme_core(E) ptp(E) pps_core(E) usb_common(E) t10_pi(E) crc_t10dif(E) crct10dif_generic(E) crct10dif_common(E) [   35.320560] CPU: 4 PID: 1584 Comm: Xorg Tainted: P        W OE     5.9.0-4-powerpc64le #1 Debian 5.9.11-1 [   35.320561] NIP:  c00800000e44c664 LR: c00800000e44c660 CTR: 0000000000000000 [   35.320562] REGS: c000000007493750 TRAP: 0700   Tainted: P        W  OE      (5.9.0-4-powerpc64le Debian 5.9.11-1) [   35.320562] MSR:  9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE>  CR: 22042222  XER: 00000000
[   35.320565] CFAR: c0000000001314e4 IRQMASK: 0
               GPR00: c00800000e44c660 c0000000074939e0 c00800000f09ec00 0000000000000028                GPR04: 0000000000000001 0000000000000004 0000000000000027 c0000005ff61bf90                GPR08: 0000000000000023 ffffffffffffffd8 0000000000000027 0000000000000000                GPR12: 0000000000002000 c0000005fffeb800 00000001473b1c70 00007fffeac1ac14                GPR16: 0000000000000000 0000000000000000 0000000000000012 0000000000000008                GPR20: 0000000000000000 0000000000010000 0000000000001000 0062000002400000                GPR24: c00020055901de08 c00020055901d800 0000000000002000 0000000000000003                GPR28: 0006200000240000 c00020055901d800 c0000005fa311800 c0000005f754d610 [   35.320661] NIP [c00800000e44c664] nvidia_mmap_helper+0x6bc/0x800 [nvidia]
[   35.320747] LR [c00800000e44c660] nvidia_mmap_helper+0x6b8/0x800 [nvidia]
[   35.320748] Call Trace:
[   35.320834] [c0000000074939e0] [c00800000e44c660] nvidia_mmap_helper+0x6b8/0x800 [nvidia] (unreliable) [   35.320921] [c000000007493ac0] [c00800000e44c814] nvidia_mmap+0x6c/0xc0 [nvidia] [   35.321006] [c000000007493b00] [c00800000e4400ec] nvidia_frontend_mmap+0x54/0x80 [nvidia]
[   35.321009] [c000000007493b20] [c0000000003bf51c] mmap_region+0x4cc/0x840
[   35.321010] [c000000007493c00] [c0000000003bfcac] do_mmap+0x41c/0x650
[   35.321012] [c000000007493c90] [c0000000003871f4] vm_mmap_pgoff+0x124/0x190 [   35.321013] [c000000007493d20] [c0000000003bce70] ksys_mmap_pgoff+0x240/0x2b0
[   35.321016] [c000000007493da0] [c000000000013b88] sys_mmap+0x48/0x90
[   35.321017] [c000000007493dc0] [c00000000002f54c] system_call_exception+0x14c/0x230 [   35.321019] [c000000007493e20] [c00000000000d948] system_call_common+0xe8/0x218
[   35.321019] Instruction dump:
[   35.321021] 4bfffe68 60000000 60420000 eae10098 4bfffaf0 f9210070 39210070 4bffffb8 [   35.321025] 3d220000 e8698690 48bebdd5 e8410018 <0fe00000> 7f03c378 3bc0fff5 48bed289
[   35.321029] ---[ end trace 0fa4d523132d0cdd ]---


PPS: the first time I hear that someone is actually trying to use
the ppc64el packages ;-)

I had the talos II for a few years and was using it for VSX work, but as it's still a very powerful machine, I decided to use it for some ML/DL work and got myself a pair of Titan X cards to use. Hopefully it should work, I'd hate to have to revert to using an x86 box just for this :-/

Regards

Konstantinos

Reply via email to