We weren't able to use lustre (2.4.2/2.5.x) with kernel 3.12.4 (staging kernel driver - kernel panics on mount) After compiling 3.13-rc we have a working system (though it has not yet being put through its' paces except for some simultaneous dd)
On Wed, Jan 8, 2014 at 5:02 PM, E.S. Rosenberg <esr+lus...@mail.hebrew.edu>wrote: > Wouldn't it make more sense to downgrade/install 2.4.2 in that case? > > > On Wed, Jan 8, 2014 at 4:58 PM, Oliver Mangold < > oliver.mang...@emea.nec.com> wrote: > >> Hello, >> >> we installed Lustre 2.5.0 servers at a customer site and ran into >> massive stability problems. Because of this we would like to downgrade >> to 2.1.6 for now, until 2.5.0 is more mature. Is this possible? I tried >> to do 'tunefs.lustre --writeconf' on the MGT (formatted with 2.5.0) and >> mount it with the 2.1.6 kernel. It results in an immediate kernel crash >> with the console output shown below. Any suggestions? >> >> Best regards, >> >> Oliver >> >> --- >> Lustre: MGS MGS started >> LustreError: 5476:0:(mgc_request.c:76:mgc_name2resid()) missing name: >> -sptlrpc >> Lustre: 5528:0:(ldlm_lib.c:952:target_handle_connect()) MGS: connection >> from bd919a64-44f9-3a8d-88e8-be602f7d6de1@0@lo t0 exp (null) cur >> 1389192193 last 0 >> Lustre: MGC10.188.20.31@o2ib: Reactivating import >> LustreError: 5476:0:(mgc_request.c:286:config_log_add()) can't create >> sptlrpc log: -sptlrpc >> LustreError: 15b-f: MGC10.188.20.31@o2ib: The configuration from log >> '-params'failed from the MGS (-22). Make sure this client and the MGS >> are running compatible versions of Lustre. >> LustreError: 15c-8: MGC10.188.20.31@o2ib: The configuration from log >> '-params' failed (-22). This may be the result of communication errors >> between this node and the MGS, a bad configuration, or other errors. See >> the syslog for more information. >> BUG: unable to handle kernel paging request at 00000000deadbeef >> IP: [<ffffffff8127ebe9>] strnlen+0x9/0x40 >> PGD 638aca067 PUD 0 >> Oops: 0000 [#1] SMP >> last sysfs file: /sys/module/ldiskfs/initstate >> CPU 4 >> Modules linked in: fsfilt_ldiskfs(U) exportfs mgs(U) mgc(U) ldiskfs(U) >> ipmi_devintf sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf >> lustre(U) lov(U) osc(U) lquota(U) mdc(U) fid(U) fld(U) ksocklnd(U) >> ko2iblnd(U) ptlrpc(U) obdclass(U) lnet(U) lvfs(U) libcfs(U) ib_ipoib >> rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 >> e1000e mlx4_ib ib_sa ib_mad ib_core mlx4_en mlx4_core mptsas mptscsih >> mptbase igb ptp pps_core microcode serio_raw i2c_i801 i2c_core iTCO_wdt >> iTCO_vendor_support ioatdma dca i7core_edac edac_core shpchp squashfs >> ext4 mbcache jbd2 raid1 sg sd_mod crc_t10dif mpt2sas(U) >> scsi_transport_sas raid_class ahci dm_multipath dm_mirror dm_region_hash >> dm_log dm_mod scsi_dh_rdac [last unloaded: scsi_wait_scan] >> >> Pid: 5476, comm: mount.lustre Not tainted >> 2.6.32-358.11.1.el6_lustre.x86_64 #1 Supermicro X8DTH-i/6/iF/6F/X8DTH >> RIP: 0010:[<ffffffff8127ebe9>] [<ffffffff8127ebe9>] strnlen+0x9/0x40 >> RSP: 0018:ffff880632073a58 EFLAGS: 00010286 >> RAX: ffffffff817a29dd RBX: ffff88061dc95000 RCX: 0000000000000002 >> RDX: 00000000deadbeef RSI: ffffffffffffffff RDI: 00000000deadbeef >> RBP: ffff880632073a58 R08: 0000000000000073 R09: 0000000000000004 >> R10: 0000000000000001 R11: ffff88033a243a6b R12: ffff88061dc94a34 >> R13: 00000000deadbeef R14: 00000000ffffffff R15: 0000000000000000 >> FS: 00007f62ee6ef700(0000) GS:ffff88034ac00000(0000) >> knlGS:0000000000000000 >> CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b >> CR2: 00000000deadbeef CR3: 0000000639dd2000 CR4: 00000000000007e0 >> DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 >> DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 >> Process mount.lustre (pid: 5476, threadinfo ffff880632072000, task >> ffff88063835b500) >> Stack: >> ffff880632073a98 ffffffff8127fea0 0000000000000000 ffff88061dc94a34 >> <d> ffffffffa051c8c7 ffffffffa051c8c5 ffff880632073c18 ffff88061dc95000 >> <d> ffff880632073b38 ffffffff812812e8 0000000000000004 0000000affffffff >> Call Trace: >> [<ffffffff8127fea0>] string+0x40/0x100 >> [<ffffffff812812e8>] vsnprintf+0x218/0x5e0 >> [<ffffffffa03ab19b>] ? cfs_set_ptldebug_header+0x2b/0xc0 [libcfs] >> [<ffffffffa03b65a3>] libcfs_debug_vmsg2+0x2c3/0xb60 [libcfs] >> [<ffffffffa04e4952>] ? lustre_process_log+0x812/0xad0 [obdclass] >> [<ffffffffa03b6e81>] libcfs_debug_msg+0x41/0x50 [libcfs] >> [<ffffffffa04ef0f1>] lustre_fill_super+0xd51/0x13a0 [obdclass] >> [<ffffffff8127f10a>] ? strlcpy+0x4a/0x60 >> [<ffffffffa04ee3a0>] ? lustre_fill_super+0x0/0x13a0 [obdclass] >> [<ffffffff8118433f>] get_sb_nodev+0x5f/0xa0 >> [<ffffffffa04dfad5>] lustre_get_sb+0x25/0x30 [obdclass] >> [<ffffffff8118395b>] vfs_kern_mount+0x7b/0x1b0 >> [<ffffffff81183b02>] do_kern_mount+0x52/0x130 >> [<ffffffff811a3ce2>] do_mount+0x2d2/0x8d0 >> [<ffffffff811a4370>] sys_mount+0x90/0xe0 >> [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b >> Code: 66 90 48 83 c2 01 80 3a 00 75 f7 48 89 d0 48 29 f8 c9 c3 66 66 66 >> 66 66 66 2e 0f 1f 84 00 00 00 00 00 55 48 85 f6 48 89 e5 74 2e <80> 3f >> 00 74 29 48 83 ee 01 48 89 f8 eb 12 66 0f 1f 84 00 00 00 >> RIP [<ffffffff8127ebe9>] strnlen+0x9/0x40 >> RSP <ffff880632073a58> >> CR2: 00000000deadbeef >> ---[ end trace bdafbd6d71d51ae3 ]--- >> Kernel panic - not syncing: Fatal exception >> Pid: 5476, comm: mount.lustre Tainted: G D --------------- >> 2.6.32-358.11.1.el6_lustre.x86_64 #1 >> Call Trace: >> [<ffffffff8150d7c8>] ? panic+0xa7/0x16f >> [<ffffffff815119f4>] ? oops_end+0xe4/0x100 >> [<ffffffff81046bfb>] ? no_context+0xfb/0x260 >> [<ffffffff81513996>] ? notifier_call_chain+0x16/0x80 >> [<ffffffff81046e85>] ? __bad_area_nosemaphore+0x125/0x1e0 >> [<ffffffff81513996>] ? notifier_call_chain+0x16/0x80 >> [<ffffffff81046fae>] ? bad_area+0x4e/0x60 >> [<ffffffff81047760>] ? __do_page_fault+0x3d0/0x480 >> [<ffffffff8106e485>] ? __call_console_drivers+0x75/0x90 >> [<ffffffff8100bb8e>] ? apic_timer_interrupt+0xe/0x20 >> [<ffffffff8106f241>] ? vprintk+0x251/0x560 >> [<ffffffff8151391e>] ? do_page_fault+0x3e/0xa0 >> [<ffffffff81510cd5>] ? page_fault+0x25/0x30 >> [<ffffffff8127ebe9>] ? strnlen+0x9/0x40 >> [<ffffffff8127fea0>] ? string+0x40/0x100 >> [<ffffffff812812e8>] ? vsnprintf+0x218/0x5e0 >> [<ffffffffa03ab19b>] ? cfs_set_ptldebug_header+0x2b/0xc0 [libcfs] >> [<ffffffffa03b65a3>] ? libcfs_debug_vmsg2+0x2c3/0xb60 [libcfs] >> [<ffffffffa04e4952>] ? lustre_process_log+0x812/0xad0 [obdclass] >> [<ffffffffa03b6e81>] ? libcfs_debug_msg+0x41/0x50 [libcfs] >> [<ffffffffa04ef0f1>] ? lustre_fill_super+0xd51/0x13a0 [obdclass] >> [<ffffffff8127f10a>] ? strlcpy+0x4a/0x60 >> [<ffffffffa04ee3a0>] ? lustre_fill_super+0x0/0x13a0 [obdclass] >> [<ffffffff8118433f>] ? get_sb_nodev+0x5f/0xa0 >> [<ffffffffa04dfad5>] ? lustre_get_sb+0x25/0x30 [obdclass] >> [<ffffffff8118395b>] ? vfs_kern_mount+0x7b/0x1b0 >> [<ffffffff81183b02>] ? do_kern_mount+0x52/0x130 >> [<ffffffff811a3ce2>] ? do_mount+0x2d2/0x8d0 >> [<ffffffff811a4370>] ? sys_mount+0x90/0xe0 >> [<ffffffff8100b072>] ? system_call_fastpath+0x16/0x1b >> >> -- >> Dr. Oliver Mangold >> System Analyst >> NEC Deutschland GmbH >> HPC Division >> Hessbrühlstraße 21b >> 70565 Stuttgart >> Germany >> Phone: +49 711 78055 13 >> Mail: oliver.mang...@emea.nec.com >> _______________________________________________ >> Lustre-discuss mailing list >> Lustre-discuss@lists.lustre.org >> http://lists.lustre.org/mailman/listinfo/lustre-discuss >> > >
_______________________________________________ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss