Wouldn't it make more sense to downgrade/install 2.4.2 in that case?
On Wed, Jan 8, 2014 at 4:58 PM, Oliver Mangold <oliver.mang...@emea.nec.com>wrote: > Hello, > > we installed Lustre 2.5.0 servers at a customer site and ran into > massive stability problems. Because of this we would like to downgrade > to 2.1.6 for now, until 2.5.0 is more mature. Is this possible? I tried > to do 'tunefs.lustre --writeconf' on the MGT (formatted with 2.5.0) and > mount it with the 2.1.6 kernel. It results in an immediate kernel crash > with the console output shown below. Any suggestions? > > Best regards, > > Oliver > > --- > Lustre: MGS MGS started > LustreError: 5476:0:(mgc_request.c:76:mgc_name2resid()) missing name: > -sptlrpc > Lustre: 5528:0:(ldlm_lib.c:952:target_handle_connect()) MGS: connection > from bd919a64-44f9-3a8d-88e8-be602f7d6de1@0@lo t0 exp (null) cur > 1389192193 last 0 > Lustre: MGC10.188.20.31@o2ib: Reactivating import > LustreError: 5476:0:(mgc_request.c:286:config_log_add()) can't create > sptlrpc log: -sptlrpc > LustreError: 15b-f: MGC10.188.20.31@o2ib: The configuration from log > '-params'failed from the MGS (-22). Make sure this client and the MGS > are running compatible versions of Lustre. > LustreError: 15c-8: MGC10.188.20.31@o2ib: The configuration from log > '-params' failed (-22). This may be the result of communication errors > between this node and the MGS, a bad configuration, or other errors. See > the syslog for more information. > BUG: unable to handle kernel paging request at 00000000deadbeef > IP: [<ffffffff8127ebe9>] strnlen+0x9/0x40 > PGD 638aca067 PUD 0 > Oops: 0000 [#1] SMP > last sysfs file: /sys/module/ldiskfs/initstate > CPU 4 > Modules linked in: fsfilt_ldiskfs(U) exportfs mgs(U) mgc(U) ldiskfs(U) > ipmi_devintf sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf > lustre(U) lov(U) osc(U) lquota(U) mdc(U) fid(U) fld(U) ksocklnd(U) > ko2iblnd(U) ptlrpc(U) obdclass(U) lnet(U) lvfs(U) libcfs(U) ib_ipoib > rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 > e1000e mlx4_ib ib_sa ib_mad ib_core mlx4_en mlx4_core mptsas mptscsih > mptbase igb ptp pps_core microcode serio_raw i2c_i801 i2c_core iTCO_wdt > iTCO_vendor_support ioatdma dca i7core_edac edac_core shpchp squashfs > ext4 mbcache jbd2 raid1 sg sd_mod crc_t10dif mpt2sas(U) > scsi_transport_sas raid_class ahci dm_multipath dm_mirror dm_region_hash > dm_log dm_mod scsi_dh_rdac [last unloaded: scsi_wait_scan] > > Pid: 5476, comm: mount.lustre Not tainted > 2.6.32-358.11.1.el6_lustre.x86_64 #1 Supermicro X8DTH-i/6/iF/6F/X8DTH > RIP: 0010:[<ffffffff8127ebe9>] [<ffffffff8127ebe9>] strnlen+0x9/0x40 > RSP: 0018:ffff880632073a58 EFLAGS: 00010286 > RAX: ffffffff817a29dd RBX: ffff88061dc95000 RCX: 0000000000000002 > RDX: 00000000deadbeef RSI: ffffffffffffffff RDI: 00000000deadbeef > RBP: ffff880632073a58 R08: 0000000000000073 R09: 0000000000000004 > R10: 0000000000000001 R11: ffff88033a243a6b R12: ffff88061dc94a34 > R13: 00000000deadbeef R14: 00000000ffffffff R15: 0000000000000000 > FS: 00007f62ee6ef700(0000) GS:ffff88034ac00000(0000) > knlGS:0000000000000000 > CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b > CR2: 00000000deadbeef CR3: 0000000639dd2000 CR4: 00000000000007e0 > DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 > Process mount.lustre (pid: 5476, threadinfo ffff880632072000, task > ffff88063835b500) > Stack: > ffff880632073a98 ffffffff8127fea0 0000000000000000 ffff88061dc94a34 > <d> ffffffffa051c8c7 ffffffffa051c8c5 ffff880632073c18 ffff88061dc95000 > <d> ffff880632073b38 ffffffff812812e8 0000000000000004 0000000affffffff > Call Trace: > [<ffffffff8127fea0>] string+0x40/0x100 > [<ffffffff812812e8>] vsnprintf+0x218/0x5e0 > [<ffffffffa03ab19b>] ? cfs_set_ptldebug_header+0x2b/0xc0 [libcfs] > [<ffffffffa03b65a3>] libcfs_debug_vmsg2+0x2c3/0xb60 [libcfs] > [<ffffffffa04e4952>] ? lustre_process_log+0x812/0xad0 [obdclass] > [<ffffffffa03b6e81>] libcfs_debug_msg+0x41/0x50 [libcfs] > [<ffffffffa04ef0f1>] lustre_fill_super+0xd51/0x13a0 [obdclass] > [<ffffffff8127f10a>] ? strlcpy+0x4a/0x60 > [<ffffffffa04ee3a0>] ? lustre_fill_super+0x0/0x13a0 [obdclass] > [<ffffffff8118433f>] get_sb_nodev+0x5f/0xa0 > [<ffffffffa04dfad5>] lustre_get_sb+0x25/0x30 [obdclass] > [<ffffffff8118395b>] vfs_kern_mount+0x7b/0x1b0 > [<ffffffff81183b02>] do_kern_mount+0x52/0x130 > [<ffffffff811a3ce2>] do_mount+0x2d2/0x8d0 > [<ffffffff811a4370>] sys_mount+0x90/0xe0 > [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b > Code: 66 90 48 83 c2 01 80 3a 00 75 f7 48 89 d0 48 29 f8 c9 c3 66 66 66 > 66 66 66 2e 0f 1f 84 00 00 00 00 00 55 48 85 f6 48 89 e5 74 2e <80> 3f > 00 74 29 48 83 ee 01 48 89 f8 eb 12 66 0f 1f 84 00 00 00 > RIP [<ffffffff8127ebe9>] strnlen+0x9/0x40 > RSP <ffff880632073a58> > CR2: 00000000deadbeef > ---[ end trace bdafbd6d71d51ae3 ]--- > Kernel panic - not syncing: Fatal exception > Pid: 5476, comm: mount.lustre Tainted: G D --------------- > 2.6.32-358.11.1.el6_lustre.x86_64 #1 > Call Trace: > [<ffffffff8150d7c8>] ? panic+0xa7/0x16f > [<ffffffff815119f4>] ? oops_end+0xe4/0x100 > [<ffffffff81046bfb>] ? no_context+0xfb/0x260 > [<ffffffff81513996>] ? notifier_call_chain+0x16/0x80 > [<ffffffff81046e85>] ? __bad_area_nosemaphore+0x125/0x1e0 > [<ffffffff81513996>] ? notifier_call_chain+0x16/0x80 > [<ffffffff81046fae>] ? bad_area+0x4e/0x60 > [<ffffffff81047760>] ? __do_page_fault+0x3d0/0x480 > [<ffffffff8106e485>] ? __call_console_drivers+0x75/0x90 > [<ffffffff8100bb8e>] ? apic_timer_interrupt+0xe/0x20 > [<ffffffff8106f241>] ? vprintk+0x251/0x560 > [<ffffffff8151391e>] ? do_page_fault+0x3e/0xa0 > [<ffffffff81510cd5>] ? page_fault+0x25/0x30 > [<ffffffff8127ebe9>] ? strnlen+0x9/0x40 > [<ffffffff8127fea0>] ? string+0x40/0x100 > [<ffffffff812812e8>] ? vsnprintf+0x218/0x5e0 > [<ffffffffa03ab19b>] ? cfs_set_ptldebug_header+0x2b/0xc0 [libcfs] > [<ffffffffa03b65a3>] ? libcfs_debug_vmsg2+0x2c3/0xb60 [libcfs] > [<ffffffffa04e4952>] ? lustre_process_log+0x812/0xad0 [obdclass] > [<ffffffffa03b6e81>] ? libcfs_debug_msg+0x41/0x50 [libcfs] > [<ffffffffa04ef0f1>] ? lustre_fill_super+0xd51/0x13a0 [obdclass] > [<ffffffff8127f10a>] ? strlcpy+0x4a/0x60 > [<ffffffffa04ee3a0>] ? lustre_fill_super+0x0/0x13a0 [obdclass] > [<ffffffff8118433f>] ? get_sb_nodev+0x5f/0xa0 > [<ffffffffa04dfad5>] ? lustre_get_sb+0x25/0x30 [obdclass] > [<ffffffff8118395b>] ? vfs_kern_mount+0x7b/0x1b0 > [<ffffffff81183b02>] ? do_kern_mount+0x52/0x130 > [<ffffffff811a3ce2>] ? do_mount+0x2d2/0x8d0 > [<ffffffff811a4370>] ? sys_mount+0x90/0xe0 > [<ffffffff8100b072>] ? system_call_fastpath+0x16/0x1b > > -- > Dr. Oliver Mangold > System Analyst > NEC Deutschland GmbH > HPC Division > Hessbrühlstraße 21b > 70565 Stuttgart > Germany > Phone: +49 711 78055 13 > Mail: oliver.mang...@emea.nec.com > _______________________________________________ > Lustre-discuss mailing list > Lustre-discuss@lists.lustre.org > http://lists.lustre.org/mailman/listinfo/lustre-discuss >
_______________________________________________ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss