Just installed a new node on the cluster, imaged just like the rest, but it was unable to mount lustre on boot. I tried to mount but got the following from dmesg:
Lustre: OBD class driver, http://www.lustre.org/ Lustre: Lustre Version: 1.8.4 Lustre: Build Version: 1.8.4-20100726215630-PRISTINE-2.6.18-194.3.1.el5_lustre.1.8.4 Lustre: Added LNI 192.168.255.194@tcp [8/256/0/180] Lustre: Accept secure, port 988 Lustre: Lustre Client File System; http://www.lustre.org/ Lustre: 4872:0:(client.c:1476:ptlrpc_expire_one_request()) @@@ Request x1373071042674689 sent from MGC192.168.5.104@tcp to NID 192.168.5.104@tcp 5s ago has timed out (5s prior to deadline). req@ffff811070397800 x1373071042674689/t0 o250->MGS@MGC192.168.5.104@tcp_0:26/25 lens 368/584 e 0 to 1 dl 1309462593 ref 1 fl Rpc:N/0/0 rc 0/0 eth0: no IPv6 routers present Lustre: 4872:0:(client.c:1476:ptlrpc_expire_one_request()) @@@ Request x1373071042674691 sent from MGC192.168.5.104@tcp to NID 192.168.5.105@tcp 5s ago has timed out (5s prior to deadline). req@ffff81107dc57000 x1373071042674691/t0 o250->MGS@MGC192.168.5.104@tcp_1:26/25 lens 368/584 e 0 to 1 dl 1309462618 ref 1 fl Rpc:N/0/0 rc 0/0 LustreError: 4735:0:(client.c:858:ptlrpc_import_delay_req()) @@@ IMP_INVALID req@ffff81107039b800 x1373071042674692/t0 o501->MGS@MGC192.168.5.104@tcp_1:26/25 lens 264/432 e 0 to 1 dl 0 ref 1 fl Rpc:/0/0 rc 0/0 LustreError: 15c-8: MGC192.168.5.104@tcp: The configuration from log 'lustre-client' failed (-108). This may be the result of communication errors between this node and the MGS, a bad configuration, or other errors. See the syslog for more information. LustreError: 4735:0:(llite_lib.c:1086:ll_fill_super()) Unable to process log: -108 Lustre: client ffff81106881fc00 umount complete LustreError: 4735:0:(obd_mount.c:2050:lustre_fill_super()) Unable to mount (-108) and from /var/log/messages: Jun 30 14:52:18 compute-6-3 kernel: LustreError: 4395:0:(client.c:858:ptlrpc_import_delay_req()) @@@ IMP_INVALID req@ffff81106f017c00 x1373072007364612/t0 o501->MGS@MGC192.168.5.104@tcp_1:26/25 lens 264/432 e 0 to 1 dl 0 ref 1 fl Rpc:/0/0 rc 0/0 Jun 30 14:52:18 compute-6-3 kernel: LustreError: 15c-8: MGC192.168.5.104@tcp: The configuration from log 'lustre-client' failed (-108). This may be the result of communication errors between this node and the MGS, a bad configuration, or other errors. See the syslog for more information. Jun 30 14:52:18 compute-6-3 kernel: LustreError: 4395:0:(llite_lib.c:1086:ll_fill_super()) Unable to process log: -108 Jun 30 14:52:18 compute-6-3 kernel: LustreError: 4395:0:(obd_mount.c:2050:lustre_fill_super()) Unable to mount (-108) Only after I ran lctl ping x.x.x.x to the MDS/MGS was I able to manually mount lustre. I got the idea to run lctl ping from a post from someone with the same problem but over infinaband, we are using ethernet here. David -- Personally, I liked the university. They gave us money and facilities, we didn't have to produce anything! You've never been out of college! You don't know what it's like out there! I've worked in the private sector. They expect results. -Ray Ghostbusters _______________________________________________ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss