Just installed a new node on the cluster, imaged just like the rest,
but it was unable to mount lustre on boot. I tried to mount but got
the following from dmesg:

Lustre: OBD class driver, http://www.lustre.org/
Lustre:     Lustre Version: 1.8.4
Lustre:     Build Version:
1.8.4-20100726215630-PRISTINE-2.6.18-194.3.1.el5_lustre.1.8.4
Lustre: Added LNI 192.168.255.194@tcp [8/256/0/180]
Lustre: Accept secure, port 988
Lustre: Lustre Client File System; http://www.lustre.org/
Lustre: 4872:0:(client.c:1476:ptlrpc_expire_one_request()) @@@ Request
x1373071042674689 sent from MGC192.168.5.104@tcp to NID
192.168.5.104@tcp 5s ago has timed out (5s prior to deadline).
  req@ffff811070397800 x1373071042674689/t0
o250->MGS@MGC192.168.5.104@tcp_0:26/25 lens 368/584 e 0 to 1 dl
1309462593 ref 1 fl Rpc:N/0/0 rc 0/0
eth0: no IPv6 routers present
Lustre: 4872:0:(client.c:1476:ptlrpc_expire_one_request()) @@@ Request
x1373071042674691 sent from MGC192.168.5.104@tcp to NID
192.168.5.105@tcp 5s ago has timed out (5s prior to deadline).
  req@ffff81107dc57000 x1373071042674691/t0
o250->MGS@MGC192.168.5.104@tcp_1:26/25 lens 368/584 e 0 to 1 dl
1309462618 ref 1 fl Rpc:N/0/0 rc 0/0
LustreError: 4735:0:(client.c:858:ptlrpc_import_delay_req()) @@@
IMP_INVALID  req@ffff81107039b800 x1373071042674692/t0
o501->MGS@MGC192.168.5.104@tcp_1:26/25 lens 264/432 e 0 to 1 dl 0 ref
1 fl Rpc:/0/0 rc 0/0
LustreError: 15c-8: MGC192.168.5.104@tcp: The configuration from log
'lustre-client' failed (-108). This may be the result of communication
errors between this node and the MGS, a bad configuration, or other
errors. See the syslog for more information.
LustreError: 4735:0:(llite_lib.c:1086:ll_fill_super()) Unable to
process log: -108
Lustre: client ffff81106881fc00 umount complete
LustreError: 4735:0:(obd_mount.c:2050:lustre_fill_super()) Unable to
mount  (-108)

and from /var/log/messages:

Jun 30 14:52:18 compute-6-3 kernel: LustreError:
4395:0:(client.c:858:ptlrpc_import_delay_req()) @@@ IMP_INVALID
req@ffff81106f017c00 x1373072007364612/t0
o501->MGS@MGC192.168.5.104@tcp_1:26/25 lens 264/432 e 0 to 1 dl 0 ref
1 fl Rpc:/0/0 rc 0/0
Jun 30 14:52:18 compute-6-3 kernel: LustreError: 15c-8:
MGC192.168.5.104@tcp: The configuration from log 'lustre-client'
failed (-108). This may be the result of communication errors between
this node and the MGS, a bad configuration, or other errors. See the
syslog for more information.
Jun 30 14:52:18 compute-6-3 kernel: LustreError:
4395:0:(llite_lib.c:1086:ll_fill_super()) Unable to process log: -108
Jun 30 14:52:18 compute-6-3 kernel: LustreError:
4395:0:(obd_mount.c:2050:lustre_fill_super()) Unable to mount  (-108)

Only after I ran lctl ping x.x.x.x to the MDS/MGS was I able to
manually mount lustre.

I got the idea to run lctl ping from a post from someone with the same
problem but over infinaband, we are using ethernet here.

David

-- 
Personally, I liked the university. They gave us money and facilities,
we didn't have to produce anything! You've never been out of college!
You don't know what it's like out there! I've worked in the private
sector. They expect results. -Ray Ghostbusters
_______________________________________________
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss

Reply via email to