[lustre-discuss] Frequent, silent OSS hangs on multi-homed system

2019-08-14 Thread Kirk, Benjamin (JSC-EG311)
Hi, I'm love some ideas to debug what has become a frequent annoyance for us. At the high level, we're observing fairly frequent OSS hangs, with absolutely no console or logging activity. Our BMC watchdogs then reboot the OSS and ~6 minutes later everything is back in line. This has been an

Re: [lustre-discuss] Lustre/ZFS snapshots mount error

2018-09-25 Thread Kirk, Benjamin (JSC-EG311)
, Kirk, Benjamin (JSC-EG311) mailto:benjamin.k...@nasa.gov>> wrote: I just opened an LU on the issue https://jira.whamcloud.com/browse/LU-11411 for anyone interested. Thanks a lot! -Ben On Aug 27, 2018, at 4:56 PM, Andreas Dilger mailto:adil...@whamcloud.com>> wrote: It's p

Re: [lustre-discuss] Lustre/ZFS snapshots mount error

2018-09-20 Thread Kirk, Benjamin (JSC-EG311)
s well. That doesn't have to be a separate MGS node, just a separate filesystem (ZFS fileset in the same zpool) on the MDS node. Cheers, Andreas On Aug 27, 2018, at 10:18, Kirk, Benjamin (JSC-EG311) mailto:benjamin.k...@nasa.gov>> wrote: Hi all, We have two filesystems, fsA & fsB (eadc below)

Re: [lustre-discuss] lustre-discuss Digest, Vol 150, Issue 14

2018-09-11 Thread Kirk, Benjamin (JSC-EG311)
Meteorologisches Institut > Ludwig-Maximilians-Universit?t M?nchen > Theresienstr. 37, 80333 M?nchen, Germany > Am 03.09.2018 um 08:16 schrieb Yong, Fan: > I would say that it is not your operations order caused trouble. Instead, it > is related with the snapshot mount logic. As me

Re: [lustre-discuss] Lustre/ZFS snapshots mount error

2018-08-28 Thread Kirk, Benjamin (JSC-EG311)
nder snapshot mode. -- Cheers, Nasf -Original Message- From: lustre-discuss [mailto:lustre-discuss-boun...@lists.lustre.org] On Behalf Of Andreas Dilger Sent: Tuesday, August 28, 2018 5:57 AM To: Kirk, Benjamin (JSC-EG311) mailto:benjamin.k...@nasa.gov>> Cc: lustre-discuss@lists.

[lustre-discuss] Lustre/ZFS snapshots mount error

2018-08-27 Thread Kirk, Benjamin (JSC-EG311)
Hi all, We have two filesystems, fsA & fsB (eadc below). Both of which get snapshots taken daily, rotated over a week. It’s a beautiful feature we’ve been using in production ever since it was introduced with 2.10. -) We’ve got Lustre/ZFS 2.10.4 on CentOS 7.5. -) Both fsA & fsB have