Re: [Lustre-discuss] Failover / reliability using SAD direct-attached storage

2011-07-21 Thread Tyler Hawes
Perhaps that was a freudian slip that I titled the thread "SAD Direct Storage" :) ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss

[Lustre-discuss] Failover / reliability using SAD direct-attached storage

2011-07-21 Thread Tyler Hawes
Apologies if this is a bit newbie, but I'm just getting started, really. I'm still in design / testing stage and looking to wrap my head around a few things. I'm most familiar with Fibre Channel storage. As I understand it, you configure a pair of OSS per OST, one actively serving it, the other pa

[Lustre-discuss] Lustre error

2011-07-21 Thread Ekaterina Popova
Hello! We have seen such error on our Lustre client (lustre-2.0.65) today. After that we even can't reboot our machine. Only hard reset helped. Lustre client is installed on working node in clustre. Its CPU usage is high (100%). Several jobs are working on it. How can I solve such problem? What

Re: [Lustre-discuss] IO-Node issue

2011-07-21 Thread DaMiri Young
Hi Wojciech, Stopping heartbeat sounds like a logical next step. Before I do that though I tried a fsck dry run using e2fsprogs v1.14.10 and got: --- # e2fsck -n -v /dev/dm-11 e2fsck 1.41.10.sun2 (24-Feb-2010) device /dev/dm-11 mounted by