Hi,
I am doing research in parallel computing and our cluster is using Lustre. I currently found that if one OST is down, I can not read any part of the file, regardless of the part is or is not located at the failed OST.
One extreme example is when I have 3 OST's: ost1, ost2 and ost3, and one file f is stored on those 3 nodes, beginning with ost1(I used #lfs setstripe /mnt/lustre 65536 0 3). When I tried to read the very first character from f with the fgetc(f), I found that my Lustre client still tries to read from all the 3 OST's. And when I shutdown ost3, the fgetc(f) can not be finished(program halts).
So is it a normal thing or I misconfigured Lustre? Thanks!
I am doing research in parallel computing and our cluster is using Lustre. I currently found that if one OST is down, I can not read any part of the file, regardless of the part is or is not located at the failed OST.
One extreme example is when I have 3 OST's: ost1, ost2 and ost3, and one file f is stored on those 3 nodes, beginning with ost1(I used #lfs setstripe /mnt/lustre 65536 0 3). When I tried to read the very first character from f with the fgetc(f), I found that my Lustre client still tries to read from all the 3 OST's. And when I shutdown ost3, the fgetc(f) can not be finished(program halts).
So is it a normal thing or I misconfigured Lustre? Thanks!
_______________________________________________ Lustre-devel mailing list [EMAIL PROTECTED] https://mail.clusterfs.com/mailman/listinfo/lustre-devel
