Dear all, thanks for all of your replies.
Am 09.03.20 um 13:32 schrieb Andreas Dilger: > It would be better to run a full e2fsck, since that not only rebuilds > the quota tables, but also ensures that the values going into the quota > tables are correct. Since the time taken by "tune2fs -O quota" is > almost the same as running e2fsck, it is better to do it the right way. We already ran e2fsck -f on all LUNs after every crash, so it seems that was all we could do, right? During the last days (since thursday), our Lustre instance was surprisingly stable. We lowered a bit the load by limiting the # of running jobs which might also helped to stablize the system. We enabled kdump, so if another crash is happening anytime soon, we hope to get at least a dump for a hint where the problem is. Thanks again Torsten -- Dr. Torsten Harenberg harenb...@physik.uni-wuppertal.de Bergische Universitaet Fakultät 4 - Physik Tel.: +49 (0)202 439-3521 Gaussstr. 20 Fax : +49 (0)202 439-2811 42097 Wuppertal
smime.p7s
Description: S/MIME Cryptographic Signature
_______________________________________________ lustre-discuss mailing list lustre-discuss@lists.lustre.org http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org