We enabled quotas on two new file systems and are now seeing lots of
the following in our logs...
Lustre: 31473:0:(quota_interface.c:460:quota_chk_acq_common()) still
haven't
managed to acquire quota space from the quota master after 10 retries
(err=0,
rc=0): 2 Time(s)
Looking at the code, it is clearly going through the loop at least 10
times however rc is always zero when the message is printed so the
acquire() call is succeeding, apparently, on the 10th try. However,
if I'm reading the code correctly, for that to happen, the thread has
already waited at least 45s (cumulatively) which is a long time to
us. It seems like such a long wait would cause other complaints but
we aren't seeing anything obvious.
Is this normal? Are others seeing the same messages? Is there some
tuning we should be doing.
Note that one of the file systems is 1.8.5 while the other is 2.0.
Thx,
Charlie Taylor
UF HPC Center
_______________________________________________
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss