>Sorry, to get back to this that late but I was busy as hell since the
>beginning of the year.


Thank you for your time!


>Has the issue repeated since then?


Yes, it's happening all the time but meanwhile i wrote a script which is 
monitoring the problem and killing freezed processes when it occurs. But i 
don't like it much, it's not a solution for me :( i also noticed, that problem 
is always affecting the whole server but not so much as freezed cgroup. Depends 
on number of freezed processes, sometimes it has almost no imapct on the rest 
of the server, sometimes the whole server is lagging much.

I have another old problem which is maybe also related to this. I wasn't 
connecting it with this before but now i'm not sure. Two of our servers, which 
are affected by this cgroup problem, are also randomly freezing completely (few 
times per month). These are the symptoms:
 - servers are answering to ping
 - it is possible to connect via SSH but connection is freezed after sending 
the password
 - it is possible to login via console but it is freezed after typeing the login
These symptoms are very similar to HDD problems or HDD overload (but there is 
no overload for sure). The only way to fix it is, probably, hard rebooting the 
server (didn't find any other way). What do you think? Can this be related? 
Maybe HDDs are locked in the similar way the cgroups are - we already found out 
that cgroup freezeing is related also to HDD activity. Maybe there is a little 
chance that the whole HDD subsystem ends in deadlock?


>You said you didn't apply other than the above mentioned patch. Could
>you apply also debugging part of the patches I have sent?
>In case you don't have it handy then it should be this one:


Just to be sure - am i supposed to apply this two patches?
http://watchdog.sk/lkml/patches/


azur
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to