I saw similar situation when container had configured cpu limit=0 via cgroup 
interface.
As result container never get CPU and but their processes was in Running state.

On 04.02.2016 22:02, Kir Kolyshkin wrote:
> Hi Bogdan,
> 
> This looks very much like a cpu scheduler lockup, as many of the processes
> belonging to the container are in R state but not running.
> 
> Can you try resetting the cpulimit for the container in question, something 
> like
> 
> vzctl set $CTID --cpulimit 0
> 
> and see if anything changes?
> 
> Also, take a look at cpu.stat for some of the processes that is in such state?
> Say, this one:
> root      107398  0.0  0.0  25460   396 ?        Rs   12:19   0:00 vzctl exec 
> 111 ps
> 
> cat /proc/vz/fairsched/107398/cpu.stat
> 
> If throttled_time is big, it means my hypothesis makes sense.
> 
> I am also ccing Vladimir, who knows a thing or two about our fair cpu 
> scheduler.
> 
> Kir.
> 
> On 02/04/2016 05:48 AM, Bogdan-Stefan Rotariu wrote:
>> Hi there,
>>
>> We are having issues with one container that cannot be stopped/suspended or 
>> killed, all commands remain in Sleep or Running Sleep.
>> Any ideea how to stop this container withour rebooting the main machine?
>> We did try to kill all proceeses, they do not die.
>>
>>       CTID      NPROC STATUS    IP_ADDR         HOSTNAME
>>        111        100 running   a.b.c.d server.name
>>
>>
>> [3839648.976835] CPT ERR: ffff8803dd109000,111 :foreign process 
>> 14243/9892(bash) inside CT (e.g. vzctl enter or vzctl exec).
>> [3839648.976842] CPT ERR: ffff8803dd109000,111 :suspend is impossible now.
>> [3839649.977756] CPT ERR: ffff8803dd109000,111 :foreign process 
>> 14243/9892(bash) inside CT (e.g. vzctl enter or vzctl exec).
>> [3839649.977764] CPT ERR: ffff8803dd109000,111 :suspend is impossible now.
>> [3839650.978718] CPT ERR: ffff8803dd109000,111 :foreign process 
>> 14243/9892(bash) inside CT (e.g. vzctl enter or vzctl exec).
>> [3839650.978726] CPT ERR: ffff8803dd109000,111 :suspend is impossible now.
>> [3839665.639557] CPT ERR: ffff880839216000,111 :foreign process 
>> 14243/9892(bash) inside CT (e.g. vzctl enter or vzctl exec).
>> [3839665.639564] CPT ERR: ffff880839216000,111 :suspend is impossible now.
>> [3839666.640019] CPT ERR: ffff880839216000,111 :foreign process 
>> 14243/9892(bash) inside CT (e.g. vzctl enter or vzctl exec).
>>
>> root       19890  0.0  0.0  25460   376 ?        Rs   03:34   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root       39626  0.0  0.0  25460   376 ?        Rs   03:44   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root       65503  0.0  0.0  27560   412 ?        Rs   11:59   0:00 vzctl 
>> enter 111
>> root       65508  0.0  0.0  27560   416 ?        Rs   11:59   0:00 vzctl 
>> enter 111
>> root       65522  0.0  0.0  27560   416 ?        Rs   11:59   0:00 vzctl 
>> enter 111
>> root       73329  0.0  0.0  25460   372 ?        Rs   12:00   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root       73371  0.0  0.0  25460   380 ?        Rs   12:00   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root       74865  0.0  0.0  25464   408 ?        Rs   12:00   0:00 vzctl 
>> stop 111
>> root       75864  0.0  0.0  25464   412 ?        Rs   12:04   0:00 vzctl 
>> stop 111
>> root       85384  0.0  0.0  25464   404 ?        Rs   12:08   0:00 vzctl 
>> stop 111
>> root       96674  0.0  0.0  25464   412 ?        Rs   12:12   0:00 vzctl 
>> stop 111
>> root       96787  0.0  0.0  25464   408 ?        Rs   12:13   0:00 vzctl 
>> stop 111 --fast
>> root      107300  0.0  0.0  27560   412 ?        Rs   12:18   0:00 vzctl 
>> enter 111
>> root      107398  0.0  0.0  25460   396 ?        Rs   12:19   0:00 vzctl 
>> exec 111 ps
>> root      116638  0.0  0.0 108168  1368 ?        S    12:21   0:00 sh -c 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo | grep --max-count=1 'MemFree' | 
>> awk '{print $2}'
>> root      116639  0.0  0.0  25460  1024 ?        S    12:21   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      116642  0.0  0.0  25460   364 ?        S    12:21   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      116643  0.0  0.0  25460   384 ?        Rs   12:21   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      116650  0.0  0.0  25460   384 ?        Rs   12:21   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      117653  0.0  0.0  25460   380 ?        Rs   12:22   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      117746  0.0  0.0 108168  1368 ?        S    12:22   0:00 sh -c 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo | grep --max-count=1 'MemFree' | 
>> awk '{print $2}'
>> root      117747  0.0  0.0 108168  1368 ?        S    12:22   0:00 sh -c 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo | grep --max-count=1 'MemFree' | 
>> awk '{print $2}'
>> root      117748  0.0  0.0  25460  1016 ?        S    12:22   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      117749  0.0  0.0  25460  1020 ?        S    12:22   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      117754  0.0  0.0  25460   360 ?        S    12:22   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      117755  0.0  0.0  25460   356 ?        S    12:22   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      117756  0.0  0.0  25460   380 ?        Rs   12:22   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      117757  0.0  0.0  25460   376 ?        Rs   12:22   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      118191  0.0  0.0 108168  1372 ?        S    12:22   0:00 sh -c 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo | grep --max-count=1 'MemTotal' | 
>> awk '{print $2}'
>> root      118192  0.0  0.0  25460  1020 ?        S    12:22   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      118195  0.0  0.0  25460   360 ?        S    12:22   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      118196  0.0  0.0  25460   380 ?        Rs   12:22   0:00 
>> /usr/sbin/vzctl exec 111 cat /proc/meminfo
>> root      126585  0.0  0.0  25464   408 ?        Rs   12:25   0:00 vzctl 
>> stop 111
>> root      129412  0.0  0.0  25464   352 ?        Rs   12:26   0:00 vzctl 
>> stop 111
>> root      138146  0.0  0.0  25464   404 ?        Rs   12:28   0:00 vzctl 
>> stop 111
>> root      147844  0.0  0.0  25464   408 ?        Rs   12:33   0:00 vzctl 
>> stop 111
>> root      157178  0.0  0.0  25464   412 ?        Rs   12:36   0:00 vzctl 
>> stop 111
>> root      158300  0.0  0.0  25464   400 ?        Rs   12:39   0:00 vzctl 
>> stop 111
>> root      179962  0.0  0.0  25464   408 ?        Rs   12:49   0:00 vzctl 
>> stop 111
>> root      180039  0.0  0.0  25464   408 ?        Rs   12:49   0:00 vzctl 
>> stop 111
>> root      220918  0.0  0.0  25464   412 ?        Rs   13:04   0:00 vzctl 
>> stop 111
>> root      240631  0.0  0.0  25464   408 ?        Rs   13:14   0:00 vzctl 
>> stop 111
>> root      247169  0.0  0.0  25464   412 ?        Rs   13:15   0:00 vzctl 
>> stop 111
>> root      250371  0.0  0.0  25464   400 ?        Rs   13:19   0:00 vzctl 
>> stop 111 --fast
>>
>>
>> _______________________________________________
>> Users mailing list
>> Users@openvz.org
>> https://lists.openvz.org/mailman/listinfo/users
> 
> _______________________________________________
> Users mailing list
> Users@openvz.org
> https://lists.openvz.org/mailman/listinfo/users
> 
_______________________________________________
Users mailing list
Users@openvz.org
https://lists.openvz.org/mailman/listinfo/users

Reply via email to