Hello, Peter.

On Tue, Oct 24, 2017 at 10:35:04AM +0200, Peter Zijlstra wrote:
> > The more I think about showing cpu stat in cgroup.stat, the uglier it
> > seems. 
> 
> I've not been paying much attention to this, could you elaborate on the
> problems there?

Sure, so, on cgroup2, the basic stat collects cpu usage info whether
cpu controller is enabled or not.  As the hot path overhead is always
per-cpu and constant, there's no reason to not to and always having
the information is useful, especially as enabling cpu isn't free of
side-effects.  This is similar to what cpuacct did in cgroup1 but
cgroup2's single hierarchy makes the dedicated controller unnecessary.

The issue at hand with this patch is how the stat is presented when
the controller is not enabled.  There were two alternatives.

1. Make it available in the core file cgroup.stat with subsystem
   prefix ("cpu." for cpu).

   This is easy to implement but a bit ugly because when the
   controller is enabled, the same information is available in two
   places - cgroup.stat and cpu.stat.  Access from userspace becomes
   ugly too especially in cases where we make more fields available in
   basic stat.

2. Make $SUBSYS.stat available whether the controller is enabled or
   not.  Show the basic stat when the controller is disabled, show the
   full thing when enabled.

   This is somewhat more complicated to implement and having a
   subsystem specific file around when the controller is not enabled
   might be a bit confusing.  However, it ensures that a given piece
   of information is available in only one place and makes it less
   awkward to make more information available through basic stat.

The original implementation went for #1 and this patch switches it to
#2.

> > This patch flips it so that "cpu.stat" is always available
> > with basic cpu stat instead.  It only changes the presentation and
> > changes to the scheduler code is minimal.  Will route with the other
> > cpu controller changes through cgroup/for-4.15 unless there are
> > objections.
> 
> And this is -v2 only? I'm a little lost on how all that connects.

Yeap, basic stat is v2 only.  We can't do it against multiple
hierarchies with constant overhead.

Thanks.

-- 
tejun

Reply via email to