Re: [RFC PATCH 1/8] memcg: Enable fine-grained control of over memory.high action

Chris Down Mon, 17 Aug 2020 10:55:22 -0700

Waiman Long writes:

On 8/17/20 10:30 AM, Chris Down wrote:
Astractly, I think this really overcomplicates the API a lot. Ifthese are truly generally useful (and I think that remains to bedemonstrated), they should be additions to the existing API, ratherthan a sidestep with prctl.
This patchset is derived from customer requests. With existing API, Isuppose you mean the memory cgroup API. Right? The reason to useprctl() is that there are users out there who want some kind ofper-process control instead of for a whole group of processes unlessthe users try to create one cgroup per process which is not veryefficient.

If using one cgroup per process is inefficient, then that's what needs to befixed. Making the API extremely complex to reason about for every user isn't agood compromise when we're talking about an already niche use case.

I also worry about some other more concrete things:
1. Doesn't this allow unprivileged applications to potentiallybypass memory.high constraints set by a system administrator?
The memory.high constraint is for triggering memory reclaim. The newmitigation actions introduced by this patchset will only be applied ifmemory reclaim alone fails to limit the physical memory consumption.The current memory cgroup memory reclaim code will not be affected bythis patchset.

memory.high isn't only for triggering memory reclaim, it's also about activethrottling when the application fails to come under. Fundamentally it'ssupposed to indicate the point at which we expect the application to eithercooperate or get forcibly descheduled -- take a look at where we callschedule_timeout_killable.

I really struggle to think about how all of those things should interact inthis patchset.

2. What's the purpose of PR_MEMACT_KILL, compared to memory.max?
A user can use this to specify which processes are less important andcan be sacrificed first instead of the other more important ones incase they are really in a OOM situation. IOW, users can specify theorder where OOM kills can happen.

You can already do that with something like oomd, which has way moreflexibility than this. Why codify this in the kernel instead of in a userspaceagent?

Re: [RFC PATCH 1/8] memcg: Enable fine-grained control of over memory.high action

Reply via email to