Andrey Vagin <[email protected]> writes: > Currently this operation requires the global CAP_SYS_RESOURCE. > It's required, because a task can exceed limits (RLIMIT_DATA, > RLIMIT_STACK). > > So let's allow task to change these parameters if a proper limit is > unlimited. > > When we restore a task we need to set up text, data and data heap sizes > from userspace to the values a task had at checkpoint time. > > Currently we can not restore these parameters, if a task lives in > a non-root user name space, because it has no capabilities in the > parent namespace.
My brain hurts just looking at this patch and how you are justifying it. For the resources you are mucking with below all you have to do is to verify that you are below the appropriate rlimit at all times and no CAP_SYS_RESOURCE check is needed. You only need CAP_SYS_RESOURCE to exceed your per process limits. All you have to do is to fix the current code to properly enforce the limits. This half-assed code that forgets the permission checks if rlimit is set to rlimit_inifinity is wrong. Eric > Cc: Andrew Morton <[email protected]> > Cc: Oleg Nesterov <[email protected]> > Cc: Al Viro <[email protected]> > Cc: Kees Cook <[email protected]> > Cc: "Eric W. Biederman" <[email protected]> > Cc: Stephen Rothwell <[email protected]> > Cc: Pavel Emelyanov <[email protected]> > Cc: Aditya Kali <[email protected]> > Signed-off-by: Andrey Vagin <[email protected]> > --- > kernel/sys.c | 19 +++++++++++++++++-- > 1 file changed, 17 insertions(+), 2 deletions(-) > > diff --git a/kernel/sys.c b/kernel/sys.c > index c0a58be..939370c 100644 > --- a/kernel/sys.c > +++ b/kernel/sys.c > @@ -1701,8 +1701,23 @@ static int prctl_set_mm(int opt, unsigned long addr, > if (arg5 || (arg4 && opt != PR_SET_MM_AUXV)) > return -EINVAL; > > - if (!capable(CAP_SYS_RESOURCE)) > - return -EPERM; > + if (!capable(CAP_SYS_RESOURCE)) { > + switch (opt) { > + case PR_SET_MM_START_DATA: > + case PR_SET_MM_END_DATA: > + case PR_SET_MM_START_BRK: > + case PR_SET_MM_BRK: > + if (rlim < RLIM_INFINITY) > + return -EPERM; > + break; > + case PR_SET_MM_START_STACK: > + if (rlimit(RLIMIT_STACK) < RLIM_INFINITY) > + return -EPERM; > + break; > + default: > + return -EPERM; > + } > + } > > if (opt == PR_SET_MM_EXE_FILE) > return prctl_set_mm_exe_file(mm, (unsigned int)addr); -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [email protected] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/

