Thanks for the hints and the pointers. We found that this
(Thu Aug 6 03:30:01 2015) [sssd[nss]] [id_callback] (0x0010): The Monitor returned an error [org.freedesktop.DBus.Error.NoReply] and this always happens when there are jobs with heavy disc IO and the nodes (see plot attached from this particular node. SGE then goes into error state with wnfg055/messages:08/06/2015 03:26:36| main|wnfg055|E|can't start job "5538749": can't get password entry for user "___". Either user does not exist or error with NIS/LDAP etc. (user name replaced) Is there any way of telling sssd to wait longer for an answer? We already tried to get the load down but it's difficult to identify which jobs are causing this, we have a large variety of users with many different applications. Best regards Torsten -- <><><><><><><><><><><><><><><><><><><><><><><><><><><><><><><><><> <> <> <> Dr. Torsten Harenberg harenb...@physik.uni-wuppertal.de <> <> Bergische Universitaet <> <> FB C - Physik Tel.: +49 (0)202 439-3521 <> <> Gaussstr. 20 Fax : +49 (0)202 439-2811 <> <> 42097 Wuppertal <> <> <> <><><><><><><>< Of course it runs NetBSD http://www.netbsd.org ><> -- Manage your subscription for the Freeipa-users mailing list: https://www.redhat.com/mailman/listinfo/freeipa-users Go to http://freeipa.org for more info on the project