I submitted an array job with -r y. One of the tasks was transferring to a node (state t) when that node went down but despite max_unheard+reschedule_unknown being exceeded neither that task nor another task on the same node was rescheduled. A manual qmod -rq seems to work but just working would be better. Is this a known problem?
William
_______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
