Hi folks, The aurora-admin drain command currently has a hardcoded limit of 5 minutes waiting for a node to be drained, after that timeout it fails.
This doesn't work very well when tasks are expected be in killing state for more than that, for example, if the scheduler transient_task_state_timeout was adjusted. Any objection on making this 5 minutes MAX_STATUS_WAIT in aurora-admin drain configurable? https://github.com/apache/aurora/blob/master/src/main/python/apache/aurora/admin/host_maintenance.py#L40 Thanks, Mauricio