> You need to give a reproducible example where you cannot just use > --halt 0 and then later --resume-failed when you have fixed the > bug/the input data.
My use case is processing large amounts of data on a heavily-subscribed cluster with *days* of queue time. Re-queueing is expensive. Cheers, Ben
