Hi, We have a custom implementation to provision Jenkins agents, it works pretty well and allow us to provision more than 400 and downscale when they finish. we have a known bug that happens when we spin more than 100 agents at the same time, the bug is that pipelines try to reuse an agent because the way we manage to put the agent offline and destroy (about 1-2% of times). In order to improve this, we are thinking about to listen to the executor events or the node/computer events but I did not find something that does not implies to modify some existing, the nodes are swarm nodes. other solution could be to make in some way those swarm agents some kind of one-shot agents. Have anyone any idea how we can approach this?
-- You received this message because you are subscribed to the Google Groups "Jenkins Developers" group. To unsubscribe from this group and stop receiving emails from it, send an email to jenkinsci-dev+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/jenkinsci-dev/08d86af8-ec92-41eb-adc9-b54f851f16fc%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.