ksharlandjiev opened a new pull request, #57753:
URL: https://github.com/apache/airflow/pull/57753

   
   ## Problem
   
   SSM operators currently fail when commands return non-zero exit codes, 
making it impossible to:
   - Route workflows based on different exit codes
   - Handle commands where non-zero exit codes represent valid business states 
(e.g., partial success, warnings)
   - Implement conditional retry logic based on specific exit codes
   - Migrate from traditional schedulers like Autosys that support exit code 
routing
   
   Users have been forced to implement manual polling workarounds with custom 
Python tasks to handle these scenarios.
   
   ## Proposal
   
   Add a `fail_on_nonzero_exit` parameter (default: `True`) to 
`SsmRunCommandOperator`, `SsmRunCommandCompletedSensor`, and 
`SsmRunCommandTrigger`.
   
   When set to `False`:
   - Tasks complete successfully regardless of command exit codes
   - Exit codes can be retrieved with `SsmGetCommandInvocationOperator` for 
routing decisions
   - AWS-level failures (TimedOut, Cancelled) still raise exceptions
   - Command-level failures (non-zero exit codes) are tolerated
   
   The default value of `True` maintains existing behavior for backward 
compatibility.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to