goingforstudying-ctrl opened a new pull request, #68250:
URL: https://github.com/apache/airflow/pull/68250

   Fixes Bug 3 from #68240.
   
   ## What
   
   `StackdriverRemoteLogIO.processors` → `proc()` calls `transport.send()` 
without
   any error handling.  In Airflow 3's supervisor model, `REMOTE_TASK_LOG` 
applies
   to ALL supervised components (scheduler, dag-processor, triggerer, workers).
   A single IAM misconfiguration or gRPC error would crash the entire process.
   
   Observed: dag-processor pod enters `CrashLoopBackOff` on every log emit when 
the
   Kubernetes Service Account lacks the `logging.logEntries.create` IAM binding.
   
   ## Fix
   
   Wrap `_transport.send()` in `try/except Exception` and emit a
   `logging.warning` instead of propagating.  Log delivery is best-effort;
   a Cloud Logging error should never kill a task-executing process.
   
   ## Changes
   
   - Guarded `_transport.send()` call in `proc()` with try/except
   - New test `test_processors_survives_transport_send_failure` verifies:
     - proc() returns the event (doesn't crash)
     - warning is logged with the failure details
   
   relates to #68240


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to