The GitHub Actions job "Nightly (beta)" on flink.git has failed.
Run started by GitHub user github-actions[bot] (triggered by 
github-actions[bot]).

Head commit for run:
3a2324a705d4e36aec3f412cb0c35767a658553a / Robert Young 
<[email protected]>
[FLINK-34569][e2e] fail fast if AWS cli container fails to start (#24491)

* [FLINK-34569][e2e] Fail fast if aws cli container fails to run

Why:
An end-to-end test run failed and in the test logs you could see that the
AWS cli container failed to start. Because of the way it's organised the
failure in the subshell did not cause a failure and AWSCLI_CONTAINER_ID was
empty. This lead to a loop trying to docker exec a command in a container
named "" and the test taking 15 minutes to time out. This change speeds up
the failure.

Note that we use 'return' to prevent an immediate failure of the script so
that we have the potential to implement a simple retry.

Signed-off-by: Robert Young <[email protected]>

* [FLINK-34569][e2e] Add naive retry when creating aws cli container

Why:
An end-to-end test run failed with what looked like a transient network
exception when pulling the aws cli image. This retries once.

Signed-off-by: Robert Young <[email protected]>

* [FLINK-34569][e2e] Remove jq containers after user

Why:
A large pile of exited jq containers were left in docker after
an operation was retried repeatedly.

Signed-off-by: Robert Young <[email protected]>

* [FLINK-34569][e2e] Clean up after failed awscli container run

Why:
If for some reason the command can return a non-zero exit code and also
create a container, this will remove it so we don't have an orphan sitting
stranded.

Signed-off-by: Robert Young <[email protected]>

---------

Signed-off-by: Robert Young <[email protected]>

Report URL: https://github.com/apache/flink/actions/runs/9394040903

With regards,
GitHub Actions via GitBox

Reply via email to