shunping opened a new issue, #37115: URL: https://github.com/apache/beam/issues/37115
### What happened? We have seen recently some flakiness in a few workflows (e.g. https://github.com/apache/beam/actions/runs/20245962422/job/58125988587, https://github.com/apache/beam/issues/30525#issuecomment-3572201375). The following error can be seen if such workflows failed: ``` Current runner version: '2.318.0' ... Error: System.ArgumentOutOfRangeException: Specified argument was out of the range of valid values. (Parameter ''using: node24' is not supported, use 'docker', 'node12', 'node16' or 'node20' instead.') at GitHub.Runner.Worker.ActionManifestManager.ConvertRuns(IExecutionContext executionContext, TemplateContext templateContext, TemplateToken inputsToken, String fileRelativePath, MappingToken outputs) at GitHub.Runner.Worker.ActionManifestManager.Load(IExecutionContext executionContext, String manifestFile) Error: Failed to load gradle/actions/4d9f0ba0025fe599b4ebab900eb7f3a1d93ef4c2/setup-gradle/action.yml ``` The reason is that node24 is added at `2.327.1`, so it failed if an older github runner is used: https://github.com/actions/runner/releases/tag/v2.327.1. Looking at the log, I see that there is actually a updating step when the runner started. If it is successful, we will see: <img width="1836" height="850" alt="Image" src="https://github.com/user-attachments/assets/4fb3bf18-663f-410b-b1f0-5499dc5aab09" /> However, when the workflow is run before the runner finished updating, we see <img width="2692" height="1198" alt="Image" src="https://github.com/user-attachments/assets/af6712cb-0827-45a5-8929-3f138600a0b6" /> After some digging, I see the ARC images for apache-beam-testing were built via github workflow: https://github.com/apache/beam/actions/workflows/build_runner_image.yml and stored at https://pantheon.corp.google.com/artifacts/docker/apache-beam-testing/us-central1/beam-github-actions/beam-arc-runner. However, we are still pinning an old version (`3063b55757509dad1c14751c9f2aa5905826d9a0` created at Aug 10, 2024) at https://github.com/apache/beam/blob/d7cd81a1649bc665581951d2330c4b8acd19ed72/.github/gh-actions-self-hosted-runners/arc/environments/beam.env#L38 ### Issue Failure Failure: Test is flaky ### Issue Priority Priority: 1 (unhealthy code / failing or flaky postcommit so we cannot be sure the product is healthy) ### Issue Components - [ ] Component: Python SDK - [ ] Component: Java SDK - [ ] Component: Go SDK - [ ] Component: Typescript SDK - [ ] Component: IO connector - [ ] Component: Beam YAML - [ ] Component: Beam examples - [ ] Component: Beam playground - [ ] Component: Beam katas - [ ] Component: Website - [ ] Component: Infrastructure - [ ] Component: Spark Runner - [ ] Component: Flink Runner - [ ] Component: Samza Runner - [ ] Component: Twister2 Runner - [ ] Component: Hazelcast Jet Runner - [ ] Component: Google Cloud Dataflow Runner -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
