[ 
https://issues.apache.org/jira/browse/FLINK-21103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17282316#comment-17282316
 ] 

Robert Metzger commented on FLINK-21103:
----------------------------------------

Another case: 
https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=13183&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529

this seems suspicious
{code}
2021-02-10T08:31:29.2869114Z Feb 10 08:31:29 Waiting for job to process up to 
1000 records, current progress: 0 records ...
2021-02-10T08:31:36.9183883Z Feb 10 08:31:36 Waiting for job to process up to 
1000 records, current progress: 0 records ...
2021-02-10T08:31:44.4554214Z Feb 10 08:31:44 Waiting for job to process up to 
1000 records, current progress: 0 records ...
2021-02-10T08:31:51.8955990Z Feb 10 08:31:51 Waiting for job to process up to 
1000 records, current progress: 0 records ...
2021-02-10T08:31:59.2788548Z Feb 10 08:31:59 Waiting for job to process up to 
1000 records, current progress: 0 records ...
2021-02-10T08:32:06.5614731Z Feb 10 08:32:06 Waiting for job to process up to 
1000 records, current progress: 0 records ...
2021-02-10T08:32:10.9063618Z 
/home/vsts/work/1/s/flink-end-to-end-tests/test-scripts/test_stream_state_ttl.sh:
 line 78: local: can only be used in a function
2021-02-10T08:32:10.9066458Z Feb 10 08:32:10 Checking for non-empty .out 
files...
2021-02-10T08:32:10.9262590Z Feb 10 08:32:10 No non-empty .out files.
2021-02-10T08:32:10.9358556Z Feb 10 08:32:10 Checking of logs skipped.
2021-02-10T08:32:10.9359359Z Feb 10 08:32:10 
2021-02-10T08:32:10.9361173Z Feb 10 08:32:10 [PASS] 'State TTL Heap backend 
end-to-end test' passed after 95 minutes and 38 seconds! Test exited with exit 
code 0.
{code}

Looks like this test is causing the e2e job to time out.

> E2e tests time out on azure
> ---------------------------
>
>                 Key: FLINK-21103
>                 URL: https://issues.apache.org/jira/browse/FLINK-21103
>             Project: Flink
>          Issue Type: Bug
>          Components: Build System / Azure Pipelines, Tests
>    Affects Versions: 1.13.0
>            Reporter: Dawid Wysakowicz
>            Priority: Major
>              Labels: test-stability
>             Fix For: 1.13.0
>
>
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=12377&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529
> {code}
> Creating worker2 ... done
> Jan 22 13:16:17 Waiting for hadoop cluster to come up. We have been trying 
> for 0 seconds, retrying ...
> Jan 22 13:16:22 Waiting for hadoop cluster to come up. We have been trying 
> for 5 seconds, retrying ...
> Jan 22 13:16:27 Waiting for hadoop cluster to come up. We have been trying 
> for 10 seconds, retrying ...
> Jan 22 13:16:32 Waiting for hadoop cluster to come up. We have been trying 
> for 15 seconds, retrying ...
> Jan 22 13:16:37 Waiting for hadoop cluster to come up. We have been trying 
> for 20 seconds, retrying ...
> Jan 22 13:16:43 Waiting for hadoop cluster to come up. We have been trying 
> for 26 seconds, retrying ...
> Jan 22 13:16:48 Waiting for hadoop cluster to come up. We have been trying 
> for 31 seconds, retrying ...
> Jan 22 13:16:53 Waiting for hadoop cluster to come up. We have been trying 
> for 36 seconds, retrying ...
> Jan 22 13:16:58 Waiting for hadoop cluster to come up. We have been trying 
> for 41 seconds, retrying ...
> Jan 22 13:17:03 Waiting for hadoop cluster to come up. We have been trying 
> for 46 seconds, retrying ...
> Jan 22 13:17:08 We only have 0 NodeManagers up. We have been trying for 0 
> seconds, retrying ...
> 21/01/22 13:17:10 INFO client.RMProxy: Connecting to ResourceManager at 
> master.docker-hadoop-cluster-network/172.19.0.3:8032
> 21/01/22 13:17:11 INFO client.AHSProxy: Connecting to Application History 
> server at master.docker-hadoop-cluster-network/172.19.0.3:10200
> Jan 22 13:17:11 We now have 2 NodeManagers up.
> ==========================================================================================
> === WARNING: This E2E Run took already 80% of the allocated time budget of 
> 250 minutes ===
> ==========================================================================================
> ======================================================================================================
> === WARNING: This E2E Run will time out in the next few minutes. Starting to 
> upload the log output ===
> ======================================================================================================
> ##[error]The task has timed out.
> Async Command Start: Upload Artifact
> Uploading 1 files
> File upload succeed.
> Upload '/tmp/_e2e_watchdog.output.0' to file container: 
> '#/11824779/e2e-timeout-logs'
> Associated artifact 140921 with build 12377
> Async Command End: Upload Artifact
> Async Command Start: Upload Artifact
> Uploading 1 files
> File upload succeed.
> Upload '/tmp/_e2e_watchdog.output.1' to file container: 
> '#/11824779/e2e-timeout-logs'
> Associated artifact 140921 with build 12377
> Async Command End: Upload Artifact
> Async Command Start: Upload Artifact
> Uploading 1 files
> File upload succeed.
> Upload '/tmp/_e2e_watchdog.output.2' to file container: 
> '#/11824779/e2e-timeout-logs'
> Associated artifact 140921 with build 12377
> Async Command End: Upload Artifact
> Finishing: Run e2e tests
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to