[ https://issues.apache.org/jira/browse/FLINK-26105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17493772#comment-17493772 ]
Matthias Pohl commented on FLINK-26105: --------------------------------------- I updated the title and added additional affected versions because this issue is also present in older versions of Flink. > Rolling log filenames cause end-to-end test to fail (example test failure > "Running HA (hashmap, async)") > -------------------------------------------------------------------------------------------------------- > > Key: FLINK-26105 > URL: https://issues.apache.org/jira/browse/FLINK-26105 > Project: Flink > Issue Type: Bug > Components: Runtime / Coordination > Affects Versions: 1.15.0, 1.13.6, 1.14.3 > Reporter: Yun Gao > Assignee: Matthias Pohl > Priority: Critical > Labels: pull-request-available, test-stability > > {code:java} > Feb 14 01:31:29 Killed TM @ 255483 > Feb 14 01:31:29 Starting new TM. > Feb 14 01:31:42 Killed TM @ 258722 > Feb 14 01:31:42 Starting new TM. > Feb 14 01:32:00 Checking for non-empty .out files... > Feb 14 01:32:00 No non-empty .out files. > Feb 14 01:32:00 FAILURE: A JM did not take over. > Feb 14 01:32:00 One or more tests FAILED. > Feb 14 01:32:00 Stopping job timeout watchdog (with pid=250820) > Feb 14 01:32:00 Killing JM watchdog @ 252644 > Feb 14 01:32:00 Killing TM watchdog @ 253262 > Feb 14 01:32:00 [FAIL] Test script contains errors. > Feb 14 01:32:00 Checking of logs skipped. > Feb 14 01:32:00 > Feb 14 01:32:00 [FAIL] 'Running HA (hashmap, async) end-to-end test' failed > after 2 minutes and 51 seconds! Test exited with exit code 1 > Feb 14 01:32:00 > 01:32:00 ##[group]Environment Information > Feb 14 01:32:01 Searching for .dump, .dumpstream and related files in > '/home/vsts/work/1/s' > dmesg: read kernel buffer failed: Operation not permitted > Feb 14 01:32:06 Stopping taskexecutor daemon (pid: 259377) on host > fv-az313-602. > Feb 14 01:32:07 Stopping standalonesession daemon (pid: 256528) on host > fv-az313-602. > Feb 14 01:32:08 Stopping zookeeper... > Feb 14 01:32:08 Stopping zookeeper daemon (pid: 251023) on host fv-az313-602. > Feb 14 01:32:09 Skipping taskexecutor daemon (pid: 251636), because it is not > running anymore on fv-az313-602. > Feb 14 01:32:09 Skipping taskexecutor daemon (pid: 255483), because it is not > running anymore on fv-az313-602. > Feb 14 01:32:09 Skipping taskexecutor daemon (pid: 258722), because it is not > running anymore on fv-az313-602. > The STDIO streams did not close within 10 seconds of the exit event from > process '/usr/bin/bash'. This may indicate a child process inherited the > STDIO streams and has not yet exited. > ##[error]Bash exited with code '1'. > {code} > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=31347&view=logs&j=e9d3d34f-3d15-59f4-0e3e-35067d100dfe&t=f8a6d3eb-38cf-5cca-9a99-d0badeb5fe62&l=8020 -- This message was sent by Atlassian Jira (v8.20.1#820001)