[ 
https://issues.apache.org/jira/browse/FLINK-33502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17787253#comment-17787253
 ] 

Matthias Pohl commented on FLINK-33502:
---------------------------------------

You have to download the build artifacts for the corresponding stage (in this 
case \{{tests}}). The archive will contain the {{watchdog}} file which is the 
CI log content. Aside from that you have the JUnit fork logs {{mvn-*.log}} (4 
since we have four surefire forks). I usually use {{grep -Hirn "<testname>" .}} 
to see all occurrences. Usually, the that lists the {{watchdog}} and one 
{{mvn-*.log}} file.

Here is the example for the last build failure:
{code:java}
$ unzip  logs-ci-test_ci_tests-1699014739.zip
$ grep -Hirn HybridShuffleITCase .
mvn-3.log:103507:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridFullExchangesRestart[enableNewHybridMode=false]
 is running.
mvn-3.log:104521:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridFullExchangesRestart[enableNewHybridMode=false]
 successfully run.
mvn-3.log:104525:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridFullExchangesRestart[enableNewHybridMode=true]
 is running.
mvn-3.log:105557:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridFullExchangesRestart[enableNewHybridMode=true]
 successfully run.
mvn-3.log:105561:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridSelectiveExchangesRestart[enableNewHybridMode=false]
 is running.
mvn-3.log:107414:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridSelectiveExchangesRestart[enableNewHybridMode=false]
 successfully run.
mvn-3.log:107418:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridSelectiveExchangesRestart[enableNewHybridMode=true]
 is running.
mvn-3.log:109414:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridSelectiveExchangesRestart[enableNewHybridMode=true]
 successfully run.
mvn-3.log:109418:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridFullExchanges[enableNewHybridMode=false]
 is running.
mvn-3.log:110391:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridFullExchanges[enableNewHybridMode=false]
 successfully run.
mvn-3.log:110395:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridFullExchanges[enableNewHybridMode=true]
 is running.
mvn-3.log:111388:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridFullExchanges[enableNewHybridMode=true]
 successfully run.
mvn-3.log:111392:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridSelectiveExchanges[enableNewHybridMode=false]
 is running.
mvn-3.log:112354:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridSelectiveExchanges[enableNewHybridMode=false]
 successfully run.
mvn-3.log:112358:Test 
org.apache.flink.test.runtime.HybridShuffleITCase.testHybridSelectiveExchanges[enableNewHybridMode=true]
 is running.
watchdog:7740:Nov 03 12:47:49 12:47:49.161 [INFO] Running 
org.apache.flink.test.runtime.HybridShuffleITCase
watchdog:8567:Nov 03 13:14:12 13:14:12.059 [ERROR] 
org.apache.flink.test.runtime.HybridShuffleITCase
watchdog:8573:Nov 03 13:14:12 13:14:12.059 [ERROR] 
org.apache.flink.test.runtime.HybridShuffleITCase
watchdog:8610:Nov 03 13:14:12 13:14:12.059 [ERROR] 
org.apache.flink.test.runtime.HybridShuffleITCase{code}
Here you see that {{mvn-3.log}} contains the logs. The last run of 
{{HybridShuffleITCase}} seems to not terminate (because there is a "is running" 
but no "successfully run" line).

Does this help? Strangely, other tests pass afterwards which makes the logs 
hard to browse... :-/

> HybridShuffleITCase caused a fatal error
> ----------------------------------------
>
>                 Key: FLINK-33502
>                 URL: https://issues.apache.org/jira/browse/FLINK-33502
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Network
>    Affects Versions: 1.19.0
>            Reporter: Matthias Pohl
>            Priority: Major
>              Labels: test-stability
>
> [https://github.com/XComp/flink/actions/runs/6789774296/job/18458197040#step:12:9177]
> {code:java}
> Error: 21:21:35 21:21:35.379 [ERROR] Error occurred in starting fork, check 
> output in log
> 9168Error: 21:21:35 21:21:35.379 [ERROR] Process Exit Code: 239
> 9169Error: 21:21:35 21:21:35.379 [ERROR] Crashed tests:
> 9170Error: 21:21:35 21:21:35.379 [ERROR] 
> org.apache.flink.test.runtime.HybridShuffleITCase
> 9171Error: 21:21:35 21:21:35.379 [ERROR] 
> org.apache.maven.surefire.booter.SurefireBooterForkException: 
> ExecutionException The forked VM terminated without properly saying goodbye. 
> VM crash or System.exit called?
> 9172Error: 21:21:35 21:21:35.379 [ERROR] Command was /bin/sh -c cd 
> /root/flink/flink-tests && /usr/lib/jvm/jdk-11.0.19+7/bin/java -XX:+UseG1GC 
> -Xms256m -XX:+IgnoreUnrecognizedVMOptions 
> --add-opens=java.base/java.util=ALL-UNNAMED 
> --add-opens=java.base/java.io=ALL-UNNAMED -Xmx1536m -jar 
> /root/flink/flink-tests/target/surefire/surefirebooter10811559899200556131.jar
>  /root/flink/flink-tests/target/surefire 2023-11-07T20-32-50_466-jvmRun4 
> surefire6242806641230738408tmp surefire_1603959900047297795160tmp
> 9173Error: 21:21:35 21:21:35.379 [ERROR] Error occurred in starting fork, 
> check output in log
> 9174Error: 21:21:35 21:21:35.379 [ERROR] Process Exit Code: 239
> 9175Error: 21:21:35 21:21:35.379 [ERROR] Crashed tests:
> 9176Error: 21:21:35 21:21:35.379 [ERROR] 
> org.apache.flink.test.runtime.HybridShuffleITCase
> 9177Error: 21:21:35 21:21:35.379 [ERROR]      at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.awaitResultsDone(ForkStarter.java:532)
> 9178Error: 21:21:35 21:21:35.379 [ERROR]      at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.runSuitesForkPerTestSet(ForkStarter.java:479)
> 9179Error: 21:21:35 21:21:35.379 [ERROR]      at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.run(ForkStarter.java:322)
> 9180Error: 21:21:35 21:21:35.379 [ERROR]      at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.run(ForkStarter.java:266)
> [...] {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to