[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17794241#comment-17794241 ] Matthias Pohl commented on FLINK-33727: --- The following build failed before the revert was pushed: https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55160&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4 > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Assignee: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > Fix For: 1.19.0 > > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792761#comment-17792761 ] Dawid Wysakowicz commented on FLINK-33727: -- Thanks for taking care of it [~Sergey Nuyanzin] and sorry for the problems. > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Assignee: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > Fix For: 1.19.0 > > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792651#comment-17792651 ] Sergey Nuyanzin commented on FLINK-33727: - reverted at 18b67b104e025b142a8321e5163edf7fbd439580 and 026bd4be9bafce86ced42d2a07e8b8820f7e6d9d > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > Fix For: 1.19.0 > > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792599#comment-17792599 ] Jim Hughes commented on FLINK-33727: That works for me; I approved that PR. > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792595#comment-17792595 ] Sergey Nuyanzin commented on FLINK-33727: - {quote} I apologize for taking out CI! The order merges resulted in PRs not testing all of the programs together in one branch until they were merged. (Which is why I'm willing to suggest disabling all RestoreTests temporarily or reverting the commits which caused the issue.) {quote} I noticed that within FLINK-33470 there was a deletion of {{JoinJsonPlanTest}} and {{JoinJsonPlanITCase}} and i would guess the idea was to replace it with new tests... In this case just disabling of new tests is not an option since in that case we are in a situation when old tests are removed and new are still not ready. If revert is more preferable option for you then current PR, then ok I created a revert PR https://github.com/apache/flink/pull/23861 and if it is ok we can close the PR for this issue and commit revert PR and reopen FLINK-33470 where further work could be continued > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792538#comment-17792538 ] Jim Hughes commented on FLINK-33727: {quote}if so I'm curious whether it would be more helpful for others to have at least a comment about that in sources {quote} Absolutely! The RestoreTest framework is new, and this discussion shows that there are a number of non-obvious assumptions. [~bvarghese] and I are new to Flink, and as we've worked with it, we have extended it to have the features necessary to test various capabilities. I apologize for taking out CI! The order merges resulted in PRs not testing all of the programs together in one branch until they were merged. (Which is why I'm willing to suggest disabling all RestoreTests temporarily or reverting the commits which caused the issue.) As an additional improvement to the RestoreTestBase, we could save the SQL text in a file and fail if it is changed. Of course, then folks could still update the test files which ought to be "immutable" in some sense. > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792536#comment-17792536 ] Sergey Nuyanzin commented on FLINK-33727: - {quote} Presently, the method `generateTestSetupFiles` is disabled and only run by test developers before a PR is submitted. This method takes the SQL, gets and saves a compiled plan, and runs through the beforeRestore data making some comparisons, and finally stopping the job and taking a savepoint. {quote} if so I'm curious whether it would be more helpful for others to have at least a comment about that in sources? IMHO it could help others who is editing the code around however not aware of it and since currently there is no any such info in code and the code is not a part of "turned on" tests, then it could be simply be broken without and neither contributor nor reviewer could notice that > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792535#comment-17792535 ] Jim Hughes commented on FLINK-33727: As an alternative, to the existing PR and disabling the RestoreTests, I'm totally fine with you reverting the commits from my PR: [https://github.com/apache/flink/pull/23680] [https://github.com/apache/flink/commit/e886dfdda6cd927548c8af0a88e78171e7ba34a8] [https://github.com/apache/flink/commit/5edc7d7b18e88cc86e84d197202d8cbb40621864] > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792534#comment-17792534 ] Jim Hughes commented on FLINK-33727: {quote}..., however tests are continuing passing. Is it expected? {quote} "Yes", but the reason is a little confusing. The RestoreTest framework has two methods: `generateTestSetupFiles` and `testRestore`. Presently, the method `generateTestSetupFiles` is disabled and only run by test developers before a PR is submitted. This method takes the SQL, gets and saves a compiled plan, and runs through the beforeRestore data making some comparisons, and finally stopping the job and taking a savepoint. The second method uses the compiled plan and the savepoint. Since you are only running the second method, changing the SQL is irrelevant and not tested (unless you manually run `generateTestSetupFiles`). {quote}what is wrong with current PR for this JIRA? {quote} CI failing is showing that the RestoreTestBase has some limitations/assumptions around state which we need to address. The current PR fixes CI, but does not address those, rather it works around them. I'd prefer that we fix the limitations rather than work around them. That's why I'm suggesting to disable the RestoreTests as a whole until Monday when Dawid and Timo can weigh in. > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792533#comment-17792533 ] Sergey Nuyanzin commented on FLINK-33727: - {quote} I'm pretty sure that if you removed the run SQL, that'd be like removing the section in a JUnit test function which does anything and then asserts that it works. (That'd explain why the tests pass without it.) {quote} I don't understand it as another experiment I just changed SQL to the wrong SQL with syntax error, however tests are continuing passing. Is it expected? Then it's is not clear what is the reason to have it and how to check that it tests what it is expected to test? {quote} If you are looking to sort things immediately, I'd suggest adding `Disabled` to `testRestore` here: https://github.com/apache/flink/blob/master/flink-table/flink-table-planner/src/test/java/org/apache/flink/table/planner/plan/nodes/exec/testutils/RestoreTestBase.java#L229 {quote} what is wrong with current PR for this JIRA? > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792532#comment-17792532 ] Jim Hughes commented on FLINK-33727: {quote}... It seems it relies on some internal state... {quote} The tests have internal state. Your testing shows that it not being reset between test classes! Thanks for digging into that; that will help us identify what we need to sort out with the TestRestoreBase. If you are looking to sort things immediately, I'd suggest adding `Disabled` to `testRestore` here: [https://github.com/apache/flink/blob/master/flink-table/flink-table-planner/src/test/java/org/apache/flink/table/planner/plan/nodes/exec/testutils/RestoreTestBase.java#L229] That'd turn off all of these tests until [~twalthr] [~dwysakowicz] and I have a solution.[|https://issues.apache.org/jira/secure/ViewProfile.jspa?name=dwysakowicz] > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792531#comment-17792531 ] Jim Hughes commented on FLINK-33727: > is there any reason to have same name? Yes and no. The SQL text will need to reference the input and output tables. In other restore tests, it makes sense to have functions which generate sources / sinks, so being able to reuse a table name is nice. > what is the reason to have {{runSql}} this in {{DeduplicationTestPrograms?}} `runSql` adds the steps which will actually execute something to the TestProgram. I'm pretty sure that if you removed the run SQL, that'd be like removing the section in a JUnit test function which does anything and then asserts that it works. (That'd explain why the tests pass without it.) In some sense, I see the TestProgram and RestoreTestBase as setting up a Builder/DSL for JUnit tests that a) test things about CompiledPlans and b) make sure that a streaming can be restored sensibly. > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792499#comment-17792499 ] Sergey Nuyanzin commented on FLINK-33727: - [~jhughes]there is also a question about {{DeduplicationRestoreTest}} what is the reason to have {{runSql}} this in {{DeduplicationTestPrograms}} e.g. for {{org.apache.flink.table.planner.plan.nodes.exec.stream.DeduplicationTestPrograms#DEDUPLICATE}} {code:java} .runSql( "insert into deduplicate_sink " + "select order_id, user, product, order_time \n" + "FROM (" + " SELECT *," + "ROW_NUMBER() OVER (PARTITION BY product ORDER BY event_time ASC) AS row_num\n" + " FROM MyTable)" + "WHERE row_num = 1") {code} I'm asking since I tried to remove it just to see what happens and tests just continue passing.. So from tests point of view it seems there is no difference whether we have this or not for {{DeduplicationTestPrograms}} Or did I miss anything? > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792458#comment-17792458 ] Sergey Nuyanzin commented on FLINK-33727: - Seems these {{MySink}} is kind of "state holder" for the tests... {quote} (Right now, the DeduplicationTestPrograms and JoinTestPrograms both use sinks called "MySink".) {quote} is there any reason to have same name? > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: pull-request-available, test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792449#comment-17792449 ] Sergey Nuyanzin commented on FLINK-33727: - I don't think it is related to concurrent execution. I was able to find a way to reproduce it locally with 100%. Just open IntellijIDEA and run all tests for {{RestoreTestBase}} Even more, I started commenting tests and realised if there at least one test e.g. {{ExpandRestoreTest}} before {{JoinRestoreTest}} then {{JoinRestoreTest}} fails with 100% at least for my env. If I comment out also {{ExpandRestoreTest}} then it starts passing. It seems it relies on some internal state... > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792413#comment-17792413 ] Jim Hughes commented on FLINK-33727: >From a quick look, the data is coming from `DeduplicationTestPrograms.java`. I believe that this shows that the various `RestoreTest`s are being executed concurrently and are interfering with each other. Two obvious ideas would be: 1. Have each RestoreTest use differently named sinks/sources. (Right now, the DeduplicationTestPrograms and JoinTestPrograms both use sinks called "MySink".) 2. Do something at the JUnit level so that implementations of RestoreTestBase do not run concurrently. Thoughts? > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: test-stability > > Since {{JoinRestoreTest}} was introduced in FLINK-33470 it seems to be a > reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (FLINK-33727) JoinRestoreTest is failing on AZP
[ https://issues.apache.org/jira/browse/FLINK-33727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792391#comment-17792391 ] Sergey Nuyanzin commented on FLINK-33727: - [~dwysakowicz], [~jhughes] could you please have a look here please? > JoinRestoreTest is failing on AZP > - > > Key: FLINK-33727 > URL: https://issues.apache.org/jira/browse/FLINK-33727 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner >Affects Versions: 1.19.0 >Reporter: Sergey Nuyanzin >Priority: Critical > Labels: test-stability > > Since it was introduced in FLINK-33470 it seems to be a reason > {noformat} > Dec 02 04:42:26 04:42:26.408 [ERROR] Failures: > Dec 02 04:42:26 04:42:26.408 [ERROR] > JoinRestoreTest>RestoreTestBase.testRestore:283 > Dec 02 04:42:26 Expecting actual: > Dec 02 04:42:26 ["+I[9, carol, apple, 9000]", > Dec 02 04:42:26 "+I[8, bill, banana, 8000]", > Dec 02 04:42:26 "+I[6, jerry, pen, 6000]"] > Dec 02 04:42:26 to contain exactly in any order: > Dec 02 04:42:26 ["+I[Adam, null]", > Dec 02 04:42:26 "+I[Baker, Research]", > Dec 02 04:42:26 "+I[Charlie, Human Resources]", > Dec 02 04:42:26 "+I[Charlie, HR]", > Dec 02 04:42:26 "+I[Don, Sales]", > Dec 02 04:42:26 "+I[Victor, null]", > Dec 02 04:42:26 "+I[Helena, Engineering]", > Dec 02 04:42:26 "+I[Juliet, Engineering]", > Dec 02 04:42:26 "+I[Ivana, Research]", > Dec 02 04:42:26 "+I[Charlie, People Operations]"] > {noformat} > examples of failures > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55120&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55129&view=logs&j=a9db68b9-a7e0-54b6-0f98-010e0aff39e2&t=cdd32e0b-6047-565b-c58f-14054472f1be&l=11786 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55136&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=12099 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=55137&view=logs&j=0c940707-2659-5648-cbe6-a1ad63045f0a&t=075c2716-8010-5565-fe08-3c4bb45824a4&l=11779 -- This message was sent by Atlassian Jira (v8.20.10#820010)