Hi Aman, I created https://issues.apache.org/jira/browse/HIVE-27087 to look into TestMiniSparkOnYarnCliDriver failures. I have a working theory of what might be going on there. I am still investigating what is the right way to fix it though.
Thanks, Vihang On Fri, Feb 10, 2023 at 10:26 AM Aman Raj <raja...@microsoft.com.invalid> wrote: > Hi Vihang, > > Yes the tests are failing locally as well with the same issue. > > Thanks, > Aman. > > Get Outlook for Android<https://aka.ms/AAb9ysg> > ________________________________ > From: Vihang Karajgaonkar <vihang.karajgaon...@databricks.com.INVALID> > Sent: Friday, February 10, 2023 11:22:15 PM > To: dev@hive.apache.org <dev@hive.apache.org> > Subject: Re: [EXTERNAL] Re: Branch-3 backports and build stability > > [You don't often get email from vihang.karajgaon...@databricks.com.invalid. > Learn why this is important at > https://aka.ms/LearnAboutSenderIdentification ] > > Thanks a lot Stamatis for starting this thread. I really appreciate all the > efforts to stabilize branch-3 to get it to a releasable state and I agree > that we should get it to a green state before opening it for PRs not > related to test failures. I can help with the effort as well. > > If we want to get the branch back to green state soon, have we considered > disabling the tests which are clearly flaky? (e.g pass on some builds and > fail on the other build with no new code changes). If we don't do that, we > will keep playing whack a mole with those tests. I propose for such tests > we should disable them and create tickets to unflake them separately. This > will help us get back to a green state faster. > > Hi Aman, > For TestMiniSparkOnYarnCliDriver failures, you probably should also look > into the spark driver/application logs and see if there are infrastructure > errors (e.g OOMs). Are these tests failing when you run locally? > > Thanks, > Vihang > > On Tue, Feb 7, 2023 at 10:05 PM Aman Raj <raja...@microsoft.com.invalid> > wrote: > > > +1, > > Thanks Stamatis and Lazlo for helping in the test case fixes till now. > > > > Team, > > I need help in fixing the following tests in Hive. I have tried different > > approaches but no luck till now. > > I am facing some issues in fixing the following tests : > > org.apache.hadoop.hive.cli.TestMiniSparkOnYarnCliDriver > > > > Issue : > > PREHOOK: Input: default@src > > PREHOOK: Output: default@src > > Failed to monitor Job[-1] with exception > > 'java.lang.IllegalStateException(Connection to remote Spark driver was > > lost)' Last known state = SENT > > Failed to execute spark task, with exception > > 'java.lang.IllegalStateException(RPC channel is closed.)' > > FAILED: Execution Error, return code 1 from > > org.apache.hadoop.hive.ql.exec.spark.SparkTask. RPC channel is closed. > > > > History : > > Initially the tests had failed with errors which I fixed in the following > > task : > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fissues.apache.org%2Fjira%2Fbrowse%2FHIVE-26940&data=05%7C01%7Crajaman%40microsoft.com%7C8ab90a50295341aa10f808db0b8f9959%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C638116483653266848%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=rNJF2%2BdnjYOzBsOn3nQO9UBeVLDctMOvNzJ%2BetpghPA%3D&reserved=0 > > > > Does anyone know what the issue is here ? There are 6-7 failures because > > of this test case. Link to the failed test cases for the stacktrace : > > > https://nam06.safelinks.protection.outlook.com/?url=http%3A%2F%2Fci.hive.apache.org%2Fblue%2Forganizations%2Fjenkins%2Fhive-precommit%2Fdetail%2FPR-3949%2F2%2Ftests%2F&data=05%7C01%7Crajaman%40microsoft.com%7C8ab90a50295341aa10f808db0b8f9959%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C638116483653266848%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=bGQ725R7D6bLTIr7eiTbGlmDNC0WBV2N4j4JRuffed4%3D&reserved=0 > > Thanks, > > Aman. > > > > ________________________________ > > From: László Bodor <bodorlaszlo0...@gmail.com> > > Sent: Tuesday, February 7, 2023 4:46 PM > > To: dev@hive.apache.org <dev@hive.apache.org> > > Subject: [EXTERNAL] Re: Branch-3 backports and build stability > > > > +1 > > also, if I merged something that I thought was for test stability (but > > instead it was a feature), excuse me :) > > for reference, the whole green test initiative is tracked under this > > umbrella: > > > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fissues.apache.org%2Fjira%2Fbrowse%2FHIVE-26836&data=05%7C01%7Crajaman%40microsoft.com%7C8ab90a50295341aa10f808db0b8f9959%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C638116483653266848%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=mISUrifau3G3jMxVC37bhn8Di76hApAUio5%2BCCQzHy4%3D&reserved=0 > > > > Stamatis Zampetakis <zabe...@gmail.com> ezt írta (időpont: 2023. febr. > 7., > > K, 12:09): > > > > > Hi all, > > > > > > The build in branch-3 is not yet green; there are ~25 test failures. It > > is > > > a common practice that we shouldn't push changes on top of a broken > build > > > unless they are addressing test failures. > > > > > > Some people (mainly Aman Raj, Chris Nauroth, and Laszlo Bodor) are > > working > > > hard to stabilize the build for quite some time now. If you want to > help > > > out then start by reviewing, merging, and fixing things around test > > > failures. > > > > > > It's not yet the time to bring new features, upgrades, bugs, etc., in > > > branch-3. I would encourage committers to not approve such changes > till > > we > > > get back to a stable branch. > > > > > > Best, > > > Stamatis > > > > > >