[ 
https://issues.apache.org/jira/browse/HIVE-12538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030446#comment-15030446
 ] 

Nemon Lou commented on HIVE-12538:
----------------------------------

A way to reproduce:
1, open beeline
2, run the following SQL:
{noformat}
create table if not exists test(id int);
set hive.execution.engine=spark;
set spark.yarn.queue=QueueA;
select count(*) from test;
select count(*) from test;
{noformat}
3,check yarn UI,and there will be two yarn applications .

Howerver,changing orders in step 2(the last setting command before actual query 
is not a spark related parameter),the problem will be gone:
{noformat}
create table if not exists test(id int);
set spark.yarn.queue=QueueA;
set hive.execution.engine=spark;
select count(*) from test;
select count(*) from test;
{noformat}



> After set spark related config, SparkSession never get reused
> -------------------------------------------------------------
>
>                 Key: HIVE-12538
>                 URL: https://issues.apache.org/jira/browse/HIVE-12538
>             Project: Hive
>          Issue Type: Bug
>          Components: Spark
>    Affects Versions: 1.3.0
>            Reporter: Nemon Lou
>
> Hive on Spark yarn-cluster mode.
> After setting "set spark.yarn.queue=QueueA;" ,
> run the query "select count(*) from test"  3 times and you will find  3 
> different yarn applications.
> Two of the yarn applications in FINISHED & SUCCEEDED state,and one in RUNNING 
> & UNDEFINED state waiting for next work.
> And if you submit one more "select count(*) from test" ,the third one will be 
> in FINISHED & SUCCEEDED state and a new yarn application will start up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to