[ 
https://issues.apache.org/jira/browse/OOZIE-3439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16774007#comment-16774007
 ] 

Shubham edited comment on OOZIE-3439 at 2/21/19 11:52 AM:
----------------------------------------------------------

[~kmarton],

Hive1 action log files has different log entries, so pattern is different for 
Hive1. (https://issues.apache.org/jira/browse/OOZIE-2112)

 Hive1 :
{code:java}
2019-02-20 14:01:36,055 [main] INFO 
org.apache.hadoop.yarn.client.api.impl.YarnClientImpl - Submitted application 
application_1550671202870_0002
2019-02-20 14:01:46,498 [main] INFO 
org.apache.hadoop.hive.ql.exec.tez.TezSessionPoolManager - The current user: 
hive, session user: hive
2019-02-20 14:01:46,498 [main] INFO 
org.apache.hadoop.hive.ql.exec.tez.TezSessionPoolManager - Current queue name 
is default incoming queue name  

{code}
But for Hive2, we do not have same log entries for yarn application.

Hive2:
{code:java}
INFO : Status: Running (Executing on YARN cluster with App id 
application_1550671202870_0004)

ESC[2K--------------------------------------------------------------------------------
ESC[2KESC[36;1m VERTICES STATUS TOTAL COMPLETED RUNNING PENDING FAILED KILLED
ESC[22;0mESC[2K--------------------------------------------------------------------------------
ESC[2KMap 1 .......... SUCCEEDED 1 1 0 0 0 0
ESC[2K--------------------------------------------------------------------------------
ESC[2KESC[31;1mVERTICES: 01/01 [==========================>>] 100% ELAPSED 
TIME: 5.10 s 
ESC[22;0mESC[2K------------------------------------------------------------

{code}
 

 


was (Author: shubham.chhabra):
[~kmarton],

Hive1 action log files has different log entries, so pattern is different for 
Hive1. (https://issues.apache.org/jira/browse/OOZIE-2112)

 

Hive1 :

{code}

2019-02-20 14:01:36,055 [main] INFO 
org.apache.hadoop.yarn.client.api.impl.YarnClientImpl - Submitted application 
application_1550671202870_0002
2019-02-20 14:01:46,498 [main] INFO 
org.apache.hadoop.hive.ql.exec.tez.TezSessionPoolManager - The current user: 
hive, session user: hive
2019-02-20 14:01:46,498 [main] INFO 
org.apache.hadoop.hive.ql.exec.tez.TezSessionPoolManager - Current queue name 
is default incoming queue name  

{code}

But for Hive2, we do not have same log entries for yarn application.

Hive2:

{code}

INFO : Status: Running (Executing on YARN cluster with App id 
application_1550671202870_0004)

ESC[2K--------------------------------------------------------------------------------
ESC[2KESC[36;1m VERTICES STATUS TOTAL COMPLETED RUNNING PENDING FAILED KILLED
ESC[22;0mESC[2K--------------------------------------------------------------------------------
ESC[2KMap 1 .......... SUCCEEDED 1 1 0 0 0 0
ESC[2K--------------------------------------------------------------------------------
ESC[2KESC[31;1mVERTICES: 01/01 [==========================>>] 100% ELAPSED 
TIME: 5.10 s 
ESC[22;0mESC[2K------------------------------------------------------------

{code}

 

 

> Hive2 action is not parsing application ID for TEZ from log file properly
> -------------------------------------------------------------------------
>
>                 Key: OOZIE-3439
>                 URL: https://issues.apache.org/jira/browse/OOZIE-3439
>             Project: Oozie
>          Issue Type: Bug
>          Components: action
>    Affects Versions: trunk
>            Reporter: Shubham
>            Assignee: Shubham
>            Priority: Major
>         Attachments: OOZIE-3439-001.patch
>
>
> Oozie workflow does not populate ChildJobUrl for Hive2 Action while Hive1 is 
> able to find child job ids.
> I looked at the code and found that pattern is not correct for hive2 action 
> logs generated in usercache.
> {code:java}
> static final Pattern[] HIVE2_JOB_IDS_PATTERNS = {
> Pattern.compile("Ended Job = (job_\\S*)"),
>  Pattern.compile("Submitted application (application[0-9_]*)"),
>  Pattern.compile("Running with YARN Application = (application[0-9_]*)")
> }
> {code}
> Adding below pattern should help in getting Hive 2 action Tez application id
> {code:java}
> Pattern.compile("Executing on YARN cluster with App id (application[0-9_]*)"),
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to