[ https://issues.apache.org/jira/browse/OOZIE-3439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16774007#comment-16774007 ]
Shubham edited comment on OOZIE-3439 at 2/21/19 11:52 AM: ---------------------------------------------------------- [~kmarton], Hive1 action log files has different log entries, so pattern is different for Hive1. (https://issues.apache.org/jira/browse/OOZIE-2112) Hive1 : {code:java} 2019-02-20 14:01:36,055 [main] INFO org.apache.hadoop.yarn.client.api.impl.YarnClientImpl - Submitted application application_1550671202870_0002 2019-02-20 14:01:46,498 [main] INFO org.apache.hadoop.hive.ql.exec.tez.TezSessionPoolManager - The current user: hive, session user: hive 2019-02-20 14:01:46,498 [main] INFO org.apache.hadoop.hive.ql.exec.tez.TezSessionPoolManager - Current queue name is default incoming queue name {code} But for Hive2, we do not have same log entries for yarn application. Hive2: {code:java} INFO : Status: Running (Executing on YARN cluster with App id application_1550671202870_0004) ESC[2K-------------------------------------------------------------------------------- ESC[2KESC[36;1m VERTICES STATUS TOTAL COMPLETED RUNNING PENDING FAILED KILLED ESC[22;0mESC[2K-------------------------------------------------------------------------------- ESC[2KMap 1 .......... SUCCEEDED 1 1 0 0 0 0 ESC[2K-------------------------------------------------------------------------------- ESC[2KESC[31;1mVERTICES: 01/01 [==========================>>] 100% ELAPSED TIME: 5.10 s ESC[22;0mESC[2K------------------------------------------------------------ {code} was (Author: shubham.chhabra): [~kmarton], Hive1 action log files has different log entries, so pattern is different for Hive1. (https://issues.apache.org/jira/browse/OOZIE-2112) Hive1 : {code} 2019-02-20 14:01:36,055 [main] INFO org.apache.hadoop.yarn.client.api.impl.YarnClientImpl - Submitted application application_1550671202870_0002 2019-02-20 14:01:46,498 [main] INFO org.apache.hadoop.hive.ql.exec.tez.TezSessionPoolManager - The current user: hive, session user: hive 2019-02-20 14:01:46,498 [main] INFO org.apache.hadoop.hive.ql.exec.tez.TezSessionPoolManager - Current queue name is default incoming queue name {code} But for Hive2, we do not have same log entries for yarn application. Hive2: {code} INFO : Status: Running (Executing on YARN cluster with App id application_1550671202870_0004) ESC[2K-------------------------------------------------------------------------------- ESC[2KESC[36;1m VERTICES STATUS TOTAL COMPLETED RUNNING PENDING FAILED KILLED ESC[22;0mESC[2K-------------------------------------------------------------------------------- ESC[2KMap 1 .......... SUCCEEDED 1 1 0 0 0 0 ESC[2K-------------------------------------------------------------------------------- ESC[2KESC[31;1mVERTICES: 01/01 [==========================>>] 100% ELAPSED TIME: 5.10 s ESC[22;0mESC[2K------------------------------------------------------------ {code} > Hive2 action is not parsing application ID for TEZ from log file properly > ------------------------------------------------------------------------- > > Key: OOZIE-3439 > URL: https://issues.apache.org/jira/browse/OOZIE-3439 > Project: Oozie > Issue Type: Bug > Components: action > Affects Versions: trunk > Reporter: Shubham > Assignee: Shubham > Priority: Major > Attachments: OOZIE-3439-001.patch > > > Oozie workflow does not populate ChildJobUrl for Hive2 Action while Hive1 is > able to find child job ids. > I looked at the code and found that pattern is not correct for hive2 action > logs generated in usercache. > {code:java} > static final Pattern[] HIVE2_JOB_IDS_PATTERNS = { > Pattern.compile("Ended Job = (job_\\S*)"), > Pattern.compile("Submitted application (application[0-9_]*)"), > Pattern.compile("Running with YARN Application = (application[0-9_]*)") > } > {code} > Adding below pattern should help in getting Hive 2 action Tez application id > {code:java} > Pattern.compile("Executing on YARN cluster with App id (application[0-9_]*)"), > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)