[ 
https://issues.apache.org/jira/browse/HIVE-24758?focusedWorklogId=550835&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-550835
 ]

ASF GitHub Bot logged work on HIVE-24758:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 10/Feb/21 18:40
            Start Date: 10/Feb/21 18:40
    Worklog Time Spent: 10m 
      Work Description: belugabehr commented on a change in pull request #1963:
URL: https://github.com/apache/hive/pull/1963#discussion_r573979578



##########
File path: ql/src/java/org/apache/hadoop/hive/ql/exec/tez/TezTask.java
##########
@@ -236,6 +239,10 @@ public int execute() {
           throw new HiveException("Operation cancelled");
         }
 
+        // Log all the info required to find the various logs for this query
+        LOG.info("HS2 Host: [{}], Query ID: [{}], Dag ID: [{}], DAG Session 
ID: [{}]", getHostNameIP(), queryId,

Review comment:
       Hey @pgaref, thanks for the valuable input.
   
   So, on one hand it may seem confusing as to the need to log the host name 
because if you are looking at this log file then of course the hostname is 
already known.  However, as I understand the code, these log messages are sent  
(redirected) to the client via Thrift RPC APIs.  This is helpful information 
for debugging for the client as it's not otherwise clear which instance of HS2 
is processing the query, if for example, a load balancer is between the client 
and HS2.  I had thought about including some sort of HS2 unique ID as well, but 
I didn't find such a capability in the project already and did not want to 
introduce here.
   
   However, these 4 pieces of information allow a client to report a problem to 
the admin and allow the admin to grab all the log files: HS2 and YARN TEZ DAG 
logs.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 550835)
    Time Spent: 1h 10m  (was: 1h)

> Log Tez Task DAG ID, DAG Session ID, HS2 Hostname
> -------------------------------------------------
>
>                 Key: HIVE-24758
>                 URL: https://issues.apache.org/jira/browse/HIVE-24758
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: David Mollitor
>            Assignee: David Mollitor
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> In order to get the logs for a particular query, submitted to Tez on YARN, 
> the following pieces of information are required:
> * YARN Application ID
> * TEZ DAG ID
> * HS2 Host that ran the job
> Include this information in TezTask output.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to