Joe McDonnell created IMPALA-12639:
--------------------------------------

             Summary: Divert the output of the Hive warm up statement during 
dataload
                 Key: IMPALA-12639
                 URL: https://issues.apache.org/jira/browse/IMPALA-12639
             Project: IMPALA
          Issue Type: Improvement
          Components: Infrastructure
    Affects Versions: Impala 4.4.0
            Reporter: Joe McDonnell
            Assignee: Joe McDonnell


During dataload in testdata/bin/create-load-data.sh, we run a couple Hive 
statements to warm up Hive. These produce hundreds of lines of output. This 
should be diverted to a log file to avoid the noise. Using the run-step 
function is the standard way to do that.
{noformat}
20:37:29 Running warm up Hive statements
20:37:30 SLF4J: Class path contains multiple SLF4J bindings.
20:37:30 SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-private-basic-parameterized/Impala-Toolchain/cdp_components-45689292/apache-hive-3.1.3000.7.2.18.0-369-bin/lib/log4j-slf4j-impl-2.18.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
20:37:30 SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-private-basic-parameterized/Impala-Toolchain/cdp_components-45689292/hadoop-3.1.1.7.2.18.0-369/share/hadoop/common/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.class]
20:37:30 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
20:37:31 SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]
20:37:32 SLF4J: Class path contains multiple SLF4J bindings.
20:37:32 SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-private-basic-parameterized/Impala-Toolchain/cdp_components-45689292/apache-hive-3.1.3000.7.2.18.0-369-bin/lib/log4j-slf4j-impl-2.18.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
20:37:32 SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-private-basic-parameterized/Impala-Toolchain/cdp_components-45689292/hadoop-3.1.1.7.2.18.0-369/share/hadoop/common/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.class]
20:37:32 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
20:37:32 SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]
20:37:32 Connecting to jdbc:hive2://localhost:11050/default;
20:37:32 Connected to: Apache Hive (version 3.1.3000.7.2.18.0-369)
20:37:32 Driver: Hive JDBC (version 3.1.3000.7.2.18.0-369)
20:37:32 Transaction isolation: TRANSACTION_REPEATABLE_READ
20:37:34 INFO  : Compiling 
command(queryId=jenkins_20231214203732_f74ae90f-84e3-44ef-ae4e-d71d14be1326): 
create database if not exists functional
20:37:34 INFO  : Semantic Analysis Completed (retrial = false)
20:37:34 INFO  : Created Hive schema: Schema(fieldSchemas:null, properties:null)
20:37:34 INFO  : Completed compiling 
command(queryId=jenkins_20231214203732_f74ae90f-84e3-44ef-ae4e-d71d14be1326); 
Time taken: 1.139 seconds
20:37:34 INFO  : Executing 
command(queryId=jenkins_20231214203732_f74ae90f-84e3-44ef-ae4e-d71d14be1326): 
create database if not exists functional
20:37:34 INFO  : Starting task [Stage-0:DDL] in serial mode
20:37:34 INFO  : Completed executing 
command(queryId=jenkins_20231214203732_f74ae90f-84e3-44ef-ae4e-d71d14be1326); 
Time taken: 0.226 seconds
20:37:34 INFO  : OK
20:37:34 No rows affected (1.572 seconds)
20:37:34 Beeline version 3.1.3000.7.2.18.0-369 by Apache Hive
20:37:34 Closing: 0: jdbc:hive2://localhost:11050/default;
20:37:35 SLF4J: Class path contains multiple SLF4J bindings.
20:37:35 SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-private-basic-parameterized/Impala-Toolchain/cdp_components-45689292/apache-hive-3.1.3000.7.2.18.0-369-bin/lib/log4j-slf4j-impl-2.18.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
20:37:35 SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-private-basic-parameterized/Impala-Toolchain/cdp_components-45689292/hadoop-3.1.1.7.2.18.0-369/share/hadoop/common/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.class]
20:37:35 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
20:37:35 SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]
20:37:36 SLF4J: Class path contains multiple SLF4J bindings.
20:37:36 SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-private-basic-parameterized/Impala-Toolchain/cdp_components-45689292/apache-hive-3.1.3000.7.2.18.0-369-bin/lib/log4j-slf4j-impl-2.18.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
20:37:36 SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-private-basic-parameterized/Impala-Toolchain/cdp_components-45689292/hadoop-3.1.1.7.2.18.0-369/share/hadoop/common/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.class]
20:37:36 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
20:37:36 SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]
20:37:37 Connecting to jdbc:hive2://localhost:11050/default;
20:37:37 Connected to: Apache Hive (version 3.1.3000.7.2.18.0-369)
20:37:37 Driver: Hive JDBC (version 3.1.3000.7.2.18.0-369)
20:37:37 Transaction isolation: TRANSACTION_REPEATABLE_READ
20:37:37 INFO  : Compiling 
command(queryId=jenkins_20231214203737_686a63ee-b02a-4760-949a-ac10809853a7): 
create table if not exists hive_warm_up_tbl (i int)
20:37:37 INFO  : Semantic Analysis Completed (retrial = false)
20:37:37 INFO  : Created Hive schema: Schema(fieldSchemas:null, properties:null)
20:37:37 INFO  : Completed compiling 
command(queryId=jenkins_20231214203737_686a63ee-b02a-4760-949a-ac10809853a7); 
Time taken: 0.075 seconds
20:37:37 INFO  : Executing 
command(queryId=jenkins_20231214203737_686a63ee-b02a-4760-949a-ac10809853a7): 
create table if not exists hive_warm_up_tbl (i int)
20:37:37 INFO  : Starting task [Stage-0:DDL] in serial mode
20:37:37 INFO  : Completed executing 
command(queryId=jenkins_20231214203737_686a63ee-b02a-4760-949a-ac10809853a7); 
Time taken: 0.101 seconds
20:37:37 INFO  : OK
20:37:37 No rows affected (0.253 seconds)
20:37:37 Beeline version 3.1.3000.7.2.18.0-369 by Apache Hive
20:37:37 Closing: 0: jdbc:hive2://localhost:11050/default;
20:37:38 SLF4J: Class path contains multiple SLF4J bindings.
20:37:38 SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-private-basic-parameterized/Impala-Toolchain/cdp_components-45689292/apache-hive-3.1.3000.7.2.18.0-369-bin/lib/log4j-slf4j-impl-2.18.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
20:37:38 SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-private-basic-parameterized/Impala-Toolchain/cdp_components-45689292/hadoop-3.1.1.7.2.18.0-369/share/hadoop/common/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.class]
20:37:38 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
20:37:38 SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]
20:37:40 SLF4J: Class path contains multiple SLF4J bindings.
20:37:40 SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-private-basic-parameterized/Impala-Toolchain/cdp_components-45689292/apache-hive-3.1.3000.7.2.18.0-369-bin/lib/log4j-slf4j-impl-2.18.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
20:37:40 SLF4J: Found binding in 
[jar:file:/data0/jenkins/workspace/impala-private-basic-parameterized/Impala-Toolchain/cdp_components-45689292/hadoop-3.1.1.7.2.18.0-369/share/hadoop/common/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.class]
20:37:40 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
20:37:40 SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]
20:37:40 Connecting to jdbc:hive2://localhost:11050/default;
20:37:40 Connected to: Apache Hive (version 3.1.3000.7.2.18.0-369)
20:37:40 Driver: Hive JDBC (version 3.1.3000.7.2.18.0-369)
20:37:40 Transaction isolation: TRANSACTION_REPEATABLE_READ
20:37:50 INFO  : Compiling 
command(queryId=jenkins_20231214203740_c0783dce-45c9-469e-b087-2b7f2f9ab77f): 
insert overwrite table hive_warm_up_tbl values (1)
20:37:50 INFO  : Semantic Analysis Completed (retrial = false)
20:37:50 INFO  : Created Hive schema: 
Schema(fieldSchemas:[FieldSchema(name:col1, type:int, comment:null)], 
properties:null)
20:37:50 INFO  : Completed compiling 
command(queryId=jenkins_20231214203740_c0783dce-45c9-469e-b087-2b7f2f9ab77f); 
Time taken: 0.949 seconds
20:37:50 INFO  : Executing 
command(queryId=jenkins_20231214203740_c0783dce-45c9-469e-b087-2b7f2f9ab77f): 
insert overwrite table hive_warm_up_tbl values (1)
20:37:50 INFO  : Query ID = 
jenkins_20231214203740_c0783dce-45c9-469e-b087-2b7f2f9ab77f
20:37:50 INFO  : Total jobs = 3
20:37:50 INFO  : Launching Job 1 out of 3
20:37:50 INFO  : Starting task [Stage-1:MAPRED] in serial mode
20:37:50 INFO  : Subscribed to counters: [] for queryId: 
jenkins_20231214203740_c0783dce-45c9-469e-b087-2b7f2f9ab77f
20:37:50 INFO  : Tez session hasn't been created yet. Opening session
20:37:50 INFO  : Dag name: insert overwrite table hive_warm_up_tb...(1) 
(Stage-1)
20:37:50 INFO  : HS2 Host: 
[impala-ec2-centos79-m6i-4xlarge-xldisk-0293.vpc.cloudera.com], Query ID: 
[jenkins_20231214203740_c0783dce-45c9-469e-b087-2b7f2f9ab77f], Dag ID: 
[dag_1702614944517_0001_1], DAG Session ID: [application_1702614944517_0001]
20:37:50 INFO  : Status: Running (Executing on YARN cluster with App id 
application_1702614944517_0001)
20:37:50 
20:37:51 
----------------------------------------------------------------------------------------------
20:37:51         VERTICES      MODE        STATUS  TOTAL  COMPLETED  
RUNNING  PENDING  FAILED  KILLED  
20:37:51 
----------------------------------------------------------------------------------------------
20:37:51 Map 1            container        INITED      1          0        
0        1       0       0  
20:37:51 
----------------------------------------------------------------------------------------------
20:37:51 VERTICES: 00/01  [>>--------------------------] 0%    
ELAPSED TIME: 2.74 s     
20:37:51 
----------------------------------------------------------------------------------------------
20:37:52 
----------------------------------------------------------------------------------------------
20:37:52         VERTICES      MODE        STATUS  TOTAL  COMPLETED  
RUNNING  PENDING  FAILED  KILLED  
20:37:52 
----------------------------------------------------------------------------------------------
20:37:52 Map 1            container        INITED      1          0        
0        1       0       0  
20:37:52 
----------------------------------------------------------------------------------------------
20:37:52 VERTICES: 00/01  [>>--------------------------] 0%    
ELAPSED TIME: 3.74 s     
20:37:52 
----------------------------------------------------------------------------------------------
20:37:53 
----------------------------------------------------------------------------------------------
20:37:53         VERTICES      MODE        STATUS  TOTAL  COMPLETED  
RUNNING  PENDING  FAILED  KILLED  
20:37:53 
----------------------------------------------------------------------------------------------
20:37:53 Map 1            container       RUNNING      1          0        
1        0       0       0  
20:37:53 
----------------------------------------------------------------------------------------------
20:37:53 VERTICES: 00/01  [>>--------------------------] 0%    
ELAPSED TIME: 4.74 s     
20:37:53 
----------------------------------------------------------------------------------------------
20:37:54 
----------------------------------------------------------------------------------------------
20:37:54         VERTICES      MODE        STATUS  TOTAL  COMPLETED  
RUNNING  PENDING  FAILED  KILLED  
20:37:54 
----------------------------------------------------------------------------------------------
20:37:54 Map 1 .......... container     SUCCEEDED      1          1        
0        0       0       0  
20:37:54 
----------------------------------------------------------------------------------------------
20:37:54 VERTICES: 01/01  [==========================>>] 100%  
ELAPSED TIME: 5.75 s     
20:37:54 
----------------------------------------------------------------------------------------------
20:37:54 
----------------------------------------------------------------------------------------------
20:37:54         VERTICES      MODE        STATUS  TOTAL  COMPLETED  
RUNNING  PENDING  FAILED  KILLED  
20:37:54 
----------------------------------------------------------------------------------------------
20:37:54 Map 1 .......... container     SUCCEEDED      1          1        
0        0       0       0  
20:37:54 
----------------------------------------------------------------------------------------------
20:37:54 VERTICES: 01/01  [==========================>>] 100%  
ELAPSED TIME: 5.88 s     
20:37:54 
----------------------------------------------------------------------------------------------
20:37:54 INFO  : Starting task [Stage-7:CONDITIONAL] in serial mode
20:37:54 INFO  : Stage-4 is selected by condition resolver.
20:37:54 INFO  : Stage-3 is filtered out by condition resolver.
20:37:54 INFO  : Stage-5 is filtered out by condition resolver.
20:37:54 INFO  : Starting task [Stage-4:MOVE] in serial mode
20:37:54 INFO  : Moving data to directory 
hdfs://localhost:20500/test-warehouse/hive_warm_up_tbl/.hive-staging_hive_2023-12-14_20-37-40_924_4404023279633860277-1/-ext-10000
 from 
hdfs://localhost:20500/test-warehouse/hive_warm_up_tbl/.hive-staging_hive_2023-12-14_20-37-40_924_4404023279633860277-1/-ext-10002
20:37:54 INFO  : Starting task [Stage-2:DEPENDENCY_COLLECTION] in serial mode
20:37:54 INFO  : Starting task [Stage-0:MOVE] in serial mode
20:37:54 INFO  : Loading data to table default.hive_warm_up_tbl from 
hdfs://localhost:20500/test-warehouse/hive_warm_up_tbl/.hive-staging_hive_2023-12-14_20-37-40_924_4404023279633860277-1/-ext-10000
20:37:54 INFO  : Completed executing 
command(queryId=jenkins_20231214203740_c0783dce-45c9-469e-b087-2b7f2f9ab77f); 
Time taken: 12.35 seconds
20:37:54 INFO  : OK
20:37:54 
----------------------------------------------------------------------------------------------
20:37:54         VERTICES      MODE        STATUS  TOTAL  COMPLETED  
RUNNING  PENDING  FAILED  KILLED  
20:37:54 
----------------------------------------------------------------------------------------------
20:37:54 Map 1 .......... container     SUCCEEDED      1          1        
0        0       0       0  
20:37:54 
----------------------------------------------------------------------------------------------
20:37:54 VERTICES: 01/01  [==========================>>] 100%  
ELAPSED TIME: 5.88 s     
20:37:54 
----------------------------------------------------------------------------------------------
20:37:54 1 row affected (13.325 seconds)
20:37:54 Beeline version 3.1.3000.7.2.18.0-369 by Apache Hive
20:37:54 Closing: 0: jdbc:hive2://localhost:11050/default;{noformat}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to