Are you using MR execution mode? it was deprecated, see warning "Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine"
On 2024/06/19 04:59:22 张三 wrote: > ---------- Forwarded message --------- > 发件人: 张三 <zsan94...@gmail.com> > Date: 2024年6月19日周三 12:55 > Subject: about hive4.0.0 error > To: <u...@hive.apache.org> > > > Dear Hive team, > > I encountered an issue while using the latest version of Hive, 4.0.0. I am > not sure if this is a bug. The details are as follows: I am using Hadoop > version 3.3.6 and have set up a fully distributed cluster. The Hive version > is 4.0.0, with one master server and one client. Currently, I am connecting > to Hive via the Beeline client. When executing a join in Hive, I > encountered the following error: > > ``` > > 0: jdbc:hive2://node1:10000> SELECT emp.emp_id, emp.emp_name, dept.dept_name > . . . . . . . . . . . . . .> FROM emp > . . . . . . . . . . . . . .> JOIN dept ON emp.dept_id = dept.dept_id; > INFO : Compiling > command(queryId=root_20240619122107_48e50dbd-9e12-4743-9652-f266165811bc): > SELECT emp.emp_id, emp.emp_name, dept.dept_name > FROM emp > JOIN dept ON emp.dept_id = dept.dept_id > INFO : No Stats for default@emp, Columns: emp_name, dept_id, emp_id > INFO : No Stats for default@dept, Columns: dept_name, dept_id > INFO : Semantic Analysis Completed (retrial = false) > INFO : Created Hive schema: > Schema(fieldSchemas:[FieldSchema(name:emp.emp_id, type:int, comment:null), > FieldSchema(name:emp.emp_name, type:strin > g, comment:null), FieldSchema(name:dept.dept_name, type:string, > comment:null)], properties:null)INFO : Completed compiling > command(queryId=root_20240619122107_48e50dbd-9e12-4743-9652-f266165811bc); > Time taken: 0.894 seconds > INFO : Concurrency mode is disabled, not creating a lock manager > INFO : Executing > command(queryId=root_20240619122107_48e50dbd-9e12-4743-9652-f266165811bc): > SELECT emp.emp_id, emp.emp_name, dept.dept_name > FROM emp > JOIN dept ON emp.dept_id = dept.dept_id > WARN : Hive-on-MR is deprecated in Hive 2 and may not be available in the > future versions. Consider using a different execution engine (i.e. tez > ) or using Hive 1.X releases.INFO : Query ID = > root_20240619122107_48e50dbd-9e12-4743-9652-f266165811bc > INFO : Total jobs = 1 > INFO : Starting task [Stage-4:MAPREDLOCAL] in serial mode > ERROR : Execution failed with exit status: 1 > ERROR : Obtaining error information > ERROR : > Task failed! > Task ID: > Stage-4 > > Logs: > > ERROR : /tmp/root/hive.log > ERROR : FAILED: Execution Error, return code 1 from > org.apache.hadoop.hive.ql.exec.mr.MapredLocalTask > INFO : Completed executing > command(queryId=root_20240619122107_48e50dbd-9e12-4743-9652-f266165811bc); > Time taken: 4.461 seconds > Error: Error while compiling statement: FAILED: Execution Error, return > code 1 from org.apache.hadoop.hive.ql.exec.mr.MapredLocalTask (state=08S0 > 1,code=1) > > ``` > > I checked the log on the Hive server at /tmp/root/hive.log and found the > following error information: > > ``` > > 2024-06-19T11:22:56,454 INFO [NotificationEventPoll 0] > HiveMetaStore.audit: ugi=root ip=unknown-ip-addr cmd=get_config_value: > name=metast > ore.batch.retrieve.max defaultValue=50 2024-06-19T11:23:01,656 INFO > [Scheduled Query Poller] HiveMetaStore.audit: ugi=root ip=unknown-ip-addr > cmd=scheduled_query_poll > 2024-06-19T11:23:56,375 INFO [NotificationEventPoll 0] > HiveMetaStore.audit: ugi=root ip=unknown-ip-addr cmd=get_config_value: > name=metast > ore.batch.retrieve.max defaultValue=50 2024-06-19T11:24:01,946 INFO > [Scheduled Query Poller] HiveMetaStore.audit: ugi=root ip=unknown-ip-addr > cmd=scheduled_query_poll > 2024-06-19T11:24:56,373 INFO [NotificationEventPoll 0] > HiveMetaStore.audit: ugi=root ip=unknown-ip-addr cmd=get_config_value: > name=metast > ore.batch.retrieve.max defaultValue=50 2024-06-19T11:25:02,197 INFO > [Scheduled Query Poller] HiveMetaStore.audit: ugi=root ip=unknown-ip-addr > cmd=scheduled_query_poll > 2024-06-19T11:25:56,376 INFO [NotificationEventPoll 0] > HiveMetaStore.audit: ugi=root ip=unknown-ip-addr cmd=get_config_value: > name=metast > ore.batch.retrieve.max defaultValue=50 2024-06-19T11:26:02,390 INFO > [Scheduled Query Poller] HiveMetaStore.audit: ugi=root ip=unknown-ip-addr > cmd=scheduled_query_poll > 2024-06-19T11:26:56,373 INFO [NotificationEventPoll 0] > HiveMetaStore.audit: ugi=root ip=unknown-ip-addr cmd=get_config_value: > name=metast > ore.batch.retrieve.max defaultValue=50 2024-06-19T11:27:02,660 INFO > [Scheduled Query Poller] HiveMetaStore.audit: ugi=root ip=unknown-ip-addr > cmd=scheduled_query_poll > 2024-06-19T11:27:48,645 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] session.HiveSessionImpl: executing > SELEC > T emp.emp_id, emp.emp_name, dept.dept_nameFROM emp > JOIN dept ON emp.dept_id = dept.dept_id > 2024-06-19T11:27:48,654 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] operation.OperationManager: Adding > opera > tion: OperationHandle [opType=EXECUTE_STATEMENT, > getHandleIdentifier()=9c5817e6-131a-4e60-bfe3-faa066a61119] SessionHandle > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0]2024-06-19T11:27:48,654 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > common.LogUtils: Thread context registra > tion is done.2024-06-19T11:27:48,654 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > operation.SQLOperation: [opType=EXECUTE_ > STATEMENT, > queryId=root_20240619112748_9448d222-0018-4dd7-ba8a-4c30eba4ddd8, > startTime=1718767668646, sessionId=00b77a5e-7cec-4e10-a0ae-f94450a56ab0, > createTime=1718766580032, userName=root, > ipAddress=192.168.179.6]2024-06-19T11:27:48,655 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > reexec.ReExecDriver: Compile #1 of query > 2024-06-19T11:27:48,655 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] ql.Driver: Compiling > command(queryId=roo > t_20240619112748_9448d222-0018-4dd7-ba8a-4c30eba4ddd8): SELECT emp.emp_id, > emp.emp_name, dept.dept_nameFROM emp > JOIN dept ON emp.dept_id = dept.dept_id > 2024-06-19T11:27:48,657 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] ql.QueryState: Query-level HMS cache > cre > ated for > root_20240619112748_9448d222-0018-4dd7-ba8a-4c30eba4ddd82024-06-19T11:27:48,657 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] parse.CalcitePlanner: Starting Semantic > Analysis2024-06-19T11:27:48,657 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] parse.CalcitePlanner: Completed phase > 1 > of Semantic Analysis2024-06-19T11:27:48,657 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > parse.CalcitePlanner: Get metadata for s > ource tables2024-06-19T11:27:48,658 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > HiveMetaStore.audit: ugi=root ip=unknow > n-ip-addr cmd=get_table : tbl=hive.default.emp 2024-06-19T11:27:48,712 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] metastore.MetastoreDefaultTransformer: S > tarting translation for processor HMSClient-@node1 on list > 12024-06-19T11:27:48,712 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] metastore.MetastoreDefaultTransformer: > T > able > emp,#bucket=-1,isBucketed:false,tableType=EXTERNAL_TABLE,tableCapabilities=null2024-06-19T11:27:48,713 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] metastore.MetastoreDefaultTransformer: T > ransformer return list of 12024-06-19T11:27:48,713 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > HiveMetaStore.audit: ugi=root ip=unknow > n-ip-addr cmd=get_table : tbl=hive.default.dept 2024-06-19T11:27:48,749 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] metastore.MetastoreDefaultTransformer: S > tarting translation for processor HMSClient-@node1 on list > 12024-06-19T11:27:48,749 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] metastore.MetastoreDefaultTransformer: > T > able > dept,#bucket=-1,isBucketed:false,tableType=EXTERNAL_TABLE,tableCapabilities=null2024-06-19T11:27:48,749 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] metastore.MetastoreDefaultTransformer: T > ransformer return list of 12024-06-19T11:27:48,749 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > parse.CalcitePlanner: Get metadata for s > ubqueries2024-06-19T11:27:48,749 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > parse.CalcitePlanner: Get metadata for d > estination tables2024-06-19T11:27:48,749 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > ql.Context: New scratch dir is hdfs://my > cluster/tmp/hive/root/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-12024-06-19T11:27:48,749 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] parse.CalcitePlanner: Completed getting > MetaData in Semantic Analysis2024-06-19T11:27:48,752 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > HiveMetaStore.audit: ugi=root ip=unknow > n-ip-addr cmd=get_all_table_constraints : tbl=hive.default.emp > 2024-06-19T11:27:48,792 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] HiveMetaStore.audit: ugi=root ip=unknow > n-ip-addr cmd=get_all_table_constraints : tbl=hive.default.dept > 2024-06-19T11:27:48,906 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] calcite.RelOptHiveTable: Calculating > col > umn statistics for default.emp, projIndxSet: [0, 1, 2], allowMissingStats: > true2024-06-19T11:27:48,907 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] HiveMetaStore.audit: ugi=root ip=unknow > n-ip-addr cmd=get_table : tbl=hive.default.emp 2024-06-19T11:27:48,979 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] metastore.MetastoreDefaultTransformer: S > tarting translation for processor HMSClient-@node1 on list > 12024-06-19T11:27:48,980 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] metastore.MetastoreDefaultTransformer: > T > able > emp,#bucket=-1,isBucketed:false,tableType=EXTERNAL_TABLE,tableCapabilities=null2024-06-19T11:27:48,980 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] metastore.MetastoreDefaultTransformer: T > ransformer return list of 12024-06-19T11:27:48,980 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > HiveMetaStore.audit: ugi=root ip=unknow > n-ip-addr cmd=get_table_statistics_req: table=hive.default.emp > 2024-06-19T11:27:49,010 WARN [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] calcite.RelOptHiveTable: No Stats for > de > fault@emp, Columns: emp_name, dept_id, emp_id2024-06-19T11:27:49,010 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > SessionState: No Stats for default@emp, > Columns: emp_name, dept_id, emp_id2024-06-19T11:27:49,011 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > calcite.RelOptHiveTable: Calculating col > umn statistics for default.dept, projIndxSet: [0, 1], allowMissingStats: > true2024-06-19T11:27:49,011 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] HiveMetaStore.audit: ugi=root ip=unknow > n-ip-addr cmd=get_table : tbl=hive.default.dept 2024-06-19T11:27:49,080 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] metastore.MetastoreDefaultTransformer: S > tarting translation for processor HMSClient-@node1 on list > 12024-06-19T11:27:49,080 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] metastore.MetastoreDefaultTransformer: > T > able > dept,#bucket=-1,isBucketed:false,tableType=EXTERNAL_TABLE,tableCapabilities=null2024-06-19T11:27:49,083 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] metastore.MetastoreDefaultTransformer: T > ransformer return list of 12024-06-19T11:27:49,084 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > HiveMetaStore.audit: ugi=root ip=unknow > n-ip-addr cmd=get_table_statistics_req: table=hive.default.dept > 2024-06-19T11:27:49,121 WARN [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] calcite.RelOptHiveTable: No Stats for > de > fault@dept, Columns: dept_name, dept_id2024-06-19T11:27:49,121 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > SessionState: No Stats for default@dept, > Columns: dept_name, dept_id2024-06-19T11:27:49,221 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > common.FileUtils: Creating directory if > it doesn't exist: > hdfs://mycluster/tmp/hive/root/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-1/-mr-10001/.hive-staging_hive_2024-06-19_11-27-48_655_7337980361533661344-12024-06-19T11:27:49,215 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] parse.CalcitePlanner: Get metadata for s > ource tables2024-06-19T11:27:49,215 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > parse.CalcitePlanner: Get metadata for s > ubqueries2024-06-19T11:27:49,215 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > parse.CalcitePlanner: Get metadata for s > ource tables2024-06-19T11:27:49,215 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > parse.CalcitePlanner: Get metadata for s > ubqueries2024-06-19T11:27:49,215 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > parse.CalcitePlanner: Get metadata for d > estination tables2024-06-19T11:27:49,215 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > parse.CalcitePlanner: Get metadata for s > ource tables2024-06-19T11:27:49,216 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > parse.CalcitePlanner: Get metadata for s > ubqueries2024-06-19T11:27:49,216 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > parse.CalcitePlanner: Get metadata for d > estination tables2024-06-19T11:27:49,216 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > parse.CalcitePlanner: Get metadata for d > estination tables2024-06-19T11:27:49,216 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > ql.Context: New scratch dir is hdfs://my > cluster/tmp/hive/root/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-12024-06-19T11:27:49,301 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] parse.CalcitePlanner: CBO Succeeded; opt > imized logical plan.2024-06-19T11:27:49,305 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > optimizer.ColumnPrunerProcFactory: JOIN > 8 oldExprs: {0=[Column[VALUE._col0], Column[VALUE._col1], > Column[KEY.reducesinkkey0]], 1=[Column[KEY.reducesinkkey0], > Column[VALUE._col0]]}2024-06-19T11:27:49,305 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > optimizer.ColumnPrunerProcFactory: JOIN > 8 newExprs: {0=[Column[VALUE._col0], Column[VALUE._col1]], > 1=[Column[VALUE._col0]]}2024-06-19T11:27:49,305 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > optimizer.ColumnPrunerProcFactory: RS 6 > oldColExprMap: {VALUE._col0=Column[_col0], VALUE._col1=Column[_col1], > KEY.reducesinkkey0=Column[_col2]}2024-06-19T11:27:49,305 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > optimizer.ColumnPrunerProcFactory: RS 6 > newColExprMap: {VALUE._col0=Column[_col0], VALUE._col1=Column[_col1], > KEY.reducesinkkey0=Column[_col2]}2024-06-19T11:27:49,305 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > optimizer.ColumnPrunerProcFactory: RS 7 > oldColExprMap: {VALUE._col0=Column[_col1], > KEY.reducesinkkey0=Column[_col0]}2024-06-19T11:27:49,306 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > optimizer.ColumnPrunerProcFactory: RS 7 > newColExprMap: {VALUE._col0=Column[_col1], > KEY.reducesinkkey0=Column[_col0]}2024-06-19T11:27:49,307 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > optimizer.BucketVersionPopulator: not co > nsidering bucketingVersion for: TS[0] because it has -1<2 buckets > 2024-06-19T11:27:49,307 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] optimizer.BucketVersionPopulator: not > co > nsidering bucketingVersion for: TS[3] because it has -1<2 buckets > 2024-06-19T11:27:49,308 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] optimizer.SortedDynPartitionOptimizer: > S > orted dynamic partitioning optimization kicked in..2024-06-19T11:27:49,348 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] exec.Utilities: Not using thread pool fo > r getContentSummary2024-06-19T11:27:49,371 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > ql.Context: New scratch dir is hdfs://my > cluster/tmp/hive/root/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-12024-06-19T11:27:49,371 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] physical.LocalMapJoinProcFactory: Settin > g max memory usage to 0.9 for table sink not followed by group > by2024-06-19T11:27:49,372 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] physical.Vectorizer: Examining input > for > mat to see if vectorization is enabled.2024-06-19T11:27:49,372 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > physical.Vectorizer: Vectorization is en > abled for input format(s) > [org.apache.hadoop.mapred.TextInputFormat]2024-06-19T11:27:49,372 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > physical.Vectorizer: Validating and vect > orizing MapWork... (vectorizedVertexNum 0)2024-06-19T11:27:49,373 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > physical.Vectorizer: Map vectorization e > nabled: true2024-06-19T11:27:49,373 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > physical.Vectorizer: Map vectorized: tru > e2024-06-19T11:27:49,373 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] physical.Vectorizer: Map > vectorizedVerte > xNum: 02024-06-19T11:27:49,373 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] physical.Vectorizer: Map > enabledConditio > nsMet: [hive.vectorized.use.vector.serde.deserialize IS > true]2024-06-19T11:27:49,373 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] physical.Vectorizer: Map > inputFileFormat > ClassNameSet: > [org.apache.hadoop.mapred.TextInputFormat]2024-06-19T11:27:49,374 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > parse.CalcitePlanner: Completed plan gen > eration2024-06-19T11:27:49,374 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] ql.Driver: Semantic Analysis Completed > ( > retrial = false)2024-06-19T11:27:49,374 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > ql.Driver: Created Hive schema: Schema(f > ieldSchemas:[FieldSchema(name:emp.emp_id, type:int, comment:null), > FieldSchema(name:emp.emp_name, type:string, comment:null), > FieldSchema(name:dept.dept_name, type:string, comment:null)], > properties:null)2024-06-19T11:27:49,375 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > exec.ListSinkOperator: Initializing Oper > ator: LIST_SINK[21]2024-06-19T11:27:49,375 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > metadata.Hive: Dumping metastore api cal > l timing information for : compilation phase2024-06-19T11:27:49,375 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > metadata.Hive: Total time spent in each > metastore function (ms): {getTableColumnStatistics_(String, String, List, > String)=213, isCompatibleWith_(Configuration)=1, > getAllTableConstraints_(AllTableConstraintsRequest)=100, > getTable_(GetTableRequest)=92, flushCache_()=0}2024-06-19T11:27:49,375 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] ql.Driver: Completed compiling command(q > ueryId=root_20240619112748_9448d222-0018-4dd7-ba8a-4c30eba4ddd8); Time > taken: 0.72 seconds2024-06-19T11:27:49,375 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > common.LogUtils: Unregistered logging co > ntext.2024-06-19T11:27:49,377 INFO [HiveServer2-Background-Pool: > Thread-110] common.LogUtils: Thread context registration is done. > 2024-06-19T11:27:49,377 INFO [HiveServer2-Background-Pool: Thread-110] > reexec.ReExecDriver: Execution #1 of query > 2024-06-19T11:27:49,378 INFO [HiveServer2-Background-Pool: Thread-110] > ql.Driver: Concurrency mode is disabled, not creating a lock manager > 2024-06-19T11:27:49,378 INFO [HiveServer2-Background-Pool: Thread-110] > ql.Driver: Executing command(queryId=root_20240619112748_9448d222-0018-4d > d7-ba8a-4c30eba4ddd8): SELECT emp.emp_id, emp.emp_name, dept.dept_nameFROM > emp > JOIN dept ON emp.dept_id = dept.dept_id > 2024-06-19T11:27:49,378 WARN [HiveServer2-Background-Pool: Thread-110] > ql.Driver: Hive-on-MR is deprecated in Hive 2 and may not be available in > the future versions. Consider using a different execution engine (i.e. > tez) or using Hive 1.X releases.2024-06-19T11:27:49,385 INFO > [HiveServer2-Background-Pool: Thread-110] ql.Driver: Query ID = > root_20240619112748_9448d222-0018-4dd7-ba8a-4c30eba > 4ddd82024-06-19T11:27:49,385 INFO [HiveServer2-Background-Pool: > Thread-110] ql.Driver: Total jobs = 1 > 2024-06-19T11:27:49,386 INFO [HiveServer2-Background-Pool: Thread-110] > ql.Driver: Starting task [Stage-4:MAPREDLOCAL] in serial mode > 2024-06-19T11:27:49,388 INFO [HiveServer2-Background-Pool: Thread-110] > mr.MapredLocalTask: Generating plan file file:/tmp/root/00b77a5e-7cec-4e1 > 0-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-6/-local-10006/plan.xml2024-06-19T11:27:49,389 > INFO [HiveServer2-Background-Pool: Thread-110] > exec.SerializationUtilities: Serializing MapredLocalWork using kryo > 2024-06-19T11:27:49,466 INFO [HiveServer2-Background-Pool: Thread-110] > mr.MapredLocalTask: Executing: /software/hadoop-3.3.6//bin/hadoop jar /so > ftware/hive-4.0.0/lib/hive-common-4.0.0.jar > org.apache.hadoop.hive.ql.exec.mr.ExecDriver -localtask -plan > file:/tmp/root/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-6/-local-10006/plan.xml > -jobconffile > file:/tmp/root/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-6/-local-10007/jobconf.xml2024-06-19T11:27:50,442 > INFO [HiveServer2-Background-Pool: > Thread-110-LocalTask-MAPREDLOCAL-stderr] mr.MapredLocalTask: SLF4J: Class > path contai > ns multiple SLF4J bindings.2024-06-19T11:27:50,442 INFO > [HiveServer2-Background-Pool: Thread-110-LocalTask-MAPREDLOCAL-stderr] > mr.MapredLocalTask: SLF4J: Found binding in > [jar:file:/software/hive-4.0.0/lib/log4j-slf4j-impl-2.18.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]2024-06-19T11:27:50,442 > INFO [HiveServer2-Background-Pool: > Thread-110-LocalTask-MAPREDLOCAL-stderr] mr.MapredLocalTask: SLF4J: Found > binding in > [jar:file:/software/hadoop-3.3.6/share/hadoop/common/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.class]2024-06-19T11:27:50,442 > INFO [HiveServer2-Background-Pool: > Thread-110-LocalTask-MAPREDLOCAL-stderr] mr.MapredLocalTask: SLF4J: See > http://www.sl > f4j.org/codes.html#multiple_bindings for an > explanation.2024-06-19T11:27:50,637 INFO [HiveServer2-Background-Pool: > Thread-110-LocalTask-MAPREDLOCAL-stderr] mr.MapredLocalTask: SLF4J: Actual > binding is > of type > [org.apache.logging.slf4j.Log4jLoggerFactory]2024-06-19T11:27:52,921 INFO > [main] conf.HiveConf: Found configuration file > file:/software/hive-4.0.0/conf/hive-site.xml > 2024-06-19T11:27:52,951 INFO [main] common.LogUtils: Thread context > registration is done. > eServer2-Background-Pool: Thread-110-LocalTask-MAPREDLOCAL-stderr] > mr.MapredLocalTask: at > org.apache.hadoop.conf.Configuration.<init>(Configurat > ion.java:843)2024-06-19T11:27:54,865 INFO [HiveServer2-Background-Pool: > Thread-110-LocalTask-MAPREDLOCAL-stderr] mr.MapredLocalTask: at > org.apache.had > oop.mapred.JobConf.<init>(JobConf.java:458)2024-06-19T11:27:54,865 INFO > [HiveServer2-Background-Pool: Thread-110-LocalTask-MAPREDLOCAL-stderr] > mr.MapredLocalTask: at org.apache.had > oop.mapred.JobConf.<init>(JobConf.java:475)2024-06-19T11:27:54,865 INFO > [HiveServer2-Background-Pool: Thread-110-LocalTask-MAPREDLOCAL-stderr] > mr.MapredLocalTask: at org.apache.had > oop.hive.ql.exec.mr.MapredLocalTask.initialize(MapredLocalTask.java:136)2024-06-19T11:27:54,865 > INFO [HiveServer2-Background-Pool: > Thread-110-LocalTask-MAPREDLOCAL-stderr] mr.MapredLocalTask: at > org.apache.had > oop.hive.ql.exec.mr.ExecDriver.main(ExecDriver.java:766)2024-06-19T11:27:54,865 > INFO [HiveServer2-Background-Pool: > Thread-110-LocalTask-MAPREDLOCAL-stderr] mr.MapredLocalTask: at > sun.reflect.Na > tiveMethodAccessorImpl.invoke0(Native Method)2024-06-19T11:27:54,865 INFO > [HiveServer2-Background-Pool: Thread-110-LocalTask-MAPREDLOCAL-stderr] > mr.MapredLocalTask: at sun.reflect.Na > tiveMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)2024-06-19T11:27:54,865 > INFO [HiveServer2-Background-Pool: > Thread-110-LocalTask-MAPREDLOCAL-stderr] mr.MapredLocalTask: at > sun.reflect.De > legatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)2024-06-19T11:27:54,865 > INFO [HiveServer2-Background-Pool: > Thread-110-LocalTask-MAPREDLOCAL-stderr] mr.MapredLocalTask: at > java.lang.refl > ect.Method.invoke(Method.java:498)2024-06-19T11:27:54,865 INFO > [HiveServer2-Background-Pool: Thread-110-LocalTask-MAPREDLOCAL-stderr] > mr.MapredLocalTask: at org.apache.had > oop.util.RunJar.run(RunJar.java:328)2024-06-19T11:27:54,865 INFO > [HiveServer2-Background-Pool: Thread-110-LocalTask-MAPREDLOCAL-stderr] > mr.MapredLocalTask: at org.apache.had > oop.util.RunJar.main(RunJar.java:241)2024-06-19T11:27:54,945 ERROR > [HiveServer2-Background-Pool: Thread-110] exec.Task: Execution failed with > exit status: 1 > 2024-06-19T11:27:54,945 ERROR [HiveServer2-Background-Pool: Thread-110] > exec.Task: Obtaining error information > 2024-06-19T11:27:54,945 ERROR [HiveServer2-Background-Pool: Thread-110] > exec.Task: > Task failed! > Task ID: > Stage-4 > > Logs: > > 2024-06-19T11:27:54,945 ERROR [HiveServer2-Background-Pool: Thread-110] > exec.Task: /tmp/root/hive.log > 2024-06-19T11:27:54,945 ERROR [HiveServer2-Background-Pool: Thread-110] > mr.MapredLocalTask: Execution failed with exit status: 1 > 2024-06-19T11:27:54,945 INFO [HiveServer2-Background-Pool: Thread-110] > reexec.ReOptimizePlugin: ReOptimization: retryPossible: false > 2024-06-19T11:27:54,945 INFO [HiveServer2-Background-Pool: Thread-110] > reexec.ReExecuteLostAMQueryPlugin: Exception is not a TezRuntimeException > , no need to check further with > ReExecuteLostAMQueryPlugin2024-06-19T11:27:54,945 ERROR > [HiveServer2-Background-Pool: Thread-110] ql.Driver: FAILED: Execution > Error, return code 1 from org.apache.hadoop. > hive.ql.exec.mr.MapredLocalTask2024-06-19T11:27:54,946 INFO > [HiveServer2-Background-Pool: Thread-110] metadata.Hive: Dumping metastore > api call timing information for : execut > ion phase2024-06-19T11:27:54,946 INFO [HiveServer2-Background-Pool: > Thread-110] metadata.Hive: Total time spent in each metastore function > (ms): {} > 2024-06-19T11:27:54,946 INFO [HiveServer2-Background-Pool: Thread-110] > ql.Driver: Completed executing command(queryId=root_20240619112748_9448d2 > 22-0018-4dd7-ba8a-4c30eba4ddd8); Time taken: 5.567 > seconds2024-06-19T11:27:54,946 INFO [HiveServer2-Background-Pool: > Thread-110] reexec.ReExecDriver: Re-execution decision is made according > to: executio > nIndex: 1, maxExecutions: 2, shouldReExecute: false2024-06-19T11:27:54,947 > INFO [HiveServer2-Background-Pool: Thread-110] common.LogUtils: > Unregistered logging context. > 2024-06-19T11:27:54,947 ERROR [HiveServer2-Background-Pool: Thread-110] > operation.SQLOperation: Error running hive query > org.apache.hive.service.cli.HiveSQLException: Error while compiling > statement: FAILED: Execution Error, return code 1 from > org.apache.hadoop.hive > .ql.exec.mr.MapredLocalTask at > org.apache.hive.service.cli.operation.Operation.toSQLException(Operation.java:376) > ~[hive-service-4.0.0.jar:4.0.0] > at > org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:249) > ~[hive-service-4.0.0.jar:4.0.0] > at > org.apache.hive.service.cli.operation.SQLOperation.access$500(SQLOperation.java:90) > ~[hive-service-4.0.0.jar:4.0.0] > at > org.apache.hive.service.cli.operation.SQLOperation$BackgroundWork$1.run(SQLOperation.java:336) > ~[hive-service-4.0.0.jar:4.0.0] > at java.security.AccessController.doPrivileged(Native Method) ~[?:1.8.0_181] > at javax.security.auth.Subject.doAs(Subject.java:422) ~[?:1.8.0_181] > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1899) > ~[hadoop-common-3.3.6.jar:?] > at > org.apache.hive.service.cli.operation.SQLOperation$BackgroundWork.run(SQLOperation.java:356) > ~[hive-service-4.0.0.jar:4.0.0] > at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > ~[?:1.8.0_181] > at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[?:1.8.0_181] > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > ~[?:1.8.0_181] > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > ~[?:1.8.0_181] > at java.lang.Thread.run(Thread.java:748) ~[?:1.8.0_181] > 2024-06-19T11:27:54,960 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] operation.OperationManager: Closing > oper > ation: OperationHandle [opType=EXECUTE_STATEMENT, > getHandleIdentifier()=9c5817e6-131a-4e60-bfe3-faa066a61119]2024-06-19T11:27:54,960 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] operation.OperationManager: Removed quer > yId: root_20240619112748_9448d222-0018-4dd7-ba8a-4c30eba4ddd8 corresponding > to operation: OperationHandle [opType=EXECUTE_STATEMENT, > getHandleIdentifier()=9c5817e6-131a-4e60-bfe3-faa066a61119] with tag: > null2024-06-19T11:27:54,961 INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 > HiveServer2-Handler-Pool: Thread-54] ql.Context: Deleting scratch dir: > hdfs:/ > /mycluster/tmp/hive/root/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-12024-06-19T11:27:54,961 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] cleanup.EventualCleanupService: Delete h > dfs://mycluster/tmp/hive/root/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-1 > operation was queued2024-06-19T11:27:54,961 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > ql.Context: Deleting scratch dir: file:/ > tmp/root/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-62024-06-19T11:27:54,961 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] cleanup.EventualCleanupService: Delete f > ile:/tmp/root/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-6 > operation was queued2024-06-19T11:27:54,961 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > ql.Context: Deleting scratch dir: file:/ > tmp/root/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-12024-06-19T11:27:54,961 > INFO [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: > Thread-54] cleanup.EventualCleanupService: Delete f > ile:/tmp/root/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-1 > operation was queued2024-06-19T11:27:54,961 INFO > [00b77a5e-7cec-4e10-a0ae-f94450a56ab0 HiveServer2-Handler-Pool: Thread-54] > operation.SQLOperation: Closing operatio > n log > /tmp/root/operation_logs/00b77a5e-7cec-4e10-a0ae-f94450a56ab0/root_20240619112748_9448d222-0018-4dd7-ba8a-4c30eba4ddd8 > without delay2024-06-19T11:27:54,970 INFO [EventualCleanupService thread > 0] cleanup.EventualCleanupService: Deleted > file:/tmp/root/00b77a5e-7cec-4e10-a0ae-f9 > 4450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-12024-06-19T11:27:54,971 > INFO [EventualCleanupService thread 3] cleanup.EventualCleanupService: > Deleted file:/tmp/root/00b77a5e-7cec-4e10-a0ae-f9 > 4450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-62024-06-19T11:27:54,989 > INFO [EventualCleanupService thread 9] cleanup.EventualCleanupService: > Deleted hdfs://mycluster/tmp/hive/root/00b77a5e-7 > cec-4e10-a0ae-f94450a56ab0/hive_2024-06-19_11-27-48_655_7337980361533661344-12024-06-19T11:27:56,432 > INFO [NotificationEventPoll 0] HiveMetaStore.audit: ugi=root > ip=unknown-ip-addr cmd=get_config_value: name=metast > ore.batch.retrieve.max defaultValue=50 2024-06-19T11:28:02,946 INFO > [Scheduled Query Poller] HiveMetaStore.audit: ugi=root ip=unknown-ip-addr > cmd=scheduled_query_poll > 2024-06-19T11:28:56,373 INFO [NotificationEventPoll 0] > HiveMetaStore.audit: ugi=root ip=unknown-ip-addr cmd=get_config_value: > name=metast > ore.batch.retrieve.max defaultValue=50 > > ``` > > Through my research, I found some answers suggesting that setting set > hive.auto.convert.join=false could resolve the issue. I tried this, and it > indeed worked. However, my question is why the default setting > hive.auto.convert.join=true does not work in Hive 4.0.0? I also tested the > same in Hive 3.x versions and did not encounter any issues. Therefore, I am > unsure about what parameters should be set to use mapjoin in Hive 4.0.0. > > Could you please assist me in resolving this issue? I am in urgent need of > a solution. > > Thank you. > > Best regards, > A loyal Hive user >