(hive) branch branch-4.0 updated (f4302068b6a -> 183f8cb41d3)

2024-03-21 Thread dkuzmenko
This is an automated email from the ASF dual-hosted git repository.

dkuzmenko pushed a change to branch branch-4.0
in repository https://gitbox.apache.org/repos/asf/hive.git


 discard f4302068b6a Updating RELEASE_NOTES, NOTICE, README.md for 4.0.0
 new 183f8cb41d3 Updating RELEASE_NOTES, NOTICE, README.md for 4.0.0

This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version.  This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:

 * -- * -- B -- O -- O -- O   (f4302068b6a)
\
 N -- N -- N   refs/heads/branch-4.0 (183f8cb41d3)

You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.

Any revisions marked "omit" are not gone; other references still
refer to them.  Any revisions marked "discard" are gone forever.

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)



(hive) 01/01: Set version to 4.0.0

2024-03-21 Thread dkuzmenko
This is an automated email from the ASF dual-hosted git repository.

dkuzmenko pushed a commit to branch branch-4.0
in repository https://gitbox.apache.org/repos/asf/hive.git

commit 258d0e98218f60b8e88b8d0f9909c6e17264c828
Author: Denys Kuzmenko 
AuthorDate: Thu Mar 21 14:47:15 2024 +0100

Set version to 4.0.0
---
 accumulo-handler/pom.xml  | 2 +-
 beeline/pom.xml   | 2 +-
 classification/pom.xml| 2 +-
 cli/pom.xml   | 2 +-
 common/pom.xml| 2 +-
 contrib/pom.xml   | 2 +-
 druid-handler/pom.xml | 2 +-
 hbase-handler/pom.xml | 2 +-
 hcatalog/core/pom.xml | 2 +-
 hcatalog/hcatalog-pig-adapter/pom.xml | 4 ++--
 hcatalog/pom.xml  | 4 ++--
 hcatalog/server-extensions/pom.xml| 2 +-
 hcatalog/webhcat/java-client/pom.xml  | 2 +-
 hcatalog/webhcat/svr/pom.xml  | 2 +-
 hplsql/pom.xml| 2 +-
 iceberg/iceberg-catalog/pom.xml   | 2 +-
 iceberg/iceberg-handler/pom.xml   | 2 +-
 iceberg/iceberg-shading/pom.xml   | 2 +-
 iceberg/patched-iceberg-api/pom.xml   | 2 +-
 iceberg/patched-iceberg-core/pom.xml  | 2 +-
 iceberg/pom.xml   | 4 ++--
 itests/custom-serde/pom.xml   | 2 +-
 itests/custom-udfs/pom.xml| 2 +-
 itests/custom-udfs/udf-classloader-udf1/pom.xml   | 2 +-
 itests/custom-udfs/udf-classloader-udf2/pom.xml   | 2 +-
 itests/custom-udfs/udf-classloader-util/pom.xml   | 2 +-
 itests/custom-udfs/udf-vectorized-badexample/pom.xml  | 2 +-
 itests/hcatalog-unit/pom.xml  | 2 +-
 itests/hive-blobstore/pom.xml | 2 +-
 itests/hive-jmh/pom.xml   | 2 +-
 itests/hive-minikdc/pom.xml   | 2 +-
 itests/hive-unit-hadoop2/pom.xml  | 2 +-
 itests/hive-unit/pom.xml  | 2 +-
 itests/pom.xml| 2 +-
 itests/qtest-accumulo/pom.xml | 2 +-
 itests/qtest-druid/pom.xml| 2 +-
 itests/qtest-iceberg/pom.xml  | 2 +-
 itests/qtest-kudu/pom.xml | 2 +-
 itests/qtest/pom.xml  | 2 +-
 itests/test-serde/pom.xml | 2 +-
 itests/util/pom.xml   | 2 +-
 jdbc-handler/pom.xml  | 2 +-
 jdbc/pom.xml  | 2 +-
 kafka-handler/pom.xml | 2 +-
 kudu-handler/pom.xml  | 2 +-
 llap-client/pom.xml   | 2 +-
 llap-common/pom.xml   | 2 +-
 llap-ext-client/pom.xml   | 2 +-
 llap-server/pom.xml   | 2 +-
 llap-tez/pom.xml  | 2 +-
 metastore/pom.xml | 2 +-
 packaging/pom.xml | 2 +-
 parser/pom.xml| 2 +-
 pom.xml   | 6 +++---
 ql/pom.xml| 2 +-
 serde/pom.xml | 2 +-
 service-rpc/pom.xml   | 2 +-
 service/pom.xml   | 2 +-
 shims/0.23/pom.xml| 2 +-
 shims/aggregator/pom.xml  | 2 +-
 shims/common/pom.xml  | 2 +-
 shims/pom.xml | 2 +-
 standalone-metastore/metastore-common/pom.xml | 2 +-
 

(hive) branch branch-4.0 updated (15ad9048e91 -> 258d0e98218)

2024-03-21 Thread dkuzmenko
This is an automated email from the ASF dual-hosted git repository.

dkuzmenko pushed a change to branch branch-4.0
in repository https://gitbox.apache.org/repos/asf/hive.git


 discard 15ad9048e91 Set version to 4.0.0
 new 258d0e98218 Set version to 4.0.0

This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version.  This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:

 * -- * -- B -- O -- O -- O   (15ad9048e91)
\
 N -- N -- N   refs/heads/branch-4.0 (258d0e98218)

You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.

Any revisions marked "omit" are not gone; other references still
refer to them.  Any revisions marked "discard" are gone forever.

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 hcatalog/hcatalog-pig-adapter/pom.xml | 2 +-
 hcatalog/pom.xml  | 2 +-
 pom.xml   | 4 ++--
 standalone-metastore/metastore-common/pom.xml | 2 +-
 standalone-metastore/metastore-server/pom.xml | 4 ++--
 standalone-metastore/metastore-tools/metastore-benchmarks/pom.xml | 2 +-
 standalone-metastore/metastore-tools/pom.xml  | 2 +-
 standalone-metastore/metastore-tools/tools-common/pom.xml | 2 +-
 standalone-metastore/pom.xml  | 6 +++---
 storage-api/pom.xml   | 2 +-
 streaming/pom.xml | 2 +-
 11 files changed, 15 insertions(+), 15 deletions(-)



(hive) branch branch-4.0 updated: Set version to 4.0.0

2024-03-21 Thread dkuzmenko
This is an automated email from the ASF dual-hosted git repository.

dkuzmenko pushed a commit to branch branch-4.0
in repository https://gitbox.apache.org/repos/asf/hive.git


The following commit(s) were added to refs/heads/branch-4.0 by this push:
 new 15ad9048e91 Set version to 4.0.0
15ad9048e91 is described below

commit 15ad9048e914b0787a9ae2b9cb963a2b1a7ce65d
Author: Denys Kuzmenko 
AuthorDate: Thu Mar 21 14:47:15 2024 +0100

Set version to 4.0.0
---
 accumulo-handler/pom.xml | 2 +-
 beeline/pom.xml  | 2 +-
 classification/pom.xml   | 2 +-
 cli/pom.xml  | 2 +-
 common/pom.xml   | 2 +-
 contrib/pom.xml  | 2 +-
 druid-handler/pom.xml| 2 +-
 hbase-handler/pom.xml| 2 +-
 hcatalog/core/pom.xml| 2 +-
 hcatalog/hcatalog-pig-adapter/pom.xml| 2 +-
 hcatalog/pom.xml | 2 +-
 hcatalog/server-extensions/pom.xml   | 2 +-
 hcatalog/webhcat/java-client/pom.xml | 2 +-
 hcatalog/webhcat/svr/pom.xml | 2 +-
 hplsql/pom.xml   | 2 +-
 iceberg/iceberg-catalog/pom.xml  | 2 +-
 iceberg/iceberg-handler/pom.xml  | 2 +-
 iceberg/iceberg-shading/pom.xml  | 2 +-
 iceberg/patched-iceberg-api/pom.xml  | 2 +-
 iceberg/patched-iceberg-core/pom.xml | 2 +-
 iceberg/pom.xml  | 4 ++--
 itests/custom-serde/pom.xml  | 2 +-
 itests/custom-udfs/pom.xml   | 2 +-
 itests/custom-udfs/udf-classloader-udf1/pom.xml  | 2 +-
 itests/custom-udfs/udf-classloader-udf2/pom.xml  | 2 +-
 itests/custom-udfs/udf-classloader-util/pom.xml  | 2 +-
 itests/custom-udfs/udf-vectorized-badexample/pom.xml | 2 +-
 itests/hcatalog-unit/pom.xml | 2 +-
 itests/hive-blobstore/pom.xml| 2 +-
 itests/hive-jmh/pom.xml  | 2 +-
 itests/hive-minikdc/pom.xml  | 2 +-
 itests/hive-unit-hadoop2/pom.xml | 2 +-
 itests/hive-unit/pom.xml | 2 +-
 itests/pom.xml   | 2 +-
 itests/qtest-accumulo/pom.xml| 2 +-
 itests/qtest-druid/pom.xml   | 2 +-
 itests/qtest-iceberg/pom.xml | 2 +-
 itests/qtest-kudu/pom.xml| 2 +-
 itests/qtest/pom.xml | 2 +-
 itests/test-serde/pom.xml| 2 +-
 itests/util/pom.xml  | 2 +-
 jdbc-handler/pom.xml | 2 +-
 jdbc/pom.xml | 2 +-
 kafka-handler/pom.xml| 2 +-
 kudu-handler/pom.xml | 2 +-
 llap-client/pom.xml  | 2 +-
 llap-common/pom.xml  | 2 +-
 llap-ext-client/pom.xml  | 2 +-
 llap-server/pom.xml  | 2 +-
 llap-tez/pom.xml | 2 +-
 metastore/pom.xml| 2 +-
 packaging/pom.xml| 2 +-
 parser/pom.xml   | 2 +-
 pom.xml  | 2 +-
 ql/pom.xml   | 2 +-
 serde/pom.xml| 2 +-
 service-rpc/pom.xml  | 2 +-
 service/pom.xml  | 2 +-
 shims/0.23/pom.xml   | 2 +-
 shims/aggregator/pom.xml | 2 +-
 shims/common/pom.xml | 2 +-
 shims/pom.xml| 2 +-
 standalone-metastore/metastore-server/pom.xml| 2 +-
 streaming/pom.xml| 2 +-
 testutils/pom.xml| 2 +-
 udf/pom.xml  | 2 +-
 vector-code-gen/pom.xml  | 2 +-
 67 files changed, 68 insertions(+), 68 deletions(-)

diff --git a/accumulo-handler/pom.xml b/accumulo-handler/pom.xml
index b68d2d39597..4e141a81e95 100644
--- a/accumulo-handler/pom.xml
+++ b/accumulo-handler/pom.xml
@@ -17,7 +17,7 @@
   
 org.apache.hive
 hive
-4.0.0-SNAPSHOT
+4.0.0
 ../pom.xml
   
   hive-accumulo-handler
diff --git a/beeline/pom.xml b/beeline/pom.xml
index b785a20e2b5..87f1740436b 100644
--- 

(hive) branch master updated (c1d592f4f98 -> 9a0ce4e1589)

2024-03-21 Thread zabetak
This is an automated email from the ASF dual-hosted git repository.

zabetak pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/hive.git


from c1d592f4f98 HIVE-28069: Iceberg: Implement Merge task functionality 
for Iceberg tables (#5076) (Sourabh Badhya reviewed by Denys Kuzmenko, 
Krisztian Kasa)
 add 9a0ce4e1589 HIVE-28140: Remove unnecessary shading of test jars in 
Kafka module to speedup build (Stamatis Zampetakis reviewed by Butao Zhang)

No new revisions were added by this update.

Summary of changes:
 kafka-handler/pom.xml | 1 -
 1 file changed, 1 deletion(-)



(hive) branch master updated: HIVE-28069: Iceberg: Implement Merge task functionality for Iceberg tables (#5076) (Sourabh Badhya reviewed by Denys Kuzmenko, Krisztian Kasa)

2024-03-21 Thread sbadhya
This is an automated email from the ASF dual-hosted git repository.

sbadhya pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/hive.git


The following commit(s) were added to refs/heads/master by this push:
 new c1d592f4f98 HIVE-28069: Iceberg: Implement Merge task functionality 
for Iceberg tables (#5076) (Sourabh Badhya reviewed by Denys Kuzmenko, 
Krisztian Kasa)
c1d592f4f98 is described below

commit c1d592f4f98e49614ffaed57813983b579b9147f
Author: Sourabh Badhya 
AuthorDate: Thu Mar 21 15:48:05 2024 +0530

HIVE-28069: Iceberg: Implement Merge task functionality for Iceberg tables 
(#5076) (Sourabh Badhya reviewed by Denys Kuzmenko, Krisztian Kasa)
---
 .../iceberg/mr/hive/HiveIcebergInputFormat.java|   2 +-
 .../mr/hive/HiveIcebergOutputCommitter.java|  66 +++
 .../iceberg/mr/hive/HiveIcebergStorageHandler.java |  27 ++
 .../mr/hive/IcebergMergeTaskProperties.java|  55 +++
 .../test/queries/positive/iceberg_merge_files.q|  95 +
 .../positive/llap/iceberg_merge_files.q.out| 458 +
 .../test/resources/testconfiguration.properties|   2 +
 .../hadoop/hive/ql/io/CombineHiveInputFormat.java  |  13 +-
 .../hive/ql/metadata/HiveStorageHandler.java   |  17 +
 .../hadoop/hive/ql/optimizer/GenMapRedUtils.java   |  41 ++
 .../hadoop/hive/ql/parse/SemanticAnalyzer.java |  13 +-
 .../ql/plan/ConditionalResolverMergeFiles.java | 132 --
 .../hadoop/hive/ql/plan/MergeTaskProperties.java   |  30 ++
 13 files changed, 921 insertions(+), 30 deletions(-)

diff --git 
a/iceberg/iceberg-handler/src/main/java/org/apache/iceberg/mr/hive/HiveIcebergInputFormat.java
 
b/iceberg/iceberg-handler/src/main/java/org/apache/iceberg/mr/hive/HiveIcebergInputFormat.java
index dd329c122aa..1ea78eeba54 100644
--- 
a/iceberg/iceberg-handler/src/main/java/org/apache/iceberg/mr/hive/HiveIcebergInputFormat.java
+++ 
b/iceberg/iceberg-handler/src/main/java/org/apache/iceberg/mr/hive/HiveIcebergInputFormat.java
@@ -191,7 +191,7 @@ public class HiveIcebergInputFormat extends 
MapredIcebergInputFormat
 
   @Override
   public boolean shouldSkipCombine(Path path, Configuration conf) {
-return true;
+return false;
   }
 
   @Override
diff --git 
a/iceberg/iceberg-handler/src/main/java/org/apache/iceberg/mr/hive/HiveIcebergOutputCommitter.java
 
b/iceberg/iceberg-handler/src/main/java/org/apache/iceberg/mr/hive/HiveIcebergOutputCommitter.java
index b4d5ce98f59..d9f3116ff84 100644
--- 
a/iceberg/iceberg-handler/src/main/java/org/apache/iceberg/mr/hive/HiveIcebergOutputCommitter.java
+++ 
b/iceberg/iceberg-handler/src/main/java/org/apache/iceberg/mr/hive/HiveIcebergOutputCommitter.java
@@ -45,7 +45,10 @@ import org.apache.hadoop.hive.conf.HiveConf;
 import org.apache.hadoop.hive.conf.HiveConf.ConfVars;
 import org.apache.hadoop.hive.ql.Context.Operation;
 import org.apache.hadoop.hive.ql.Context.RewritePolicy;
+import org.apache.hadoop.hive.ql.exec.Utilities;
+import org.apache.hadoop.hive.ql.io.CombineHiveInputFormat;
 import org.apache.hadoop.hive.ql.metadata.HiveUtils;
+import org.apache.hadoop.hive.ql.plan.MapWork;
 import org.apache.hadoop.hive.ql.session.SessionStateUtil;
 import org.apache.hadoop.mapred.JobConf;
 import org.apache.hadoop.mapred.JobContext;
@@ -167,6 +170,8 @@ public class HiveIcebergOutputCommitter extends 
OutputCommitter {
   LOG.info("CommitTask found no serialized table in config for 
table: {}.", output);
 }
   }, IOException.class);
+
+  cleanMergeTaskInputFiles(jobConf, tableExecutor, context);
 } finally {
   if (tableExecutor != null) {
 tableExecutor.shutdown();
@@ -741,4 +746,65 @@ public class HiveIcebergOutputCommitter extends 
OutputCommitter {
   throw new NotFoundException("Can not read or parse committed file: %s", 
fileForCommitLocation);
 }
   }
+
+  public List getOutputFiles(List jobContexts) throws 
IOException {
+List outputs = collectOutputs(jobContexts);
+ExecutorService fileExecutor = 
fileExecutor(jobContexts.get(0).getJobConf());
+ExecutorService tableExecutor = 
tableExecutor(jobContexts.get(0).getJobConf(), outputs.size());
+Collection dataFiles = new ConcurrentLinkedQueue<>();
+try {
+  Tasks.foreach(outputs.stream().flatMap(kv -> kv.jobContexts.stream()
+  .map(jobContext -> new SimpleImmutableEntry<>(kv.table, 
jobContext
+  .suppressFailureWhenFinished()
+  .executeWith(tableExecutor)
+  .onFailure((output, exc) -> LOG.warn("Failed to retrieve merge 
input file for the table {}", output, exc))
+  .run(output -> {
+JobContext jobContext = output.getValue();
+JobConf jobConf = jobContext.getJobConf();
+LOG.info("Cleaning job for jobID: {}, table: {}", 
jobContext.getJobID(), output);
+
+Table table = output.getKey();
+FileSystem fileSystem = 

(hive) branch master updated: HIVE-28111: FSCountersSummary is only printed when LLAP IO is enabled (#5133) (Laszlo Bodor reviewed by Ayush Saxena)

2024-03-21 Thread abstractdog
This is an automated email from the ASF dual-hosted git repository.

abstractdog pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/hive.git


The following commit(s) were added to refs/heads/master by this push:
 new 5c05cc76056 HIVE-28111: FSCountersSummary is only printed when LLAP IO 
is enabled (#5133) (Laszlo Bodor reviewed by Ayush Saxena)
5c05cc76056 is described below

commit 5c05cc760564e8f2c3ded576d93894bf4612956a
Author: Bodor Laszlo 
AuthorDate: Thu Mar 21 10:06:41 2024 +0100

HIVE-28111: FSCountersSummary is only printed when LLAP IO is enabled 
(#5133) (Laszlo Bodor reviewed by Ayush Saxena)
---
 .../org/apache/hadoop/hive/ql/exec/tez/monitoring/TezJobMonitor.java| 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git 
a/ql/src/java/org/apache/hadoop/hive/ql/exec/tez/monitoring/TezJobMonitor.java 
b/ql/src/java/org/apache/hadoop/hive/ql/exec/tez/monitoring/TezJobMonitor.java
index fbbbf189e48..e5c4f2e09c5 100644
--- 
a/ql/src/java/org/apache/hadoop/hive/ql/exec/tez/monitoring/TezJobMonitor.java
+++ 
b/ql/src/java/org/apache/hadoop/hive/ql/exec/tez/monitoring/TezJobMonitor.java
@@ -445,8 +445,8 @@ public class TezJobMonitor {
   //llap IO summary
   if (HiveConf.getBoolVar(hiveConf, HiveConf.ConfVars.LLAP_IO_ENABLED, 
false)) {
 new LLAPioSummary(progressMap, dagClient).print(console);
-new FSCountersSummary(progressMap, dagClient).print(console);
   }
+  new FSCountersSummary(progressMap, dagClient).print(console);
   String wmQueue = HiveConf.getVar(hiveConf, 
ConfVars.HIVE_SERVER2_TEZ_INTERACTIVE_QUEUE);
   if (wmQueue != null && !wmQueue.isEmpty()) {
 new LlapWmSummary(progressMap, dagClient).print(console);