(hudi) annotated tag release-1.0.0-beta1-rc1 updated (2bc5518c138 -> e97aa6ccb56)

2023-11-10 Thread yihua
This is an automated email from the ASF dual-hosted git repository.

yihua pushed a change to annotated tag release-1.0.0-beta1-rc1
in repository https://gitbox.apache.org/repos/asf/hudi.git


*** WARNING: tag release-1.0.0-beta1-rc1 was modified! ***

from 2bc5518c138 (commit)
  to e97aa6ccb56 (tag)
 tagging 2bc5518c138d55581e1fd3f7327fc049cab3e77c (commit)
 replaces hoodie-0.4.7
  by Y Ethan Guo
  on Fri Nov 10 23:59:08 2023 -0800

- Log -
release-1.0.0-beta1
-BEGIN PGP SIGNATURE-

iQIzBAABCAAdFiEEDE0xZCfsqnGiCtlma+HUVMkPXqUFAmVPNEwACgkQa+HUVMkP
XqVzHg//X4bjru5AdLmrhzmvXqer6LPi8kWBp+gqcmGQtAhYlciw92G838jG8xTn
7mndWF9rU612id869frTrEOt3CIAgdQ7m4Jn3PVt2l8QSBChVcJ5LgGKFVHiRQdc
YdIKNkrKu3vgw9ip6O2tAUrXCNLPyao2yNb2Z5niWlwa3sIxb3APXv9gE5IO6vGD
tT7zaPK0ZI+l1mGSpajwaETe1rSKBYLV98MtX5vr6FYdS7jASdtKxavyfMeCWpwc
hw3fE0vpAgB3hkXkyg39KOapsQNjqFVnWKxYf3tpgxpanIhPPMisoS6PZIAq8A36
7/KijkpL0TD9S0NJPz76HhomITceF9QAzDM/FtTwioMVq8bj+BM/E4OccAfWechh
fpAxQPVCURljNztgBlcj9kXmyV76nrK2xxajUoDNqaPJKsqtNJ8X6Rez8KHR1Oi4
oZceFKBp4wYucxrTrj5WSVSW3WyLq6EB7RBJ/v3g17X0GF16idENsGOo3h9mkWP9
F+QKKAq5W0hXJgpJ9zCaQhEMONwFCMmJ2KQJqXIKJWjHAEvB0cJxEdiKNYYarrz8
1tjezuMEUCv1X/ciKWHgcXvYNMgP3TCXrAwnzRHFK998aJF4OMLuMvIaS5D4pKNp
4zwrGeCLXrU3j+BxAL+EKJDKGoNgn91Cb5/V1o4VEyka6QiPXhM=
=XPzw
-END PGP SIGNATURE-
---


No new revisions were added by this update.

Summary of changes:



[jira] [Closed] (HUDI-7074) Flink incremental query for non-blocking concurrency control

2023-11-10 Thread Sagar Sumit (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-7074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sagar Sumit closed HUDI-7074.
-
Resolution: Fixed

> Flink incremental query for non-blocking concurrency control
> 
>
> Key: HUDI-7074
> URL: https://issues.apache.org/jira/browse/HUDI-7074
> Project: Apache Hudi
>  Issue Type: Improvement
>  Components: flink
>Reporter: Danny Chen
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-6787) Integrate FileGroupReader with HoodieMergeOnReadSnapshotReader and RealtimeCompactedRecordReader for Hive

2023-11-10 Thread Sagar Sumit (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-6787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sagar Sumit updated HUDI-6787:
--
Fix Version/s: 1.0.0
   (was: 1.0.0-beta1)

> Integrate FileGroupReader with HoodieMergeOnReadSnapshotReader and 
> RealtimeCompactedRecordReader for Hive
> -
>
> Key: HUDI-6787
> URL: https://issues.apache.org/jira/browse/HUDI-6787
> Project: Apache Hudi
>  Issue Type: New Feature
>Reporter: Ethan Guo
>Priority: Blocker
> Fix For: 1.0.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [PR] [HUDI-7074] Flink incremental query for non-blocking concurrency control [hudi]

2023-11-10 Thread via GitHub


danny0405 commented on code in PR #10053:
URL: https://github.com/apache/hudi/pull/10053#discussion_r1390137807


##
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/HoodieMergedReadHandle.java:
##
@@ -99,7 +99,7 @@ private Option getLatestFileSlice() {
 && 
hoodieTable.getMetaClient().getCommitsTimeline().filterCompletedInstants().lastInstant().isPresent())
 {
   return Option.fromJavaOptional(hoodieTable
   .getHoodieView()
-  
.getLatestMergedFileSlicesBeforeOrOn(partitionPathFileIDPair.getLeft(), 
instantTime)
+  .getLatestFileSlices(partitionPathFileIDPair.getLeft())

Review Comment:
   It won't, the method intends to return the latest commit but it invokes the 
`getLatestMergedFileSlicesBeforeOrOn` which has some problems on NB-CC 
scenarios.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



(hudi) branch master updated (c260bce03ec -> 796ac8697e7)

2023-11-10 Thread danny0405
This is an automated email from the ASF dual-hosted git repository.

danny0405 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git


from c260bce03ec [HUDI-7076] Turn on new features by default through 
configs for 1.0.0-beta1 (#9998)
 add 796ac8697e7 [HUDI-7074] Flink incremental query for non-blocking 
concurrency control (#10053)

No new revisions were added by this update.

Summary of changes:
 .../org/apache/hudi/io/HoodieMergedReadHandle.java |  2 +-
 .../hudi/client/TestJavaHoodieBackedMetadata.java  | 21 -
 .../functional/TestHoodieBackedMetadata.java   | 26 +--
 .../hudi/metadata/HoodieTableMetadataUtil.java |  4 +-
 .../hudi/sink/bootstrap/BootstrapOperator.java |  2 +-
 .../partitioner/profile/DeltaWriteProfile.java |  2 +-
 .../apache/hudi/source/IncrementalInputSplits.java | 32 +++---
 .../apache/hudi/table/format/TestInputFormat.java  | 50 ++
 .../test/java/org/apache/hudi/utils/TestUtils.java | 11 +
 9 files changed, 135 insertions(+), 15 deletions(-)



Re: [PR] [HUDI-7074] Flink incremental query for non-blocking concurrency control [hudi]

2023-11-10 Thread via GitHub


danny0405 merged PR #10053:
URL: https://github.com/apache/hudi/pull/10053


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7074] Flink incremental query for non-blocking concurrency control [hudi]

2023-11-10 Thread via GitHub


danny0405 commented on code in PR #10053:
URL: https://github.com/apache/hudi/pull/10053#discussion_r1390137560


##
hudi-common/src/main/java/org/apache/hudi/metadata/HoodieTableMetadataUtil.java:
##
@@ -1000,7 +1000,9 @@ private static List 
getPartitionFileSlices(HoodieTableMetaClient meta
 if (mergeFileSlices) {
   if 
(metaClient.getActiveTimeline().filterCompletedInstants().lastInstant().isPresent())
 {
 fileSliceStream = fsView.getLatestMergedFileSlicesBeforeOrOn(
-partition, 
metaClient.getActiveTimeline().filterCompletedInstants().lastInstant().get().getTimestamp());
+// including pending compaction instant as the last instant so 
that the finished delta commits
+// that start earlier than the compaction can be queried.
+partition, 
metaClient.getActiveTimeline().filterCompletedAndCompactionInstants().lastInstant().get().getTimestamp());

Review Comment:
   yeah, just additionally including the pending compaction instants because 
now we do file slicing based on completion time, we should use compaction 
instant to filter out the file slices if there are del_commit with smaller 
start time but greater completion time (than the compaction)



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7074] Flink incremental query for non-blocking concurrency control [hudi]

2023-11-10 Thread via GitHub


codope commented on code in PR #10053:
URL: https://github.com/apache/hudi/pull/10053#discussion_r1390135823


##
hudi-common/src/main/java/org/apache/hudi/metadata/HoodieTableMetadataUtil.java:
##
@@ -1000,7 +1000,9 @@ private static List 
getPartitionFileSlices(HoodieTableMetaClient meta
 if (mergeFileSlices) {
   if 
(metaClient.getActiveTimeline().filterCompletedInstants().lastInstant().isPresent())
 {
 fileSliceStream = fsView.getLatestMergedFileSlicesBeforeOrOn(
-partition, 
metaClient.getActiveTimeline().filterCompletedInstants().lastInstant().get().getTimestamp());
+// including pending compaction instant as the last instant so 
that the finished delta commits
+// that start earlier than the compaction can be queried.
+partition, 
metaClient.getActiveTimeline().filterCompletedAndCompactionInstants().lastInstant().get().getTimestamp());

Review Comment:
   you mean to filter **completed** compaction instants and not the **pending** 
right? Comment above says pending compaction.



##
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/HoodieMergedReadHandle.java:
##
@@ -99,7 +99,7 @@ private Option getLatestFileSlice() {
 && 
hoodieTable.getMetaClient().getCommitsTimeline().filterCompletedInstants().lastInstant().isPresent())
 {
   return Option.fromJavaOptional(hoodieTable
   .getHoodieView()
-  
.getLatestMergedFileSlicesBeforeOrOn(partitionPathFileIDPair.getLeft(), 
instantTime)
+  .getLatestFileSlices(partitionPathFileIDPair.getLeft())

Review Comment:
   can it return uncommitted file slices? I think it won't but just to confirm.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7082] Add Flink 1.14 and Spark 3.13 docker image script [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10066:
URL: https://github.com/apache/hudi/pull/10066#issuecomment-1806697318

   
   ## CI report:
   
   * 8c32f7bc02016c301288575fabc9db2eac46aaab Azure: 
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20850)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-5936] Fix serialization problem when FileStatus is not serializable [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10065:
URL: https://github.com/apache/hudi/pull/10065#issuecomment-1806674537

   
   ## CI report:
   
   * dc9735093064e2f15277330d56012cf7835604da UNKNOWN
   * f7d0cf8adf8e80204fe9de5f231d062ae43e6787 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20848)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



svn commit: r65210 - in /dev/hudi/hudi-1.0.0-beta1-rc1: ./ hudi-1.0.0-beta1-rc1.src.tgz hudi-1.0.0-beta1-rc1.src.tgz.asc hudi-1.0.0-beta1-rc1.src.tgz.sha512

2023-11-10 Thread yihua
Author: yihua
Date: Sat Nov 11 04:24:11 2023
New Revision: 65210

Log:
Add Apache Hudi 1.0.0-beta1 RC1 source release

Added:
dev/hudi/hudi-1.0.0-beta1-rc1/
dev/hudi/hudi-1.0.0-beta1-rc1/hudi-1.0.0-beta1-rc1.src.tgz   (with props)
dev/hudi/hudi-1.0.0-beta1-rc1/hudi-1.0.0-beta1-rc1.src.tgz.asc
dev/hudi/hudi-1.0.0-beta1-rc1/hudi-1.0.0-beta1-rc1.src.tgz.sha512

Added: dev/hudi/hudi-1.0.0-beta1-rc1/hudi-1.0.0-beta1-rc1.src.tgz
==
Binary file - no diff available.

Propchange: dev/hudi/hudi-1.0.0-beta1-rc1/hudi-1.0.0-beta1-rc1.src.tgz
--
svn:mime-type = application/octet-stream

Added: dev/hudi/hudi-1.0.0-beta1-rc1/hudi-1.0.0-beta1-rc1.src.tgz.asc
==
--- dev/hudi/hudi-1.0.0-beta1-rc1/hudi-1.0.0-beta1-rc1.src.tgz.asc (added)
+++ dev/hudi/hudi-1.0.0-beta1-rc1/hudi-1.0.0-beta1-rc1.src.tgz.asc Sat Nov 11 
04:24:11 2023
@@ -0,0 +1,16 @@
+-BEGIN PGP SIGNATURE-
+
+iQIzBAABCAAdFiEEiIqTQeYA64VQqs1e+xt1BPf3cMkFAmVPAB4ACgkQ+xt1BPf3
+cMmutg/+J4Ih8KoHPpAJp4CwXRjgSlRRK8mHYxA9w2xXW9qo8baD4Sw/vU+jG9//
+Howc+RD4NPasN+jBaxvYMAqIJoGFzHQY3mcu+axMF6mVSfS4JsBZgET1Tzqh9SSF
+fSxOHJQUxbhtl/NQkveWmNlIapUpFxgkgerXOvRibyrb29WZGl55ZAGQVG1Jl3zR
+YIRe2Rvtqxi4I7ahkm/zDiwJn11ldEVoTeKLQ0+n/95IICdRwpTiYyhcvU/YlF+9
+OoT7zAmhJmGC7aFcdh5veJr0w1aeFLYqyxmh3kEukAIrgQRRqnCBYLk0whsiG0r1
+NEZ7/RfnlV8T5yc8x8AcaxzZo+Qdv+iOcHc8UeNm7Eg14GOrkfFEQZPRkLuyZgUn
+yZmRh44KCowVLVGSPujjATLNmv5XT+7TimwhO7/DSMZ9Io652Uzz1uQ3H7bQ8GAk
+RHvFyjdHarX+evCHK5YbTzQ/0rAJuEcUS9xjJyWZKrb9CnyDxIOzDOVixOYRrxnv
+RJ/Dti3Qhf9LYeHvSKzP2pDWbDoRaC4va3JfuxI6LqOZVrAl3vBraCjyKXeZLIVU
+W3D7ygklpSa/yZ6X90SfqxNuJZLbJkA3gHsVBzNmKFtH6Rw1NOfc2x3PZiIMPnXw
+xuVhoM41nJwn2a5PQ1Qq3OSLihB8YCKKILFV/Ncv/IBpLkmrI7s=
+=BJNM
+-END PGP SIGNATURE-

Added: dev/hudi/hudi-1.0.0-beta1-rc1/hudi-1.0.0-beta1-rc1.src.tgz.sha512
==
--- dev/hudi/hudi-1.0.0-beta1-rc1/hudi-1.0.0-beta1-rc1.src.tgz.sha512 (added)
+++ dev/hudi/hudi-1.0.0-beta1-rc1/hudi-1.0.0-beta1-rc1.src.tgz.sha512 Sat Nov 
11 04:24:11 2023
@@ -0,0 +1 @@
+80f1894c5199b1ea251b5890b52cf66258f0a5398d0cf5a0773dcfd2b592f12d2e7fb8968675d5a936effc727335aee9f943c6c5a8f38413cc6718f11af28c18
  hudi-1.0.0-beta1-rc1.src.tgz




(hudi) branch master updated: [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 (#9998)

2023-11-10 Thread yihua
This is an automated email from the ASF dual-hosted git repository.

yihua pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git


The following commit(s) were added to refs/heads/master by this push:
 new c260bce03ec [HUDI-7076] Turn on new features by default through 
configs for 1.0.0-beta1 (#9998)
c260bce03ec is described below

commit c260bce03ec0b1abbd14af1e3ef9617bbae9e80a
Author: Sagar Sumit 
AuthorDate: Sat Nov 11 09:12:47 2023 +0530

[HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 
(#9998)

This commit enables the following new features by default through configs:
- Write record positions to MOR log data blocks 
(`hoodie.write.record.positions`)
- Enable partial updates when possible for Spark SQL MERGE INTO statement 
(`hoodie.spark.sql.merge.into.partial.updates`)
- Use new file group reader for MOR snapshot queries 
(`hoodie.file.group.reader.enabled`)
- Use new Hudi Spark parquet file format for various types of queries 
(`hoodie.datasource.read.use.new.parquet.file.format`)

-

Co-authored-by: Y Ethan Guo 
---
 .../org/apache/hudi/config/HoodieWriteConfig.java  |  2 +-
 .../hudi/common/config/HoodieReaderConfig.java |  2 +-
 .../scala/org/apache/hudi/DataSourceOptions.scala  |  4 +-
 .../functional/TestParquetColumnProjection.scala   | 51 --
 .../hudi/functional/TestSparkDataSource.scala  |  3 ++
 .../apache/spark/sql/hudi/TestInsertTable.scala|  4 +-
 .../hudi/TestNestedSchemaPruningOptimization.scala |  6 +--
 .../TestHoodiePruneFileSourcePartitions.scala  |  6 +--
 .../utilities/sources/TestHoodieIncrSource.java|  2 +
 9 files changed, 44 insertions(+), 36 deletions(-)

diff --git 
a/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieWriteConfig.java
 
b/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieWriteConfig.java
index cd8f9f6b629..6a36e5025bc 100644
--- 
a/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieWriteConfig.java
+++ 
b/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieWriteConfig.java
@@ -751,7 +751,7 @@ public class HoodieWriteConfig extends HoodieConfig {
 
   public static final ConfigProperty WRITE_RECORD_POSITIONS = 
ConfigProperty
   .key("hoodie.write.record.positions")
-  .defaultValue(false)
+  .defaultValue(true)
   .markAdvanced()
   .sinceVersion("1.0.0")
   .withDocumentation("Whether to write record positions to the block 
header for data blocks containing updates and delete blocks. "
diff --git 
a/hudi-common/src/main/java/org/apache/hudi/common/config/HoodieReaderConfig.java
 
b/hudi-common/src/main/java/org/apache/hudi/common/config/HoodieReaderConfig.java
index c572cc21adc..20e745d7a9a 100644
--- 
a/hudi-common/src/main/java/org/apache/hudi/common/config/HoodieReaderConfig.java
+++ 
b/hudi-common/src/main/java/org/apache/hudi/common/config/HoodieReaderConfig.java
@@ -54,7 +54,7 @@ public class HoodieReaderConfig extends HoodieConfig {
 
   public static final ConfigProperty FILE_GROUP_READER_ENABLED = 
ConfigProperty
   .key("hoodie.file.group.reader.enabled")
-  .defaultValue(false)
+  .defaultValue(true)
   .markAdvanced()
   .sinceVersion("1.0.0")
   .withDocumentation("Use engine agnostic file group reader if enabled");
diff --git 
a/hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/DataSourceOptions.scala
 
b/hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/DataSourceOptions.scala
index dc54825ac90..efa9c9e692f 100644
--- 
a/hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/DataSourceOptions.scala
+++ 
b/hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/DataSourceOptions.scala
@@ -88,7 +88,7 @@ object DataSourceReadOptions {
 
   val USE_NEW_HUDI_PARQUET_FILE_FORMAT: ConfigProperty[String] = ConfigProperty
 .key("hoodie.datasource.read.use.new.parquet.file.format")
-.defaultValue("false")
+.defaultValue("true")
 .markAdvanced()
 .sinceVersion("0.14.0")
 .withDocumentation("Read using the new Hudi parquet file format. The new 
Hudi parquet file format is " +
@@ -558,7 +558,7 @@ object DataSourceWriteOptions {
 
   val ENABLE_MERGE_INTO_PARTIAL_UPDATES: ConfigProperty[Boolean] = 
ConfigProperty
 .key("hoodie.spark.sql.merge.into.partial.updates")
-.defaultValue(false)
+.defaultValue(true)
 .markAdvanced()
 .sinceVersion("1.0.0")
 .withDocumentation("Whether to write partial updates to the data blocks 
containing updates "
diff --git 
a/hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/hudi/functional/TestParquetColumnProjection.scala
 
b/hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/hudi/functional/TestParquetColumnProjection.scala
index ee1edbcccb2..6ff7e5681e6 100644
--- 

Re: [PR] [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 [hudi]

2023-11-10 Thread via GitHub


yihua merged PR #9998:
URL: https://github.com/apache/hudi/pull/9998


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 [hudi]

2023-11-10 Thread via GitHub


yihua commented on PR #9998:
URL: https://github.com/apache/hudi/pull/9998#issuecomment-1806641779

   The fourth task (`UT FT other modules`) in Azure CI passes in #10055 based 
on the same set of changes.
   https://github.com/apache/hudi/assets/2497195/74e488c8-aaa3-45b9-9f3c-9a3227dc81c9;>
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7082] Add Flink 1.14 and Spark 3.13 docker image script [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10066:
URL: https://github.com/apache/hudi/pull/10066#issuecomment-1806641536

   
   ## CI report:
   
   * 8c32f7bc02016c301288575fabc9db2eac46aaab Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20850)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR] Fix testRenamePartition [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10064:
URL: https://github.com/apache/hudi/pull/10064#issuecomment-1806641517

   
   ## CI report:
   
   * 4ab080293d7d6c5d93e0353397fe516ec47dd9a5 Azure: 
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20843)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-5936] Fix serialization problem when FileStatus is not serializable [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10065:
URL: https://github.com/apache/hudi/pull/10065#issuecomment-1806616830

   
   ## CI report:
   
   * 2c42b4127fa90eb1f5de9948b5a6bec9ed27662e Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20846)
 
   * dc9735093064e2f15277330d56012cf7835604da UNKNOWN
   * f7d0cf8adf8e80204fe9de5f231d062ae43e6787 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20848)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7082] Add Flink 1.14 and Spark 3.13 docker image script [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10066:
URL: https://github.com/apache/hudi/pull/10066#issuecomment-1806638295

   
   ## CI report:
   
   * 8c32f7bc02016c301288575fabc9db2eac46aaab UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #9998:
URL: https://github.com/apache/hudi/pull/9998#issuecomment-1806638210

   
   ## CI report:
   
   * d1f415b1400e0ff853e38b48fdaa7a9fe7aa44d3 UNKNOWN
   * f7f4a9e3342020e4a1472f5a3b59d25f9c1ec090 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20842)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806635356

   
   ## CI report:
   
   * bfebcd456acf943c525ca2830ab864098a802e9a Azure: 
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20845)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7072] Remove support for Flink 1.13 [hudi]

2023-11-10 Thread via GitHub


danny0405 commented on PR #10052:
URL: https://github.com/apache/hudi/pull/10052#issuecomment-1806635003

   Can you reference PR: https://github.com/apache/hudi/pull/5072, the 
redundants between 1.13 and 1.14 can be moved back to `hudi-flink` module I 
think.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] [HUDI-7082] Add Flink 1.14 and Spark 3.13 docker image script [hudi]

2023-11-10 Thread via GitHub


danny0405 opened a new pull request, #10066:
URL: https://github.com/apache/hudi/pull/10066

   ### Change Logs
   
   We are gonna remove the support for flink 1.13, so update the bundle 
validation scripts. 
   
   ### Impact
   
   none
   
   ### Risk level (write none, low medium or high below)
   
   none
   
   ### Documentation Update
   
   _Describe any necessary documentation update if there is any new feature, 
config, or user-facing change_
   
   - _The config description must be updated if new configs are added or the 
default value of the configs are changed_
   - _Any new feature or user-facing change requires updating the Hudi website. 
Please create a Jira ticket, attach the
 ticket number here and follow the 
[instruction](https://hudi.apache.org/contribute/developer-setup#website) to 
make
 changes to the website._
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Updated] (HUDI-7082) Add Flink 1.14 and Spark 3.13 docker image script

2023-11-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-7082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HUDI-7082:
-
Labels: pull-request-available  (was: )

> Add Flink 1.14 and Spark 3.13 docker image script
> -
>
> Key: HUDI-7082
> URL: https://issues.apache.org/jira/browse/HUDI-7082
> Project: Apache Hudi
>  Issue Type: Improvement
>Reporter: Danny Chen
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (HUDI-7082) Add Flink 1.14 and Spark 3.13 docker image script

2023-11-10 Thread Danny Chen (Jira)
Danny Chen created HUDI-7082:


 Summary: Add Flink 1.14 and Spark 3.13 docker image script
 Key: HUDI-7082
 URL: https://issues.apache.org/jira/browse/HUDI-7082
 Project: Apache Hudi
  Issue Type: Improvement
Reporter: Danny Chen
 Fix For: 1.0.0






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [PR] [HUDI-5936] Fix serialization problem when FileStatus is not serializable [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10065:
URL: https://github.com/apache/hudi/pull/10065#issuecomment-1806615305

   
   ## CI report:
   
   * 2c42b4127fa90eb1f5de9948b5a6bec9ed27662e Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20846)
 
   * dc9735093064e2f15277330d56012cf7835604da UNKNOWN
   * f7d0cf8adf8e80204fe9de5f231d062ae43e6787 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-5936] Fix serialization problem when FileStatus is not serializable [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10065:
URL: https://github.com/apache/hudi/pull/10065#issuecomment-1806604213

   
   ## CI report:
   
   * 2c42b4127fa90eb1f5de9948b5a6bec9ed27662e Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20846)
 
   * dc9735093064e2f15277330d56012cf7835604da UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



(hudi) branch master updated: [MINOR] Disable reader for test with enum (#10061)

2023-11-10 Thread yihua
This is an automated email from the ASF dual-hosted git repository.

yihua pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git


The following commit(s) were added to refs/heads/master by this push:
 new 558281ed430 [MINOR] Disable reader for test with enum (#10061)
558281ed430 is described below

commit 558281ed4303756ad7a00331e1568dbb107f8571
Author: Jon Vexler 
AuthorDate: Fri Nov 10 19:42:45 2023 -0500

[MINOR] Disable reader for test with enum (#10061)

Co-authored-by: Jonathan Vexler <=>
Co-authored-by: Y Ethan Guo 
---
 .../hudi/utilities/sources/HoodieIncrSource.java | 20 
 .../hudi/utilities/sources/TestHoodieIncrSource.java |  3 +++
 2 files changed, 23 insertions(+)

diff --git 
a/hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/HoodieIncrSource.java
 
b/hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/HoodieIncrSource.java
index fa316cf806f..aafd4c9e3b5 100644
--- 
a/hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/HoodieIncrSource.java
+++ 
b/hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/HoodieIncrSource.java
@@ -19,9 +19,11 @@
 package org.apache.hudi.utilities.sources;
 
 import org.apache.hudi.DataSourceReadOptions;
+import org.apache.hudi.common.config.HoodieReaderConfig;
 import org.apache.hudi.common.config.TypedProperties;
 import org.apache.hudi.common.model.HoodieRecord;
 import 
org.apache.hudi.common.table.timeline.TimelineUtils.HollowCommitHandling;
+import org.apache.hudi.common.util.CollectionUtils;
 import org.apache.hudi.common.util.Option;
 import org.apache.hudi.common.util.ReflectionUtils;
 import org.apache.hudi.common.util.collection.Pair;
@@ -38,6 +40,9 @@ import org.slf4j.Logger;
 import org.slf4j.LoggerFactory;
 
 import java.util.Collections;
+import java.util.HashMap;
+import java.util.Map;
+import java.util.Set;
 
 import static org.apache.hudi.DataSourceReadOptions.BEGIN_INSTANTTIME;
 import static org.apache.hudi.DataSourceReadOptions.END_INSTANTTIME;
@@ -58,6 +63,10 @@ import static 
org.apache.hudi.utilities.sources.helpers.IncrSourceHelper.getHoll
 public class HoodieIncrSource extends RowSource {
 
   private static final Logger LOG = 
LoggerFactory.getLogger(HoodieIncrSource.class);
+  public static final Set HOODIE_INCR_SOURCE_READ_OPT_KEYS =
+  CollectionUtils.createImmutableSet(
+  "hoodie.datasource.read.use.new.parquet.file.format",
+  HoodieReaderConfig.FILE_GROUP_READER_ENABLED.key());
   private final Option snapshotLoadQuerySplitter;
 
   public static class Config {
@@ -128,10 +137,19 @@ public class HoodieIncrSource extends RowSource {
 
HoodieIncrSourceConfig.HOODIE_DROP_ALL_META_FIELDS_FROM_SOURCE.defaultValue();
   }
 
+  private final Map readOpts = new HashMap<>();
+
   public HoodieIncrSource(TypedProperties props, JavaSparkContext 
sparkContext, SparkSession sparkSession,
   SchemaProvider schemaProvider) {
 super(props, sparkContext, sparkSession, schemaProvider);
 
+for (Object key : props.keySet()) {
+  String keyString = key.toString();
+  if (HOODIE_INCR_SOURCE_READ_OPT_KEYS.contains(keyString)) {
+readOpts.put(keyString, props.getString(key.toString()));
+  }
+}
+
 this.snapshotLoadQuerySplitter = 
Option.ofNullable(props.getString(SNAPSHOT_LOAD_QUERY_SPLITTER_CLASS_NAME, 
null))
 .map(className -> (SnapshotLoadQuerySplitter) 
ReflectionUtils.loadClass(className,
 new Class[] {TypedProperties.class}, props));
@@ -181,6 +199,7 @@ public class HoodieIncrSource extends RowSource {
 // Do Incr pull. Set end instant if available
 if (queryInfo.isIncremental()) {
   source = sparkSession.read().format("org.apache.hudi")
+  .options(readOpts)
   .option(QUERY_TYPE().key(), QUERY_TYPE_INCREMENTAL_OPT_VAL())
   .option(BEGIN_INSTANTTIME().key(), queryInfo.getStartInstant())
   .option(END_INSTANTTIME().key(), queryInfo.getEndInstant())
@@ -192,6 +211,7 @@ public class HoodieIncrSource extends RowSource {
 } else {
   // if checkpoint is missing from source table, and if strategy is set to 
READ_UPTO_LATEST_COMMIT, we have to issue snapshot query
   Dataset snapshot = sparkSession.read().format("org.apache.hudi")
+  .options(readOpts)
   .option(DataSourceReadOptions.QUERY_TYPE().key(), 
DataSourceReadOptions.QUERY_TYPE_SNAPSHOT_OPT_VAL())
   .load(srcPath);
   if (snapshotLoadQuerySplitter.isPresent()) {
diff --git 
a/hudi-utilities/src/test/java/org/apache/hudi/utilities/sources/TestHoodieIncrSource.java
 
b/hudi-utilities/src/test/java/org/apache/hudi/utilities/sources/TestHoodieIncrSource.java
index d35041592aa..1b534c22c7e 100644
--- 
a/hudi-utilities/src/test/java/org/apache/hudi/utilities/sources/TestHoodieIncrSource.java
+++ 

Re: [PR] [MINOR] Disable reader for test with enum [hudi]

2023-11-10 Thread via GitHub


yihua merged PR #10061:
URL: https://github.com/apache/hudi/pull/10061


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-5936] Fix serialization problem when FileStatus is not serializable [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10065:
URL: https://github.com/apache/hudi/pull/10065#issuecomment-1806600513

   
   ## CI report:
   
   * 2c42b4127fa90eb1f5de9948b5a6bec9ed27662e Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20846)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR] Disable reader for test with enum [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10061:
URL: https://github.com/apache/hudi/pull/10061#issuecomment-1806600476

   
   ## CI report:
   
   * 1b2a9288f4d10a0c68336eaf97770b165a3bc773 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20839)
 
   * f26cf775abaed2874441043af23e2b95d1514855 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-5936] Fix serialization problem when FileStatus is not serializable [hudi]

2023-11-10 Thread via GitHub


CTTY commented on PR #10065:
URL: https://github.com/apache/hudi/pull/10065#issuecomment-1806598734

   Reopened this because this behavior as we are seeing another exception 
triggered on a different method which is not fixed


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-5936] Fix serialization problem when FileStatus is not serializable [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10065:
URL: https://github.com/apache/hudi/pull/10065#issuecomment-1806596299

   
   ## CI report:
   
   * 2c42b4127fa90eb1f5de9948b5a6bec9ed27662e UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806596253

   
   ## CI report:
   
   *  Unknown: [CANCELED](TBD) 
   * bfebcd456acf943c525ca2830ab864098a802e9a Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20845)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #9998:
URL: https://github.com/apache/hudi/pull/9998#issuecomment-1806596160

   
   ## CI report:
   
   * d1f415b1400e0ff853e38b48fdaa7a9fe7aa44d3 UNKNOWN
   * 4cabc36a8ca585b4b1f4ff845145618a70f67174 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20836)
 
   * f7f4a9e3342020e4a1472f5a3b59d25f9c1ec090 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20842)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR] disable reader for test with enum [hudi]

2023-11-10 Thread via GitHub


yihua commented on code in PR #10061:
URL: https://github.com/apache/hudi/pull/10061#discussion_r1390059295


##
hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/HoodieIncrSource.java:
##
@@ -128,10 +130,16 @@ public static class Config {
 
HoodieIncrSourceConfig.HOODIE_DROP_ALL_META_FIELDS_FROM_SOURCE.defaultValue();
   }
 
+  private final Map readOpts = new HashMap<>();
+
   public HoodieIncrSource(TypedProperties props, JavaSparkContext 
sparkContext, SparkSession sparkSession,
   SchemaProvider schemaProvider) {
 super(props, sparkContext, sparkSession, schemaProvider);
 
+for (Object key : props.keySet()) {
+  readOpts.put(key.toString(), props.getString(key.toString()));

Review Comment:
   Fixed.



##
hudi-utilities/src/test/java/org/apache/hudi/utilities/sources/TestHoodieIncrSource.java:
##
@@ -213,10 +215,9 @@ public void 
testHoodieIncrSourceInflightCommitBeforeCompletedCommit(HoodieTableT
 }
   }
 
-  @ParameterizedTest
-  @EnumSource(HoodieTableType.class)
-  public void testHoodieIncrSourceWithPendingTableServices(HoodieTableType 
tableType) throws IOException {
-this.tableType = tableType;
+  @Test
+  public void testHoodieIncrSourceWithPendingTableServices() throws 
IOException {
+this.tableType = MERGE_ON_READ;

Review Comment:
   Fixed.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-6658] inject filters for incremental query [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10063:
URL: https://github.com/apache/hudi/pull/10063#issuecomment-1806592047

   
   ## CI report:
   
   * dad8fa7cc5f3c3ff65fd0cd9024866083a67db61 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20841)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][Testing] Test hudi-cli module [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10060:
URL: https://github.com/apache/hudi/pull/10060#issuecomment-1806592026

   
   ## CI report:
   
   * dded663cc76f356a4b37941edeaf9dba3adf5c37 Azure: 
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20844)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806591999

   
   ## CI report:
   
   *  Unknown: [CANCELED](TBD) 
   * bfebcd456acf943c525ca2830ab864098a802e9a UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #9998:
URL: https://github.com/apache/hudi/pull/9998#issuecomment-1806591889

   
   ## CI report:
   
   * d1f415b1400e0ff853e38b48fdaa7a9fe7aa44d3 UNKNOWN
   * 647c783c527a1b18e47feaee3fb7335b37fe5f26 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20834)
 
   * 4cabc36a8ca585b4b1f4ff845145618a70f67174 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20836)
 
   * f7f4a9e3342020e4a1472f5a3b59d25f9c1ec090 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20842)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


yihua commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806582292

   @hudi-bot run azure


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] [HUDI-5936] Fix serialization problem when FileStatus is not serializable [hudi]

2023-11-10 Thread via GitHub


CTTY opened a new pull request, #10065:
URL: https://github.com/apache/hudi/pull/10065

   ### Change Logs
   
   Original PR: #8190 
   
   ### Impact
   
   _Describe any public API or user-facing feature change or any performance 
impact._
   
   ### Risk level (write none, low medium or high below)
   
   _If medium or high, explain what verification was done to mitigate the 
risks._
   
   ### Documentation Update
   
   _Describe any necessary documentation update if there is any new feature, 
config, or user-facing change_
   
   - _The config description must be updated if new configs are added or the 
default value of the configs are changed_
   - _Any new feature or user-facing change requires updating the Hudi website. 
Please create a Jira ticket, attach the
 ticket number here and follow the 
[instruction](https://hudi.apache.org/contribute/developer-setup#website) to 
make
 changes to the website._
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR] Fix testRenamePartition [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10064:
URL: https://github.com/apache/hudi/pull/10064#issuecomment-1806565749

   
   ## CI report:
   
   * 4ab080293d7d6c5d93e0353397fe516ec47dd9a5 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20843)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][Testing] Test hudi-cli module [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10060:
URL: https://github.com/apache/hudi/pull/10060#issuecomment-1806565674

   
   ## CI report:
   
   * f70767fb0cf87e383474adc127779f6d31990742 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20838)
 
   * dded663cc76f356a4b37941edeaf9dba3adf5c37 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #9998:
URL: https://github.com/apache/hudi/pull/9998#issuecomment-1806565428

   
   ## CI report:
   
   * d1f415b1400e0ff853e38b48fdaa7a9fe7aa44d3 UNKNOWN
   * 647c783c527a1b18e47feaee3fb7335b37fe5f26 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20834)
 
   * 4cabc36a8ca585b4b1f4ff845145618a70f67174 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20836)
 
   * f7f4a9e3342020e4a1472f5a3b59d25f9c1ec090 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR] Fix testRenamePartition [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10064:
URL: https://github.com/apache/hudi/pull/10064#issuecomment-1806561197

   
   ## CI report:
   
   * 4ab080293d7d6c5d93e0353397fe516ec47dd9a5 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR] disable reader for test with enum [hudi]

2023-11-10 Thread via GitHub


yihua commented on code in PR #10061:
URL: https://github.com/apache/hudi/pull/10061#discussion_r1390017284


##
hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/HoodieIncrSource.java:
##
@@ -128,10 +130,16 @@ public static class Config {
 
HoodieIncrSourceConfig.HOODIE_DROP_ALL_META_FIELDS_FROM_SOURCE.defaultValue();
   }
 
+  private final Map readOpts = new HashMap<>();
+
   public HoodieIncrSource(TypedProperties props, JavaSparkContext 
sparkContext, SparkSession sparkSession,
   SchemaProvider schemaProvider) {
 super(props, sparkContext, sparkSession, schemaProvider);
 
+for (Object key : props.keySet()) {
+  readOpts.put(key.toString(), props.getString(key.toString()));

Review Comment:
   Instead of putting every properties here, only extract 
"FILE_GROUP_READER_ENABLED" and "USE_NEW_HUDI_PARQUET_FILE_FORMAT" here?



##
hudi-utilities/src/test/java/org/apache/hudi/utilities/sources/TestHoodieIncrSource.java:
##
@@ -213,10 +215,9 @@ public void 
testHoodieIncrSourceInflightCommitBeforeCompletedCommit(HoodieTableT
 }
   }
 
-  @ParameterizedTest
-  @EnumSource(HoodieTableType.class)
-  public void testHoodieIncrSourceWithPendingTableServices(HoodieTableType 
tableType) throws IOException {
-this.tableType = tableType;
+  @Test
+  public void testHoodieIncrSourceWithPendingTableServices() throws 
IOException {
+this.tableType = MERGE_ON_READ;

Review Comment:
   No need to change this?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-6613] implement inmemory file index to allow for glob paths [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10062:
URL: https://github.com/apache/hudi/pull/10062#issuecomment-1806556765

   
   ## CI report:
   
   * 6be57a2edc36106e71c267ad111fa1db4e9bf32a Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20840)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][Testing] Test hudi-cli module [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10060:
URL: https://github.com/apache/hudi/pull/10060#issuecomment-1806556736

   
   ## CI report:
   
   * f70767fb0cf87e383474adc127779f6d31990742 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20838)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806556723

   
   ## CI report:
   
   * f70767fb0cf87e383474adc127779f6d31990742 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20837)
 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20838)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] [MINOR] Fix testRenamePartition [hudi]

2023-11-10 Thread via GitHub


linliu-code opened a new pull request, #10064:
URL: https://github.com/apache/hudi/pull/10064

   
   
   ### Change Logs
   Fix unit test: testRenamePartition
   
   ### Impact
   
   Fix the unit test.
   
   ### Risk level (write none, low medium or high below)
   
   Low.
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-6658] inject filters for incremental query [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10063:
URL: https://github.com/apache/hudi/pull/10063#issuecomment-1806528256

   
   ## CI report:
   
   * dad8fa7cc5f3c3ff65fd0cd9024866083a67db61 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20841)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-6613] implement inmemory file index to allow for glob paths [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10062:
URL: https://github.com/apache/hudi/pull/10062#issuecomment-1806528229

   
   ## CI report:
   
   * 6be57a2edc36106e71c267ad111fa1db4e9bf32a Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20840)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #9998:
URL: https://github.com/apache/hudi/pull/9998#issuecomment-1806527945

   
   ## CI report:
   
   * d1f415b1400e0ff853e38b48fdaa7a9fe7aa44d3 UNKNOWN
   * 647c783c527a1b18e47feaee3fb7335b37fe5f26 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20834)
 
   * 4cabc36a8ca585b4b1f4ff845145618a70f67174 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20836)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-6658] inject filters for incremental query [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10063:
URL: https://github.com/apache/hudi/pull/10063#issuecomment-1806523593

   
   ## CI report:
   
   * dad8fa7cc5f3c3ff65fd0cd9024866083a67db61 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-6613] implement inmemory file index to allow for glob paths [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10062:
URL: https://github.com/apache/hudi/pull/10062#issuecomment-1806523558

   
   ## CI report:
   
   * 6be57a2edc36106e71c267ad111fa1db4e9bf32a UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR] disable reader for test with enum [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10061:
URL: https://github.com/apache/hudi/pull/10061#issuecomment-1806518111

   
   ## CI report:
   
   * 1b2a9288f4d10a0c68336eaf97770b165a3bc773 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20839)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Updated] (HUDI-6658) Implement MOR Incremental for new file format

2023-11-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-6658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HUDI-6658:
-
Labels: pull-request-available  (was: )

> Implement MOR Incremental for new file format
> -
>
> Key: HUDI-6658
> URL: https://issues.apache.org/jira/browse/HUDI-6658
> Project: Apache Hudi
>  Issue Type: Improvement
>Reporter: Jonathan Vexler
>Assignee: Jonathan Vexler
>Priority: Major
>  Labels: pull-request-available
>
> Implement MOR Incremental for new file format



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[PR] [HUDI-6658] inject filters for incremental query [hudi]

2023-11-10 Thread via GitHub


jonvex opened a new pull request, #10063:
URL: https://github.com/apache/hudi/pull/10063

   ### Change Logs
   
   only spark 3.3 currently. want to see how ci runs
   
   ### Impact
   
   incremental query will work much better including df.count() should be 
correct.
   
   ### Risk level (write none, low medium or high below)
   
   low
   
   ### Documentation Update
   
   N/A
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Assigned] (HUDI-6658) Implement MOR Incremental for new file format

2023-11-10 Thread Jonathan Vexler (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-6658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Vexler reassigned HUDI-6658:
-

Assignee: Jonathan Vexler

> Implement MOR Incremental for new file format
> -
>
> Key: HUDI-6658
> URL: https://issues.apache.org/jira/browse/HUDI-6658
> Project: Apache Hudi
>  Issue Type: Improvement
>Reporter: Jonathan Vexler
>Assignee: Jonathan Vexler
>Priority: Major
>
> Implement MOR Incremental for new file format



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [PR] [HUDI-0000] do not merge. Testing batch reading [hudi]

2023-11-10 Thread via GitHub


jonvex closed pull request #10039: [HUDI-] do not merge. Testing batch 
reading
URL: https://github.com/apache/hudi/pull/10039


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] [HUDI-6613] implement inmemory file index to allow for glob paths [hudi]

2023-11-10 Thread via GitHub


jonvex opened a new pull request, #10062:
URL: https://github.com/apache/hudi/pull/10062

   ### Change Logs
   
   implement inmemory file index to allow for queries with glob paths
   
   ### Impact
   
   improve new reader
   
   ### Risk level (write none, low medium or high below)
   
   low
   
   ### Documentation Update
   
   N/A
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-0000] DO NOT MERGE Fix incr errors new reader [hudi]

2023-11-10 Thread via GitHub


jonvex closed pull request #9954: [HUDI-] DO NOT MERGE Fix incr errors new 
reader
URL: https://github.com/apache/hudi/pull/9954


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Updated] (HUDI-6613) New file format does not work with in memory index

2023-11-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-6613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HUDI-6613:
-
Labels: pull-request-available  (was: )

> New file format does not work with in memory index
> --
>
> Key: HUDI-6613
> URL: https://issues.apache.org/jira/browse/HUDI-6613
> Project: Apache Hudi
>  Issue Type: Improvement
>  Components: spark
>Reporter: Jonathan Vexler
>Assignee: Jonathan Vexler
>Priority: Major
>  Labels: pull-request-available
>
> .option(HoodieIndexConfig.INDEX_TYPE.key, IndexType.INMEMORY.toString)
> then the file index is empty so we need to get the files from glob paths



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Assigned] (HUDI-6613) New file format does not work with in memory index

2023-11-10 Thread Jonathan Vexler (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-6613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Vexler reassigned HUDI-6613:
-

Assignee: Jonathan Vexler

> New file format does not work with in memory index
> --
>
> Key: HUDI-6613
> URL: https://issues.apache.org/jira/browse/HUDI-6613
> Project: Apache Hudi
>  Issue Type: Improvement
>  Components: spark
>Reporter: Jonathan Vexler
>Assignee: Jonathan Vexler
>Priority: Major
>
> .option(HoodieIndexConfig.INDEX_TYPE.key, IndexType.INMEMORY.toString)
> then the file index is empty so we need to get the files from glob paths



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-6613) New file format does not work with in memory index

2023-11-10 Thread Jonathan Vexler (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-6613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Vexler updated HUDI-6613:
--
Status: In Progress  (was: Open)

> New file format does not work with in memory index
> --
>
> Key: HUDI-6613
> URL: https://issues.apache.org/jira/browse/HUDI-6613
> Project: Apache Hudi
>  Issue Type: Improvement
>  Components: spark
>Reporter: Jonathan Vexler
>Assignee: Jonathan Vexler
>Priority: Major
>
> .option(HoodieIndexConfig.INDEX_TYPE.key, IndexType.INMEMORY.toString)
> then the file index is empty so we need to get the files from glob paths



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [PR] [MINOR][Testing] Test hudi-cli module [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10060:
URL: https://github.com/apache/hudi/pull/10060#issuecomment-1806479127

   
   ## CI report:
   
   * f70767fb0cf87e383474adc127779f6d31990742 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20838)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR] disable reader for test with enum [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10061:
URL: https://github.com/apache/hudi/pull/10061#issuecomment-1806479164

   
   ## CI report:
   
   * 1b2a9288f4d10a0c68336eaf97770b165a3bc773 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][Testing] Test hudi-cli module [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10060:
URL: https://github.com/apache/hudi/pull/10060#issuecomment-1806468044

   
   ## CI report:
   
   * f70767fb0cf87e383474adc127779f6d31990742 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #9998:
URL: https://github.com/apache/hudi/pull/9998#issuecomment-1806467419

   
   ## CI report:
   
   * d1f415b1400e0ff853e38b48fdaa7a9fe7aa44d3 UNKNOWN
   * 3b32f34ca7b86153850a6dea50b3de2c488b1b61 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20825)
 
   * 647c783c527a1b18e47feaee3fb7335b37fe5f26 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20834)
 
   * 4cabc36a8ca585b4b1f4ff845145618a70f67174 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20836)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806467914

   
   ## CI report:
   
   * 3fff661e4b61f2eec4148b3eb93110c8cdc8c8e1 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20833)
 
   * f70767fb0cf87e383474adc127779f6d31990742 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20837)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] [MINOR] disable reader for test with enum [hudi]

2023-11-10 Thread via GitHub


jonvex opened a new pull request, #10061:
URL: https://github.com/apache/hudi/pull/10061

   ### Change Logs
   
   _Describe context and summary for this change. Highlight if any code was 
copied._
   
   ### Impact
   
   _Describe any public API or user-facing feature change or any performance 
impact._
   
   ### Risk level (write none, low medium or high below)
   
   _If medium or high, explain what verification was done to mitigate the 
risks._
   
   ### Documentation Update
   
   _Describe any necessary documentation update if there is any new feature, 
config, or user-facing change_
   
   - _The config description must be updated if new configs are added or the 
default value of the configs are changed_
   - _Any new feature or user-facing change requires updating the Hudi website. 
Please create a Jira ticket, attach the
 ticket number here and follow the 
[instruction](https://hudi.apache.org/contribute/developer-setup#website) to 
make
 changes to the website._
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806460194

   
   ## CI report:
   
   * 3fff661e4b61f2eec4148b3eb93110c8cdc8c8e1 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20833)
 
   * f70767fb0cf87e383474adc127779f6d31990742 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #9998:
URL: https://github.com/apache/hudi/pull/9998#issuecomment-1806459953

   
   ## CI report:
   
   * d1f415b1400e0ff853e38b48fdaa7a9fe7aa44d3 UNKNOWN
   * 3b32f34ca7b86153850a6dea50b3de2c488b1b61 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20825)
 
   * 647c783c527a1b18e47feaee3fb7335b37fe5f26 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20834)
 
   * 4cabc36a8ca585b4b1f4ff845145618a70f67174 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Created] (HUDI-7081) Hoodie FilegroupReader cannot read Enums from MOR avro log blocks

2023-11-10 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7081:
-

 Summary: Hoodie FilegroupReader cannot read Enums from MOR avro 
log blocks
 Key: HUDI-7081
 URL: https://issues.apache.org/jira/browse/HUDI-7081
 Project: Apache Hudi
  Issue Type: Bug
Reporter: Jonathan Vexler


{code:java}
buildFileGroupIterator(
  Option.empty[PartitionedFile => Iterator[InternalRow]],
  partitionValues,
  Option.empty[HoodieBaseFile],
  getLogFilesFromSlice(fileSlice),
  requiredSchemaWithMandatory,
  outputSchema,
  partitionSchema,
  broadcastedHadoopConf.value.value,
  -1,
  -1,
  shouldUseRecordPosition
) {code}
This only uses StructType schemas and converts to avro. So it thinks enums are 
strings



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[PR] [MINOR][Testing] Test hudi-cli module [hudi]

2023-11-10 Thread via GitHub


yihua opened a new pull request, #10060:
URL: https://github.com/apache/hudi/pull/10060

   ### Change Logs
   
   As above
   ### Impact
   
   none
   ### Risk level
   
   none
   ### Documentation Update
   
   N/A
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806397370

   
   ## CI report:
   
   * 3fff661e4b61f2eec4148b3eb93110c8cdc8c8e1 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20833)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



(hudi) branch master updated (a91bdece2c1 -> 256bbe9d455)

2023-11-10 Thread yihua
This is an automated email from the ASF dual-hosted git repository.

yihua pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git


from a91bdece2c1 [HUDI-7079] Disable new file format for metadata table 
(#10058)
 add 256bbe9d455 [MINOR] Disable reader in schema evolution tests with 
Streamer (#10059)

No new revisions were added by this update.

Summary of changes:
 .../TestHoodieDeltaStreamerSchemaEvolutionBase.java | 13 +
 .../TestHoodieDeltaStreamerSchemaEvolutionExtensive.java|  6 +++---
 .../TestHoodieDeltaStreamerSchemaEvolutionQuick.java|  2 +-
 3 files changed, 17 insertions(+), 4 deletions(-)



Re: [PR] [MINOR] disable reader in schema evo test [hudi]

2023-11-10 Thread via GitHub


yihua merged PR #10059:
URL: https://github.com/apache/hudi/pull/10059


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] [MINOR] disable reader in schema evo test [hudi]

2023-11-10 Thread via GitHub


jonvex opened a new pull request, #10059:
URL: https://github.com/apache/hudi/pull/10059

   ### Change Logs
   
   disable new reader explicitly 
   
   ### Impact
   
   not supported currently so will pass even if reader is enabled by default
   
   ### Risk level (write none, low medium or high below)
   
   none
   
   ### Documentation Update
   
   N/A
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806338086

   
   ## CI report:
   
   * 6dc1594a35d786c6efbf944f36c41b6e51e4acf1 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20830)
 
   * 3fff661e4b61f2eec4148b3eb93110c8cdc8c8e1 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20833)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #9998:
URL: https://github.com/apache/hudi/pull/9998#issuecomment-1806337863

   
   ## CI report:
   
   * d1f415b1400e0ff853e38b48fdaa7a9fe7aa44d3 UNKNOWN
   * 3b32f34ca7b86153850a6dea50b3de2c488b1b61 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20825)
 
   * 647c783c527a1b18e47feaee3fb7335b37fe5f26 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20834)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #9998:
URL: https://github.com/apache/hudi/pull/9998#issuecomment-1806329969

   
   ## CI report:
   
   * d1f415b1400e0ff853e38b48fdaa7a9fe7aa44d3 UNKNOWN
   * 3b32f34ca7b86153850a6dea50b3de2c488b1b61 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20825)
 
   * 647c783c527a1b18e47feaee3fb7335b37fe5f26 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Updated] (HUDI-7080) Re-enable a few tests with new Spark file format and file group reader

2023-11-10 Thread Ethan Guo (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-7080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-7080:

Fix Version/s: 1.0.0

> Re-enable a few tests with new Spark file format and file group reader
> --
>
> Key: HUDI-7080
> URL: https://issues.apache.org/jira/browse/HUDI-7080
> Project: Apache Hudi
>  Issue Type: Improvement
>Reporter: Ethan Guo
>Priority: Major
> Fix For: 1.0.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (HUDI-7080) Re-enable a few tests with new Spark file format and file group reader

2023-11-10 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-7080:
---

 Summary: Re-enable a few tests with new Spark file format and file 
group reader
 Key: HUDI-7080
 URL: https://issues.apache.org/jira/browse/HUDI-7080
 Project: Apache Hudi
  Issue Type: Improvement
Reporter: Ethan Guo






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [PR] [MINOR][TESTING] Enable new configs by default for 1.0.0-beta - 2 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10054:
URL: https://github.com/apache/hudi/pull/10054#issuecomment-1806257277

   
   ## CI report:
   
   * 8f2e512d3591980d75d06bfb1086ac854b12745c UNKNOWN
   * 321ee11654d9e2627e6677179b3c9c26e233c538 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20829)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806239544

   
   ## CI report:
   
   * 021fbdb5dfe7b04a087815adf6afa51a27686523 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20826)
 
   * 6dc1594a35d786c6efbf944f36c41b6e51e4acf1 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20830)
 
   * 3fff661e4b61f2eec4148b3eb93110c8cdc8c8e1 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20833)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806231902

   
   ## CI report:
   
   * 021fbdb5dfe7b04a087815adf6afa51a27686523 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20826)
 
   * 6dc1594a35d786c6efbf944f36c41b6e51e4acf1 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20830)
 
   * 3fff661e4b61f2eec4148b3eb93110c8cdc8c8e1 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7076] Turn on new features by default through configs for 1.0.0-beta1 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #9998:
URL: https://github.com/apache/hudi/pull/9998#issuecomment-1806231530

   
   ## CI report:
   
   * d1f415b1400e0ff853e38b48fdaa7a9fe7aa44d3 UNKNOWN
   * 3b32f34ca7b86153850a6dea50b3de2c488b1b61 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20825)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7074] Flink incremental query for non-blocking concurrency control [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10053:
URL: https://github.com/apache/hudi/pull/10053#issuecomment-1806209456

   
   ## CI report:
   
   * 240ab68dc60b1d30983a6b1c7a497e9d5b4d4e16 Azure: 
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20822)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7079] disable new file format for metadata table [hudi]

2023-11-10 Thread via GitHub


yihua merged PR #10058:
URL: https://github.com/apache/hudi/pull/10058


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



(hudi) branch master updated: [HUDI-7079] Disable new file format for metadata table (#10058)

2023-11-10 Thread yihua
This is an automated email from the ASF dual-hosted git repository.

yihua pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git


The following commit(s) were added to refs/heads/master by this push:
 new a91bdece2c1 [HUDI-7079] Disable new file format for metadata table 
(#10058)
a91bdece2c1 is described below

commit a91bdece2c1782410009915881ff6488fdfbf973
Author: Jon Vexler 
AuthorDate: Fri Nov 10 13:04:42 2023 -0500

[HUDI-7079] Disable new file format for metadata table (#10058)

Disable the new file format for the metadata table because we haven't 
implemented hfile base files yet. Also made new method for duplicated code

Co-authored-by: Jonathan Vexler <=>
---
 .../java/org/apache/hudi/client/BaseHoodieTableServiceClient.java| 2 +-
 .../src/main/java/org/apache/hudi/table/HoodieTable.java | 2 +-
 .../java/org/apache/hudi/table/action/compact/CompactHelpers.java| 2 +-
 .../org/apache/hudi/common/bootstrap/index/HFileBootstrapIndex.java  | 4 +---
 .../java/org/apache/hudi/common/table/HoodieTableMetaClient.java | 5 +
 .../src/main/scala/org/apache/hudi/DefaultSource.scala   | 4 ++--
 6 files changed, 11 insertions(+), 8 deletions(-)

diff --git 
a/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/BaseHoodieTableServiceClient.java
 
b/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/BaseHoodieTableServiceClient.java
index 8b39a46cb07..112b8111004 100644
--- 
a/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/BaseHoodieTableServiceClient.java
+++ 
b/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/BaseHoodieTableServiceClient.java
@@ -952,7 +952,7 @@ public abstract class BaseHoodieTableServiceClient 
extends BaseHoodieCl
   // from the async indexer (`HoodieIndexer`).
   // TODO(HUDI-5733): This should be cleaned up once the proper fix of 
rollbacks in the
   //  metadata table is landed.
-  if (isMetadataTable(metaClient.getBasePathV2().toString())) {
+  if (metaClient.isMetadataTable()) {
 return 
inflightInstantsStream.map(HoodieInstant::getTimestamp).filter(entry -> {
   if (curInstantTime.isPresent()) {
 return !entry.equals(curInstantTime.get());
diff --git 
a/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/HoodieTable.java
 
b/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/HoodieTable.java
index c44d3b0f4cb..aa521fcc3e5 100644
--- 
a/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/HoodieTable.java
+++ 
b/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/HoodieTable.java
@@ -1010,7 +1010,7 @@ public abstract class HoodieTable implements 
Serializable {
 // Only execute metadata table deletion when all the following conditions 
are met
 // (1) This is data table
 // (2) Metadata table is disabled in HoodieWriteConfig for the writer
-return !HoodieTableMetadata.isMetadataTable(metaClient.getBasePath())
+return !metaClient.isMetadataTable()
 && !config.isMetadataTableEnabled();
   }
 
diff --git 
a/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/compact/CompactHelpers.java
 
b/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/compact/CompactHelpers.java
index 6a1e3f14062..78ea56fd9ad 100644
--- 
a/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/compact/CompactHelpers.java
+++ 
b/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/compact/CompactHelpers.java
@@ -106,7 +106,7 @@ public class CompactHelpers {
   }
 
   public Option getInstantRange(HoodieTableMetaClient 
metaClient) {
-return 
HoodieTableMetadata.isMetadataTable(metaClient.getBasePathV2().toString())
+return metaClient.isMetadataTable()
 ? Option.of(getMetadataLogReaderInstantRange(metaClient)) : 
Option.empty();
   }
 
diff --git 
a/hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/HFileBootstrapIndex.java
 
b/hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/HFileBootstrapIndex.java
index 27314f150dc..744f3dc3a2d 100644
--- 
a/hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/HFileBootstrapIndex.java
+++ 
b/hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/HFileBootstrapIndex.java
@@ -34,7 +34,6 @@ import org.apache.hudi.common.util.collection.Pair;
 import org.apache.hudi.exception.HoodieException;
 import org.apache.hudi.exception.HoodieIOException;
 import org.apache.hudi.io.storage.HoodieHFileUtils;
-import org.apache.hudi.metadata.HoodieTableMetadata;
 
 import org.apache.hadoop.conf.Configuration;
 import org.apache.hadoop.fs.FileSystem;
@@ -106,8 +105,7 @@ public class HFileBootstrapIndex extends BootstrapIndex {
   FileSystem fs = metaClient.getFs();
   // The metadata table is never bootstrapped, so the bootstrap index 

Re: [PR] [HUDI-7079] disable new file format for metadata table [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10058:
URL: https://github.com/apache/hudi/pull/10058#issuecomment-1806157600

   
   ## CI report:
   
   * 2227bae3742151691a638a70c94c9d44118be13a Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20831)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806157555

   
   ## CI report:
   
   * 021fbdb5dfe7b04a087815adf6afa51a27686523 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20826)
 
   * 6dc1594a35d786c6efbf944f36c41b6e51e4acf1 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20830)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable new configs by default for 1.0.0-beta - 2 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10054:
URL: https://github.com/apache/hudi/pull/10054#issuecomment-1806157511

   
   ## CI report:
   
   * 8f2e512d3591980d75d06bfb1086ac854b12745c UNKNOWN
   * 7be659422f59706308364bdab2ce3e78a2f08812 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20823)
 
   * 321ee11654d9e2627e6677179b3c9c26e233c538 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20829)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7079] disable new file format for metadata table [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10058:
URL: https://github.com/apache/hudi/pull/10058#issuecomment-1806147191

   
   ## CI report:
   
   * 2227bae3742151691a638a70c94c9d44118be13a UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable new configs by default for 1.0.0-beta - 2 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10054:
URL: https://github.com/apache/hudi/pull/10054#issuecomment-1806147053

   
   ## CI report:
   
   * 8f2e512d3591980d75d06bfb1086ac854b12745c UNKNOWN
   * 7be659422f59706308364bdab2ce3e78a2f08812 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20823)
 
   * 321ee11654d9e2627e6677179b3c9c26e233c538 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [MINOR][TESTING] Enable configs testing 3 [hudi]

2023-11-10 Thread via GitHub


hudi-bot commented on PR #10055:
URL: https://github.com/apache/hudi/pull/10055#issuecomment-1806147122

   
   ## CI report:
   
   * 021fbdb5dfe7b04a087815adf6afa51a27686523 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=20826)
 
   * 6dc1594a35d786c6efbf944f36c41b6e51e4acf1 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Updated] (HUDI-7079) Disable new file reader for metadata table

2023-11-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-7079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HUDI-7079:
-
Labels: pull-request-available  (was: )

> Disable new file reader for metadata table
> --
>
> Key: HUDI-7079
> URL: https://issues.apache.org/jira/browse/HUDI-7079
> Project: Apache Hudi
>  Issue Type: Improvement
>Reporter: Jonathan Vexler
>Assignee: Jonathan Vexler
>Priority: Major
>  Labels: pull-request-available
>
> hfile base files are not yet implemented, so the metadata table should use 
> the hoodie relations



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[PR] [HUDI-7079] disable new file format for metadata table [hudi]

2023-11-10 Thread via GitHub


jonvex opened a new pull request, #10058:
URL: https://github.com/apache/hudi/pull/10058

   ### Change Logs
   
   Disable the new file format for the metadata table because we haven't 
implemented hfile base files yet. Also made new method for duplicated code
   
   ### Impact
   
   metadata table will work
   
   ### Risk level (write none, low medium or high below)
   
   low
   
   ### Documentation Update
   
   N/A
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Created] (HUDI-7079) Disable new file reader for metadata table

2023-11-10 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7079:
-

 Summary: Disable new file reader for metadata table
 Key: HUDI-7079
 URL: https://issues.apache.org/jira/browse/HUDI-7079
 Project: Apache Hudi
  Issue Type: Improvement
Reporter: Jonathan Vexler
Assignee: Jonathan Vexler


hfile base files are not yet implemented, so the metadata table should use the 
hoodie relations



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


  1   2   >