[GitHub] [hudi] nsivabalan commented on a diff in pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
nsivabalan commented on code in PR #5427: URL: https://github.com/apache/hudi/pull/5427#discussion_r858305310 ## hudi-common/src/main/java/org/apache/hudi/internal/schema/utils/InternalSchemaUtils.java: ## @@ -54,29 +58,75 @@ private InternalSchemaUtils() { */ public

[GitHub] [hudi] hudi-bot commented on pull request #5432: [HUDI-3977] Flink hudi table with date type partition path throws Hoo…

2022-04-25 Thread GitBox
hudi-bot commented on PR #5432: URL: https://github.com/apache/hudi/pull/5432#issuecomment-1109376729 ## CI report: * 1a53ea2b021079025b6a3fe6ebb1184d26a3aa64 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5432: [HUDI-3977] Flink hudi table with date type partition path throws Hoo…

2022-04-25 Thread GitBox
hudi-bot commented on PR #5432: URL: https://github.com/apache/hudi/pull/5432#issuecomment-1109375020 ## CI report: * 1a53ea2b021079025b6a3fe6ebb1184d26a3aa64 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5432: [HUDI-3977] Flink hudi table with date type partition path throws Hoo…

2022-04-25 Thread GitBox
hudi-bot commented on PR #5432: URL: https://github.com/apache/hudi/pull/5432#issuecomment-1109371861 ## CI report: * 1a53ea2b021079025b6a3fe6ebb1184d26a3aa64 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
hudi-bot commented on PR #5427: URL: https://github.com/apache/hudi/pull/5427#issuecomment-1109371819 ## CI report: * c71805c763f244e9e59832b9d67f48d74f1e9c64 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109371737 ## CI report: * 8c6f6e19940ce7ac04dfcfce52da3ccdaf3a8b0f UNKNOWN * dd2fea49a3161ed270b3f8f7e598beb6800178d8 Azure:

[GitHub] [hudi] sharathkola commented on issue #5223: [SUPPORT] - HUDI clustering - read issues

2022-04-25 Thread GitBox
sharathkola commented on issue #5223: URL: https://github.com/apache/hudi/issues/5223#issuecomment-1109352692 @suryaprasanna Can you please verify the commit_files.zip that I have attached above (it has 20220404094047.commit and 20220404094203.replacecommit files) to confirm if it has

[GitHub] [hudi] hudi-bot commented on pull request #5432: [HUDI-3977] Flink hudi table with date type partition path throws Hoo…

2022-04-25 Thread GitBox
hudi-bot commented on PR #5432: URL: https://github.com/apache/hudi/pull/5432#issuecomment-1109352618 ## CI report: * 1a53ea2b021079025b6a3fe6ebb1184d26a3aa64 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[jira] [Updated] (HUDI-3977) Flink hudi table with date type partition path throws HoodieNotSupportedException

2022-04-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3977: - Labels: pull-request-available (was: ) > Flink hudi table with date type partition path throws

[GitHub] [hudi] danny0405 opened a new pull request, #5432: [HUDI-3977] Flink hudi table with date type partition path throws Hoo…

2022-04-25 Thread GitBox
danny0405 opened a new pull request, #5432: URL: https://github.com/apache/hudi/pull/5432 …dieNotSupportedException ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109347481 ## CI report: * 8c6f6e19940ce7ac04dfcfce52da3ccdaf3a8b0f UNKNOWN * dd2fea49a3161ed270b3f8f7e598beb6800178d8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109345634 ## CI report: * 56068124025de8998ffd1c87b65ca67e80f2d62b Azure:

[GitHub] [hudi] rahil-c commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
rahil-c commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109344571 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109343824 ## CI report: * 56068124025de8998ffd1c87b65ca67e80f2d62b Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109342212 ## CI report: * 56068124025de8998ffd1c87b65ca67e80f2d62b Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109340628 ## CI report: * 56068124025de8998ffd1c87b65ca67e80f2d62b Azure:

[GitHub] [hudi] rahil-c commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
rahil-c commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109330190 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [hudi] chaplinthink commented on issue #3657: [SUPPORT] Failed to insert data by flink-sql

2022-04-25 Thread GitBox
chaplinthink commented on issue #3657: URL: https://github.com/apache/hudi/issues/3657#issuecomment-1109324024 @danny0405 Hi, I try Hudi 0.10.0 version with Flink version 1.12.2 and 1.13.1, There is still such a problem. I am testing Flink CDC to Hudi, but it dose not work. -- This

[GitHub] [hudi] alexeykudinkin commented on pull request #5430: [WIP][Stacked on 5428] Optimize out mandatory columns when no merging is performed

2022-04-25 Thread GitBox
alexeykudinkin commented on PR #5430: URL: https://github.com/apache/hudi/pull/5430#issuecomment-1109310149 @nsivabalan this is for 0.12 not for 0.11 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109305034 ## CI report: * 56068124025de8998ffd1c87b65ca67e80f2d62b Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109303294 ## CI report: * aeb42e6848d1d5b53700e92f44c95fd18283bb14 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5428: [WIP][HUDI-3896] Porting Nested Schema Pruning optimization for Hudi's custom Relations

2022-04-25 Thread GitBox
hudi-bot commented on PR #5428: URL: https://github.com/apache/hudi/pull/5428#issuecomment-1109301959 ## CI report: * fd9570efbb7448e73976aaa8a14771f2e4daf67a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
hudi-bot commented on PR #5427: URL: https://github.com/apache/hudi/pull/5427#issuecomment-1109301948 ## CI report: * fe6cc9d4d51c6a8a6f2b8cbd969a06d835a4b8e0 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
hudi-bot commented on PR #5427: URL: https://github.com/apache/hudi/pull/5427#issuecomment-1109299948 ## CI report: * fe6cc9d4d51c6a8a6f2b8cbd969a06d835a4b8e0 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109299801 ## CI report: * aeb42e6848d1d5b53700e92f44c95fd18283bb14 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5430: [WIP][Stacked on 5428] Optimize out mandatory columns when no merging is performed

2022-04-25 Thread GitBox
hudi-bot commented on PR #5430: URL: https://github.com/apache/hudi/pull/5430#issuecomment-1109296967 ## CI report: * 968ca518a9b54ecf294387e4b3d3d761c8f8a3cd Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109296580 ## CI report: * aeb42e6848d1d5b53700e92f44c95fd18283bb14 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5428: [WIP][HUDI-3896] Porting Nested Schema Pruning optimization for Hudi's custom Relations

2022-04-25 Thread GitBox
hudi-bot commented on PR #5428: URL: https://github.com/apache/hudi/pull/5428#issuecomment-1109254506 ## CI report: * fd9570efbb7448e73976aaa8a14771f2e4daf67a Azure:

[jira] [Created] (HUDI-3977) Flink hudi table with date type partition path throws HoodieNotSupportedException

2022-04-25 Thread Danny Chen (Jira)
Danny Chen created HUDI-3977: Summary: Flink hudi table with date type partition path throws HoodieNotSupportedException Key: HUDI-3977 URL: https://issues.apache.org/jira/browse/HUDI-3977 Project:

[hudi] branch master updated (f2ba0fead2 -> 762623a15c)

2022-04-25 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from f2ba0fead2 [HUDI-3085] Improve bulk insert partitioner abstraction (#4441) add 762623a15c [HUDI-3972] Fixing

[GitHub] [hudi] nsivabalan merged pull request #5424: [HUDI-3972] Fixing hoodie.properties/tableConfig for no preCombine field with writes

2022-04-25 Thread GitBox
nsivabalan merged PR #5424: URL: https://github.com/apache/hudi/pull/5424 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] YuangZhang opened a new issue, #5431: [SUPPORT] Flink Date type as partition field

2022-04-25 Thread GitBox
YuangZhang opened a new issue, #5431: URL: https://github.com/apache/hudi/issues/5431 flink sql can't use date as partition field `create TABLE hudi_sink( role_id string, log_id string, origin_json string, origin_log string, ts timestamp(3), ds

[GitHub] [hudi] hudi-bot commented on pull request #5428: [WIP][HUDI-3896] Porting Nested Schema Pruning optimization for Hudi's custom Relations

2022-04-25 Thread GitBox
hudi-bot commented on PR #5428: URL: https://github.com/apache/hudi/pull/5428#issuecomment-1109249115 ## CI report: * fd9570efbb7448e73976aaa8a14771f2e4daf67a Azure:

[GitHub] [hudi] nsivabalan commented on a diff in pull request #5430: [WIP][Stacked on 5428] Optimize out mandatory columns when no merging is performed

2022-04-25 Thread GitBox
nsivabalan commented on code in PR #5430: URL: https://github.com/apache/hudi/pull/5430#discussion_r858193317 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieMergeOnReadRDD.scala: ## @@ -127,9 +130,9 @@ class HoodieMergeOnReadRDD(@transient sc:

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109249051 ## CI report: * 81356d0c5251f745dff71ea22bd4a4ad29f07561 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5428: [WIP][HUDI-3896] Porting Nested Schema Pruning optimization for Hudi's custom Relations

2022-04-25 Thread GitBox
hudi-bot commented on PR #5428: URL: https://github.com/apache/hudi/pull/5428#issuecomment-1109247528 ## CI report: * 1e47f0288921b821bbf29d3b72b1156e82aefd5c Azure:

[GitHub] [hudi] nsivabalan commented on a diff in pull request #5304: [DOCS] Add faq for async compaction options

2022-04-25 Thread GitBox
nsivabalan commented on code in PR #5304: URL: https://github.com/apache/hudi/pull/5304#discussion_r858192316 ## website/learn/faq.md: ## @@ -253,6 +253,25 @@ Simplest way to run compaction on MOR dataset is to run the [compaction inline]( That said, for obvious reasons of

[GitHub] [hudi] hudi-bot commented on pull request #5430: [WIP][Stacked on 5428] Optimize out mandatory columns when no merging is performed

2022-04-25 Thread GitBox
hudi-bot commented on PR #5430: URL: https://github.com/apache/hudi/pull/5430#issuecomment-1109246021 ## CI report: * e494e1f8865a09ff4be7fe5390cc6d348671be09 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5428: [WIP][HUDI-3896] Porting Nested Schema Pruning optimization for Hudi's custom Relations

2022-04-25 Thread GitBox
hudi-bot commented on PR #5428: URL: https://github.com/apache/hudi/pull/5428#issuecomment-1109246005 ## CI report: * 1e47f0288921b821bbf29d3b72b1156e82aefd5c Azure:

[GitHub] [hudi] suryaprasanna commented on issue #5223: [SUPPORT] - HUDI clustering - read issues

2022-04-25 Thread GitBox
suryaprasanna commented on issue #5223: URL: https://github.com/apache/hudi/issues/5223#issuecomment-1109245586 @nsivabalan I tried out both 0.8.0 and 0.10.1 versions. My job is not returning duplicates and considering only the latest files. I tried on both partitioned and

[GitHub] [hudi] hudi-bot commented on pull request #5430: [WIP][Stacked on 5428] Optimize out mandatory columns when no merging is performed

2022-04-25 Thread GitBox
hudi-bot commented on PR #5430: URL: https://github.com/apache/hudi/pull/5430#issuecomment-1109244114 ## CI report: * e494e1f8865a09ff4be7fe5390cc6d348671be09 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5424: [HUDI-3972] Fixing hoodie.properties/tableConfig for no preCombine field with writes

2022-04-25 Thread GitBox
hudi-bot commented on PR #5424: URL: https://github.com/apache/hudi/pull/5424#issuecomment-1109244099 ## CI report: * db866f91efcb820f77f86962f6263c80bebb7db8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5419: [WIP][HUDI-3088] Use Spark 3.2 as default Spark version

2022-04-25 Thread GitBox
hudi-bot commented on PR #5419: URL: https://github.com/apache/hudi/pull/5419#issuecomment-1109244061 ## CI report: * 901cf10311e7b2f0cba88c71bf1d8c6998bbd953 UNKNOWN * b8529d91bd8c7eae03c3c6c41374fa6625aadfc0 UNKNOWN * 4c42f0c2d4fc7af4be3d7247faf5dc087a54fbac Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109244009 ## CI report: * 81356d0c5251f745dff71ea22bd4a4ad29f07561 Azure:

[GitHub] [hudi] yihua commented on a diff in pull request #5430: [WIP][Stacked on 5428] Optimize out mandatory columns when no merging is performed

2022-04-25 Thread GitBox
yihua commented on code in PR #5430: URL: https://github.com/apache/hudi/pull/5430#discussion_r858186088 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/MergeOnReadSnapshotRelation.scala: ## @@ -144,6 +186,15 @@ class

[GitHub] [hudi] hudi-bot commented on pull request #5430: [WIP][Stacked on 5428] Optimize out mandatory columns when no merging is performed

2022-04-25 Thread GitBox
hudi-bot commented on PR #5430: URL: https://github.com/apache/hudi/pull/5430#issuecomment-1109223888 ## CI report: * e494e1f8865a09ff4be7fe5390cc6d348671be09 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109222178 ## CI report: * 65774002326a060b49e294793f4414fe2f31d812 Azure:

[GitHub] [hudi] alexeykudinkin opened a new pull request, #5430: [WIP] Optimize out mandatory columns when no merging is performed

2022-04-25 Thread GitBox
alexeykudinkin opened a new pull request, #5430: URL: https://github.com/apache/hudi/pull/5430 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the

[jira] [Updated] (HUDI-3582) Introduce Secondary Index to Improve HUDI Query Performance

2022-04-25 Thread shibei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shibei updated HUDI-3582: - Summary: Introduce Secondary Index to Improve HUDI Query Performance (was: Support record level index based on

[jira] [Updated] (HUDI-3907) RFC for Introduce Secondary Index to Improve HUDI Query Performance

2022-04-25 Thread shibei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shibei updated HUDI-3907: - Summary: RFC for Introduce Secondary Index to Improve HUDI Query Performance (was: RFC for lucene based record

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109216635 ## CI report: * 65774002326a060b49e294793f4414fe2f31d812 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-04-25 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1109215084 ## CI report: * 65774002326a060b49e294793f4414fe2f31d812 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
hudi-bot commented on PR #5427: URL: https://github.com/apache/hudi/pull/5427#issuecomment-1109213490 ## CI report: * fe6cc9d4d51c6a8a6f2b8cbd969a06d835a4b8e0 Azure:

[GitHub] [hudi] yihua commented on a diff in pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
yihua commented on code in PR #5427: URL: https://github.com/apache/hudi/pull/5427#discussion_r858156982 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/HoodieSparkUtils.scala: ## @@ -324,7 +326,14 @@ object HoodieSparkUtils extends SparkAdapterSupport {

[GitHub] [hudi] yihua commented on a diff in pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
yihua commented on code in PR #5427: URL: https://github.com/apache/hudi/pull/5427#discussion_r858155285 ## hudi-common/src/main/java/org/apache/hudi/internal/schema/utils/InternalSchemaUtils.java: ## @@ -54,13 +58,16 @@ private InternalSchemaUtils() { */ public static

[GitHub] [hudi] hudi-bot commented on pull request #5419: [WIP][HUDI-3088] Use Spark 3.2 as default Spark version

2022-04-25 Thread GitBox
hudi-bot commented on PR #5419: URL: https://github.com/apache/hudi/pull/5419#issuecomment-1109194738 ## CI report: * 901cf10311e7b2f0cba88c71bf1d8c6998bbd953 UNKNOWN * b8529d91bd8c7eae03c3c6c41374fa6625aadfc0 UNKNOWN * 96e73e9bea606cc38a9ef65896bfebfc24164a50 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5419: [WIP][HUDI-3088] Use Spark 3.2 as default Spark version

2022-04-25 Thread GitBox
hudi-bot commented on PR #5419: URL: https://github.com/apache/hudi/pull/5419#issuecomment-1109191313 ## CI report: * 901cf10311e7b2f0cba88c71bf1d8c6998bbd953 UNKNOWN * b8529d91bd8c7eae03c3c6c41374fa6625aadfc0 UNKNOWN * 96e73e9bea606cc38a9ef65896bfebfc24164a50 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5419: [WIP][HUDI-3088] Use Spark 3.2 as default Spark version

2022-04-25 Thread GitBox
hudi-bot commented on PR #5419: URL: https://github.com/apache/hudi/pull/5419#issuecomment-1109189434 ## CI report: * 901cf10311e7b2f0cba88c71bf1d8c6998bbd953 UNKNOWN * b8529d91bd8c7eae03c3c6c41374fa6625aadfc0 UNKNOWN * 39f75efcfc24c5004491c7971475a3c757c9ae61 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5428: [WIP][HUDI-3896] Porting Nested Schema Pruning optimization for Hudi's custom Relations

2022-04-25 Thread GitBox
hudi-bot commented on PR #5428: URL: https://github.com/apache/hudi/pull/5428#issuecomment-1109187860 ## CI report: * 1e47f0288921b821bbf29d3b72b1156e82aefd5c Azure:

[GitHub] [hudi] nsivabalan commented on a diff in pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
nsivabalan commented on code in PR #5427: URL: https://github.com/apache/hudi/pull/5427#discussion_r858134674 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/HoodieSparkUtils.scala: ## @@ -324,7 +326,14 @@ object HoodieSparkUtils extends SparkAdapterSupport {

[GitHub] [hudi] hudi-bot commented on pull request #5424: [HUDI-3972] Fixing hoodie.properties/tableConfig for no preCombine field with writes

2022-04-25 Thread GitBox
hudi-bot commented on PR #5424: URL: https://github.com/apache/hudi/pull/5424#issuecomment-1109184324 ## CI report: * c141846af3b5785d284d2a628eb77397df03cf52 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5424: [HUDI-3972] Fixing hoodie.properties/tableConfig for no preCombine field with writes

2022-04-25 Thread GitBox
hudi-bot commented on PR #5424: URL: https://github.com/apache/hudi/pull/5424#issuecomment-1109182229 ## CI report: * c141846af3b5785d284d2a628eb77397df03cf52 Azure:

[GitHub] [hudi] nsivabalan commented on a diff in pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
nsivabalan commented on code in PR #5427: URL: https://github.com/apache/hudi/pull/5427#discussion_r858131177 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/HoodieSparkUtils.scala: ## @@ -324,7 +326,14 @@ object HoodieSparkUtils extends SparkAdapterSupport {

[GitHub] [hudi] zhqu1148980644 commented on issue #5371: [SUPPORT] Hudi Compaction

2022-04-25 Thread GitBox
zhqu1148980644 commented on issue #5371: URL: https://github.com/apache/hudi/issues/5371#issuecomment-1109179429 I also have a question relating to async compaction. I found that the `org.apache.hudi.sink.compact.HoodieFlinkCompactor` job is a flink batch job, does this mean I have to run

[jira] [Updated] (HUDI-3928) Add savepoint and restore to CLI guide page

2022-04-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3928: - Labels: pull-request-available (was: ) > Add savepoint and restore to CLI guide page >

[GitHub] [hudi] nsivabalan commented on pull request #5429: [HUDI-3928][HUDI-3932] Adding docs for 0.11 release (savepoint restore to CLI, pulsar commit callback, hive schema provider)

2022-04-25 Thread GitBox
nsivabalan commented on PR #5429: URL: https://github.com/apache/hudi/pull/5429#issuecomment-1109172491 https://user-images.githubusercontent.com/513218/165196198-a6f23859-4070-49e4-be9a-29e8e9ffd0e5.png;>

[GitHub] [hudi] nsivabalan opened a new pull request, #5429: [HUDI-3928][HUDI-3932] Adding docs for 0.11 release (savepoint restore to CLI, pulsar commit callback, hive schema provider)

2022-04-25 Thread GitBox
nsivabalan opened a new pull request, #5429: URL: https://github.com/apache/hudi/pull/5429 ## What is the purpose of the pull request Adding docs for various features for 0.11. - Savepoint restore to Cli - Pulsar commit callback - Hive schema provider. ## Brief

[jira] [Commented] (HUDI-3976) Newly introduced HiveSyncConfig config, syncAsSparkDataSourceTable is defaulted as true

2022-04-25 Thread Surya Prasanna Yalla (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527819#comment-17527819 ] Surya Prasanna Yalla commented on HUDI-3976: CC [~pwason] [~shivnarayan]  > Newly introduced

[jira] [Created] (HUDI-3976) Newly introduced HiveSyncConfig config, syncAsSparkDataSourceTable is defaulted as true

2022-04-25 Thread Surya Prasanna Yalla (Jira)
Surya Prasanna Yalla created HUDI-3976: -- Summary: Newly introduced HiveSyncConfig config, syncAsSparkDataSourceTable is defaulted as true Key: HUDI-3976 URL: https://issues.apache.org/jira/browse/HUDI-3976

[jira] [Closed] (HUDI-3081) Revisiting Read Path Infra across Query Engines

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-3081. Resolution: Done > Revisiting Read Path Infra across Query Engines >

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
alexeykudinkin commented on code in PR #5427: URL: https://github.com/apache/hudi/pull/5427#discussion_r858121011 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/HoodieSparkUtils.scala: ## @@ -324,7 +326,14 @@ object HoodieSparkUtils extends SparkAdapterSupport

[jira] [Updated] (HUDI-3247) Support incremental queries in AbstractHoodieTableFileIndex

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3247: - Epic Link: HUDI-2749 (was: HUDI-3081) > Support incremental queries in AbstractHoodieTableFileIndex >

[GitHub] [hudi] hudi-bot commented on pull request #5419: [WIP][HUDI-3088] Use Spark 3.2 as default Spark version

2022-04-25 Thread GitBox
hudi-bot commented on PR #5419: URL: https://github.com/apache/hudi/pull/5419#issuecomment-1109159873 ## CI report: * 901cf10311e7b2f0cba88c71bf1d8c6998bbd953 UNKNOWN * b8529d91bd8c7eae03c3c6c41374fa6625aadfc0 UNKNOWN * 39f75efcfc24c5004491c7971475a3c757c9ae61 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
hudi-bot commented on PR #5427: URL: https://github.com/apache/hudi/pull/5427#issuecomment-1109150765 ## CI report: * fe6cc9d4d51c6a8a6f2b8cbd969a06d835a4b8e0 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5428: [WIP][HUDI-3896] Porting Nested Schema Pruning optimization for Hudi's custom Relations

2022-04-25 Thread GitBox
hudi-bot commented on PR #5428: URL: https://github.com/apache/hudi/pull/5428#issuecomment-1109150785 ## CI report: * 1e47f0288921b821bbf29d3b72b1156e82aefd5c Azure:

[jira] [Assigned] (HUDI-3088) Make Spark 3 the default profile for build and test

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-3088: Assignee: Rahil Chertara (was: Forward Xu) > Make Spark 3 the default profile for build and test

[jira] [Updated] (HUDI-3088) Make Spark 3 the default profile for build and test

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3088: - Epic Link: HUDI-3431 (was: HUDI-1297) > Make Spark 3 the default profile for build and test >

[GitHub] [hudi] hudi-bot commented on pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
hudi-bot commented on PR #5427: URL: https://github.com/apache/hudi/pull/5427#issuecomment-1109149306 ## CI report: * fe6cc9d4d51c6a8a6f2b8cbd969a06d835a4b8e0 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #5428: [WIP][HUDI-3896] Porting Nested Schema Pruning optimization for Hudi's custom Relations

2022-04-25 Thread GitBox
hudi-bot commented on PR #5428: URL: https://github.com/apache/hudi/pull/5428#issuecomment-1109149326 ## CI report: * 1e47f0288921b821bbf29d3b72b1156e82aefd5c UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #5419: [WIP][HUDI-3088] Use Spark 3.2 as default Spark version

2022-04-25 Thread GitBox
hudi-bot commented on PR #5419: URL: https://github.com/apache/hudi/pull/5419#issuecomment-1109149279 ## CI report: * 901cf10311e7b2f0cba88c71bf1d8c6998bbd953 UNKNOWN * b8529d91bd8c7eae03c3c6c41374fa6625aadfc0 UNKNOWN * 39f75efcfc24c5004491c7971475a3c757c9ae61 Azure:

[jira] [Updated] (HUDI-3853) Integ Tests running against Spark3

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3853: - Epic Link: HUDI-3431 > Integ Tests running against Spark3 > -- > >

[jira] [Updated] (HUDI-3853) Integ Tests running against Spark3

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3853: - Issue Type: Improvement (was: Epic) > Integ Tests running against Spark3 >

[jira] [Updated] (HUDI-3853) Integ Tests running against Spark3

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3853: - Fix Version/s: 0.12.0 > Integ Tests running against Spark3 > -- > >

[jira] [Updated] (HUDI-3431) Certify Hudi against Spark3 Hive3 Hadoop3

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3431: - Fix Version/s: 0.12.0 > Certify Hudi against Spark3 Hive3 Hadoop3 >

[jira] [Updated] (HUDI-3414) Table data encryption

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3414: - Fix Version/s: 1.0.0 > Table data encryption > - > > Key: HUDI-3414 >

[jira] [Updated] (HUDI-3414) Table data encryption

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3414: - Fix Version/s: 0.11.0 > Table data encryption > - > > Key: HUDI-3414

[GitHub] [hudi] hudi-bot commented on pull request #5424: [HUDI-3972] Fixing hoodie.properties/tableConfig for no preCombine field with writes

2022-04-25 Thread GitBox
hudi-bot commented on PR #5424: URL: https://github.com/apache/hudi/pull/5424#issuecomment-1109147860 ## CI report: * c141846af3b5785d284d2a628eb77397df03cf52 Azure:

[jira] [Updated] (HUDI-3896) Support Spark optimizations for `HadoopFsRelation`

2022-04-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3896: - Labels: pull-request-available (was: ) > Support Spark optimizations for `HadoopFsRelation` >

[GitHub] [hudi] alexeykudinkin opened a new pull request, #5428: [HUDI-3896] Porting Nested Schema Pruning optimization for Hudi's custom Relations

2022-04-25 Thread GitBox
alexeykudinkin opened a new pull request, #5428: URL: https://github.com/apache/hudi/pull/5428 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the

[jira] [Updated] (HUDI-3049) Use flink table name as default synced hive table name

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3049: - Fix Version/s: 0.12.0 (was: 0.11.0) > Use flink table name as default synced hive

[jira] [Updated] (HUDI-992) For hive-style partitioned source data, partition columns synced with Hive will always have String type

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-992: Fix Version/s: (was: 0.11.0) > For hive-style partitioned source data, partition columns synced with

[GitHub] [hudi] nsivabalan commented on a diff in pull request #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
nsivabalan commented on code in PR #5427: URL: https://github.com/apache/hudi/pull/5427#discussion_r858106552 ## hudi-common/src/main/java/org/apache/hudi/internal/schema/utils/InternalSchemaUtils.java: ## @@ -54,13 +58,16 @@ private InternalSchemaUtils() { */ public

[jira] [Updated] (HUDI-3970) Close testing gap in e2e test

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3970: - Fix Version/s: 0.11.1 > Close testing gap in e2e test > - > >

[jira] [Updated] (HUDI-3974) Fix upgrade step wrt precombine field

2022-04-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3974: - Labels: pull-request-available (was: ) > Fix upgrade step wrt precombine field >

[GitHub] [hudi] yihua opened a new pull request, #5427: [HUDI-3974] Fix schema projection to skip non-existent preCombine field

2022-04-25 Thread GitBox
yihua opened a new pull request, #5427: URL: https://github.com/apache/hudi/pull/5427 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the purpose

[jira] [Updated] (HUDI-512) Support Logical Partitioning

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-512: Epic Name: Logical partitioning > Support Logical Partitioning > > >

[jira] [Updated] (HUDI-3908) Profile MOR snapshot query flow

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3908: - Fix Version/s: 0.12.0 > Profile MOR snapshot query flow > --- > >

[jira] [Updated] (HUDI-3410) Revisit Record-reading Abstractions

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3410: - Fix Version/s: 0.12.0 > Revisit Record-reading Abstractions > --- > >

[GitHub] [hudi] alexeykudinkin commented on pull request #5424: [HUDI-3972] Fixing hoodie.properties/tableConfig for no preCombine field with writes

2022-04-25 Thread GitBox
alexeykudinkin commented on PR #5424: URL: https://github.com/apache/hudi/pull/5424#issuecomment-1109132603 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[jira] [Updated] (HUDI-538) [UMBRELLA] Restructuring hudi client module for multi engine support

2022-04-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-538: Fix Version/s: 0.12.0 > [UMBRELLA] Restructuring hudi client module for multi engine support >

  1   2   3   >