[GitHub] [hudi] jonvex commented on a diff in pull request #9083: [HUDI-6464] Spark SQL Merge Into for pkless tables

2023-07-03 Thread via GitHub
jonvex commented on code in PR #9083: URL: https://github.com/apache/hudi/pull/9083#discussion_r1251115598 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/hudi/TestMergeIntoTable2.scala: ## @@ -373,6 +373,7 @@ class TestMergeIntoTable2 extends

[GitHub] [hudi] jonvex commented on a diff in pull request #9083: [HUDI-6464] Spark SQL Merge Into for pkless tables

2023-07-03 Thread via GitHub
jonvex commented on code in PR #9083: URL: https://github.com/apache/hudi/pull/9083#discussion_r1251109286 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieCreateRecordUtils.scala: ## @@ -243,27 +261,27 @@ object HoodieCreateRecordUtils { def

[GitHub] [hudi] jonvex commented on a diff in pull request #9083: [HUDI-6464] Spark SQL Merge Into for pkless tables

2023-07-03 Thread via GitHub
jonvex commented on code in PR #9083: URL: https://github.com/apache/hudi/pull/9083#discussion_r1251109002 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieCreateRecordUtils.scala: ## @@ -66,6 +82,8 @@ object HoodieCreateRecordUtils {

[GitHub] [hudi] hudi-bot commented on pull request #9116: [HUDI-6470] Add spark sql conf in AlterTableCommand

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9116: URL: https://github.com/apache/hudi/pull/9116#issuecomment-1618870184 ## CI report: * 80b8747dfec3e4cac7719796a46e52e9846081f5 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9083: [HUDI-6464] Spark SQL Merge Into for pkless tables

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9083: URL: https://github.com/apache/hudi/pull/9083#issuecomment-1618869877 ## CI report: * 3a0bfb88049cf2c0f8afe5c925dbd76fa6f7cd89 UNKNOWN * b402e3b9b1090857748665ff0628492781758b8a Azure:

[GitHub] [hudi] jonvex commented on a diff in pull request #9083: [HUDI-6464] Spark SQL Merge Into for pkless tables

2023-07-03 Thread via GitHub
jonvex commented on code in PR #9083: URL: https://github.com/apache/hudi/pull/9083#discussion_r1251101868 ## hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/index/SparkHoodieIndexFactory.java: ## @@ -42,7 +42,13 @@ * A factory to generate Spark {@link

[GitHub] [hudi] jonvex commented on a diff in pull request #9083: [HUDI-6464] Spark SQL Merge Into for pkless tables

2023-07-03 Thread via GitHub
jonvex commented on code in PR #9083: URL: https://github.com/apache/hudi/pull/9083#discussion_r1251099307 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/index/HoodieInternalProxyIndex.java: ## @@ -0,0 +1,70 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] [hudi] hudi-bot commented on pull request #9116: [HUDI-6470] Add spark sql conf in AlterTableCommand

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9116: URL: https://github.com/apache/hudi/pull/9116#issuecomment-1618854202 ## CI report: * 80b8747dfec3e4cac7719796a46e52e9846081f5 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #9083: [HUDI-6464] Spark SQL Merge Into for pkless tables

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9083: URL: https://github.com/apache/hudi/pull/9083#issuecomment-1618853799 ## CI report: * 3a0bfb88049cf2c0f8afe5c925dbd76fa6f7cd89 UNKNOWN * b402e3b9b1090857748665ff0628492781758b8a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9115: [HUDI-6469] Revert HUDI-6311

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9115: URL: https://github.com/apache/hudi/pull/9115#issuecomment-1618836501 ## CI report: * 2a046240c1e7c0a18f9b57c0845298ea65b72951 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9038: [HUDI-6423] Incremental cleaning should consider inflight compaction instant

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9038: URL: https://github.com/apache/hudi/pull/9038#issuecomment-1618835690 ## CI report: * a65a29c0cf1c8feb9f39e168ba80c99ebcae1c5d UNKNOWN * f7bbb54e37438e281310671b8e7755b7b54f2de8 Azure:

[GitHub] [hudi] jonvex commented on a diff in pull request #9083: [HUDI-6464] Spark SQL Merge Into for pkless tables

2023-07-03 Thread via GitHub
jonvex commented on code in PR #9083: URL: https://github.com/apache/hudi/pull/9083#discussion_r1251061344 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/spark/sql/hudi/command/MergeIntoKeyGenerator.scala: ## @@ -0,0 +1,85 @@ +/* + * Licensed to the Apache

[GitHub] [hudi] jonvex commented on a diff in pull request #9083: [HUDI-6464] Spark SQL Merge Into for pkless tables

2023-07-03 Thread via GitHub
jonvex commented on code in PR #9083: URL: https://github.com/apache/hudi/pull/9083#discussion_r1251060950 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieCreateRecordUtils.scala: ## @@ -243,27 +261,27 @@ object HoodieCreateRecordUtils { def

[GitHub] [hudi] jonvex commented on a diff in pull request #9083: [HUDI-6464] Spark SQL Merge Into for pkless tables

2023-07-03 Thread via GitHub
jonvex commented on code in PR #9083: URL: https://github.com/apache/hudi/pull/9083#discussion_r1251058735 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieCreateRecordUtils.scala: ## @@ -66,6 +82,8 @@ object HoodieCreateRecordUtils {

[jira] [Updated] (HUDI-6470) Add spark sql conf in AlterTableCommand

2023-07-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6470: - Labels: pull-request-available (was: ) > Add spark sql conf in AlterTableCommand >

[GitHub] [hudi] jonvex commented on a diff in pull request #9083: [HUDI-6464] Spark SQL Merge Into for pkless tables

2023-07-03 Thread via GitHub
jonvex commented on code in PR #9083: URL: https://github.com/apache/hudi/pull/9083#discussion_r1251058371 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieCreateRecordUtils.scala: ## @@ -46,16 +47,31 @@ import

[GitHub] [hudi] Zouxxyy opened a new pull request, #9116: [HUDI-6470] Add spark sql conf in AlterTableCommand

2023-07-03 Thread via GitHub
Zouxxyy opened a new pull request, #9116: URL: https://github.com/apache/hudi/pull/9116 ### Change Logs Add spark sql conf in AlterTableCommand ### Impact Add spark sql conf in AlterTableCommand ### Risk level (write none, low medium or high below) low

[jira] [Updated] (HUDI-6470) Add spark sql conf in AlterTableCommand

2023-07-03 Thread zouxxyy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zouxxyy updated HUDI-6470: -- Summary: Add spark sql conf in AlterTableCommand (was: Add spark sql conf in AlterTableCommond) > Add spark

[GitHub] [hudi] hudi-bot commented on pull request #9115: [HUDI-6469] Revert HUDI-6311

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9115: URL: https://github.com/apache/hudi/pull/9115#issuecomment-1618718335 ## CI report: * 2a046240c1e7c0a18f9b57c0845298ea65b72951 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #9038: [HUDI-6423] Incremental cleaning should consider inflight compaction instant

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9038: URL: https://github.com/apache/hudi/pull/9038#issuecomment-1618717281 ## CI report: * a65a29c0cf1c8feb9f39e168ba80c99ebcae1c5d UNKNOWN * b58c9f0078da328bee1ceb8f178cfad15a25b271 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8668: [HUDI-3639] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-07-03 Thread via GitHub
hudi-bot commented on PR #8668: URL: https://github.com/apache/hudi/pull/8668#issuecomment-1618714185 ## CI report: * 138705f83063d40e3abc5d33df1562041947b7d2 Azure:

[jira] [Created] (HUDI-6470) Add spark sql conf in AlterTableCommond

2023-07-03 Thread zouxxyy (Jira)
zouxxyy created HUDI-6470: - Summary: Add spark sql conf in AlterTableCommond Key: HUDI-6470 URL: https://issues.apache.org/jira/browse/HUDI-6470 Project: Apache Hudi Issue Type: Bug

[GitHub] [hudi] hudi-bot commented on pull request #9038: [HUDI-6423] Incremental cleaning should consider inflight compaction instant

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9038: URL: https://github.com/apache/hudi/pull/9038#issuecomment-1618678548 ## CI report: * a65a29c0cf1c8feb9f39e168ba80c99ebcae1c5d UNKNOWN * b58c9f0078da328bee1ceb8f178cfad15a25b271 Azure:

[GitHub] [hudi] ad1happy2go commented on issue #8965: [SUPPORT]NoSuchMethodError: org.apache.curator.CuratorZookeeperClient.startAdvancedTrace

2023-07-03 Thread via GitHub
ad1happy2go commented on issue #8965: URL: https://github.com/apache/hudi/issues/8965#issuecomment-1618626587 Thanks @nb for the insight. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[jira] [Updated] (HUDI-6469) Revert HUDI-6311

2023-07-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6469: - Labels: pull-request-available (was: ) > Revert HUDI-6311 > > >

[GitHub] [hudi] jonvex opened a new pull request, #9115: [HUDI-6469] Revert HUDI-6311

2023-07-03 Thread via GitHub
jonvex opened a new pull request, #9115: URL: https://github.com/apache/hudi/pull/9115 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any performance

[jira] [Created] (HUDI-6469) Revert HUDI-6311

2023-07-03 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-6469: - Summary: Revert HUDI-6311 Key: HUDI-6469 URL: https://issues.apache.org/jira/browse/HUDI-6469 Project: Apache Hudi Issue Type: Bug Components:

[GitHub] [hudi] hudi-bot commented on pull request #9038: [HUDI-6423] Incremental cleaning should consider inflight compaction instant

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9038: URL: https://github.com/apache/hudi/pull/9038#issuecomment-1618514570 ## CI report: * a65a29c0cf1c8feb9f39e168ba80c99ebcae1c5d UNKNOWN * b58c9f0078da328bee1ceb8f178cfad15a25b271 Azure:

[GitHub] [hudi] nsivabalan commented on a diff in pull request #9114: [HUDI-6467] Fix deletes handling in rli when partition path is updated

2023-07-03 Thread via GitHub
nsivabalan commented on code in PR #9114: URL: https://github.com/apache/hudi/pull/9114#discussion_r1250983854 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java: ## @@ -1159,10 +1161,30 @@ protected boolean

[GitHub] [hudi] nsivabalan commented on pull request #8913: [HUDI-6343] Fixing fileId format for all mdt partitions

2023-07-03 Thread via GitHub
nsivabalan commented on PR #8913: URL: https://github.com/apache/hudi/pull/8913#issuecomment-1618464344 this is already fixed in master -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] nsivabalan closed pull request #8913: [HUDI-6343] Fixing fileId format for all mdt partitions

2023-07-03 Thread via GitHub
nsivabalan closed pull request #8913: [HUDI-6343] Fixing fileId format for all mdt partitions URL: https://github.com/apache/hudi/pull/8913 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] nsivabalan commented on pull request #8610: [HUDI-6156] Prevent leaving tmp file in timeline when multi process t…

2023-07-03 Thread via GitHub
nsivabalan commented on PR #8610: URL: https://github.com/apache/hudi/pull/8610#issuecomment-1618459143 hey @hbgstc123 @danny0405 : Is this very critical? We have few days before which we might need to land this if we want to get this in. Can you guys try to gauge the risk and ROI on

[GitHub] [hudi] hudi-bot commented on pull request #9114: [HUDI-6467] Fix deletes handling in rli when partition path is updated

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9114: URL: https://github.com/apache/hudi/pull/9114#issuecomment-1618457281 ## CI report: * 02a297b2e0cdfd364b2dccffdad4ff6df9adf564 Azure:

[GitHub] [hudi] nsivabalan commented on pull request #7469: [HUDI-5386] Cleaning conflicts when write concurrency mode is OCC

2023-07-03 Thread via GitHub
nsivabalan commented on PR #7469: URL: https://github.com/apache/hudi/pull/7469#issuecomment-1618411933 hey @LinMingQiang : Do you think you can get the patch in ready state in the next day or two. If the patch is ready and has tests, we can try to get it in for 0.14.0. -- This is an

[GitHub] [hudi] nsivabalan commented on pull request #6868: [Hudi-4882] Multiple ordering fields and null value update for partial update to handle out-of-order events

2023-07-03 Thread via GitHub
nsivabalan commented on PR #6868: URL: https://github.com/apache/hudi/pull/6868#issuecomment-1618403905 I don't have full context on cur status of this patch. But if you think we can review and land this safely by this Friday, we can take it in for 0.14.0. I will let you guys

[GitHub] [hudi] parisni commented on pull request #9071: [HUDI-6453] Cascade Glue schema changes to partitions

2023-07-03 Thread via GitHub
parisni commented on PR #9071: URL: https://github.com/apache/hudi/pull/9071#issuecomment-1618276927 Yes will review this asap, likely in two days or so -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [hudi] hudi-bot commented on pull request #8668: [HUDI-3639] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-07-03 Thread via GitHub
hudi-bot commented on PR #8668: URL: https://github.com/apache/hudi/pull/8668#issuecomment-1618201778 ## CI report: * b9129f7a7b65f57c629d2adeb198377bb32d3b73 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8668: [HUDI-3639] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-07-03 Thread via GitHub
hudi-bot commented on PR #8668: URL: https://github.com/apache/hudi/pull/8668#issuecomment-1618185094 ## CI report: * 2046a130a24fa1865cc4d62a23f329617b837391 Azure:

[jira] [Created] (HUDI-6468) Support SQL hints to parse hoodie configuration

2023-07-03 Thread Aditya Goenka (Jira)
Aditya Goenka created HUDI-6468: --- Summary: Support SQL hints to parse hoodie configuration Key: HUDI-6468 URL: https://issues.apache.org/jira/browse/HUDI-6468 Project: Apache Hudi Issue Type:

[GitHub] [hudi] hudi-bot commented on pull request #9114: [HUDI-6467] Fix deletes handling in rli when partition path is updated

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9114: URL: https://github.com/apache/hudi/pull/9114#issuecomment-1618063323 ## CI report: * 02a297b2e0cdfd364b2dccffdad4ff6df9adf564 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8668: [HUDI-3639] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-07-03 Thread via GitHub
hudi-bot commented on PR #8668: URL: https://github.com/apache/hudi/pull/8668#issuecomment-1618060536 ## CI report: * 2046a130a24fa1865cc4d62a23f329617b837391 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9112: [HUDI-6465] Fix data skipping support BIGINT

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9112: URL: https://github.com/apache/hudi/pull/9112#issuecomment-1618063095 ## CI report: * 45bcbc09dacb95a4f7e2c66fba71dd29e13c620d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9113: [HUDI-6466] Fix spark insert overwrite partitioned table with dynamic partition

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9113: URL: https://github.com/apache/hudi/pull/9113#issuecomment-1618063223 ## CI report: * c6127a02ea8c3f4e8819559dbb4efa9f64bc040f Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9114: [HUDI-6467] Fix deletes handling in rli when partition path is updated

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9114: URL: https://github.com/apache/hudi/pull/9114#issuecomment-1618042449 ## CI report: * 02a297b2e0cdfd364b2dccffdad4ff6df9adf564 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #8668: [HUDI-3639] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-07-03 Thread via GitHub
hudi-bot commented on PR #8668: URL: https://github.com/apache/hudi/pull/8668#issuecomment-1618039337 ## CI report: * 2046a130a24fa1865cc4d62a23f329617b837391 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8668: [HUDI-3639] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-07-03 Thread via GitHub
hudi-bot commented on PR #8668: URL: https://github.com/apache/hudi/pull/8668#issuecomment-1618015491 ## CI report: * 2046a130a24fa1865cc4d62a23f329617b837391 Azure:

[jira] [Updated] (HUDI-6467) Fix handling of deletes in record level index when partition path is updated

2023-07-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6467: - Labels: pull-request-available (was: ) > Fix handling of deletes in record level index when

[GitHub] [hudi] codope opened a new pull request, #9114: [HUDI-6467] Fix deletes handling in rli when partition path is updated

2023-07-03 Thread via GitHub
codope opened a new pull request, #9114: URL: https://github.com/apache/hudi/pull/9114 ### Change Logs #9058 added support for deletes in RLI. However, it missed a corner case when there are deletes with partition path updates. As a result, `TestGlobalIndexEnableUpdatePartitions`

[jira] [Created] (HUDI-6467) Fix handling of deletes in record level index when partition path is updated

2023-07-03 Thread Sagar Sumit (Jira)
Sagar Sumit created HUDI-6467: - Summary: Fix handling of deletes in record level index when partition path is updated Key: HUDI-6467 URL: https://issues.apache.org/jira/browse/HUDI-6467 Project: Apache

[GitHub] [hudi] boneanxs commented on a diff in pull request #6468: [HUDI-4731] Shutdown CloudWatch reporter when query completes

2023-07-03 Thread via GitHub
boneanxs commented on code in PR #6468: URL: https://github.com/apache/hudi/pull/6468#discussion_r1250340527 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metrics/Metrics.java: ## @@ -53,14 +53,15 @@ private Metrics(HoodieWriteConfig metricConfig) { }

[GitHub] [hudi] codope commented on a diff in pull request #8881: [HUDI-6316] Adding corrupted and rollback log blocks metrics

2023-07-03 Thread via GitHub
codope commented on code in PR #8881: URL: https://github.com/apache/hudi/pull/8881#discussion_r1250666784 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metrics/HoodieMetrics.java: ## @@ -168,24 +185,24 @@ public void updateMetricsForEmptyData(String

[GitHub] [hudi] codope commented on a diff in pull request #8881: [HUDI-6316] Adding corrupted and rollback log blocks metrics

2023-07-03 Thread via GitHub
codope commented on code in PR #8881: URL: https://github.com/apache/hudi/pull/8881#discussion_r1250666784 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metrics/HoodieMetrics.java: ## @@ -168,24 +185,24 @@ public void updateMetricsForEmptyData(String

[GitHub] [hudi] hudi-bot commented on pull request #9113: [HUDI-6466] Fix spark insert overwrite partitioned table with dynamic partition

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9113: URL: https://github.com/apache/hudi/pull/9113#issuecomment-1617853783 ## CI report: * c6127a02ea8c3f4e8819559dbb4efa9f64bc040f Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9111: Add CodeQL workflow

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9111: URL: https://github.com/apache/hudi/pull/9111#issuecomment-1617853648 ## CI report: * 4c02398fceda917fd902966ff4ad4598394a370a Azure:

[GitHub] [hudi] codope merged pull request #9098: [MINOR] Reverting disabled tests for multiwriter archival

2023-07-03 Thread via GitHub
codope merged PR #9098: URL: https://github.com/apache/hudi/pull/9098 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[hudi] branch master updated (b5f749ccbf1 -> c3e9276809a)

2023-07-03 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from b5f749ccbf1 [HUDI-6463] Fix deluge loggings of HoodieBackedTableMetadataWriter#getMetadataPartitionsToUpdate (#9107)

[GitHub] [hudi] hudi-bot commented on pull request #9112: [HUDI-6465] Fix data skipping support BIGINT

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9112: URL: https://github.com/apache/hudi/pull/9112#issuecomment-1617746528 ## CI report: * 45bcbc09dacb95a4f7e2c66fba71dd29e13c620d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9113: [HUDI-6466] Fix spark insert overwrite partitioned table with dynamic partition

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9113: URL: https://github.com/apache/hudi/pull/9113#issuecomment-1617746603 ## CI report: * c6127a02ea8c3f4e8819559dbb4efa9f64bc040f UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #9112: [HUDI-6465] Fix data skipping support BIGINT

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9112: URL: https://github.com/apache/hudi/pull/9112#issuecomment-1617730308 ## CI report: * 45bcbc09dacb95a4f7e2c66fba71dd29e13c620d UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[jira] [Updated] (HUDI-6466) Spark's capcity of insert overwrite partitioned table with dynamic partition lost

2023-07-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6466: - Labels: pull-request-available (was: ) > Spark's capcity of insert overwrite partitioned table

[GitHub] [hudi] flashJd opened a new pull request, #9113: [HUDI-6466] Fix spark insert overwrite partitioned table with dynamic partition

2023-07-03 Thread via GitHub
flashJd opened a new pull request, #9113: URL: https://github.com/apache/hudi/pull/9113 ### Change Logs When upgrade hudi from 0.12.2->0.13.1, I found spark's capcity of insert overwrite partitioned table with dynamic partition lost, see

[jira] [Created] (HUDI-6466) Spark's capcity of insert overwrite partitioned table with dynamic partition lost

2023-07-03 Thread yonghua jian (Jira)
yonghua jian created HUDI-6466: -- Summary: Spark's capcity of insert overwrite partitioned table with dynamic partition lost Key: HUDI-6466 URL: https://issues.apache.org/jira/browse/HUDI-6466 Project:

[GitHub] [hudi] KnightChess commented on issue #9101: [SUPPORT] Transaction and spark job final state inconsistency in batch processing

2023-07-03 Thread via GitHub
KnightChess commented on issue #9101: URL: https://github.com/apache/hudi/issues/9101#issuecomment-1617677280 > Is there anyway the cleaning and archive does not trigger recomputation of the whole job? timeline: 0.commit 1.commit I think it is possible. The

[GitHub] [hudi] Alowator opened a new pull request, #9112: [HUDI-6465] Fix data skipping support BIGINT

2023-07-03 Thread via GitHub
Alowator opened a new pull request, #9112: URL: https://github.com/apache/hudi/pull/9112 ### Change Logs Add support of BIGINT type in DataPruner ### Impact This PR fixes bug: **DataPruner doesnt support BIGINT type** ### Risk level (write none, low medium or high

[GitHub] [hudi] hudi-bot commented on pull request #8668: [HUDI-3639] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-07-03 Thread via GitHub
hudi-bot commented on PR #8668: URL: https://github.com/apache/hudi/pull/8668#issuecomment-1617645826 ## CI report: * 7e2708c1868584fc1828945063c160a004628b7e Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8668: [HUDI-3639] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-07-03 Thread via GitHub
hudi-bot commented on PR #8668: URL: https://github.com/apache/hudi/pull/8668#issuecomment-1617632541 ## CI report: * 7e2708c1868584fc1828945063c160a004628b7e Azure:

[GitHub] [hudi] danny0405 commented on pull request #9071: [HUDI-6453] Cascade Glue schema changes to partitions

2023-07-03 Thread via GitHub
danny0405 commented on PR #9071: URL: https://github.com/apache/hudi/pull/9071#issuecomment-1617594288 @parisni , do you have intreast to take a look of this PR? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [hudi] danny0405 commented on issue #9101: [SUPPORT] Transaction and spark job final state inconsistency in batch processing

2023-07-03 Thread via GitHub
danny0405 commented on issue #9101: URL: https://github.com/apache/hudi/issues/9101#issuecomment-1617582944 Is there anyway the cleaning and archive does not trigger recomputation of the whole job? -- This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [hudi] danny0405 commented on a diff in pull request #9087: [HUDI-6329] Aadjust the partitioner automatically for flink consistent hashing index

2023-07-03 Thread via GitHub
danny0405 commented on code in PR #9087: URL: https://github.com/apache/hudi/pull/9087#discussion_r1250442036 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/sink/clustering/update/strategy/FlinkConsistentBucketUpdateStrategy.java: ## @@ -0,0 +1,147 @@ +/* + *

[jira] [Updated] (HUDI-6465) Fix data skipping support BIGINT

2023-07-03 Thread Mark Bukhner (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Bukhner updated HUDI-6465: --- Fix Version/s: 0.14.0 > Fix data skipping support BIGINT > > >

[GitHub] [hudi] danny0405 commented on pull request #9087: [HUDI-6329] Aadjust the partitioner automatically for flink consistent hashing index

2023-07-03 Thread via GitHub
danny0405 commented on PR #9087: URL: https://github.com/apache/hudi/pull/9087#issuecomment-1617579258 [6329.patch.zip](https://github.com/apache/hudi/files/11934542/6329.patch.zip) Thanks for the contribution, I have reviewed and applied a patch. -- This is an automated message from

[jira] [Created] (HUDI-6465) Fix data skipping support BIGINT

2023-07-03 Thread Mark Bukhner (Jira)
Mark Bukhner created HUDI-6465: -- Summary: Fix data skipping support BIGINT Key: HUDI-6465 URL: https://issues.apache.org/jira/browse/HUDI-6465 Project: Apache Hudi Issue Type: Bug

[GitHub] [hudi] boneanxs commented on a diff in pull request #6468: [HUDI-4731] Shutdown CloudWatch reporter when query completes

2023-07-03 Thread via GitHub
boneanxs commented on code in PR #6468: URL: https://github.com/apache/hudi/pull/6468#discussion_r1250435457 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieSparkSqlWriter.scala: ## @@ -594,6 +595,10 @@ object HoodieSparkSqlWriter {

[GitHub] [hudi] KnightChess commented on issue #9101: [SUPPORT] Transaction and spark job final state inconsistency in batch processing

2023-07-03 Thread via GitHub
KnightChess commented on issue #9101: URL: https://github.com/apache/hudi/issues/9101#issuecomment-1617563560 @danny0405 yes, it will recompute again, there will be multiple commit instances in a offline task cycle like daily cycle, in this day, there should be one commit instance in

[GitHub] [hudi] cxzl25 commented on a diff in pull request #8668: [HUDI-3639] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-07-03 Thread via GitHub
cxzl25 commented on code in PR #8668: URL: https://github.com/apache/hudi/pull/8668#discussion_r1250421775 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/hudi/functional/TestParquetColumnProjection.scala: ## @@ -330,6 +329,41 @@ class TestParquetColumnProjection

[GitHub] [hudi] hudi-bot commented on pull request #9111: Add CodeQL workflow

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9111: URL: https://github.com/apache/hudi/pull/9111#issuecomment-1617552468 ## CI report: * 4c02398fceda917fd902966ff4ad4598394a370a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9111: Add CodeQL workflow

2023-07-03 Thread via GitHub
hudi-bot commented on PR #9111: URL: https://github.com/apache/hudi/pull/9111#issuecomment-1617538490 ## CI report: * 4c02398fceda917fd902966ff4ad4598394a370a UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] danny0405 commented on issue #9101: [SUPPORT] Transaction and spark job final state inconsistency in batch processing

2023-07-03 Thread via GitHub
danny0405 commented on issue #9101: URL: https://github.com/apache/hudi/issues/9101#issuecomment-1617520203 > when we trigger mayBeCleanAndArchive, it throw Exception, and make this job failed, it will retry in job level. But this commit has commit. So the final result is: instance

[GitHub] [hudi] danny0405 commented on a diff in pull request #6468: [HUDI-4731] Shutdown CloudWatch reporter when query completes

2023-07-03 Thread via GitHub
danny0405 commented on code in PR #6468: URL: https://github.com/apache/hudi/pull/6468#discussion_r1250381027 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieSparkSqlWriter.scala: ## @@ -594,6 +595,10 @@ object HoodieSparkSqlWriter {

[GitHub] [hudi] jorgectf opened a new pull request, #9111: Add CodeQL workflow

2023-07-03 Thread via GitHub
jorgectf opened a new pull request, #9111: URL: https://github.com/apache/hudi/pull/9111 Hello from [GitHub Security Lab](https://securitylab.github.com/)! Your repository is critical to the security of the Open Source Software (OSS) ecosystem, and as part of our mission to make OSS

[GitHub] [hudi] danny0405 commented on a diff in pull request #8668: [HUDI-3639] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-07-03 Thread via GitHub
danny0405 commented on code in PR #8668: URL: https://github.com/apache/hudi/pull/8668#discussion_r1250377895 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/hudi/functional/TestParquetColumnProjection.scala: ## @@ -330,6 +329,41 @@ class

[GitHub] [hudi] danny0405 commented on pull request #8668: [HUDI-3639] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-07-03 Thread via GitHub
danny0405 commented on PR #8668: URL: https://github.com/apache/hudi/pull/8668#issuecomment-1617493404 This is known to be flaky:

[GitHub] [hudi] sunneebaby commented on issue #7115: [SUPPORT]flink insert hudi cow table ,but hive query is null。

2023-07-03 Thread via GitHub
sunneebaby commented on issue #7115: URL: https://github.com/apache/hudi/issues/7115#issuecomment-1617492345 I also face this case. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] boneanxs commented on a diff in pull request #6468: [HUDI-4731] Shutdown CloudWatch reporter when query completes

2023-07-03 Thread via GitHub
boneanxs commented on code in PR #6468: URL: https://github.com/apache/hudi/pull/6468#discussion_r1250338414 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieSparkSqlWriter.scala: ## @@ -594,6 +595,10 @@ object HoodieSparkSqlWriter {

[GitHub] [hudi] nbeeee commented on issue #8965: [SUPPORT]NoSuchMethodError: org.apache.curator.CuratorZookeeperClient.startAdvancedTrace

2023-07-03 Thread via GitHub
nb commented on issue #8965: URL: https://github.com/apache/hudi/issues/8965#issuecomment-1617456302 I added the zookeeper dependency to the hudi-spark-bundle/pom.xml file and repackaged it, which is now working properly > @nb Also it this a deltastreamer job or spark

<    1   2