[GitHub] [hudi] xushiyan merged pull request #9017: [HUDI-6393] Enable MOR support for Record index with functional test cases

2023-06-29 Thread via GitHub
xushiyan merged PR #9017: URL: https://github.com/apache/hudi/pull/9017 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] xushiyan commented on pull request #9017: [HUDI-6393] Enable MOR support for Record index with functional test cases

2023-06-29 Thread via GitHub
xushiyan commented on PR #9017: URL: https://github.com/apache/hudi/pull/9017#issuecomment-1613510452 CI is timing out as expected. The newly added testcase is passing. will land this now. -- This is an automated message from the Apache Git Service. To respond to the message, please log

[GitHub] [hudi] nsivabalan merged pull request #9089: [MINOR] Increase timeout for Azure CI: UT spark-datasource to 240 minutes

2023-06-29 Thread via GitHub
nsivabalan merged PR #9089: URL: https://github.com/apache/hudi/pull/9089 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[hudi] branch master updated (8def3e68ae5 -> 05435bb0344)

2023-06-29 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from 8def3e68ae5 [MINOR] Improve CollectionUtils helper methods (#9088) add 05435bb0344 [MINOR] Increase timeout for

[GitHub] [hudi] hudi-bot commented on pull request #9083: PKLess Merge Into

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9083: URL: https://github.com/apache/hudi/pull/9083#issuecomment-1613500714 ## CI report: * be6801e9ca41f00576a511c7d3ffe144e90717ee Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9058: [HUDI-6376] Support for deletes in HUDI Indexes including metadata table record index.

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9058: URL: https://github.com/apache/hudi/pull/9058#issuecomment-1613500487 ## CI report: * 1697d1bfa095ca16a9361e3728a77331d3a28037 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9083: PKLess Merge Into

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9083: URL: https://github.com/apache/hudi/pull/9083#issuecomment-1613490095 ## CI report: * be6801e9ca41f00576a511c7d3ffe144e90717ee Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9063: [HUDI-6448] Improve upgrade/downgrade for table ver. 6

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9063: URL: https://github.com/apache/hudi/pull/9063#issuecomment-1613489865 ## CI report: * 4775dce07f2f3237b32f22b360f3423b1eafce85 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9063: [HUDI-6448] Improve upgrade/downgrade for table ver. 6

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9063: URL: https://github.com/apache/hudi/pull/9063#issuecomment-1613476228 ## CI report: * 4775dce07f2f3237b32f22b360f3423b1eafce85 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9083: PKLess Merge Into

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9083: URL: https://github.com/apache/hudi/pull/9083#issuecomment-1613476471 ## CI report: * be6801e9ca41f00576a511c7d3ffe144e90717ee Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9017: [HUDI-6393] Enable MOR support for Record index with functional test cases

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9017: URL: https://github.com/apache/hudi/pull/9017#issuecomment-1613475988 ## CI report: * ceffe7d8146f48e1c6c083613646463c1404a77f Azure:

[GitHub] [hudi] BBency opened a new issue, #9094: Async Clustering failing with errors for MOR table

2023-06-29 Thread via GitHub
BBency opened a new issue, #9094: URL: https://github.com/apache/hudi/issues/9094 **Problem Description** We have a MOR table which is partitioned by yearmonth(MM). We would like to trigger async clustering after doing the compaction at the end of the day so that we can stitch

[GitHub] [hudi] xushiyan commented on pull request #9063: [HUDI-6448] Improve upgrade/downgrade for table ver. 6

2023-06-29 Thread via GitHub
xushiyan commented on PR #9063: URL: https://github.com/apache/hudi/pull/9063#issuecomment-1613418110 manually verified the flow 0.13.1 -> 0.14.0-SNAPSHOT (this PR) before upgrade ``` hoodie.table.version=5 hoodie.table.metadata.partitions=files ``` upgrade

[GitHub] [hudi] nsivabalan commented on issue #9079: [SUPPORT] Hudi delete not working when using UuidKeyGenerator

2023-06-29 Thread via GitHub
nsivabalan commented on issue #9079: URL: https://github.com/apache/hudi/issues/9079#issuecomment-1613408548 this is a known limitation of UUID Key generator. This key gen is generally meant to be used only for immutable data. with 0.14.0, we are adding pk less(primary key less) table,

[GitHub] [hudi] noahtaite commented on issue #9067: [SUPPORT] Manual Glue sync for large, highly partitioned table failing

2023-06-29 Thread via GitHub
noahtaite commented on issue #9067: URL: https://github.com/apache/hudi/issues/9067#issuecomment-1613377080 Hello @danny0405 @ad1happy2go I can confirm 0.13.1 works nicely as the HMS sync mode now supports batching and boolean values (conditional sync). thank you for the support -- This

[GitHub] [hudi] noahtaite closed issue #9067: [SUPPORT] Manual Glue sync for large, highly partitioned table failing

2023-06-29 Thread via GitHub
noahtaite closed issue #9067: [SUPPORT] Manual Glue sync for large, highly partitioned table failing URL: https://github.com/apache/hudi/issues/9067 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] gamblewin opened a new issue, #9093: [SUPPORT] Is it allowed using Flink Table API sqlQuery() to read data from hudi tables?

2023-06-29 Thread via GitHub
gamblewin opened a new issue, #9093: URL: https://github.com/apache/hudi/issues/9093 **Describe the problem you faced** I'm trying to use flink table api sqlQuery to read data from hudi table but not working, so am i doing it wrong or hudi doesn't support this way to query data.

[GitHub] [hudi] hudi-bot commented on pull request #9038: [HUDI-6423] Incremental cleaning should consider inflight compaction instant

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9038: URL: https://github.com/apache/hudi/pull/9038#issuecomment-1613357939 ## CI report: * a65a29c0cf1c8feb9f39e168ba80c99ebcae1c5d UNKNOWN * 34f8823f48712c57058bc37c8936a276c1457557 Azure:

[GitHub] [hudi] ad1happy2go commented on issue #9086: [SUPPORT]How to build with scala 2.11 for spark and scala2.12 for flink

2023-06-29 Thread via GitHub
ad1happy2go commented on issue #9086: URL: https://github.com/apache/hudi/issues/9086#issuecomment-1613339717 @bigdata-spec I dont think we can build with different scala version in a single build. You may need to build it twice and then use the spark and flink jars from separate

[GitHub] [hudi] ad1happy2go commented on issue #9091: [BUG] Use NonpartitionedKeyGenerator WriteOperationType BULK_INSERT and UPSERT get different _hoodie_record_key format

2023-06-29 Thread via GitHub
ad1happy2go commented on issue #9091: URL: https://github.com/apache/hudi/issues/9091#issuecomment-1613328306 @lipusheng Known Issue which got fixed in hudi 0.13.X. Refer this GitHub issue - https://github.com/apache/hudi/issues/8981 -- This is an automated message from the Apache

[GitHub] [hudi] hudi-bot commented on pull request #9082: [HUDI-6445] Distribute spark ds func tests

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9082: URL: https://github.com/apache/hudi/pull/9082#issuecomment-1613268792 ## CI report: * c529c624afdca331514a2bdfb78cc6e18ab9f57a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9082: [HUDI-6445] Distribute spark ds func tests

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9082: URL: https://github.com/apache/hudi/pull/9082#issuecomment-1613254158 ## CI report: * c529c624afdca331514a2bdfb78cc6e18ab9f57a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9041: [HUDI-6431] Support update partition path in record-level index

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9041: URL: https://github.com/apache/hudi/pull/9041#issuecomment-1613253794 ## CI report: * b681df04a7ad0febbcd9235622c2ee7f98759cf9 Azure:

[GitHub] [hudi] KenjiFujima commented on pull request #8933: [HUDI-5329] Spark reads table error when Flink creates table without record key and primary key

2023-06-29 Thread via GitHub
KenjiFujima commented on PR #8933: URL: https://github.com/apache/hudi/pull/8933#issuecomment-1613251280 @danny0405, I have addressed above comments. PTAL. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [hudi] xushiyan commented on a diff in pull request #9041: [HUDI-6431] Support update partition path in record-level index

2023-06-29 Thread via GitHub
xushiyan commented on code in PR #9041: URL: https://github.com/apache/hudi/pull/9041#discussion_r1246658153 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java: ## @@ -459,11 +459,6 @@ private Pair>

[GitHub] [hudi] xushiyan commented on a diff in pull request #9041: [HUDI-6431] Support update partition path in record-level index

2023-06-29 Thread via GitHub
xushiyan commented on code in PR #9041: URL: https://github.com/apache/hudi/pull/9041#discussion_r1246655960 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/index/HoodieIndexUtils.java: ## @@ -310,6 +312,56 @@ public static HoodieData>

[GitHub] [hudi] codope commented on a diff in pull request #9041: [HUDI-6431] Support update partition path in record-level index

2023-06-29 Thread via GitHub
codope commented on code in PR #9041: URL: https://github.com/apache/hudi/pull/9041#discussion_r1246655286 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/index/HoodieIndexUtils.java: ## @@ -310,6 +312,56 @@ public static HoodieData> mergeForPartitionUpdates(

[GitHub] [hudi] xushiyan commented on a diff in pull request #9041: [HUDI-6431] Support update partition path in record-level index

2023-06-29 Thread via GitHub
xushiyan commented on code in PR #9041: URL: https://github.com/apache/hudi/pull/9041#discussion_r1241059669 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/index/HoodieIndexUtils.java: ## @@ -310,6 +312,56 @@ public static HoodieData>

[GitHub] [hudi] xushiyan commented on a diff in pull request #9041: [HUDI-6431] Support update partition path in record-level index

2023-06-29 Thread via GitHub
xushiyan commented on code in PR #9041: URL: https://github.com/apache/hudi/pull/9041#discussion_r1241059710 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/index/HoodieIndexUtils.java: ## @@ -310,6 +312,56 @@ public static HoodieData>

[GitHub] [hudi] codope commented on a diff in pull request #9041: [HUDI-6431] Support update partition path in record-level index

2023-06-29 Thread via GitHub
codope commented on code in PR #9041: URL: https://github.com/apache/hudi/pull/9041#discussion_r1246648546 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/index/simple/HoodieGlobalSimpleIndex.java: ## @@ -72,85 +68,37 @@ public HoodieData> tagLocation(

[GitHub] [hudi] hudi-bot commented on pull request #9063: [HUDI-6448] Improve upgrade/downgrade for table ver. 6

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9063: URL: https://github.com/apache/hudi/pull/9063#issuecomment-1613184716 ## CI report: * 4775dce07f2f3237b32f22b360f3423b1eafce85 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9041: [HUDI-6431] Support update partition path in record-level index

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9041: URL: https://github.com/apache/hudi/pull/9041#issuecomment-1613184534 ## CI report: * b681df04a7ad0febbcd9235622c2ee7f98759cf9 Azure:

[jira] [Updated] (HUDI-6459) Add Rollback test for Record Level Index

2023-06-29 Thread Lokesh Jain (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lokesh Jain updated HUDI-6459: -- Summary: Add Rollback test for Record Level Index (was: Add Rollback validation for Record Level

[jira] [Created] (HUDI-6459) Add Rollback validation for Record Level Index

2023-06-29 Thread Lokesh Jain (Jira)
Lokesh Jain created HUDI-6459: - Summary: Add Rollback validation for Record Level Index Key: HUDI-6459 URL: https://issues.apache.org/jira/browse/HUDI-6459 Project: Apache Hudi Issue Type: Bug

[GitHub] [hudi] hudi-bot commented on pull request #9064: [HUDI-6450] Fix null strings handling in convertRowToJsonString

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9064: URL: https://github.com/apache/hudi/pull/9064#issuecomment-1613172562 ## CI report: * 2b572a55998c0e1c4eca7970e8f63ed79254161c Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9063: [HUDI-6448] Improve upgrade/downgrade for table ver. 6

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9063: URL: https://github.com/apache/hudi/pull/9063#issuecomment-1613172489 ## CI report: * 4775dce07f2f3237b32f22b360f3423b1eafce85 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #9041: [HUDI-6431] Support update partition path in record-level index

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9041: URL: https://github.com/apache/hudi/pull/9041#issuecomment-1613172359 ## CI report: * b681df04a7ad0febbcd9235622c2ee7f98759cf9 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9092: [MINOR] Enable log compaction by default for MDT

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9092: URL: https://github.com/apache/hudi/pull/9092#issuecomment-1613159780 ## CI report: * 408e9f946e0a0647b0fc9f8e220d55ad2fbde62d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9089: [MINOR] Increase timeout for Azure CI: UT spark-datasource to 240 minutes

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9089: URL: https://github.com/apache/hudi/pull/9089#issuecomment-1613159726 ## CI report: * 4d2e8926188ce5aa2342054aeb99bf1d31eaf0e3 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9064: [HUDI-6450] Fix null strings handling in convertRowToJsonString

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9064: URL: https://github.com/apache/hudi/pull/9064#issuecomment-1613159516 ## CI report: * 2b572a55998c0e1c4eca7970e8f63ed79254161c Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9063: [HUDI-6448] Improve upgrade/downgrade for table ver. 6

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9063: URL: https://github.com/apache/hudi/pull/9063#issuecomment-1613159448 ## CI report: * 4775dce07f2f3237b32f22b360f3423b1eafce85 Azure:

[GitHub] [hudi] codope commented on a diff in pull request #9058: [HUDI-6376] Support for deletes in HUDI Indexes including metadata table record index.

2023-06-29 Thread via GitHub
codope commented on code in PR #9058: URL: https://github.com/apache/hudi/pull/9058#discussion_r1246592430 ## hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataPayload.java: ## @@ -209,9 +211,10 @@ public class HoodieMetadataPayload implements

[GitHub] [hudi] xushiyan commented on a diff in pull request #9041: [HUDI-6431] Support update partition path in record-level index

2023-06-29 Thread via GitHub
xushiyan commented on code in PR #9041: URL: https://github.com/apache/hudi/pull/9041#discussion_r1246579974 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/index/HoodieIndexUtils.java: ## @@ -310,6 +312,56 @@ public static HoodieData>

[GitHub] [hudi] xushiyan commented on a diff in pull request #9041: [HUDI-6431] Support update partition path in record-level index

2023-06-29 Thread via GitHub
xushiyan commented on code in PR #9041: URL: https://github.com/apache/hudi/pull/9041#discussion_r1246579590 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/index/HoodieIndexUtils.java: ## @@ -310,6 +312,56 @@ public static HoodieData>

[hudi] branch master updated: [MINOR] Improve CollectionUtils helper methods (#9088)

2023-06-29 Thread xushiyan
This is an automated email from the ASF dual-hosted git repository. xushiyan pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 8def3e68ae5 [MINOR] Improve CollectionUtils

[GitHub] [hudi] xushiyan merged pull request #9088: [MINOR] Improve CollectionUtils helper methods

2023-06-29 Thread via GitHub
xushiyan merged PR #9088: URL: https://github.com/apache/hudi/pull/9088 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot commented on pull request #9092: [MINOR] Enable log compaction by default for MDT

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9092: URL: https://github.com/apache/hudi/pull/9092#issuecomment-1613076306 ## CI report: * 408e9f946e0a0647b0fc9f8e220d55ad2fbde62d UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #9064: [HUDI-6450] Fix null strings handling in convertRowToJsonString

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9064: URL: https://github.com/apache/hudi/pull/9064#issuecomment-1613075951 ## CI report: * 2b572a55998c0e1c4eca7970e8f63ed79254161c Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8609: [HUDI-6154] Introduced retry while reading hoodie.properties to deal with parallel updates.

2023-06-29 Thread via GitHub
hudi-bot commented on PR #8609: URL: https://github.com/apache/hudi/pull/8609#issuecomment-1613056925 ## CI report: * e14bd41edf6cc961d77087eea67f755f23590834 Azure:

[jira] [Created] (HUDI-6458) Scheduling jobs should not fail when there is no completed commits

2023-06-29 Thread kwang (Jira)
kwang created HUDI-6458: --- Summary: Scheduling jobs should not fail when there is no completed commits Key: HUDI-6458 URL: https://issues.apache.org/jira/browse/HUDI-6458 Project: Apache Hudi Issue

[GitHub] [hudi] zaza commented on a diff in pull request #9064: [HUDI-6450] Fix null strings handling in convertRowToJsonString

2023-06-29 Thread via GitHub
zaza commented on code in PR #9064: URL: https://github.com/apache/hudi/pull/9064#discussion_r1246538265 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/cdc/HoodieCDCRDD.scala: ## @@ -561,7 +561,7 @@ class HoodieCDCRDD(

[GitHub] [hudi] codope opened a new pull request, #9092: [MINOR] Enable log compaction by default for MDT

2023-06-29 Thread via GitHub
codope opened a new pull request, #9092: URL: https://github.com/apache/hudi/pull/9092 ### Change Logs Enable log compaction on metadata table by default. ### Impact Will compact log blocks to produce another log file every 5 log blocks. ### Risk level (write

[GitHub] [hudi] hudi-bot commented on pull request #9088: [MINOR] Improve CollectionUtils helper methods

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9088: URL: https://github.com/apache/hudi/pull/9088#issuecomment-1613041272 ## CI report: * fb282b7602962846c4f561cd101033fca41e43d6 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8609: [HUDI-6154] Introduced retry while reading hoodie.properties to deal with parallel updates.

2023-06-29 Thread via GitHub
hudi-bot commented on PR #8609: URL: https://github.com/apache/hudi/pull/8609#issuecomment-1613038827 ## CI report: * e14bd41edf6cc961d77087eea67f755f23590834 Azure:

[jira] [Created] (HUDI-6457) Keep JavaSizeBasedClusteringPlanStrategy and SparkSizeBasedClusteringPlanStrategy aligned

2023-06-29 Thread kwang (Jira)
kwang created HUDI-6457: --- Summary: Keep JavaSizeBasedClusteringPlanStrategy and SparkSizeBasedClusteringPlanStrategy aligned Key: HUDI-6457 URL: https://issues.apache.org/jira/browse/HUDI-6457 Project: Apache

[GitHub] [hudi] zaza commented on a diff in pull request #9064: [HUDI-6450] Fix null strings handling in convertRowToJsonString

2023-06-29 Thread via GitHub
zaza commented on code in PR #9064: URL: https://github.com/apache/hudi/pull/9064#discussion_r1246504222 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/cdc/HoodieCDCRDD.scala: ## @@ -561,7 +561,7 @@ class HoodieCDCRDD(

[GitHub] [hudi] lipusheng opened a new issue, #9091: [SUPPORT]

2023-06-29 Thread via GitHub
lipusheng opened a new issue, #9091: URL: https://github.com/apache/hudi/issues/9091 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)? - Join the mailing list to engage in conversations and get faster support at

[GitHub] [hudi] codope commented on a diff in pull request #8609: [HUDI-6154] Introduced retry while reading hoodie.properties to deal with parallel updates.

2023-06-29 Thread via GitHub
codope commented on code in PR #8609: URL: https://github.com/apache/hudi/pull/8609#discussion_r1246489239 ## hudi-common/src/main/java/org/apache/hudi/common/table/HoodieTableConfig.java: ## @@ -334,22 +337,43 @@ public HoodieTableConfig() { super(); } - private

[GitHub] [hudi] hudi-bot commented on pull request #9082: [HUDI-6445] Distribute spark ds func tests

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9082: URL: https://github.com/apache/hudi/pull/9082#issuecomment-1612915683 ## CI report: * c529c624afdca331514a2bdfb78cc6e18ab9f57a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9007: [HUDI-6405] Fix incremental file sync for clustering and logcompaction

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9007: URL: https://github.com/apache/hudi/pull/9007#issuecomment-1612915077 ## CI report: * 3b6d13a83efdae5e46eebe9ae168ba7e0d8e9f34 Azure:

[GitHub] [hudi] LINGQ1991 commented on issue #8903: [SUPPORT] aws spark3.2.1 & hudi 0.13.1 with java.lang.NoSuchMethodError: org.apache.spark.sql.execution.datasources.PartitionedFile

2023-06-29 Thread via GitHub
LINGQ1991 commented on issue #8903: URL: https://github.com/apache/hudi/issues/8903#issuecomment-1612912367 > @ad1happy2go I use emr-6.5.0. It's error with " java.lang.NoSuchMethodError: org.apache.spark.sql.execution.datasources.PartitionedFile". > > But i have package with oss

[GitHub] [hudi] flashJd commented on pull request #9048: [HUDI-6434] Fix illegalArgumentException when do read_optimized read in Flink

2023-06-29 Thread via GitHub
flashJd commented on PR #9048: URL: https://github.com/apache/hudi/pull/9048#issuecomment-1612907539 > The `DeltaCommitWriteHandleFactory` can be tweaked for the purpose, I'm wondering what's the engine conflicts you are talking about? sry to reply late ## engine conflicts:

[GitHub] [hudi] flashJd commented on pull request #9048: [HUDI-6434] Fix illegalArgumentException when do read_optimized read in Flink

2023-06-29 Thread via GitHub
flashJd commented on PR #9048: URL: https://github.com/apache/hudi/pull/9048#issuecomment-1612904526 > sry to reply late ## engine conflicts: v0.12.2 when spark insert overwrite a partition after flink write the log files only bucket in this partition,

[GitHub] [hudi] beyond1920 opened a new issue, #9090: [SUPPORT]

2023-06-29 Thread via GitHub
beyond1920 opened a new issue, #9090: URL: https://github.com/apache/hudi/issues/9090 I cherry pick [HUDI-1517](https://issues.apache.org/jira/browse/HUDI-1517) into internal HUDI version. And find a FileNotFoundException during read latest snapshot of a MOR table.

[GitHub] [hudi] beyond1920 commented on pull request #4913: [HUDI-1517] create marker file for every log file

2023-06-29 Thread via GitHub
beyond1920 commented on PR #4913: URL: https://github.com/apache/hudi/pull/4913#issuecomment-1612808066 I cherrypick this PR to the internal HUDI. And find a `FileNotFoundException` during read latest snapshot of a mor table.

[GitHub] [hudi] hudi-bot commented on pull request #9066: [HUDI-6452] Add MOR snapshot reader to integrate with query engines without using Hadoop APIs

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9066: URL: https://github.com/apache/hudi/pull/9066#issuecomment-1612807150 ## CI report: * 8662958e8ccb7203d320dc33445f9f2dbc28fb0c Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8933: [HUDI-5329] Spark reads table error when Flink creates table without record key and primary key

2023-06-29 Thread via GitHub
hudi-bot commented on PR #8933: URL: https://github.com/apache/hudi/pull/8933#issuecomment-1612806333 ## CI report: * d1564f421664fd2dee15dfdbdae4dec07baedf92 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9066: [HUDI-6452] Add MOR snapshot reader to integrate with query engines without using Hadoop APIs

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9066: URL: https://github.com/apache/hudi/pull/9066#issuecomment-1612791679 ## CI report: * 8662958e8ccb7203d320dc33445f9f2dbc28fb0c Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9058: [HUDI-6376] Support for deletes in HUDI Indexes including metadata table record index.

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9058: URL: https://github.com/apache/hudi/pull/9058#issuecomment-1612791490 ## CI report: * 345482ba6529fc3bf0ac9f50ce0c1d79a3accd37 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9058: [HUDI-6376] Support for deletes in HUDI Indexes including metadata table record index.

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9058: URL: https://github.com/apache/hudi/pull/9058#issuecomment-1612774450 ## CI report: * 345482ba6529fc3bf0ac9f50ce0c1d79a3accd37 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9017: [HUDI-6393] Add functional tests for RecordLevelIndex

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9017: URL: https://github.com/apache/hudi/pull/9017#issuecomment-1612701307 ## CI report: * a3c1d99e2266ec68d9082fe4c76c4bf62070f5a9 Azure:

[GitHub] [hudi] xushiyan commented on a diff in pull request #9058: [HUDI-6376] Support for deletes in HUDI Indexes including metadata table record index.

2023-06-29 Thread via GitHub
xushiyan commented on code in PR #9058: URL: https://github.com/apache/hudi/pull/9058#discussion_r1246371700 ## hudi-client/hudi-spark-client/src/test/java/org/apache/hudi/client/functional/TestHoodieIndex.java: ## @@ -749,6 +749,67 @@ public void

[GitHub] [hudi] hudi-bot commented on pull request #9088: [MINOR] Improve CollectionUtils helper methods

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9088: URL: https://github.com/apache/hudi/pull/9088#issuecomment-1612690821 ## CI report: * fb282b7602962846c4f561cd101033fca41e43d6 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9038: [HUDI-6423] Incremental cleaning should consider inflight compaction instant

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9038: URL: https://github.com/apache/hudi/pull/9038#issuecomment-1612690558 ## CI report: * a65a29c0cf1c8feb9f39e168ba80c99ebcae1c5d UNKNOWN * 34f8823f48712c57058bc37c8936a276c1457557 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9017: [HUDI-6393] Add functional tests for RecordLevelIndex

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9017: URL: https://github.com/apache/hudi/pull/9017#issuecomment-1612690440 ## CI report: * d0b2f2457cf648b1b631c75bd64cc1320af69393 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9088: [MINOR] Improve CollectionUtils helper methods

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9088: URL: https://github.com/apache/hudi/pull/9088#issuecomment-1612678874 ## CI report: * fb282b7602962846c4f561cd101033fca41e43d6 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #9063: [HUDI-6448] Improve upgrade/downgrade for table ver. 6

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9063: URL: https://github.com/apache/hudi/pull/9063#issuecomment-1612678677 ## CI report: * 69b2bb853be0f79845efd56f68b934b9f69ae22a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9038: [HUDI-6423] Incremental cleaning should consider inflight compaction instant

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9038: URL: https://github.com/apache/hudi/pull/9038#issuecomment-1612678539 ## CI report: * a65a29c0cf1c8feb9f39e168ba80c99ebcae1c5d UNKNOWN * 34f8823f48712c57058bc37c8936a276c1457557 Azure:

[jira] [Closed] (HUDI-6151) Rollback previously applied commits to MDT when operations are retried.

2023-06-29 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen closed HUDI-6151. Resolution: Fixed Fixed via master branch: b95248e011931f4748a7a9fbb8298cbbb71bda88 > Rollback previously

[hudi] branch master updated: [HUDI-6151] Rollback previously applied commits to MDT when operations are retried (#8604)

2023-06-29 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new b95248e0119 [HUDI-6151] Rollback previously

[GitHub] [hudi] danny0405 merged pull request #8604: [HUDI-6151] Rollback previously applied commits to MDT when operations are retried.

2023-06-29 Thread via GitHub
danny0405 merged PR #8604: URL: https://github.com/apache/hudi/pull/8604 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] lokeshj1703 commented on a diff in pull request #9017: [HUDI-6393] Add functional tests for RecordLevelIndex

2023-06-29 Thread via GitHub
lokeshj1703 commented on code in PR #9017: URL: https://github.com/apache/hudi/pull/9017#discussion_r1246314270 ## pom.xml: ## @@ -175,7 +175,7 @@ 2.12.10 ${scala12.version} 2.8.1 -2.12 +2.11 Review Comment: Sorry! Forgot to remove this change. This

[GitHub] [hudi] xushiyan commented on a diff in pull request #9017: [HUDI-6393] Add functional tests for RecordLevelIndex

2023-06-29 Thread via GitHub
xushiyan commented on code in PR #9017: URL: https://github.com/apache/hudi/pull/9017#discussion_r1246304418 ## pom.xml: ## @@ -175,7 +175,7 @@ 2.12.10 ${scala12.version} 2.8.1 -2.12 +2.11 Review Comment: this is the default value which should be 2.12

[GitHub] [hudi] hudi-bot commented on pull request #9063: [HUDI-6448] Improve upgrade/downgrade for table ver. 6

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9063: URL: https://github.com/apache/hudi/pull/9063#issuecomment-1612621031 ## CI report: * 69b2bb853be0f79845efd56f68b934b9f69ae22a Azure:

[jira] [Commented] (HUDI-5608) Support decimals w/ precision > 30 in Column Stats

2023-06-29 Thread Jira
[ https://issues.apache.org/jira/browse/HUDI-5608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17738431#comment-17738431 ] 赵富午 commented on HUDI-5608: --- Is there any new progress? > Support decimals w/ precision > 30 in Column Stats >

[GitHub] [hudi] hudi-bot commented on pull request #8604: [HUDI-6151] Rollback previously applied commits to MDT when operations are retried.

2023-06-29 Thread via GitHub
hudi-bot commented on PR #8604: URL: https://github.com/apache/hudi/pull/8604#issuecomment-1612619567 ## CI report: * eb39bc7559945e199e43a2a3d51e1ab15b4e3e2f Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9087: [HUDI-6329] Write pipelines for table with consistent bucket index would detect whether clustering service occurs and automatically adjust the

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9087: URL: https://github.com/apache/hudi/pull/9087#issuecomment-1612610932 ## CI report: * 1bc4ea70966fd2c2cbd7cea126f4fd6b5c875077 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9088: [MINOR] Improve CollectionUtils helper methods

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9088: URL: https://github.com/apache/hudi/pull/9088#issuecomment-1612610988 ## CI report: * fb282b7602962846c4f561cd101033fca41e43d6 Azure:

[GitHub] [hudi] liudu2326526 commented on issue #6297: [SUPPORT] Flink SQL client cow table query error "org/apache/parquet/column/ColumnDescriptor" (but mor table query normal)

2023-06-29 Thread via GitHub
liudu2326526 commented on issue #6297: URL: https://github.com/apache/hudi/issues/6297#issuecomment-1612608779 I also encountered this problem when reading hudi tables.I was able to run it locally, but failed to run it on the cluster. Caused by: java.lang.LinkageError: loader

[GitHub] [hudi] codope commented on a diff in pull request #8526: [HUDI-6116] Optimize log block reading by removing seeks to check corrupted blocks.

2023-06-29 Thread via GitHub
codope commented on code in PR #8526: URL: https://github.com/apache/hudi/pull/8526#discussion_r1246115508 ## hudi-common/src/main/java/org/apache/hudi/common/table/log/block/HoodieLogBlock.java: ## @@ -264,8 +267,9 @@ public static Option tryReadContent(FSDataInputStream

[GitHub] [hudi] hudi-bot commented on pull request #9089: [MINOR] Increase timeout for Azure CI: UT spark-datasource to 240 minutes

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9089: URL: https://github.com/apache/hudi/pull/9089#issuecomment-1612600328 ## CI report: * 4d2e8926188ce5aa2342054aeb99bf1d31eaf0e3 Azure:

[GitHub] [hudi] thomasg19930417 commented on issue #9084: [SUPPORT] Historical Clean and RollBack commits are not archived

2023-06-29 Thread via GitHub
thomasg19930417 commented on issue #9084: URL: https://github.com/apache/hudi/issues/9084#issuecomment-1612598176 @danny0405 Are there any parameters to control this, or are there any instructions in the documentation -- This is an automated message from the Apache Git Service. To

[GitHub] [hudi] hudi-bot commented on pull request #9007: [HUDI-6405] Fix incremental file sync for clustering and logcompaction

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9007: URL: https://github.com/apache/hudi/pull/9007#issuecomment-1612553846 ## CI report: * 3360fa18333a0097fa762824f02eb9cd6c4bad5d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9089: [MINOR] Increase timeout for Azure CI: UT spark-datasource to 240 minutes

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9089: URL: https://github.com/apache/hudi/pull/9089#issuecomment-1612545698 ## CI report: * 4d2e8926188ce5aa2342054aeb99bf1d31eaf0e3 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #9038: [HUDI-6423] Incremental cleaning should consider inflight compaction instant

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9038: URL: https://github.com/apache/hudi/pull/9038#issuecomment-1612545458 ## CI report: * a65a29c0cf1c8feb9f39e168ba80c99ebcae1c5d UNKNOWN * 34f8823f48712c57058bc37c8936a276c1457557 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9007: [HUDI-6405] Fix incremental file sync for clustering and logcompaction

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9007: URL: https://github.com/apache/hudi/pull/9007#issuecomment-1612545219 ## CI report: * 3360fa18333a0097fa762824f02eb9cd6c4bad5d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #9038: [HUDI-6423] Incremental cleaning should consider inflight compaction instant

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9038: URL: https://github.com/apache/hudi/pull/9038#issuecomment-1612536986 ## CI report: * 1f61b83797a35d3d960f4bee865b14772931a4d2 Azure:

[GitHub] [hudi] lokeshj1703 opened a new pull request, #9089: [MINOR] Increase timeout for Azure CI: UT spark-datasource to 240 minutes

2023-06-29 Thread via GitHub
lokeshj1703 opened a new pull request, #9089: URL: https://github.com/apache/hudi/pull/9089 ### Change Logs Azure CI: UT spark-datasource job times out frequently after 3 hours duration. This PR increases the timeout to 4 hours.

[GitHub] [hudi] hudi-bot commented on pull request #9038: [HUDI-6423] Incremental cleaning should consider inflight compaction instant

2023-06-29 Thread via GitHub
hudi-bot commented on PR #9038: URL: https://github.com/apache/hudi/pull/9038#issuecomment-1612497554 ## CI report: * 1f61b83797a35d3d960f4bee865b14772931a4d2 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8933: [HUDI-5329] Spark reads table error when Flink creates table without record key and primary key

2023-06-29 Thread via GitHub
hudi-bot commented on PR #8933: URL: https://github.com/apache/hudi/pull/8933#issuecomment-1612497373 ## CI report: * 9ab390d9f29c63cdd7a07da37ce1899cb43ce330 Azure:

<    1   2   3   >