[GitHub] [hudi] hbgstc123 commented on pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 commented on PR #7903: URL: https://github.com/apache/hudi/pull/7903#issuecomment-1423760246 > thanks for review and advice -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the s

[GitHub] [hudi] voonhous commented on pull request #6868: [Hudi-4882] Multiple ordering fields and null value update for partial update to handle out-of-order events

2023-02-08 Thread via GitHub
voonhous commented on PR #6868: URL: https://github.com/apache/hudi/pull/6868#issuecomment-1423752841 Commenting for visibility -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific commen

[GitHub] [hudi] hbgstc123 commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101069152 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering()

[GitHub] [hudi] hbgstc123 commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101048291 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering()

[GitHub] [hudi] hudi-bot commented on pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7903: URL: https://github.com/apache/hudi/pull/7903#issuecomment-1423742547 ## CI report: * ee465d312a5953c8b8337d7fa4f6d7dbc97142a2 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=150

[GitHub] [hudi] hudi-bot commented on pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7903: URL: https://github.com/apache/hudi/pull/7903#issuecomment-1423736291 ## CI report: * ee465d312a5953c8b8337d7fa4f6d7dbc97142a2 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1504

[GitHub] [hudi] hudi-bot commented on pull request #7752: [MINOR] De-duplicating Iterator implementations

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7752: URL: https://github.com/apache/hudi/pull/7752#issuecomment-1423735518 ## CI report: * dec6178b4b835160cc59964bdd25ad7fb1fdd41e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1504

[GitHub] [hudi] hbgstc123 commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101048291 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering()

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101042060 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101042060 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101042060 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering

[GitHub] [hudi] Zouxxyy commented on pull request #7876: [MINOR] Improve RunClusteringProcedure with partition selected

2023-02-08 Thread via GitHub
Zouxxyy commented on PR #7876: URL: https://github.com/apache/hudi/pull/7876#issuecomment-1423718585 Hi, In fact, `RunClusteringProcedure with partition selected` has been supported ![image](https://user-images.githubusercontent.com/37108074/217738829-70be53b0-78db-487b-ba81-cfe2337b

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101042060 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering

[GitHub] [hudi] hbgstc123 commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101029083 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering()

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101028481 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101028481 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering

[GitHub] [hudi] hbgstc123 commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101028220 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering()

[GitHub] [hudi] hbgstc123 commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101028069 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/table/HoodieTableSource.java: ## @@ -380,7 +380,8 @@ private List buildFileIndex() { .path

[GitHub] [hudi] hudi-bot commented on pull request #7904: [HUDI-5735] Flink-hudi write time format data UTC time zone problem

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7904: URL: https://github.com/apache/hudi/pull/7904#issuecomment-1423697590 ## CI report: * ac26a880833f1f19aea723f17a13c6efbd86f5ca Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1505

[GitHub] [hudi] hudi-bot commented on pull request #7895: [HUDI-5736] De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7895: URL: https://github.com/apache/hudi/pull/7895#issuecomment-1423692951 ## CI report: * 3b6dbf0bcc7059a7dc4a5132bf45d0d1451d7fb0 UNKNOWN * 192a62704a96fe3c67e5017d624e456b6722f02f Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #7904: [HUDI-5735] Flink-hudi write time format data UTC time zone problem

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7904: URL: https://github.com/apache/hudi/pull/7904#issuecomment-1423693027 ## CI report: * ac26a880833f1f19aea723f17a13c6efbd86f5ca UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #7895: [HUDI-5736] De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7895: URL: https://github.com/apache/hudi/pull/7895#issuecomment-1423683310 ## CI report: * 3b6dbf0bcc7059a7dc4a5132bf45d0d1451d7fb0 UNKNOWN * 192a62704a96fe3c67e5017d624e456b6722f02f UNKNOWN Bot commands @hudi-bot supports the following

[GitHub] [hudi] hudi-bot commented on pull request #7885: [MINOR] Make sure FTs are run in GH CI

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7885: URL: https://github.com/apache/hudi/pull/7885#issuecomment-1423683208 ## CI report: * c3d027696958b320912712447bcf41c3f2d28221 UNKNOWN * fe84e4662e1853b8ab23484e0c3a679e52a9d1cb Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[jira] [Updated] (HUDI-5735) Fix: Flink-hudi write time format data UTC time zone problem

2023-02-08 Thread luckily (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] luckily updated HUDI-5735: -- Issue Type: Improvement (was: Bug) > Fix: Flink-hudi write time format data UTC time zone problem > ---

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101012578 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering

[jira] [Updated] (HUDI-5735) Fix: Flink-hudi write time format data UTC time zone problem

2023-02-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-5735: - Labels: pull-request-available (was: ) > Fix: Flink-hudi write time format data UTC time zone pro

[GitHub] [hudi] liaotian1005 opened a new pull request, #7904: [HUDI-5735] Flink-hudi write time format data UTC time zone problem

2023-02-08 Thread via GitHub
liaotian1005 opened a new pull request, #7904: URL: https://github.com/apache/hudi/pull/7904 link-hudi write data type of timestamp format UTC time zone problem. 1. The time zone written by flink is local, but the time zone read is not local 2.flink writes timestamp data, but

[GitHub] [hudi] koochiswathiTR commented on issue #3739: Hoodie clean is not deleting old files

2023-02-08 Thread via GitHub
koochiswathiTR commented on issue #3739: URL: https://github.com/apache/hudi/issues/3739#issuecomment-1423673376 can we change and schedule clean up so that cleanup runs only in one batch ? rest other batches processing time would be faster. @nsivabalan -- This is an automated message

[jira] [Updated] (HUDI-5736) De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread Alexander Trushev (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Trushev updated HUDI-5736: Description: Fix https://issues.apache.org/jira/browse/HUDI-5704 for Flink engine (was: Fix

[jira] [Updated] (HUDI-5736) De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread Alexander Trushev (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Trushev updated HUDI-5736: Component/s: flink writer-core > De-coupling column drop flag and schema va

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1100996687 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1100996687 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering

[jira] [Updated] (HUDI-5736) De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-5736: - Labels: pull-request-available (was: ) > De-coupling column drop flag and schema validation flag

[GitHub] [hudi] hudi-bot commented on pull request #7895: [HUDI-5736] De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7895: URL: https://github.com/apache/hudi/pull/7895#issuecomment-1423652843 ## CI report: * 3b6dbf0bcc7059a7dc4a5132bf45d0d1451d7fb0 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #7872: [HUDI-5716] Cleaning up `Partitioner`s hierarchy

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7872: URL: https://github.com/apache/hudi/pull/7872#issuecomment-1423652742 ## CI report: * 8f42e8c18690c8ae76121c714c2c0cda21841264 UNKNOWN * bb3bd527c1c20fb046c23cd4d34e218fb7a06f82 UNKNOWN * 20cba2df6bd792c5173b2ef7780ca093e2fac2b5 Azure: [SUCCES

[GitHub] [hudi] pan3793 commented on pull request #7900: [HUDI-5731] Cleaning up unnecessary relocation for com.google.common packages

2023-02-08 Thread via GitHub
pan3793 commented on PR #7900: URL: https://github.com/apache/hudi/pull/7900#issuecomment-1423650689 Thanks for fixing this issue. And I think curator should be relocated/removed as well. The issue happens on Kyuubi IT because 1. Kyuubi invokes the curator to access ZK 2. Du

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1100991761 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void testAppendWriteReadSkippingClustering

[GitHub] [hudi] veenaypatil commented on issue #6014: [SUPPORT] High runtime for a batch in SparkWriteHelper stage

2023-02-08 Thread via GitHub
veenaypatil commented on issue #6014: URL: https://github.com/apache/hudi/issues/6014#issuecomment-1423646531 @nsivabalan sorry for late response on this issue, I am not seeing this issue as of now, we were only seeing this issue when we killed the job and restarted it. > I see

[jira] [Created] (HUDI-5736) De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread Alexander Trushev (Jira)
Alexander Trushev created HUDI-5736: --- Summary: De-coupling column drop flag and schema validation flag in Flink Key: HUDI-5736 URL: https://issues.apache.org/jira/browse/HUDI-5736 Project: Apache Hu

[GitHub] [hudi] pramodbiligiri commented on a diff in pull request #7864: [HUDI-5688] Small workaround that can prevent NPE of EmptyRelation.schema

2023-02-08 Thread via GitHub
pramodbiligiri commented on code in PR #7864: URL: https://github.com/apache/hudi/pull/7864#discussion_r1100984391 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/DefaultSource.scala: ## @@ -241,7 +241,12 @@ object DefaultSource { } if (meta

[jira] [Updated] (HUDI-5672) Lockless multi writer support

2023-02-08 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-5672: - Summary: Lockless multi writer support (was: Flink multi writer support) > Lockless multi writer support

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1100978930 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/table/HoodieTableSource.java: ## @@ -380,7 +380,8 @@ private List buildFileIndex() { .pa

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1100978930 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/table/HoodieTableSource.java: ## @@ -380,7 +380,8 @@ private List buildFileIndex() { .pa

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1100978401 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/table/HoodieTableSource.java: ## @@ -380,7 +380,8 @@ private List buildFileIndex() { .pa

[GitHub] [hudi] koochiswathiTR commented on issue #3739: Hoodie clean is not deleting old files

2023-02-08 Thread via GitHub
koochiswathiTR commented on issue #3739: URL: https://github.com/apache/hudi/issues/3739#issuecomment-1423611752 @nsivabalan,@vinothchandar , @bhasudha , @bvaradar , @n3nash Cleanup tirggers after compaction? or Cleanup runs when an upsert on hudi dataset ? or cleanup triggers when config

[GitHub] [hudi] hudi-bot commented on pull request #7860: [HUDI-5673] Support multi writer for bucket index with guarded lock

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7860: URL: https://github.com/apache/hudi/pull/7860#issuecomment-1423605075 ## CI report: * e72f988f68e3021f857b43b14b8721be3f988df5 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1502

[GitHub] [hudi] hudi-bot commented on pull request #7808: [MINOR] use ExecutorFactory in BootstrapHandler

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7808: URL: https://github.com/apache/hudi/pull/7808#issuecomment-1423604986 ## CI report: * dbedf67bd39cf8ff13b7dbe1294be86bc5f9718f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1502

[GitHub] [hudi] hudi-bot commented on pull request #7804: [HUDI-915][HUDI-5656] Rebased `HoodieBootstrapRelation` onto `HoodieBaseRelation`

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7804: URL: https://github.com/apache/hudi/pull/7804#issuecomment-1423604915 ## CI report: * 214938fa79f087400977256140ef633dace60663 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1481

[GitHub] [hudi] hudi-bot commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423604711 ## CI report: * 8b89f3d81e3df42d79d5e1a55672bb9beefee0a9 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=150

[GitHub] [hudi] KnightChess commented on pull request #7808: [MINOR] use ExecutorFactory in BootstrapHandler

2023-02-08 Thread via GitHub
KnightChess commented on PR #7808: URL: https://github.com/apache/hudi/pull/7808#issuecomment-1423602752 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[jira] [Updated] (HUDI-915) Partition Columns missing in files upserted after Metadata Bootstrap

2023-02-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-915: Labels: pull-request-available (was: ) > Partition Columns missing in files upserted after Metadata

[GitHub] [hudi] hudi-bot commented on pull request #7868: [HUDI-1593] Add support for "show restores" and "show restore" commands in hudi-cli

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7868: URL: https://github.com/apache/hudi/pull/7868#issuecomment-1423600470 ## CI report: * 5b6f539ecdc4ba84b7b509b43bf4c3836c575dca Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1503

[GitHub] [hudi] hudi-bot commented on pull request #7860: [HUDI-5673] Support multi writer for bucket index with guarded lock

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7860: URL: https://github.com/apache/hudi/pull/7860#issuecomment-1423600420 ## CI report: * e72f988f68e3021f857b43b14b8721be3f988df5 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1502

[GitHub] [hudi] hudi-bot commented on pull request #7804: [HUDI-915][HUDI-5656] Rebased `HoodieBootstrapRelation` onto `HoodieBaseRelation`

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7804: URL: https://github.com/apache/hudi/pull/7804#issuecomment-1423600291 ## CI report: * 214938fa79f087400977256140ef633dace60663 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1481

[GitHub] [hudi] hudi-bot commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423600051 ## CI report: * a3e25d91fe89abb52b2019c5f5a68f28a321a1f8 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1486

[jira] [Created] (HUDI-5735) Fix: Flink-hudi write time format data UTC time zone problem

2023-02-08 Thread luckily (Jira)
luckily created HUDI-5735: - Summary: Fix: Flink-hudi write time format data UTC time zone problem Key: HUDI-5735 URL: https://issues.apache.org/jira/browse/HUDI-5735 Project: Apache Hudi Issue Type:

[GitHub] [hudi] liaotian1005 commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
liaotian1005 commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423572273 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [hudi] hudi-bot commented on pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7903: URL: https://github.com/apache/hudi/pull/7903#issuecomment-1423568635 ## CI report: * ee465d312a5953c8b8337d7fa4f6d7dbc97142a2 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1504

[GitHub] [hudi] hudi-bot commented on pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7903: URL: https://github.com/apache/hudi/pull/7903#issuecomment-1423564644 ## CI report: * ee465d312a5953c8b8337d7fa4f6d7dbc97142a2 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #7894: [HUDI-5729]Fix RowDataKeyGen method getRecordKey

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7894: URL: https://github.com/apache/hudi/pull/7894#issuecomment-1423564610 ## CI report: * f5abcc66d670acbf7543915f127c96bd7622e01e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1503

[GitHub] [hudi] wuzhenhua01 commented on pull request #7678: [HUDI-5562] Add maven wrapper

2023-02-08 Thread via GitHub
wuzhenhua01 commented on PR #7678: URL: https://github.com/apache/hudi/pull/7678#issuecomment-1423537795 cc @alexeykudinkin -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[jira] [Updated] (HUDI-5562) Add maven wrapper

2023-02-08 Thread wuzhenhua (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wuzhenhua updated HUDI-5562: Description: In a project that uses Maven and often changes the required version, it might be easier to use

[jira] [Updated] (HUDI-5734) Fix flink batch read skip clustering data lost

2023-02-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-5734: - Labels: pull-request-available (was: ) > Fix flink batch read skip clustering data lost > ---

[GitHub] [hudi] hbgstc123 opened a new pull request, #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 opened a new pull request, #7903: URL: https://github.com/apache/hudi/pull/7903 ### Change Logs When flink incremental batch read, disable skip_clustering config. Because skip_clustering could lost data when old commits are cleaned. ### Impact no ### R

[GitHub] [hudi] nbeeee opened a new issue, #7902: [SUPPORT].UnresolvedUnionException: Not in union exception occurred when writing data through spark

2023-02-08 Thread via GitHub
nb opened a new issue, #7902: URL: https://github.com/apache/hudi/issues/7902 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)? - Join the mailing list to engage in conversations and get faster support at dev-subscr...

[jira] [Updated] (HUDI-5734) Fix flink batch read skip clustering data lost

2023-02-08 Thread HBG (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] HBG updated HUDI-5734: -- Summary: Fix flink batch read skip clustering data lost (was: Fix data lost because skip clustering when incremental ba

[GitHub] [hudi] hudi-bot commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423523385 ## CI report: * a3e25d91fe89abb52b2019c5f5a68f28a321a1f8 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1486

[jira] [Created] (HUDI-5734) Fix data lost because skip clustering when incremental batch read in flink

2023-02-08 Thread HBG (Jira)
HBG created HUDI-5734: - Summary: Fix data lost because skip clustering when incremental batch read in flink Key: HUDI-5734 URL: https://issues.apache.org/jira/browse/HUDI-5734 Project: Apache Hudi Issue

[GitHub] [hudi] hudi-bot commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423518841 ## CI report: * a3e25d91fe89abb52b2019c5f5a68f28a321a1f8 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1486

[GitHub] [hudi] hudi-bot commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423507868 ## CI report: * a3e25d91fe89abb52b2019c5f5a68f28a321a1f8 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1486

[GitHub] [hudi] liaotian1005 commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
liaotian1005 commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423484954 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[jira] [Created] (HUDI-5733) TestHoodieDeltaStreamer.testHoodieIndexer failure

2023-02-08 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-5733: - Summary: TestHoodieDeltaStreamer.testHoodieIndexer failure Key: HUDI-5733 URL: https://issues.apache.org/jira/browse/HUDI-5733 Project: Apache Hudi Issue T

[GitHub] [hudi] hudi-bot commented on pull request #7901: [HUDI-5665] Adding support to re-use table configs

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7901: URL: https://github.com/apache/hudi/pull/7901#issuecomment-1423472994 ## CI report: * 4f81cc10efc5863beb8f9656c05fc2e03ce6c7ee Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1504

[GitHub] [hudi] hudi-bot commented on pull request #7901: [HUDI-5665] Adding support to re-use table configs

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7901: URL: https://github.com/apache/hudi/pull/7901#issuecomment-1423467799 ## CI report: * 4f81cc10efc5863beb8f9656c05fc2e03ce6c7ee UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] nsivabalan opened a new pull request, #7901: [HUDI-5665] Adding support to re-use table configs

2023-02-08 Thread via GitHub
nsivabalan opened a new pull request, #7901: URL: https://github.com/apache/hudi/pull/7901 ### Change Logs - As of now, we expect users to set some of the mandatory fields in every write. For eg, record keys, partition path etc. These cannot change for a given table and gets serializ

[jira] [Updated] (HUDI-5665) Re-use table configs for subsequent writes

2023-02-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-5665: - Labels: pull-request-available (was: ) > Re-use table configs for subsequent writes > ---

[GitHub] [hudi] hudi-bot commented on pull request #7886: [HUDI-5726]Fix timestamp field is 8 hours longer than the time

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7886: URL: https://github.com/apache/hudi/pull/7886#issuecomment-1423402514 ## CI report: * 69c39e941d6ee3cc21512b9d41b6fe048a91cc56 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1503

[GitHub] [hudi] alexeykudinkin commented on pull request #7898: [HUDI-5731] Add guava dependency to Spark and MR bundle

2023-02-08 Thread via GitHub
alexeykudinkin commented on PR #7898: URL: https://github.com/apache/hudi/pull/7898#issuecomment-1423397463 #7900 is addressing this properly by removing unnecessary relocations -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [hudi] hudi-bot commented on pull request #7900: [HUDI-5731] Cleaning up unnecessary relocation for com.google.common packages

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7900: URL: https://github.com/apache/hudi/pull/7900#issuecomment-1423367486 ## CI report: * a7c7f17108423f5d6f563faec66eb715d1a8f539 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1504

[GitHub] [hudi] hudi-bot commented on pull request #7847: [HUDI-5697] Revisiting refreshing of Hudi relations after write operations on the tables

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7847: URL: https://github.com/apache/hudi/pull/7847#issuecomment-1423367248 ## CI report: * 28ce832318206166f9d72f58510d46b50ba652d2 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1491

[GitHub] [hudi] hudi-bot commented on pull request #7752: [MINOR] De-duplicating Iterator implementations

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7752: URL: https://github.com/apache/hudi/pull/7752#issuecomment-1423367110 ## CI report: * 3ebe28ff5e180f1322cbd7621e57daa1234eb1dd Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1464

[GitHub] [hudi] hudi-bot commented on pull request #7900: [HUDI-5731] Cleaning up unnecessary relocation for com.google.common packages

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7900: URL: https://github.com/apache/hudi/pull/7900#issuecomment-1423362413 ## CI report: * a7c7f17108423f5d6f563faec66eb715d1a8f539 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #7891: [HUDI-5728] HoodieTimelineArchiver archives the latest instant before inflight replacecommit

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7891: URL: https://github.com/apache/hudi/pull/7891#issuecomment-1423362343 ## CI report: * 7b6cf690564944cfeacf6d2e29e029f86fddec51 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1503

[GitHub] [hudi] hudi-bot commented on pull request #7847: [HUDI-5697] Revisiting refreshing of Hudi relations after write operations on the tables

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7847: URL: https://github.com/apache/hudi/pull/7847#issuecomment-1423362089 ## CI report: * 28ce832318206166f9d72f58510d46b50ba652d2 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1491

[GitHub] [hudi] hudi-bot commented on pull request #7752: [MINOR] De-duplicating Iterator implementations

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7752: URL: https://github.com/apache/hudi/pull/7752#issuecomment-1423361929 ## CI report: * 3ebe28ff5e180f1322cbd7621e57daa1234eb1dd Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1464

[GitHub] [hudi] hudi-bot commented on pull request #7890: [MINOR] bot.yml ignore more filetype.

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7890: URL: https://github.com/apache/hudi/pull/7890#issuecomment-1423355534 ## CI report: * e97e505b8697f2dfa6cd0d4d42e018204c08215f Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1503

[GitHub] [hudi] hudi-bot commented on pull request #7885: [MINOR] Make sure FTs are run in GH CI

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7885: URL: https://github.com/apache/hudi/pull/7885#issuecomment-1423355449 ## CI report: * c3d027696958b320912712447bcf41c3f2d28221 UNKNOWN * b132eda24a8e705f210d1116dab543632fa09b0c Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #7872: [HUDI-5716] Cleaning up `Partitioner`s hierarchy

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7872: URL: https://github.com/apache/hudi/pull/7872#issuecomment-1423355371 ## CI report: * 8f42e8c18690c8ae76121c714c2c0cda21841264 UNKNOWN * bb3bd527c1c20fb046c23cd4d34e218fb7a06f82 UNKNOWN * de96182a55e0574c96d2b384734a4808c8ba6399 Azure: [FAILUR

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #7752: [MINOR] De-duplicating Iterator implementations

2023-02-08 Thread via GitHub
alexeykudinkin commented on code in PR #7752: URL: https://github.com/apache/hudi/pull/7752#discussion_r1100759052 ## hudi-common/src/main/java/org/apache/hudi/common/util/ClosableIterator.java: ## @@ -24,8 +24,29 @@ * An iterator that give a chance to release resources. *

[jira] [Updated] (HUDI-5731) Fix com.google.common classes still being relocated in Hudi Spark bundle

2023-02-08 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-5731: -- Description: As originally reported in: [https://github.com/apache/hudi/pull/6240#issuecomment-

[jira] [Updated] (HUDI-5731) Fix com.google.common classes still being relocated in Hudi Spark bundle

2023-02-08 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-5731: -- Fix Version/s: 0.13.1 > Fix com.google.common classes still being relocated in Hudi Spark bundle

[GitHub] [hudi] alexeykudinkin opened a new pull request, #7900: [MINOR] Cleaning up unnecessary relocation for com.google.common packages

2023-02-08 Thread via GitHub
alexeykudinkin opened a new pull request, #7900: URL: https://github.com/apache/hudi/pull/7900 ### Change Logs TBA ### Impact _Describe any public API or user-facing feature change or any performance impact._ ### Risk level (write none, low medium or high below)

[jira] [Updated] (HUDI-5731) Fix com.google.common classes still being relocated in Hudi Spark bundle

2023-02-08 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-5731: -- Summary: Fix com.google.common classes still being relocated in Hudi Spark bundle (was: Add gua

[jira] [Assigned] (HUDI-5731) Fix com.google.common classes still being relocated in Hudi Spark bundle

2023-02-08 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin reassigned HUDI-5731: - Assignee: Alexey Kudinkin > Fix com.google.common classes still being relocated in Hudi S

[GitHub] [hudi] hudi-bot commented on pull request #7872: [HUDI-5716] Cleaning up `Partitioner`s hierarchy

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7872: URL: https://github.com/apache/hudi/pull/7872#issuecomment-1423318544 ## CI report: * 8f42e8c18690c8ae76121c714c2c0cda21841264 UNKNOWN * bb3bd527c1c20fb046c23cd4d34e218fb7a06f82 UNKNOWN * de96182a55e0574c96d2b384734a4808c8ba6399 Azure: [FAILUR

[GitHub] [hudi] hudi-bot commented on pull request #7885: [MINOR] Make sure FTs are run in GH CI

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7885: URL: https://github.com/apache/hudi/pull/7885#issuecomment-1423312566 ## CI report: * c3d027696958b320912712447bcf41c3f2d28221 UNKNOWN * b132eda24a8e705f210d1116dab543632fa09b0c Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[jira] [Updated] (HUDI-5732) Launching hudi-spark3.3 in EMR cluster w/ OSS spark fails due to timeline server (spark) NoClassDefFound

2023-02-08 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5732: -- Description: I am trying to use hudi-spark3.3. bundle in EMR cluster using OSS spark. 

[jira] [Updated] (HUDI-5732) Launching hudi-spark3.3 in EMR cluster w/ OSS spark fails due to timeline server (spark) NoClassDefFound

2023-02-08 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5732: -- Description: I am trying to use hudi-spark3.3. bundle in EMR cluster using OSS spark. 

[jira] [Created] (HUDI-5732) Launching hudi-spark3.3 in EMR cluster w/ OSS spark fails due to timeline server (spark) NoClassDefFound

2023-02-08 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-5732: - Summary: Launching hudi-spark3.3 in EMR cluster w/ OSS spark fails due to timeline server (spark) NoClassDefFound Key: HUDI-5732 URL: https://issues.apache.org/jira/brow

[GitHub] [hudi] hudi-bot commented on pull request #7898: [HUDI-5731] Add guava dependency to Spark and MR bundle

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7898: URL: https://github.com/apache/hudi/pull/7898#issuecomment-1423255409 ## CI report: * e10025521ff7a24978b9f22b1180bce55e2238fe Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1504

  1   2   >