[GitHub] [hudi] lokeshj1703 commented on pull request #7869: [HUDI-5713] created essential property for configs

2023-03-09 Thread via GitHub
lokeshj1703 commented on PR #7869: URL: https://github.com/apache/hudi/pull/7869#issuecomment-1461597006 @jonvex Can you mark this PR as ready? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[jira] [Created] (HUDI-5911) SimpleTransactionDirectMarkerBasedDetectionStrategy can't work with none-partitioned table

2023-03-09 Thread xi chaomin (Jira)
xi chaomin created HUDI-5911: Summary: SimpleTransactionDirectMarkerBasedDetectionStrategy can't work with none-partitioned table Key: HUDI-5911 URL: https://issues.apache.org/jira/browse/HUDI-5911 Projec

[jira] [Created] (HUDI-5912) Update snapshot_exporter to reflect the corrent jar name.md

2023-03-09 Thread Danny Chen (Jira)
Danny Chen created HUDI-5912: Summary: Update snapshot_exporter to reflect the corrent jar name.md Key: HUDI-5912 URL: https://issues.apache.org/jira/browse/HUDI-5912 Project: Apache Hudi Issue

[GitHub] [hudi] hudi-bot commented on pull request #8125: [HUDI-5900] Clean up unused metadata configs

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8125: URL: https://github.com/apache/hudi/pull/8125#issuecomment-1461617238 ## CI report: * 5c1deb1c2e910c41d8396ecaf3961a63444583a7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1561

[GitHub] [hudi] hudi-bot commented on pull request #8128: [HUDI-5782] Tweak defaults and remove unnecessary configs after config review

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8128: URL: https://github.com/apache/hudi/pull/8128#issuecomment-1461617350 ## CI report: * 894861b03430217482771663639c9e413b0dca3b UNKNOWN * 8c39806bc06d180eb8c07b0879bc9fad3c9cc170 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] xicm opened a new pull request, #8143: [HUDI-5911] SimpleTransactionDirectMarkerBasedDetectionStrategy can't work with none-partitioned table

2023-03-09 Thread via GitHub
xicm opened a new pull request, #8143: URL: https://github.com/apache/hudi/pull/8143 ### Change Logs lock_key is `hoodie.write.lock.zookeeper.base_path` + partition + '/' + fileId in SimpleTransactionDirectMarkerBasedDetectionStrategy. If the table is a none partition table, the path

[GitHub] [hudi] danny0405 commented on issue #8087: [SUPPORT] split_reader don't checkpoint before consuming all splits

2023-03-09 Thread via GitHub
danny0405 commented on issue #8087: URL: https://github.com/apache/hudi/issues/8087#issuecomment-1461618237 Yes, you are right. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific commen

[jira] [Updated] (HUDI-5911) SimpleTransactionDirectMarkerBasedDetectionStrategy can't work with none-partitioned table

2023-03-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-5911: - Labels: pull-request-available (was: ) > SimpleTransactionDirectMarkerBasedDetectionStrategy can'

[GitHub] [hudi] danny0405 commented on issue #8087: [SUPPORT] split_reader don't checkpoint before consuming all splits

2023-03-09 Thread via GitHub
danny0405 commented on issue #8087: URL: https://github.com/apache/hudi/issues/8087#issuecomment-1461619274 > I will fire a fix. But what I don't understand is why MailboxExecutor doesn't work as expected Needs to dig into the backround before we fire a fix. -- This is an automate

[GitHub] [hudi] danny0405 commented on issue #8141: [SUPPORT] how to use hudi in cdp 7.1.7

2023-03-09 Thread via GitHub
danny0405 commented on issue #8141: URL: https://github.com/apache/hudi/issues/8141#issuecomment-1461622117 > Multiple sources found for hudi (org.apache.hudi.Spark3DefaultSource, org.apache.hudi.Spark32PlusDefaultSource), please specify the fully qualified class name. Seems it is

[GitHub] [hudi] hudi-bot commented on pull request #8143: [HUDI-5911] SimpleTransactionDirectMarkerBasedDetectionStrategy can't work with none-partitioned table

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8143: URL: https://github.com/apache/hudi/pull/8143#issuecomment-1461630070 ## CI report: * 0bd1545bc12e727c30a08e689c05fcc59c1a UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #8125: [HUDI-5900] Clean up unused metadata configs

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8125: URL: https://github.com/apache/hudi/pull/8125#issuecomment-1461629887 ## CI report: * 5c1deb1c2e910c41d8396ecaf3961a63444583a7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1561

[GitHub] [hudi] xuzifu666 commented on a diff in pull request #8133: [HUDI-5904] support more than one update actions in merge into table

2023-03-09 Thread via GitHub
xuzifu666 commented on code in PR #8133: URL: https://github.com/apache/hudi/pull/8133#discussion_r1130710788 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/hudi/TestMergeIntoTable.scala: ## @@ -115,6 +116,72 @@ class TestMergeIntoTable extends HoodieSpa

[GitHub] [hudi] hudi-bot commented on pull request #8143: [HUDI-5911] SimpleTransactionDirectMarkerBasedDetectionStrategy can't work with none-partitioned table

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8143: URL: https://github.com/apache/hudi/pull/8143#issuecomment-1461650211 ## CI report: * 0bd1545bc12e727c30a08e689c05fcc59c1a Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1563

[jira] [Created] (HUDI-5913) Table can not read correctly when computed column is in the midst

2023-03-09 Thread Danny Chen (Jira)
Danny Chen created HUDI-5913: Summary: Table can not read correctly when computed column is in the midst Key: HUDI-5913 URL: https://issues.apache.org/jira/browse/HUDI-5913 Project: Apache Hudi

[GitHub] [hudi] danny0405 commented on pull request #8098: [HUDI-5913] Table can not read correctly when computed column is in the midst

2023-03-09 Thread via GitHub
danny0405 commented on PR #8098: URL: https://github.com/apache/hudi/pull/8098#issuecomment-1461669960 Thanks for the feedback, you are right, the test case passed because the computed column is the last column within the schema, I have created a patch and attach it here: [5913.patch.zi

[jira] [Updated] (HUDI-5913) Table can not read correctly when computed column is in the midst

2023-03-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-5913: - Labels: pull-request-available (was: ) > Table can not read correctly when computed column is in

[GitHub] [hudi] danny0405 commented on a diff in pull request #8133: [HUDI-5904] support more than one update actions in merge into table

2023-03-09 Thread via GitHub
danny0405 commented on code in PR #8133: URL: https://github.com/apache/hudi/pull/8133#discussion_r1130727610 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/hudi/TestMergeIntoTable.scala: ## @@ -115,6 +116,72 @@ class TestMergeIntoTable extends HoodieSpa

[GitHub] [hudi] danny0405 commented on pull request #8139: [HUDI-5909] Reuse hive client if possible

2023-03-09 Thread via GitHub
danny0405 commented on PR #8139: URL: https://github.com/apache/hudi/pull/8139#issuecomment-1461676523 > > We try to close the hive meta sync connection after each meta sync, does that logic still work after your change? > > We don't close `HiveClient`(it lives with the lifetime of th

[GitHub] [hudi] hudi-bot commented on pull request #8072: [HUDI-5857] Insert overwrite into bucket table would generate new file group id

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8072: URL: https://github.com/apache/hudi/pull/8072#issuecomment-1461725586 ## CI report: * a76dc55ea3de9fc9b5f886dc5e5162c29b651a7a Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1562

[GitHub] [hudi] hudi-bot commented on pull request #8072: [HUDI-5857] Insert overwrite into bucket table would generate new file group id

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8072: URL: https://github.com/apache/hudi/pull/8072#issuecomment-1461739031 ## CI report: * a76dc55ea3de9fc9b5f886dc5e5162c29b651a7a Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1562

[GitHub] [hudi] hudi-bot commented on pull request #8051: [HUDI-5851] Improvement of data skipping, only converts expressions to evaluators once

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8051: URL: https://github.com/apache/hudi/pull/8051#issuecomment-1461820992 ## CI report: * 70822885c9cd8df3e8e540d3febffd4a4d1dfe32 UNKNOWN * 2b559390d0b7bed0926f4f536106ac7e3741003f Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #8125: [HUDI-5900] Clean up unused metadata configs

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8125: URL: https://github.com/apache/hudi/pull/8125#issuecomment-1461821429 ## CI report: * 2468b8707960fbd4c4f9fd1df74d99a6273033c6 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1563

[GitHub] [hudi] hudi-bot commented on pull request #8051: [HUDI-5851] Improvement of data skipping, only converts expressions to evaluators once

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8051: URL: https://github.com/apache/hudi/pull/8051#issuecomment-1461849915 ## CI report: * 70822885c9cd8df3e8e540d3febffd4a4d1dfe32 UNKNOWN * 2b559390d0b7bed0926f4f536106ac7e3741003f Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #8125: [HUDI-5900] Clean up unused metadata configs

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8125: URL: https://github.com/apache/hudi/pull/8125#issuecomment-1461850286 ## CI report: * 2468b8707960fbd4c4f9fd1df74d99a6273033c6 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1563

[GitHub] [hudi] MarlboroBoy commented on issue #8141: [SUPPORT] how to use hudi in cdp 7.1.7

2023-03-09 Thread via GitHub
MarlboroBoy commented on issue #8141: URL: https://github.com/apache/hudi/issues/8141#issuecomment-1461855123 ![Uploading WX20230309-193057.png…]() -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

[GitHub] [hudi] hudi-bot commented on pull request #8125: [HUDI-5900] Clean up unused metadata configs

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8125: URL: https://github.com/apache/hudi/pull/8125#issuecomment-1461914519 ## CI report: * 2468b8707960fbd4c4f9fd1df74d99a6273033c6 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1563

[GitHub] [hudi] hudi-bot commented on pull request #8143: [HUDI-5911] SimpleTransactionDirectMarkerBasedDetectionStrategy can't work with none-partitioned table

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8143: URL: https://github.com/apache/hudi/pull/8143#issuecomment-1461914734 ## CI report: * 0bd1545bc12e727c30a08e689c05fcc59c1a Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1563

[GitHub] [hudi] bithw1 commented on issue #7994: [SUPPORT]How to get back the historic commit time information in my scenario

2023-03-09 Thread via GitHub
bithw1 commented on issue #7994: URL: https://github.com/apache/hudi/issues/7994#issuecomment-1461918269 Thanks @Zouxxyy for the helpful answer! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to th

[GitHub] [hudi] bithw1 closed issue #7994: [SUPPORT]How to get back the historic commit time information in my scenario

2023-03-09 Thread via GitHub
bithw1 closed issue #7994: [SUPPORT]How to get back the historic commit time information in my scenario URL: https://github.com/apache/hudi/issues/7994 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

[GitHub] [hudi] hudi-bot commented on pull request #8133: [HUDI-5904] support more than one update actions in merge into table

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8133: URL: https://github.com/apache/hudi/pull/8133#issuecomment-1461928335 ## CI report: * d9df3394e7f023685edf606626268a625fc9b9cc Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1562

[GitHub] [hudi] hudi-bot commented on pull request #8133: [HUDI-5904] support more than one update actions in merge into table

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8133: URL: https://github.com/apache/hudi/pull/8133#issuecomment-1461944225 ## CI report: * d9df3394e7f023685edf606626268a625fc9b9cc Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1562

[GitHub] [hudi] hudi-bot commented on pull request #8088: [HUDI-5873] The pending compactions of dataset table should not block…

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8088: URL: https://github.com/apache/hudi/pull/8088#issuecomment-1461943991 ## CI report: * c65842899078697c5c5ff647e89f7cf918531f8d Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1554

[GitHub] [hudi] beyond1920 commented on pull request #8072: [HUDI-5857] Insert overwrite into bucket table would generate new file group id

2023-03-09 Thread via GitHub
beyond1920 commented on PR #8072: URL: https://github.com/apache/hudi/pull/8072#issuecomment-1461972444 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [hudi] hudi-bot commented on pull request #8072: [HUDI-5857] Insert overwrite into bucket table would generate new file group id

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8072: URL: https://github.com/apache/hudi/pull/8072#issuecomment-1462026938 ## CI report: * f5503443fd6080f0ec93a9a21e14a33f0fa432c7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1563

[GitHub] [hudi] hudi-bot commented on pull request #8088: [HUDI-5873] The pending compactions of dataset table should not block…

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8088: URL: https://github.com/apache/hudi/pull/8088#issuecomment-1462027120 ## CI report: * c65842899078697c5c5ff647e89f7cf918531f8d Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1554

[GitHub] [hudi] hudi-bot commented on pull request #8072: [HUDI-5857] Insert overwrite into bucket table would generate new file group id

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8072: URL: https://github.com/apache/hudi/pull/8072#issuecomment-1462039805 ## CI report: * a76dc55ea3de9fc9b5f886dc5e5162c29b651a7a Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1562

[GitHub] [hudi] hudi-bot commented on pull request #8133: [HUDI-5904] support more than one update actions in merge into table

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8133: URL: https://github.com/apache/hudi/pull/8133#issuecomment-1462040249 ## CI report: * 53a6b317d65baadbb12b440f45b39fecca8bba9d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1563

[GitHub] [hudi] leesf commented on a diff in pull request #8133: [HUDI-5904] support more than one update actions in merge into table

2023-03-09 Thread via GitHub
leesf commented on code in PR #8133: URL: https://github.com/apache/hudi/pull/8133#discussion_r1131055256 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/hudi/TestMergeIntoTable.scala: ## @@ -115,6 +116,65 @@ class TestMergeIntoTable extends HoodieSparkSq

[GitHub] [hudi] xuzifu666 commented on a diff in pull request #8133: [HUDI-5904] support more than one update actions in merge into table

2023-03-09 Thread via GitHub
xuzifu666 commented on code in PR #8133: URL: https://github.com/apache/hudi/pull/8133#discussion_r1131058033 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/hudi/TestMergeIntoTable.scala: ## @@ -115,6 +116,65 @@ class TestMergeIntoTable extends HoodieSpa

[GitHub] [hudi] hudi-bot commented on pull request #8133: [HUDI-5904] support more than one update actions in merge into table

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8133: URL: https://github.com/apache/hudi/pull/8133#issuecomment-1462124893 ## CI report: * 53a6b317d65baadbb12b440f45b39fecca8bba9d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1563

[GitHub] [hudi] hudi-bot commented on pull request #8051: [HUDI-5851] Improvement of data skipping, only converts expressions to evaluators once

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8051: URL: https://github.com/apache/hudi/pull/8051#issuecomment-1462124426 ## CI report: * 70822885c9cd8df3e8e540d3febffd4a4d1dfe32 UNKNOWN * cd1480e99f1a380e052f848370b84b1d4f4018d4 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #8072: [HUDI-5857] Insert overwrite into bucket table would generate new file group id

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8072: URL: https://github.com/apache/hudi/pull/8072#issuecomment-1462136496 ## CI report: * a76dc55ea3de9fc9b5f886dc5e5162c29b651a7a Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1562

[GitHub] [hudi] hudi-bot commented on pull request #8133: [HUDI-5904] support more than one update actions in merge into table

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8133: URL: https://github.com/apache/hudi/pull/8133#issuecomment-1462136912 ## CI report: * 53a6b317d65baadbb12b440f45b39fecca8bba9d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1563

[GitHub] [hudi] hudi-bot commented on pull request #8125: [HUDI-5900] Clean up unused metadata configs

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8125: URL: https://github.com/apache/hudi/pull/8125#issuecomment-1462136785 ## CI report: * 451c5e463186f1346b601822a7f010205f68f040 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1563

[GitHub] [hudi] vinothchandar commented on issue #8018: [SUPPORT] why is the schema evolution done while not setting hoodie.schema.on.read.enable

2023-03-09 Thread via GitHub
vinothchandar commented on issue #8018: URL: https://github.com/apache/hudi/issues/8018#issuecomment-1462141331 +1 on @kazdy 's notes above on ASR. Hudi has always supported some automatic schema evolution to deal with streaming data similar to what Kafka/Schema registry model achieves. The

[GitHub] [hudi] vinothchandar commented on issue #8018: [SUPPORT] why is the schema evolution done while not setting hoodie.schema.on.read.enable

2023-03-09 Thread via GitHub
vinothchandar commented on issue #8018: URL: https://github.com/apache/hudi/issues/8018#issuecomment-1462143271 @menna224 For your original issue on not adding the new column, it's not something that has come up before. So we would need to provide some way to alter behavior to ignore the ex

[GitHub] [hudi] hudi-bot commented on pull request #8072: [HUDI-5857] Insert overwrite into bucket table would generate new file group id

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8072: URL: https://github.com/apache/hudi/pull/8072#issuecomment-1462148944 ## CI report: * a76dc55ea3de9fc9b5f886dc5e5162c29b651a7a Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1562

[GitHub] [hudi] hudi-bot commented on pull request #8133: [HUDI-5904] support more than one update actions in merge into table

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8133: URL: https://github.com/apache/hudi/pull/8133#issuecomment-1462149339 ## CI report: * 53a6b317d65baadbb12b440f45b39fecca8bba9d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1563

[GitHub] [hudi] KnightChess commented on pull request #7956: [HUDI-5797] fix use bulk insert error as row

2023-03-09 Thread via GitHub
KnightChess commented on PR #7956: URL: https://github.com/apache/hudi/pull/7956#issuecomment-1462149633 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [hudi] hudi-bot commented on pull request #8088: [HUDI-5873] The pending compactions of dataset table should not block…

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8088: URL: https://github.com/apache/hudi/pull/8088#issuecomment-1462209986 ## CI report: * 10777af559d8be0a0c421ebfb98f001501638aa5 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1563

[GitHub] [hudi] peter-mccabe opened a new issue, #8144: [SUPPORT]Unable to connect to an s3 hudi table

2023-03-09 Thread via GitHub
peter-mccabe opened a new issue, #8144: URL: https://github.com/apache/hudi/issues/8144 I am unable to connect to an s3 hudi table using the hudi client. I keep getting an error: Loading HoodieTableMetaClient from s3://test-datalake/datasets//test_table Failed to get instance of or

[GitHub] [hudi] hudi-bot commented on pull request #7956: [HUDI-5797] fix use bulk insert error as row

2023-03-09 Thread via GitHub
hudi-bot commented on PR #7956: URL: https://github.com/apache/hudi/pull/7956#issuecomment-1462209379 ## CI report: * 5bd4d5c4de8fc54bf93fb7fd252b6e61fda85373 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1519

[GitHub] [hudi] hudi-bot commented on pull request #8072: [HUDI-5857] Insert overwrite into bucket table would generate new file group id

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8072: URL: https://github.com/apache/hudi/pull/8072#issuecomment-1462426218 ## CI report: * 7d6319dd8880f2f5af3ba0ea1c38058499585337 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1564

[GitHub] [hudi] hudi-bot commented on pull request #8133: [HUDI-5904] support more than one update actions in merge into table

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8133: URL: https://github.com/apache/hudi/pull/8133#issuecomment-1462426618 ## CI report: * 8e3fad5fa9e9c64e7e345a317865f6fe6a9a7620 UNKNOWN * 5b8a43f4b2f18352738b6e9c9a183a1bde5c4540 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[jira] [Updated] (HUDI-5688) schema field of EmptyRelation subtype of BaseRelation should not be null

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5688: - Fix Version/s: 0.13.1 0.12.3 > schema field of EmptyRelation subtype of BaseRelation sh

[GitHub] [hudi] kkrugler commented on issue #8136: [SUPPORT] Wrong type returned by ParquetColumnarRowSplitReader in hudi-flink1.16.x code

2023-03-09 Thread via GitHub
kkrugler commented on issue #8136: URL: https://github.com/apache/hudi/issues/8136#issuecomment-1462510643 Hi @BruceKellan, > Hi kkrugler, I have seen your project code. > > Hudi-flink is not directly open to users. > > You rely on hudi-flink in your code, so you indirect

[GitHub] [hudi] hudi-bot commented on pull request #7956: [HUDI-5797] fix use bulk insert error as row

2023-03-09 Thread via GitHub
hudi-bot commented on PR #7956: URL: https://github.com/apache/hudi/pull/7956#issuecomment-1462533868 ## CI report: * 5bd4d5c4de8fc54bf93fb7fd252b6e61fda85373 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1519

[GitHub] [hudi] kkrugler opened a new pull request, #8145: Fix for class cast exception

2023-03-09 Thread via GitHub
kkrugler opened a new pull request, #8145: URL: https://github.com/apache/hudi/pull/8145 ### Change Logs Change return type of ParquetColumnarRowSplitReader (in hudi-flink1.16.x code base) to RowData, was ColumnarRowData See https://github.com/apache/hudi/issues/8136 ###

[GitHub] [hudi] kkrugler commented on issue #8136: [SUPPORT] Wrong type returned by ParquetColumnarRowSplitReader in hudi-flink1.16.x code

2023-03-09 Thread via GitHub
kkrugler commented on issue #8136: URL: https://github.com/apache/hudi/issues/8136#issuecomment-1462545592 Hi @danny0405 - see https://github.com/apache/hudi/pull/8145 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use th

[GitHub] [hudi] sydneyhoran commented on issue #6316: [SUPPORT] Running `--continuous` mode with HoodieMultiTableDeltaStreamer seems to only ingest first table

2023-03-09 Thread via GitHub
sydneyhoran commented on issue #6316: URL: https://github.com/apache/hudi/issues/6316#issuecomment-1462564388 After more testing, I believe one more [code change](https://github.com/sydneyhoran/hudi/blob/bde3719226bade5bce204cdc0d16fb3874123e0d/hudi-utilities/src/main/java/org/apache/hudi/ut

[GitHub] [hudi] hudi-bot commented on pull request #8145: Fix for class cast exception

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8145: URL: https://github.com/apache/hudi/pull/8145#issuecomment-1462565861 ## CI report: * ce0d21a20373e72ec6a5aa89c61bc538a075ccc2 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #8145: Fix for class cast exception

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8145: URL: https://github.com/apache/hudi/pull/8145#issuecomment-1462614966 ## CI report: * ce0d21a20373e72ec6a5aa89c61bc538a075ccc2 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1564

[GitHub] [hudi] sydneyhoran opened a new issue, #8146: [SUPPORT] Running `--continuous` mode with MultiTable and PostWriteTerminationStrategy seems to leave Spark job hanging

2023-03-09 Thread via GitHub
sydneyhoran opened a new issue, #8146: URL: https://github.com/apache/hudi/issues/8146 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)? - Join the mailing list to engage in conversations and get faster support at dev-subs

[GitHub] [hudi] yihua commented on a diff in pull request #7912: [HUDI-5739] Review Hudi Config Defaults

2023-03-09 Thread via GitHub
yihua commented on code in PR #7912: URL: https://github.com/apache/hudi/pull/7912#discussion_r1131559389 ## hudi-common/src/main/java/org/apache/hudi/common/config/HoodieStorageConfig.java: ## @@ -108,6 +111,7 @@ public class HoodieStorageConfig extends HoodieConfig { .d

[GitHub] [hudi] hudi-bot commented on pull request #8145: Fix for class cast exception

2023-03-09 Thread via GitHub
hudi-bot commented on PR #8145: URL: https://github.com/apache/hudi/pull/8145#issuecomment-1462817486 ## CI report: * ce0d21a20373e72ec6a5aa89c61bc538a075ccc2 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1564

[GitHub] [hudi] kkrugler opened a new issue, #8147: [SUPPORT] Missing dependency on hive-exec (core)

2023-03-09 Thread via GitHub
kkrugler opened a new issue, #8147: URL: https://github.com/apache/hudi/issues/8147 **Describe the problem you faced** When using Flink to do an incremental query read from a table, using the 0.12.2 and Flink 1.15, I occasionally get a ClassNotFoundException for `org.apache.hadoop.hi

[GitHub] [hudi] kkrugler opened a new issue, #8148: [SUPPORT]

2023-03-09 Thread via GitHub
kkrugler opened a new issue, #8148: URL: https://github.com/apache/hudi/issues/8148 **Describe the problem you faced** When running a Flink workflow that writes to a Hudi table, metaspace is leaked whenever the job restarts from a checkpoint. Additionally, if a persistent (not

[GitHub] [hudi] phani482 commented on issue #7800: [SUPPORT] "java.lang.OutOfMemoryError: Requested array size exceeds VM limit" while writing to Hudi COW table

2023-03-09 Thread via GitHub
phani482 commented on issue #7800: URL: https://github.com/apache/hudi/issues/7800#issuecomment-1463030125 Thanks! @nsivabalan Will try it out and see if this will fix our issue. Although this could take some time for us to implement in prod. Will post here whenever we do the upgrad

[jira] [Updated] (HUDI-5520) Fail MDT when list of log files grows unboundedly

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5520: - Fix Version/s: 0.12.3 > Fail MDT when list of log files grows unboundedly > --

[jira] [Updated] (HUDI-5507) SparkSQL can not read the latest change data without execute "refresh table xxx"

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5507: - Fix Version/s: 0.12.3 > SparkSQL can not read the latest change data without execute "refresh table > xxx

[jira] [Updated] (HUDI-5500) Short-circuit upsert operation when we know that the table is empty

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5500: - Issue Type: Improvement (was: Bug) > Short-circuit upsert operation when we know that the table is empty

[jira] [Updated] (HUDI-5498) Update docs for reading Hudi tables on Databricks runtime

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5498: - Fix Version/s: 0.14.0 (was: 0.13.1) > Update docs for reading Hudi tables on Databr

[jira] [Updated] (HUDI-5498) Update docs for reading Hudi tables on Databricks runtime

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5498: - Fix Version/s: 0.13.1 0.12.3 (was: 0.14.0) > Update docs for rea

[jira] [Updated] (HUDI-5500) Short-circuit upsert operation when we know that the table is empty

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5500: - Priority: Critical (was: Blocker) > Short-circuit upsert operation when we know that the table is empty >

[jira] [Updated] (HUDI-5450) Test Record level index as default w/ azure CI

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5450: - Fix Version/s: 0.14.0 (was: 0.13.1) > Test Record level index as default w/ azure C

[jira] [Updated] (HUDI-5446) Add support to write record level index to MDT

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5446: - Fix Version/s: 0.14.0 (was: 0.13.1) > Add support to write record level index to MD

[jira] [Updated] (HUDI-5444) FileNotFound issue w/ metadata enabled

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5444: - Fix Version/s: 0.12.3 > FileNotFound issue w/ metadata enabled > -- >

[jira] [Updated] (HUDI-5447) Add support for Record level index read from MDT

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5447: - Fix Version/s: 0.14.0 (was: 0.13.1) > Add support for Record level index read from

[jira] [Updated] (HUDI-619) Investigate and implement mechanism to have hive/presto/sparksql queries avoid stitching and return null values for hoodie columns

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-619: Fix Version/s: 0.14.0 (was: 0.13.1) > Investigate and implement mechanism to have hive

[jira] [Updated] (HUDI-992) For hive-style partitioned source data, partition columns synced with Hive will always have String type

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-992: Fix Version/s: 0.12.3 > For hive-style partitioned source data, partition columns synced with Hive > will al

[jira] [Updated] (HUDI-1369) Bootstrap datasource jobs from hanging via spark-submit

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1369: - Fix Version/s: 0.12.3 > Bootstrap datasource jobs from hanging via spark-submit >

[jira] [Updated] (HUDI-2431) Reimplement BufferedWriter in streaming fashion

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2431: - Fix Version/s: 0.14.0 (was: 0.13.1) > Reimplement BufferedWriter in streaming fashi

[jira] [Updated] (HUDI-1779) Fail to bootstrap/upsert a table which contains timestamp column

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1779: - Fix Version/s: 0.12.3 > Fail to bootstrap/upsert a table which contains timestamp column > ---

[jira] [Updated] (HUDI-3114) Kafka Connect can not connect Hive by jdbc

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3114: - Component/s: dependencies > Kafka Connect can not connect Hive by jdbc > -

[jira] [Updated] (HUDI-2458) Relax compaction in metadata being fenced based on inflight requests in data table

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2458: - Fix Version/s: 0.12.3 > Relax compaction in metadata being fenced based on inflight requests in data > ta

[jira] [Updated] (HUDI-3113) Kafka Connect create Multiple Embedded Timeline Services

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3113: - Fix Version/s: 0.14.0 (was: 0.13.1) > Kafka Connect create Multiple Embedded Timeli

[GitHub] [hudi] nsivabalan commented on a diff in pull request #8107: [HUDI-5514] Adding auto generation of record keys support to Hudi

2023-03-09 Thread via GitHub
nsivabalan commented on code in PR #8107: URL: https://github.com/apache/hudi/pull/8107#discussion_r1131831306 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/HoodieDatasetBulkInsertHelper.scala: ## @@ -82,9 +85,19 @@ object HoodieDatasetBulkInsertHelper

[jira] [Updated] (HUDI-3114) Kafka Connect can not connect Hive by jdbc

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3114: - Fix Version/s: 0.12.3 > Kafka Connect can not connect Hive by jdbc > -

[jira] [Updated] (HUDI-3114) Kafka Connect can not connect Hive by jdbc

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3114: - Priority: Critical (was: Blocker) > Kafka Connect can not connect Hive by jdbc >

[GitHub] [hudi] nsivabalan commented on a diff in pull request #8107: [HUDI-5514] Adding auto generation of record keys support to Hudi

2023-03-09 Thread via GitHub
nsivabalan commented on code in PR #8107: URL: https://github.com/apache/hudi/pull/8107#discussion_r1131831680 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/HoodieDatasetBulkInsertHelper.scala: ## @@ -82,9 +85,19 @@ object HoodieDatasetBulkInsertHelper

[GitHub] [hudi] nsivabalan commented on a diff in pull request #8107: [HUDI-5514] Adding auto generation of record keys support to Hudi

2023-03-09 Thread via GitHub
nsivabalan commented on code in PR #8107: URL: https://github.com/apache/hudi/pull/8107#discussion_r1131832606 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/HoodieDatasetBulkInsertHelper.scala: ## @@ -82,9 +85,19 @@ object HoodieDatasetBulkInsertHelper

[jira] [Updated] (HUDI-3674) Remove unnecessary HBase-related dependencies from bundles if there is any

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3674: - Component/s: dependencies > Remove unnecessary HBase-related dependencies from bundles if there is any > -

[jira] [Updated] (HUDI-3517) Unicode in partition path causes it to be resolved wrongly

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3517: - Fix Version/s: 0.12.3 > Unicode in partition path causes it to be resolved wrongly > -

[jira] [Updated] (HUDI-3674) Remove unnecessary HBase-related dependencies from bundles if there is any

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3674: - Fix Version/s: 0.12.3 > Remove unnecessary HBase-related dependencies from bundles if there is any > -

[jira] [Updated] (HUDI-3411) Incorrect Record Key Field property Handling

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3411: - Fix Version/s: 0.14.0 (was: 0.13.1) > Incorrect Record Key Field property Handling

[jira] [Updated] (HUDI-3853) Integ Tests running against Spark3

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3853: - Fix Version/s: 0.14.0 (was: 0.13.1) > Integ Tests running against Spark3 >

[jira] [Updated] (HUDI-3879) Suppress exceptions that are not fatal in HoodieMetadataTableValidator

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3879: - Component/s: metadata > Suppress exceptions that are not fatal in HoodieMetadataTableValidator > -

[jira] [Updated] (HUDI-3879) Suppress exceptions that are not fatal in HoodieMetadataTableValidator

2023-03-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3879: - Fix Version/s: 0.12.3 > Suppress exceptions that are not fatal in HoodieMetadataTableValidator > -

[GitHub] [hudi] nsivabalan commented on a diff in pull request #8107: [HUDI-5514] Adding auto generation of record keys support to Hudi

2023-03-09 Thread via GitHub
nsivabalan commented on code in PR #8107: URL: https://github.com/apache/hudi/pull/8107#discussion_r1131833317 ## hudi-common/src/main/java/org/apache/hudi/common/table/HoodieTableConfig.java: ## @@ -260,6 +260,18 @@ public class HoodieTableConfig extends HoodieConfig { .

  1   2   3   >