[GitHub] [hudi] hudi-bot commented on pull request #5639: [HUDI-2516] Upgrade JUnit to 5.8.2

2022-05-22 Thread GitBox
hudi-bot commented on PR #5639: URL: https://github.com/apache/hudi/pull/5639#issuecomment-1134253359 ## CI report: * a82f303254c75fa7f1c690ee9113a637b84d399c Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8846

[GitHub] [hudi] hudi-bot commented on pull request #5639: [HUDI-2516] Upgrade JUnit to 5.8.2

2022-05-22 Thread GitBox
hudi-bot commented on PR #5639: URL: https://github.com/apache/hudi/pull/5639#issuecomment-1134247939 ## CI report: * d77015ef99eedcfa824fa84ece205ca4448265ad Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8844

[GitHub] [hudi] hudi-bot commented on pull request #5639: [HUDI-2516] Upgrade JUnit to 5.8.2

2022-05-22 Thread GitBox
hudi-bot commented on PR #5639: URL: https://github.com/apache/hudi/pull/5639#issuecomment-1134245355 ## CI report: * d77015ef99eedcfa824fa84ece205ca4448265ad Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8844

[GitHub] [hudi] hudi-bot commented on pull request #5659: [HUDI-4137] SnowflakeSyncTool MVP implementation to integrate with Snowflake

2022-05-22 Thread GitBox
hudi-bot commented on PR #5659: URL: https://github.com/apache/hudi/pull/5659#issuecomment-1134242794 ## CI report: * 223068cbc3deeac3f39b5592f3812d5b4643ad2d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8845

[GitHub] [hudi] hudi-bot commented on pull request #5567: [RFC-53] Use Lock-Free Message Queue Disruptor Improving Hoodie Writing Efficiency

2022-05-22 Thread GitBox
hudi-bot commented on PR #5567: URL: https://github.com/apache/hudi/pull/5567#issuecomment-1134242594 ## CI report: * 6edd9582998823a2c3dfd181c55f2f380c248e87 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8843

[GitHub] [hudi] jinxing64 commented on pull request #5534: [HUDI-4056] Refine partition matching in when drop / truncate partition.

2022-05-22 Thread GitBox
jinxing64 commented on PR #5534: URL: https://github.com/apache/hudi/pull/5534#issuecomment-1134228759 @XuQianJin-Stars Mind to take another look :) ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above t

[GitHub] [hudi] jinxing64 commented on a diff in pull request #5534: [HUDI-4056] Refine partition matching in when drop / truncate partition.

2022-05-22 Thread GitBox
jinxing64 commented on code in PR #5534: URL: https://github.com/apache/hudi/pull/5534#discussion_r879059238 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/hudi/TestAlterTableDropPartition.scala: ## @@ -80,111 +80,92 @@ class TestAlterTableDropPartition

[GitHub] [hudi] jinxing64 commented on a diff in pull request #5534: [HUDI-4056] Refine partition matching in when drop / truncate partition.

2022-05-22 Thread GitBox
jinxing64 commented on code in PR #5534: URL: https://github.com/apache/hudi/pull/5534#discussion_r879058082 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/spark/sql/hudi/command/SqlKeyGenerator.scala: ## @@ -53,8 +55,17 @@ class SqlKeyGenerator(props: Type

[GitHub] [hudi] jinxing64 commented on a diff in pull request #5534: [HUDI-4056] Refine partition matching in when drop / truncate partition.

2022-05-22 Thread GitBox
jinxing64 commented on code in PR #5534: URL: https://github.com/apache/hudi/pull/5534#discussion_r879057203 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/spark/sql/hudi/ProvidesHoodieConfig.scala: ## @@ -124,7 +124,7 @@ trait ProvidesHoodieConfig extends

[GitHub] [hudi] jinxing64 commented on a diff in pull request #5534: [HUDI-4056] Refine partition matching in when drop / truncate partition.

2022-05-22 Thread GitBox
jinxing64 commented on code in PR #5534: URL: https://github.com/apache/hudi/pull/5534#discussion_r879055918 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/spark/sql/catalyst/catalog/HoodieCatalogTable.scala: ## @@ -274,15 +274,15 @@ class HoodieCatalogTabl

[GitHub] [hudi] jinxing64 commented on a diff in pull request #5534: [HUDI-4056] Refine partition matching in when drop / truncate partition.

2022-05-22 Thread GitBox
jinxing64 commented on code in PR #5534: URL: https://github.com/apache/hudi/pull/5534#discussion_r879053177 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieWriterUtils.scala: ## @@ -113,7 +113,10 @@ object HoodieWriterUtils { def getOriginKeyG

[GitHub] [hudi] zhangyue19921010 commented on pull request #4459: [HUDI-3116]Add a new HoodieDropPartitionsTool to let users drop table partitions through a standalone job.

2022-05-22 Thread GitBox
zhangyue19921010 commented on PR #4459: URL: https://github.com/apache/hudi/pull/4459#issuecomment-1134217584 Hi @nsivabalan Sorry to bother you. Is there anything i can do? Or could we land this PR :) -- This is an automated message from the Apache Git Service. To respond to the message

[GitHub] [hudi] kumudkumartirupati commented on pull request #5598: [HUDI-4107] Updated documentation for 0.11.0 - DeltaStreamer

2022-05-22 Thread GitBox
kumudkumartirupati commented on PR #5598: URL: https://github.com/apache/hudi/pull/5598#issuecomment-1134216851 > > @xushiyan I have updated the docs for 0.10.0 and 0.10.1 as well for the `--enable-sync` flag if this is what you meant. Please check and let me know. > > @kumudkumartiru

[GitHub] [hudi] hudi-bot commented on pull request #5659: [HUDI-4137] SnowflakeSyncTool MVP implementation to integrate with Snowflake

2022-05-22 Thread GitBox
hudi-bot commented on PR #5659: URL: https://github.com/apache/hudi/pull/5659#issuecomment-1134214629 ## CI report: * 223068cbc3deeac3f39b5592f3812d5b4643ad2d Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8845

[GitHub] [hudi] amit-ranjan-de commented on issue #5599: [SUPPORT] File names in S3 do not match the file names in the latest .commit file

2022-05-22 Thread GitBox
amit-ranjan-de commented on issue #5599: URL: https://github.com/apache/hudi/issues/5599#issuecomment-1134213324 > Is this something reproducible consistently in your environment? No @xushiyan 😞 , it is random and we couldn't find any pattern or consistency in the failure.

[GitHub] [hudi] hudi-bot commented on pull request #5659: [HUDI-4137] SnowflakeSyncTool MVP implementation to integrate with Snowflake

2022-05-22 Thread GitBox
hudi-bot commented on PR #5659: URL: https://github.com/apache/hudi/pull/5659#issuecomment-1134212712 ## CI report: * 223068cbc3deeac3f39b5592f3812d5b4643ad2d UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[jira] [Updated] (HUDI-4137) Implement SnowflakeSyncTool to support Hudi to Snowflake Integration

2022-05-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4137: - Labels: integration pull-request-available (was: integration) > Implement SnowflakeSyncTool to su

[GitHub] [hudi] vingov opened a new pull request, #5659: [HUDI-4137] SnowflakeSyncTool MVP implementation to integrate with Snowflake

2022-05-22 Thread GitBox
vingov opened a new pull request, #5659: URL: https://github.com/apache/hudi/pull/5659 ## What is the purpose of the pull request This pull request adds the Snowflake Sync feature, this is a requirement to read Hudi tables on the Snowflake warehouse. ## Brief change log

[jira] [Updated] (HUDI-4023) de-couple spark from utilities bundle

2022-05-22 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-4023: -- Status: In Progress (was: Open) > de-couple spark from utilities bundle > -

[jira] [Updated] (HUDI-4023) de-couple spark from utilities bundle

2022-05-22 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-4023: -- Status: Patch Available (was: In Progress) > de-couple spark from utilities bundle > --

[jira] [Updated] (HUDI-2832) [Umbrella] [RFC-40] Integrated Hudi with Snowflake

2022-05-22 Thread Vinoth Govindarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Govindarajan updated HUDI-2832: -- Summary: [Umbrella] [RFC-40] Integrated Hudi with Snowflake (was: [Umbrella] [RFC-40]

[jira] [Created] (HUDI-4137) Implement SnowflakeSyncTool to support Hudi to Snowflake Integration

2022-05-22 Thread Vinoth Govindarajan (Jira)
Vinoth Govindarajan created HUDI-4137: - Summary: Implement SnowflakeSyncTool to support Hudi to Snowflake Integration Key: HUDI-4137 URL: https://issues.apache.org/jira/browse/HUDI-4137 Project: A

[GitHub] [hudi] hudi-bot commented on pull request #5639: [HUDI-2516] Upgrade JUnit to 5.8.1

2022-05-22 Thread GitBox
hudi-bot commented on PR #5639: URL: https://github.com/apache/hudi/pull/5639#issuecomment-1134203878 ## CI report: * d77015ef99eedcfa824fa84ece205ca4448265ad Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8844

[GitHub] [hudi] hudi-bot commented on pull request #5443: [HUDI-3982] Comprehensive schema evolution in flink when read/batch/cow/snapshot

2022-05-22 Thread GitBox
hudi-bot commented on PR #5443: URL: https://github.com/apache/hudi/pull/5443#issuecomment-1134203702 ## CI report: * b5a6de35bc5a797f45773fc6fed2dda5cf015499 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8842

[GitHub] [hudi] yuzhaojing closed pull request #3599: [HUDI-2207] Support independent flink hudi clustering function

2022-05-22 Thread GitBox
yuzhaojing closed pull request #3599: [HUDI-2207] Support independent flink hudi clustering function URL: https://github.com/apache/hudi/pull/3599 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] hudi-bot commented on pull request #5627: [HUDI-3350][HUDI-3351] Rebase Record combining semantic into `HoodieRecordCombiningEngine`

2022-05-22 Thread GitBox
hudi-bot commented on PR #5627: URL: https://github.com/apache/hudi/pull/5627#issuecomment-1134147164 ## CI report: * d7ed9f43b3c10763e02169cd4b48fcde17afa8b2 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8840

[GitHub] [hudi] santoshsb commented on issue #5452: Schema Evolution: Missing column for previous records when new entry does not have the same while upsert.

2022-05-22 Thread GitBox
santoshsb commented on issue #5452: URL: https://github.com/apache/hudi/issues/5452#issuecomment-1134144289 thanks @xiarixiaoyao, our schema for storing data as defined by FHIR standards https://www.hl7.org/fhir/patient.schema.json.html seams to be complicated, as most of the fields here ar

[GitHub] [hudi] danny0405 commented on issue #5553: [SUPPORT] java.io.EOFException: Invalid position - exceeds the bounds of the stream

2022-05-22 Thread GitBox
danny0405 commented on issue #5553: URL: https://github.com/apache/hudi/issues/5553#issuecomment-1134143994 And i believe this exception is very probably caused by this PR: https://github.com/apache/hudi/commit/ed106f671e929d2a6277ae189242201a02a7b58f cc @codope -- This is an automate

[GitHub] [hudi] hudi-bot commented on pull request #5639: [HUDI-2516] Upgrade JUnit to 5.8.1

2022-05-22 Thread GitBox
hudi-bot commented on PR #5639: URL: https://github.com/apache/hudi/pull/5639#issuecomment-1134141335 ## CI report: * 34437a2b046578c1fc9f35c77554f97c52ca0e03 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8775

[GitHub] [hudi] hudi-bot commented on pull request #5567: [RFC-53] Use Lock-Free Message Queue Disruptor Improving Hoodie Writing Efficiency

2022-05-22 Thread GitBox
hudi-bot commented on PR #5567: URL: https://github.com/apache/hudi/pull/5567#issuecomment-1134141242 ## CI report: * 765c7c9d031deb7bdebead97032a61b4b9e5ad4a Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8694

[GitHub] [hudi] hudi-bot commented on pull request #5639: [HUDI-2516] Upgrade JUnit to 5.8.1

2022-05-22 Thread GitBox
hudi-bot commented on PR #5639: URL: https://github.com/apache/hudi/pull/5639#issuecomment-1134139844 ## CI report: * 34437a2b046578c1fc9f35c77554f97c52ca0e03 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8775

[GitHub] [hudi] hudi-bot commented on pull request #5567: [RFC-53] Use Lock-Free Message Queue Disruptor Improving Hoodie Writing Efficiency

2022-05-22 Thread GitBox
hudi-bot commented on PR #5567: URL: https://github.com/apache/hudi/pull/5567#issuecomment-1134139744 ## CI report: * 765c7c9d031deb7bdebead97032a61b4b9e5ad4a Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8694

[GitHub] [hudi] hudi-bot commented on pull request #5443: [HUDI-3982] Comprehensive schema evolution in flink when read/batch/cow/snapshot

2022-05-22 Thread GitBox
hudi-bot commented on PR #5443: URL: https://github.com/apache/hudi/pull/5443#issuecomment-1134139655 ## CI report: * b48c32759295f8aabe8beb2edb963caded743d10 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8701

[GitHub] [hudi] hudi-bot commented on pull request #5534: [HUDI-4056] Refine partition matching in when drop / truncate partition.

2022-05-22 Thread GitBox
hudi-bot commented on PR #5534: URL: https://github.com/apache/hudi/pull/5534#issuecomment-1134138326 ## CI report: * d834cab7033af3caa93e0baeb4975eab5258b31b Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8839

[GitHub] [hudi] hudi-bot commented on pull request #5443: [HUDI-3982] Comprehensive schema evolution in flink when read/batch/cow/snapshot

2022-05-22 Thread GitBox
hudi-bot commented on PR #5443: URL: https://github.com/apache/hudi/pull/5443#issuecomment-1134138264 ## CI report: * b48c32759295f8aabe8beb2edb963caded743d10 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8701

[GitHub] [hudi] santoshsb commented on issue #5388: Error in query: Cannot partition by nested column: meta.source / java.lang.IllegalArgumentException: Can't find preCombineKey `meta.lastUpdated` in

2022-05-22 Thread GitBox
santoshsb commented on issue #5388: URL: https://github.com/apache/hudi/issues/5388#issuecomment-1134135780 Thank you. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

[GitHub] [hudi] zhangyue19921010 commented on pull request #5567: [RFC-53] Use Lock-Free Message Queue Disruptor Improving Hoodie Writing Efficiency

2022-05-22 Thread GitBox
zhangyue19921010 commented on PR #5567: URL: https://github.com/apache/hudi/pull/5567#issuecomment-1134135189 Hi @leesf. Comments are addressed. PTAL. Thanks :) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL a

[GitHub] [hudi] yangzhiyue opened a new issue, #5658: [SUPPORT]

2022-05-22 Thread GitBox
yangzhiyue opened a new issue, #5658: URL: https://github.com/apache/hudi/issues/5658 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)? yes - Join the mailing list to engage in conversations and get faster support at dev-s

[GitHub] [hudi] liujinhui1994 commented on a diff in pull request #5639: [HUDI-2516] Upgrade JUnit to 5.8.2

2022-05-22 Thread GitBox
liujinhui1994 commented on code in PR #5639: URL: https://github.com/apache/hudi/pull/5639#discussion_r878976922 ## pom.xml: ## @@ -100,9 +100,9 @@ 5.3.4 2.17 1.10.1 -5.7.0-M1 -5.7.0-M1 -1.7.0-M1 +5.8.1 Review Comment: Yes it will be 5.8.2 here

[GitHub] [hudi] fengjian428 commented on pull request #4676: [HUDI-3304] support partial update on mor table

2022-05-22 Thread GitBox
fengjian428 commented on PR #4676: URL: https://github.com/apache/hudi/pull/4676#issuecomment-1134117007 already added ut in TestHoodieDeltaStreamer and TestHoodieSparkMergeOnReadTableCompaction -- This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [hudi] sleep1661 commented on issue #5503: [SUPPORT] Hive sync action failed when using spark-sql

2022-05-22 Thread GitBox
sleep1661 commented on issue #5503: URL: https://github.com/apache/hudi/issues/5503#issuecomment-1134116981 > 1. I was using hudi bundle jars 2. I only test with hadoop-2.6.5. Maybe this issue cause only in my custom environment, I will test more. -- This is an automate

[GitHub] [hudi] danny0405 commented on a diff in pull request #5572: [HUDI-3654] Preparations for hudi metastore.

2022-05-22 Thread GitBox
danny0405 commented on code in PR #5572: URL: https://github.com/apache/hudi/pull/5572#discussion_r878986268 ## hudi-common/src/main/java/org/apache/hudi/common/table/HoodieTableMetaClient.java: ## @@ -98,21 +100,22 @@ public class HoodieTableMetaClient implements Serializable

[GitHub] [hudi] hudi-bot commented on pull request #5635: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-22 Thread GitBox
hudi-bot commented on PR #5635: URL: https://github.com/apache/hudi/pull/5635#issuecomment-1134110141 ## CI report: * c1b55456c2289f5b12ba5b94e68dc28a4a69ae32 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8780

[GitHub] [hudi] hudi-bot commented on pull request #3599: [HUDI-2207] Support independent flink hudi clustering function

2022-05-22 Thread GitBox
hudi-bot commented on PR #3599: URL: https://github.com/apache/hudi/pull/3599#issuecomment-1134109453 ## CI report: * c3405300e9bc97445637c7251536ec0f0d6fbbd1 UNKNOWN * e1689c454142ed2daa6d548e272cdbc5b353731d UNKNOWN * eb878d43a82656bdaeb2946fc9c950fd591ce95f UNKNOWN * 9f

[GitHub] [hudi] hudi-bot commented on pull request #5635: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-22 Thread GitBox
hudi-bot commented on PR #5635: URL: https://github.com/apache/hudi/pull/5635#issuecomment-1134108404 ## CI report: * c1b55456c2289f5b12ba5b94e68dc28a4a69ae32 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8780

[GitHub] [hudi] hudi-bot commented on pull request #3599: [HUDI-2207] Support independent flink hudi clustering function

2022-05-22 Thread GitBox
hudi-bot commented on PR #3599: URL: https://github.com/apache/hudi/pull/3599#issuecomment-1134107591 ## CI report: * c3405300e9bc97445637c7251536ec0f0d6fbbd1 UNKNOWN * e1689c454142ed2daa6d548e272cdbc5b353731d UNKNOWN * eb878d43a82656bdaeb2946fc9c950fd591ce95f UNKNOWN * 9f

[jira] [Updated] (HUDI-4136) Run snapshot query in hive throw ' IOException: java.lang.IllegalArgumentException: HoodieRealtimeRecordReader can only work on RealtimeSplit and not with a empty file'.

2022-05-22 Thread yuehanwang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuehanwang updated HUDI-4136: - Description: In Merge-On-Read table execut a snapshot query inΒ  > Run snapshot query in hive throw ' IOExc

[GitHub] [hudi] danny0405 commented on issue #5553: [SUPPORT] java.io.EOFException: Invalid position - exceeds the bounds of the stream

2022-05-22 Thread GitBox
danny0405 commented on issue #5553: URL: https://github.com/apache/hudi/issues/5553#issuecomment-1134102712 cc @alexeykudinkin -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific commen

[GitHub] [hudi] danny0405 commented on issue #5553: [SUPPORT] java.io.EOFException: Invalid position - exceeds the bounds of the stream

2022-05-22 Thread GitBox
danny0405 commented on issue #5553: URL: https://github.com/apache/hudi/issues/5553#issuecomment-1134102604 My insight was that there are some concurrency conflict when multiple threads try to read and modify the hoodie.properties file. After the refactoring from: https://github.com/apache/

[jira] [Updated] (HUDI-4136) Run snapshot query in hive throw ' IOException: java.lang.IllegalArgumentException: HoodieRealtimeRecordReader can only work on RealtimeSplit and not with a empty file'.

2022-05-22 Thread yuehanwang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuehanwang updated HUDI-4136: - Summary: Run snapshot query in hive throw ' IOException: java.lang.IllegalArgumentException: HoodieRealtim

[jira] [Created] (HUDI-4136) Run snapshot query in hive throw ' IOException: java.lang.IllegalArgumentException: HoodieRealtimeRecordReader can only work on RealtimeSplit and not with a empty file' wh

2022-05-22 Thread yuehanwang (Jira)
yuehanwang created HUDI-4136: Summary: Run snapshot query in hive throw ' IOException: java.lang.IllegalArgumentException: HoodieRealtimeRecordReader can only work on RealtimeSplit and not with a empty file' when not execut compaction plan

[GitHub] [hudi] yuzhaojing commented on pull request #3599: [HUDI-2207] Support independent flink hudi clustering function

2022-05-22 Thread GitBox
yuzhaojing commented on PR #3599: URL: https://github.com/apache/hudi/pull/3599#issuecomment-1134098983 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [hudi] liujinhui1994 commented on a diff in pull request #5639: [HUDI-2516] Upgrade JUnit to 5.8.2

2022-05-22 Thread GitBox
liujinhui1994 commented on code in PR #5639: URL: https://github.com/apache/hudi/pull/5639#discussion_r878976922 ## pom.xml: ## @@ -100,9 +100,9 @@ 5.3.4 2.17 1.10.1 -5.7.0-M1 -5.7.0-M1 -1.7.0-M1 +5.8.1 Review Comment: Yes it will be 5.8.2 here

[GitHub] [hudi] liujinhui1994 commented on pull request #5635: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-22 Thread GitBox
liujinhui1994 commented on PR #5635: URL: https://github.com/apache/hudi/pull/5635#issuecomment-1134091196 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[GitHub] [hudi] danny0405 commented on a diff in pull request #5393: [HUDI-3921] Improve rewriteRecordWithNewSchema and refactor code

2022-05-22 Thread GitBox
danny0405 commented on code in PR #5393: URL: https://github.com/apache/hudi/pull/5393#discussion_r878971642 ## hudi-common/src/main/java/org/apache/hudi/avro/HoodieAvroUtils.java: ## @@ -826,7 +826,7 @@ private static Object rewriteRecordWithNewSchema(Object oldRecord, Schema

[GitHub] [hudi] xushiyan commented on issue #5599: [SUPPORT] File names in S3 do not match the file names in the latest .commit file

2022-05-22 Thread GitBox
xushiyan commented on issue #5599: URL: https://github.com/apache/hudi/issues/5599#issuecomment-1134089463 @amit-ranjan-de it's only possible troubleshoot this kind of data issue if we can reproduce it. Is this something reproducible consistently in your environment? also since this is on a

[jira] [Commented] (HUDI-4129) Initializes a new fs view for WriteProfile#reload

2022-05-22 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17540731#comment-17540731 ] Danny Chen commented on HUDI-4129: -- Fixed via master branch: 3ef137d1565dd1aa21456d06c52b

[jira] [Resolved] (HUDI-4129) Initializes a new fs view for WriteProfile#reload

2022-05-22 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen resolved HUDI-4129. -- > Initializes a new fs view for WriteProfile#reload > - > >

[GitHub] [hudi] hudi-bot commented on pull request #5627: [HUDI-3350][HUDI-3351] Rebase Record combining semantic into `HoodieRecordCombiningEngine`

2022-05-22 Thread GitBox
hudi-bot commented on PR #5627: URL: https://github.com/apache/hudi/pull/5627#issuecomment-1134084482 ## CI report: * Unknown: [CANCELED](TBD) * d7ed9f43b3c10763e02169cd4b48fcde17afa8b2 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-05-22 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1134084295 ## CI report: * 8c6f6e19940ce7ac04dfcfce52da3ccdaf3a8b0f UNKNOWN * c4799803cff8adffef56e889a5cd4d52599fcf73 UNKNOWN * ec5e2c706f3e7c96e7116eb8195176ee649fa161 Azure: [FAILUR

[hudi] branch master updated (271d1a79c0 -> 3ef137d156)

2022-05-22 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from 271d1a79c0 [HUDI-4051] Allow nested field as primary key and preCombineField in spark sql (#5517) add 3ef137d15

[GitHub] [hudi] danny0405 merged pull request #5640: [HUDI-4129] Initializes a new fs view for WriteProfile#reload

2022-05-22 Thread GitBox
danny0405 merged PR #5640: URL: https://github.com/apache/hudi/pull/5640 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache

[GitHub] [hudi] hudi-bot commented on pull request #5627: [HUDI-3350][HUDI-3351] Rebase Record combining semantic into `HoodieRecordCombiningEngine`

2022-05-22 Thread GitBox
hudi-bot commented on PR #5627: URL: https://github.com/apache/hudi/pull/5627#issuecomment-1134082800 ## CI report: * Unknown: [CANCELED](TBD) * d7ed9f43b3c10763e02169cd4b48fcde17afa8b2 UNKNOWN Bot commands @hudi-bot supports the following commands: - `

[GitHub] [hudi] wulei0302 commented on pull request #5627: [HUDI-3350][HUDI-3351] Rebase Record combining semantic into `HoodieRecordCombiningEngine`

2022-05-22 Thread GitBox
wulei0302 commented on PR #5627: URL: https://github.com/apache/hudi/pull/5627#issuecomment-1134082070 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. T

[jira] [Updated] (HUDI-3601) Support multi-arch builds in docker setup

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3601: - Fix Version/s: 0.12.0 > Support multi-arch builds in docker setup > --

[jira] [Updated] (HUDI-3601) Support multi-arch builds in docker setup

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3601: - Component/s: dependencies > Support multi-arch builds in docker setup > --

[jira] [Updated] (HUDI-3601) Support multi-arch builds in docker setup

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3601: - Priority: Critical (was: Major) > Support multi-arch builds in docker setup > ---

[GitHub] [hudi] hudi-bot commented on pull request #5534: [HUDI-4056] Refine partition matching in when drop / truncate partition.

2022-05-22 Thread GitBox
hudi-bot commented on PR #5534: URL: https://github.com/apache/hudi/pull/5534#issuecomment-1134076751 ## CI report: * 0024d0306e8671655004ff995c7754a99de6c59d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8829

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-05-22 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1134076607 ## CI report: * 8c6f6e19940ce7ac04dfcfce52da3ccdaf3a8b0f UNKNOWN * c4799803cff8adffef56e889a5cd4d52599fcf73 UNKNOWN * ba68f3ba139064ca92f121b12ec88e5a63886781 Azure: [FAILUR

[jira] [Updated] (HUDI-3602) Document steps for manually building arm64 images

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3602: - Parent: (was: HUDI-3601) Issue Type: Task (was: Sub-task) > Document steps for manually build

[jira] [Reopened] (HUDI-2786) Failed to connect to namenode in Docker Demo on Apple M1 chip

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reopened HUDI-2786: -- > Failed to connect to namenode in Docker Demo on Apple M1 chip > --

[GitHub] [hudi] xushiyan commented on issue #4985: [SUPPORT]The current Docker demo only for x86 system ,arm64 system can't run success

2022-05-22 Thread GitBox
xushiyan commented on issue #4985: URL: https://github.com/apache/hudi/issues/4985#issuecomment-1134075085 tracking the work and fix in https://issues.apache.org/jira/browse/HUDI-3601 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to G

[GitHub] [hudi] hudi-bot commented on pull request #5534: [HUDI-4056] Refine partition matching in when drop / truncate partition.

2022-05-22 Thread GitBox
hudi-bot commented on PR #5534: URL: https://github.com/apache/hudi/pull/5534#issuecomment-1134074714 ## CI report: * 0024d0306e8671655004ff995c7754a99de6c59d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8829

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-05-22 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1134074580 ## CI report: * 8c6f6e19940ce7ac04dfcfce52da3ccdaf3a8b0f UNKNOWN * c4799803cff8adffef56e889a5cd4d52599fcf73 UNKNOWN * ba68f3ba139064ca92f121b12ec88e5a63886781 Azure: [FAILUR

[GitHub] [hudi] xushiyan commented on issue #5280: [SUPPORT] Docker Demo: Failed to Connect to namenode

2022-05-22 Thread GitBox
xushiyan commented on issue #5280: URL: https://github.com/apache/hudi/issues/5280#issuecomment-1134074287 tracking the work and fix in https://issues.apache.org/jira/browse/HUDI-3601 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

[jira] [Closed] (HUDI-2786) Failed to connect to namenode in Docker Demo on Apple M1 chip

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-2786. Resolution: Staged > Failed to connect to namenode in Docker Demo on Apple M1 chip > ---

[GitHub] [hudi] hudi-bot commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-05-22 Thread GitBox
hudi-bot commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1134072670 ## CI report: * 8c6f6e19940ce7ac04dfcfce52da3ccdaf3a8b0f UNKNOWN * c4799803cff8adffef56e889a5cd4d52599fcf73 UNKNOWN * ba68f3ba139064ca92f121b12ec88e5a63886781 Azure: [FAILUR

[jira] [Closed] (HUDI-28) Assign ownership of Hudi docker images to private@

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-28?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-28. -- Resolution: Fixed > Assign ownership of Hudi docker images to private@ > -

[jira] [Commented] (HUDI-2202) Add Trino to Docker Demo

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17540722#comment-17540722 ] Raymond Xu commented on HUDI-2202: -- [~codope] can we close this? > Add Trino to Docker D

[jira] [Updated] (HUDI-2786) Failed to connect to namenode in Docker Demo on Apple M1 chip

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2786: - Component/s: dependencies dev-experience > Failed to connect to namenode in Docker Demo o

[GitHub] [hudi] jinxing64 commented on pull request #5534: [HUDI-4056] Refine partition matching in when drop / truncate partition.

2022-05-22 Thread GitBox
jinxing64 commented on PR #5534: URL: https://github.com/apache/hudi/pull/5534#issuecomment-1134068120 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

[GitHub] [hudi] rahil-c commented on pull request #5402: [WIP] Support Hadoop 3.x Hive 3.x and Spark 3.2.x default

2022-05-22 Thread GitBox
rahil-c commented on PR #5402: URL: https://github.com/apache/hudi/pull/5402#issuecomment-1134053086 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [hudi] xushiyan commented on pull request #5393: [HUDI-3921] Improve rewriteRecordWithNewSchema and refactor code

2022-05-22 Thread GitBox
xushiyan commented on PR #5393: URL: https://github.com/apache/hudi/pull/5393#issuecomment-1134038076 @xiarixiaoyao can you please rebase and verify this patch via some testing? sorry about the delay. let's try to land this soon. -- This is an automated message from the Apache Git Servic

[jira] [Updated] (HUDI-3921) Reconcile schema evolution logic with base file re-writing

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3921: - Priority: Blocker (was: Major) > Reconcile schema evolution logic with base file re-writing > ---

[jira] [Updated] (HUDI-3921) Reconcile schema evolution logic with base file re-writing

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3921: - Summary: Reconcile schema evolution logic with base file re-writing (was: Fixed schema evolution cannot w

[jira] [Assigned] (HUDI-3921) Reconcile schema evolution logic with base file re-writing

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-3921: Assignee: Tao Meng > Reconcile schema evolution logic with base file re-writing > -

[GitHub] [hudi] xushiyan commented on pull request #5647: Update the docs for Distributed File System (DFS) section on Hudi Delta Streamer page

2022-05-22 Thread GitBox
xushiyan commented on PR #5647: URL: https://github.com/apache/hudi/pull/5647#issuecomment-1134028467 > @xushiyan thanks for the review and suggestions πŸ‘πŸΎ Do you think it makes sense to add this to the versioned docs too, since the same behavior is expected? At least since from 0.8.0 which

[GitHub] [hudi] bkosuru commented on issue #5569: [SUPPORT] Issues with URL_ENCODE_PARTITIONING_OPT_KEY in hudi 0.11.0

2022-05-22 Thread GitBox
bkosuru commented on issue #5569: URL: https://github.com/apache/hudi/issues/5569#issuecomment-1133983759 @xushiyan If you are not going fix this issue, we will have to do (1) One-time migration of the data because the current partition has ```<``` and ```>``` encoded. (2) Change the

[GitHub] [hudi] xccui commented on issue #5553: [SUPPORT] java.io.EOFException: Invalid position - exceeds the bounds of the stream

2022-05-22 Thread GitBox
xccui commented on issue #5553: URL: https://github.com/apache/hudi/issues/5553#issuecomment-1133965682 It seems that Hudi tries to use the new length to read data from the old file stream, even though the file has been updated in S3. -- This is an automated message from the Apache Git Se

[GitHub] [hudi] hudi-bot commented on pull request #5656: [MINOR] Fix a potential NPE and some finer points of hudi cli

2022-05-22 Thread GitBox
hudi-bot commented on PR #5656: URL: https://github.com/apache/hudi/pull/5656#issuecomment-1133950850 ## CI report: * de9b2ae93752f146eaa8c2094b94da6b7a2f78a1 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8837

[GitHub] [hudi] hudi-bot commented on pull request #5656: [MINOR] Fix a potential NPE and some finer points of hudi cli

2022-05-22 Thread GitBox
hudi-bot commented on PR #5656: URL: https://github.com/apache/hudi/pull/5656#issuecomment-1133931839 ## CI report: * de9b2ae93752f146eaa8c2094b94da6b7a2f78a1 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8837

[GitHub] [hudi] hudi-bot commented on pull request #4676: [HUDI-3304] support partial update on mor table

2022-05-22 Thread GitBox
hudi-bot commented on PR #4676: URL: https://github.com/apache/hudi/pull/4676#issuecomment-1133931660 ## CI report: * e0f586459cf8757da7f814046af56f4028cc8148 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8836

[GitHub] [hudi] hudi-bot commented on pull request #3391: [HUDI-83] Fix Timestamp/Date type read by Hive3

2022-05-22 Thread GitBox
hudi-bot commented on PR #3391: URL: https://github.com/apache/hudi/pull/3391#issuecomment-1133931596 ## CI report: * c580319cd4d3ee58c428e240b141114bcb10de64 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8835

[GitHub] [hudi] hudi-bot commented on pull request #5656: [MINOR] Fix a potential NPE and some finer points of hudi cli

2022-05-22 Thread GitBox
hudi-bot commented on PR #5656: URL: https://github.com/apache/hudi/pull/5656#issuecomment-1133931329 ## CI report: * de9b2ae93752f146eaa8c2094b94da6b7a2f78a1 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] xushiyan commented on issue #5569: [SUPPORT] Issues with URL_ENCODE_PARTITIONING_OPT_KEY in hudi 0.11.0

2022-05-22 Thread GitBox
xushiyan commented on issue #5569: URL: https://github.com/apache/hudi/issues/5569#issuecomment-1133928928 @bkosuru You're right that users shouldn't worry about the encoding. This was changed in 0.9.0 release where `<` and `>` won't be encoded, and the behavior stayed the same afterwards.

[GitHub] [hudi] luoyajun10 opened a new pull request, #5656: [MINOR] Fix a potential NPE and some finer points of hudi cli

2022-05-22 Thread GitBox
luoyajun10 opened a new pull request, #5656: URL: https://github.com/apache/hudi/pull/5656 ## What is the purpose of the pull request This pull request fixes a potential NPE and some finer points of hudi cli. ## Brief change log - Modify the initLauncher code of SparkU

[jira] [Commented] (HUDI-4125) Add integration tests around bootstrapped Hudi table

2022-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17540613#comment-17540613 ] Raymond Xu commented on HUDI-4125: -- [~guoyihua] do we have the ticket for the actual fix?

[GitHub] [hudi] leobiscassi commented on pull request #5647: Update the docs for Distributed File System (DFS) section on Hudi Delta Streamer page

2022-05-22 Thread GitBox
leobiscassi commented on PR #5647: URL: https://github.com/apache/hudi/pull/5647#issuecomment-1133914368 @xushiyan thanks for the review and suggestions πŸ‘πŸΎ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [hudi] xushiyan closed issue #5389: [SUPPORT] - AWS EMR and Glue Catalog

2022-05-22 Thread GitBox
xushiyan closed issue #5389: [SUPPORT] - AWS EMR and Glue Catalog URL: https://github.com/apache/hudi/issues/5389 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe

[GitHub] [hudi] xiarixiaoyao commented on a diff in pull request #5064: [HUDI-3654] Initialize hudi metastore module.

2022-05-22 Thread GitBox
xiarixiaoyao commented on code in PR #5064: URL: https://github.com/apache/hudi/pull/5064#discussion_r878879827 ## hudi-metastore/src/main/java/org/apache/hudi/common/table/timeline/HoodieMetastoreBasedTimeline.java: ## @@ -0,0 +1,77 @@ +/* + * Licensed to the Apache Software Fo

  1   2   >