[jira] [Updated] (HUDI-4754) Add compliance check in GH actions

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4754: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Add compliance check in GH actions >

[jira] [Updated] (HUDI-4760) Clustering results in repeated triggers of clustering execution

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4760: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Clustering results in repeated triggers of clustering

[jira] [Updated] (HUDI-4758) Enhance validations for hudi-examples quick start for spark and pyspark

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4758: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Enhance validations for hudi-examples quick start for

[jira] [Updated] (HUDI-4465) Optimizing file-listing path in MT

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4465: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Optimizing file-listing path in MT >

[jira] [Updated] (HUDI-4749) PartitionsForFullCleaning in CleanPlanner is using FileSystemBasedListing

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4749: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > PartitionsForFullCleaning in CleanPlanner is using

[jira] [Updated] (HUDI-4341) HoodieHFileReader is not compatible with Hadoop 3

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4341: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > HoodieHFileReader is not compatible with Hadoop 3 >

[jira] [Updated] (HUDI-4261) OOM in bulk-insert when using "NONE" sort-mode for table w/ large # of partitions

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4261: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > OOM in bulk-insert when using "NONE" sort-mode for

[jira] [Updated] (HUDI-4652) Test COW: Deltastreamer writing with non-Hudi partitions

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4652: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Test COW: Deltastreamer writing with non-Hudi

[jira] [Updated] (HUDI-4691) Deduplicate Spark 3.2 and Spark 3.3 integrations

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4691: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Deduplicate Spark 3.2 and Spark 3.3 integrations >

[jira] [Updated] (HUDI-3861) 'path' in CatalogTable#properties failed to be updated when renaming table

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3861: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > 'path' in CatalogTable#properties failed to be

[jira] [Updated] (HUDI-4757) Enhance hudi-examples to add pyspark examples

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4757: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Enhance hudi-examples to add pyspark examples >

[jira] [Updated] (HUDI-4626) Partitioning table by `_hoodie_partition_path` fails

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4626: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Partitioning table by `_hoodie_partition_path` fails

[jira] [Updated] (HUDI-4674) change the default value of inputFormat for the MOR table

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4674: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > change the default value of inputFormat for the MOR

[jira] [Updated] (HUDI-3529) Improve dependency management and bundling

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3529: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Improve dependency management and bundling >

[jira] [Updated] (HUDI-4651) Test COW: Spark datasource writing with non-Hudi partitions

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4651: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Test COW: Spark datasource writing with non-Hudi

[jira] [Updated] (HUDI-4615) Fix empty commits being made by deltastreamer with S3EventsSource when there is no data in SQS on starting a new pipeline

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4615: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Fix empty commits being made by deltastreamer with

[jira] [Updated] (HUDI-1574) Trim existing unit tests to finish in much shorter amount of time

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1574: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Trim existing unit tests to finish in much shorter

[jira] [Updated] (HUDI-3207) Hudi Trino connector PR review

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3207: - Sprint: Hudi-Sprint-Jan-10, Hudi-Sprint-Jan-18, Hudi-Sprint-Jan-24, Hudi-Sprint-Jan-31,

[jira] [Updated] (HUDI-3967) Automatic savepoint in Hudi

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3967: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Automatic savepoint in Hudi >

[jira] [Updated] (HUDI-3391) presto and hive beeline fails to read MOR table w/ 2 or more array fields

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3391: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > presto and hive beeline fails to read MOR table w/ 2

[jira] [Updated] (HUDI-1779) Fail to bootstrap/upsert a table which contains timestamp column

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1779: - Sprint: 2022/08/22, 2022/09/05 (was: 2022/08/22) > Fail to bootstrap/upsert a table which contains

[jira] [Updated] (HUDI-4585) Optimize query performance on Presto Hudi connector

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4585: - Sprint: 2022/08/08, 2022/08/22, 2022/09/05 (was: 2022/08/08, 2022/08/22) > Optimize query performance

[jira] [Updated] (HUDI-4586) Address S3 timeouts in Bloom Index with metadata table

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4586: - Sprint: 2022/08/08, 2022/08/22, 2022/09/05 (was: 2022/08/08, 2022/08/22) > Address S3 timeouts in Bloom

[jira] [Updated] (HUDI-2754) Performance improvement for IncrementalRelation

2022-09-07 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-2754: Story Points: 0 (was: 0.5) > Performance improvement for IncrementalRelation >

[jira] [Assigned] (HUDI-3122) presto query failed for bootstrap tables

2022-09-07 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-3122: --- Assignee: Sagar Sumit (was: Ethan Guo) > presto query failed for bootstrap tables >

[jira] [Assigned] (HUDI-955) Test MOR : Presto Read Optimized Query with metadata bootstrap

2022-09-07 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-955: -- Assignee: Sagar Sumit (was: Ethan Guo) > Test MOR : Presto Read Optimized Query with metadata

[jira] [Assigned] (HUDI-956) Test MOR : Presto Realtime Query with metadata bootstrap

2022-09-07 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-956: -- Assignee: Sagar Sumit (was: Ethan Guo) > Test MOR : Presto Realtime Query with metadata bootstrap >

[jira] [Assigned] (HUDI-621) Presto Integration for supporting Bootstrapped table

2022-09-07 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-621: -- Assignee: Sagar Sumit (was: Ethan Guo) > Presto Integration for supporting Bootstrapped table >

[jira] [Assigned] (HUDI-619) Investigate and implement mechanism to have hive/presto/sparksql queries avoid stitching and return null values for hoodie columns

2022-09-07 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-619: -- Assignee: Sagar Sumit (was: Ethan Guo) > Investigate and implement mechanism to have

[GitHub] [hudi] slachiewicz commented on pull request #5784: [HUDI-4193] Upgrade Protobuf to 3.21.1

2022-09-07 Thread GitBox
slachiewicz commented on PR #5784: URL: https://github.com/apache/hudi/pull/5784#issuecomment-1239530386 #6535 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [hudi] slachiewicz commented on pull request #6535: [HUDI-4193] change protoc version so it compiles on m1 mac

2022-09-07 Thread GitBox
slachiewicz commented on PR #6535: URL: https://github.com/apache/hudi/pull/6535#issuecomment-1239528565 Please do not use different versions of protoc depending of different environment because build will not be reproducible That's why I proposed to bump protoc to one of latest

[jira] [Assigned] (HUDI-4803) Community support - Sagar

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-4803: Assignee: Sagar Sumit (was: Ethan Guo) > Community support - Sagar > - >

[jira] [Created] (HUDI-4804) Community support - Alexey

2022-09-07 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-4804: Summary: Community support - Alexey Key: HUDI-4804 URL: https://issues.apache.org/jira/browse/HUDI-4804 Project: Apache Hudi Issue Type: Task Reporter:

[jira] [Assigned] (HUDI-4804) Community support - Alexey

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-4804: Assignee: Alexey Kudinkin (was: Ethan Guo) > Community support - Alexey >

[hudi] branch master updated (dbb044b751 -> e8aee84c7c)

2022-09-07 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from dbb044b751 [HUDI-4731] Shutdown CloudWatch reporter when query completes (#6468) add e8aee84c7c [HUDI-4793]

[jira] [Created] (HUDI-4803) Community support - Sagar

2022-09-07 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-4803: Summary: Community support - Sagar Key: HUDI-4803 URL: https://issues.apache.org/jira/browse/HUDI-4803 Project: Apache Hudi Issue Type: Task Reporter:

[jira] [Assigned] (HUDI-4802) Community support - Ethan

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-4802: Assignee: Ethan Guo (was: sivabalan narayanan) > Community support - Ethan >

[GitHub] [hudi] nsivabalan merged pull request #6617: [HUDI-4793] Fixing ScalaTest tests to properly respect Log4j2 configs

2022-09-07 Thread GitBox
nsivabalan merged PR #6617: URL: https://github.com/apache/hudi/pull/6617 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Created] (HUDI-4802) Community support - Ethan

2022-09-07 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-4802: Summary: Community support - Ethan Key: HUDI-4802 URL: https://issues.apache.org/jira/browse/HUDI-4802 Project: Apache Hudi Issue Type: Task Reporter:

[jira] [Created] (HUDI-4801) Community support - Siva

2022-09-07 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-4801: Summary: Community support - Siva Key: HUDI-4801 URL: https://issues.apache.org/jira/browse/HUDI-4801 Project: Apache Hudi Issue Type: Task Reporter:

[jira] [Assigned] (HUDI-4801) Community support - Siva

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-4801: Assignee: sivabalan narayanan (was: Raymond Xu) > Community support - Siva >

[jira] [Created] (HUDI-4800) Community support - Raymond

2022-09-07 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-4800: Summary: Community support - Raymond Key: HUDI-4800 URL: https://issues.apache.org/jira/browse/HUDI-4800 Project: Apache Hudi Issue Type: Task Reporter:

[jira] [Updated] (HUDI-4800) Community support - Raymond

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4800: - Sprint: 2022/09/05 > Community support - Raymond > --- > > Key:

[jira] [Updated] (HUDI-2786) Failed to connect to namenode in Docker Demo on Apple M1 chip

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2786: - Reviewers: sivabalan narayanan > Failed to connect to namenode in Docker Demo on Apple M1 chip >

[jira] [Updated] (HUDI-4193) Fail to compile in osx aarch_64 environment

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4193: - Sprint: 2022/09/05 > Fail to compile in osx aarch_64 environment >

[jira] [Updated] (HUDI-4193) Fail to compile in osx aarch_64 environment

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4193: - Reviewers: sivabalan narayanan > Fail to compile in osx aarch_64 environment >

[jira] [Updated] (HUDI-4193) Fail to compile in osx aarch_64 environment

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4193: - Story Points: 2 > Fail to compile in osx aarch_64 environment >

[jira] [Assigned] (HUDI-4193) Fail to compile in osx aarch_64 environment

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-4193: Assignee: Jonathan Vexler > Fail to compile in osx aarch_64 environment >

[jira] [Assigned] (HUDI-2786) Failed to connect to namenode in Docker Demo on Apple M1 chip

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-2786: Assignee: Jonathan Vexler > Failed to connect to namenode in Docker Demo on Apple M1 chip >

[jira] [Updated] (HUDI-2786) Failed to connect to namenode in Docker Demo on Apple M1 chip

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2786: - Sprint: 2022/09/05 (was: 2022/09/19) > Failed to connect to namenode in Docker Demo on Apple M1 chip >

[jira] [Updated] (HUDI-4193) Fail to compile in osx aarch_64 environment

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4193: - Priority: Major (was: Minor) > Fail to compile in osx aarch_64 environment >

[GitHub] [hudi] hudi-bot commented on pull request #5478: [HUDI-3998] Fix getCommitsSinceLastCleaning failed when async cleaning

2022-09-07 Thread GitBox
hudi-bot commented on PR #5478: URL: https://github.com/apache/hudi/pull/5478#issuecomment-1239482130 ## CI report: * 7a9f87cb94043c2447da84ff07ff93009c891174 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6625: [HUDI-4799] improve analyzer exception tip when can not resolve expre…

2022-09-07 Thread GitBox
hudi-bot commented on PR #6625: URL: https://github.com/apache/hudi/pull/6625#issuecomment-1239476976 ## CI report: * 5f385a174df1fa344b87a3a4ada3f3f6d61f1d76 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #6624: [HUDI-4518] add unit for reentrant lock in diff lockProvider

2022-09-07 Thread GitBox
hudi-bot commented on PR #6624: URL: https://github.com/apache/hudi/pull/6624#issuecomment-1239476880 ## CI report: * 5bc514fe8aa680df5066dc0d1bcad3fc950afdf8 Azure:

[jira] [Updated] (HUDI-4697) Support show_invalid_parquet command based on Call Produce Command

2022-09-07 Thread jimmyz (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jimmyz updated HUDI-4697: - Status: Open (was: In Progress) > Support show_invalid_parquet command based on Call Produce Command >

[jira] [Resolved] (HUDI-4697) Support show_invalid_parquet command based on Call Produce Command

2022-09-07 Thread jimmyz (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jimmyz resolved HUDI-4697. -- > Support show_invalid_parquet command based on Call Produce Command >

[jira] [Updated] (HUDI-4697) Support show_invalid_parquet command based on Call Produce Command

2022-09-07 Thread jimmyz (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jimmyz updated HUDI-4697: - Status: In Progress (was: Open) > Support show_invalid_parquet command based on Call Produce Command >

[jira] [Updated] (HUDI-4797) Merge Into Table Failed when Source Table Has Different Column Order

2022-09-07 Thread jimmyz (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jimmyz updated HUDI-4797: - Status: In Progress (was: Open) > Merge Into Table Failed when Source Table Has Different Column Order >

[jira] [Updated] (HUDI-4797) Merge Into Table Failed when Source Table Has Different Column Order

2022-09-07 Thread jimmyz (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jimmyz updated HUDI-4797: - Status: Patch Available (was: In Progress) > Merge Into Table Failed when Source Table Has Different Column

[GitHub] [hudi] hudi-bot commented on pull request #6624: [HUDI-4518] add unit for reentrant lock in diff lockProvider

2022-09-07 Thread GitBox
hudi-bot commented on PR #6624: URL: https://github.com/apache/hudi/pull/6624#issuecomment-1239469704 ## CI report: * 5bc514fe8aa680df5066dc0d1bcad3fc950afdf8 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #6619: [HUDI-4796] MetricsReporter stop bug

2022-09-07 Thread GitBox
hudi-bot commented on PR #6619: URL: https://github.com/apache/hudi/pull/6619#issuecomment-1239469606 ## CI report: * 461a755d6938132f17243987fb7ab5e69a883f1e Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6384: [HUDI-4613] Avoid the use of regex expressions when call hoodieFileGroup#addLogFile function

2022-09-07 Thread GitBox
hudi-bot commented on PR #6384: URL: https://github.com/apache/hudi/pull/6384#issuecomment-1239468814 ## CI report: * ecc4f73ee21eac826979c427414a8560036ceceb Azure:

[jira] [Updated] (HUDI-4799) improve analyzer exception tip when can not resolve expression

2022-09-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4799: - Labels: pull-request-available (was: ) > improve analyzer exception tip when can not resolve

[GitHub] [hudi] KnightChess opened a new pull request, #6625: [HUDI-4799] improve analyzer exception tip when can not resolve expre…

2022-09-07 Thread GitBox
KnightChess opened a new pull request, #6625: URL: https://github.com/apache/hudi/pull/6625 …ssion ### Change Logs in merge into, and source table is unresolved, the Expression is doubtful when can not resolve expression ### Impact _Describe any public API or

[GitHub] [hudi] KnightChess opened a new pull request, #6624: [HUDI-4518] add unit for reentrant lock in diff lockProvider

2022-09-07 Thread GitBox
KnightChess opened a new pull request, #6624: URL: https://github.com/apache/hudi/pull/6624 ### Change Logs add ut for #6272 ### Impact _Describe any public API or user-facing feature change or any performance impact._ **Risk level: none | low | medium | high**

[jira] [Updated] (HUDI-4799) improve analyzer exception tip when can not resolve expression

2022-09-07 Thread KnightChess (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] KnightChess updated HUDI-4799: -- Description: sql: merge into hudi_mor_pk_cbfield_tbl3 as target  using (select id, dt from

[GitHub] [hudi] hudi-bot commented on pull request #6619: [HUDI-4796] MetricsReporter stop bug

2022-09-07 Thread GitBox
hudi-bot commented on PR #6619: URL: https://github.com/apache/hudi/pull/6619#issuecomment-1239462385 ## CI report: * 461a755d6938132f17243987fb7ab5e69a883f1e Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6384: [HUDI-4613] Avoid the use of regex expressions when call hoodieFileGroup#addLogFile function

2022-09-07 Thread GitBox
hudi-bot commented on PR #6384: URL: https://github.com/apache/hudi/pull/6384#issuecomment-1239461414 ## CI report: * ecc4f73ee21eac826979c427414a8560036ceceb Azure:

[jira] [Updated] (HUDI-4799) improve analyzer exception tip when can not resolve expression

2022-09-07 Thread KnightChess (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] KnightChess updated HUDI-4799: -- Description: sql: merge into hudi_mor_pk_cbfield_tbl3 as target  using (select id, dt from

[jira] [Assigned] (HUDI-4799) improve analyzer exception tip when can not resolve expression

2022-09-07 Thread KnightChess (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] KnightChess reassigned HUDI-4799: - Assignee: KnightChess > improve analyzer exception tip when can not resolve expression >

[jira] [Created] (HUDI-4799) improve analyzer exception tip when can not resolve expression

2022-09-07 Thread KnightChess (Jira)
KnightChess created HUDI-4799: - Summary: improve analyzer exception tip when can not resolve expression Key: HUDI-4799 URL: https://issues.apache.org/jira/browse/HUDI-4799 Project: Apache Hudi

[GitHub] [hudi] hudi-bot commented on pull request #6619: [HUDI-4796] MetricsReporter stop bug

2022-09-07 Thread GitBox
hudi-bot commented on PR #6619: URL: https://github.com/apache/hudi/pull/6619#issuecomment-1239454983 ## CI report: * 461a755d6938132f17243987fb7ab5e69a883f1e Azure:

[GitHub] [hudi] yuzhaojing commented on a diff in pull request #4309: [HUDI-3016][RFC-43] Proposal to implement Table Management Service

2022-09-07 Thread GitBox
yuzhaojing commented on code in PR #4309: URL: https://github.com/apache/hudi/pull/4309#discussion_r964884802 ## rfc/rfc-43/rfc-43.md: ## @@ -0,0 +1,316 @@ + + +# RFC-43: Implement Table Management ServiceTable Management Service for Hudi + +## Proposers + +- @yuzhaojing + +##

[GitHub] [hudi] santoshsb commented on issue #5452: Schema Evolution: Missing column for previous records when new entry does not have the same while upsert.

2022-09-07 Thread GitBox
santoshsb commented on issue #5452: URL: https://github.com/apache/hudi/issues/5452#issuecomment-1239394929 @codope here is the output without the above mentioned config, have also added the code which am using for testing the fix. --ERROR `22/09/07

[GitHub] [hudi] flashJd commented on pull request #6429: [HUDI-4636] Output preCombine fields of delete records when changelog disabled

2022-09-07 Thread GitBox
flashJd commented on PR #6429: URL: https://github.com/apache/hudi/pull/6429#issuecomment-1239369070 > > We need the preCombine and partition fields also, so pull this request. > > Can you explain why we need this then, do you want to write to another hudi table using these records ?

[GitHub] [hudi] hudi-bot commented on pull request #6384: [HUDI-4613] Avoid the use of regex expressions when call hoodieFileGroup#addLogFile function

2022-09-07 Thread GitBox
hudi-bot commented on PR #6384: URL: https://github.com/apache/hudi/pull/6384#issuecomment-1239360812 ## CI report: * ecc4f73ee21eac826979c427414a8560036ceceb Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6566: [HUDI-4766] Strengthen flink clustering job

2022-09-07 Thread GitBox
hudi-bot commented on PR #6566: URL: https://github.com/apache/hudi/pull/6566#issuecomment-1239296785 ## CI report: * b10c9d062f03c2c2675866c6f4bf6346dc03ea49 UNKNOWN * a2dcd81f74603e88c4db895900d43eee6702a6da UNKNOWN * c404647afc6d26bc0e69a7a8ef93f378b397bb96 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #6574: Keep a clustering running at the same time.#6573

2022-09-07 Thread GitBox
hudi-bot commented on PR #6574: URL: https://github.com/apache/hudi/pull/6574#issuecomment-1239291320 ## CI report: * 7ced8cc1e89594e2a074a546a165ce3ef744841f Azure:

[GitHub] [hudi] praveenkmr opened a new issue, #6623: java.lang.ClassNotFoundException: Class org.apache.hadoop.hbase.client.ClusterStatusListener$MulticastListener with HBase Index [SUPPORT]

2022-09-07 Thread GitBox
praveenkmr opened a new issue, #6623: URL: https://github.com/apache/hudi/issues/6623 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)? - Join the mailing list to engage in conversations and get faster support at

[GitHub] [hudi] hudi-bot commented on pull request #6366: [HUDI-4794] add an option of the log file block size

2022-09-07 Thread GitBox
hudi-bot commented on PR #6366: URL: https://github.com/apache/hudi/pull/6366#issuecomment-1239214183 ## CI report: * 9e8e5113a5dd1419282a3b0aa17b796b74b7f886 Azure:

[GitHub] [hudi] codope commented on issue #5281: [SUPPORT] .hoodie/hoodie.properties file can be deleted due to retention settings of cloud providers

2022-09-07 Thread GitBox
codope commented on issue #5281: URL: https://github.com/apache/hudi/issues/5281#issuecomment-1239195738 Closing. I've a patch to add a note in the docs. #6622 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [hudi] codope closed issue #5281: [SUPPORT] .hoodie/hoodie.properties file can be deleted due to retention settings of cloud providers

2022-09-07 Thread GitBox
codope closed issue #5281: [SUPPORT] .hoodie/hoodie.properties file can be deleted due to retention settings of cloud providers URL: https://github.com/apache/hudi/issues/5281 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[jira] [Updated] (HUDI-3893) Add support to refresh hoodie.properties at regular intervals

2022-09-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3893: - Labels: pull-request-available (was: ) > Add support to refresh hoodie.properties at regular

[GitHub] [hudi] codope opened a new pull request, #6622: [HUDI-3893][DOCS] Add a note about lifecycle policies

2022-09-07 Thread GitBox
codope opened a new pull request, #6622: URL: https://github.com/apache/hudi/pull/6622 ### Change Logs Added a note regrading lifecyle policy under cloud documentation. See issue #5281 ### Impact None. Documentation change. **Risk level: none | low | medium |

[GitHub] [hudi] codope commented on issue #6024: [SUPPORT] DELETE_PARTITION causes AWS Athena Query failure

2022-09-07 Thread GitBox
codope commented on issue #6024: URL: https://github.com/apache/hudi/issues/6024#issuecomment-1239185807 @Gatsby-Lee Gentle reminder. Can we close this issue? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[jira] [Updated] (HUDI-4798) Need to upgrade parquet after nested fields fixed

2022-09-07 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-4798: -- Fix Version/s: 1.0.0 > Need to upgrade parquet after nested fields fixed >

[jira] [Created] (HUDI-4798) Need to upgrade parquet after nested fields fixed

2022-09-07 Thread Sagar Sumit (Jira)
Sagar Sumit created HUDI-4798: - Summary: Need to upgrade parquet after nested fields fixed Key: HUDI-4798 URL: https://issues.apache.org/jira/browse/HUDI-4798 Project: Apache Hudi Issue Type:

[GitHub] [hudi] codope commented on issue #5701: [SUPPORT]hudi how to upsert a non null array data to a existing column with array of nulls,optional binary. java.lang.ClassCastException: optional bina

2022-09-07 Thread GitBox
codope commented on issue #5701: URL: https://github.com/apache/hudi/issues/5701#issuecomment-1239183364 We need to upgrade parquet-avro once the above issues are fixed. Closing this as it is not related to Hudi. Created HUDI-4798 to track parquet upgrade. -- This is an automated

[GitHub] [hudi] codope closed issue #5701: [SUPPORT]hudi how to upsert a non null array data to a existing column with array of nulls,optional binary. java.lang.ClassCastException: optional binary ele

2022-09-07 Thread GitBox
codope closed issue #5701: [SUPPORT]hudi how to upsert a non null array data to a existing column with array of nulls,optional binary. java.lang.ClassCastException: optional binary element (UTF8) is not a group URL: https://github.com/apache/hudi/issues/5701 -- This is an automated message

[GitHub] [hudi] codope commented on issue #2509: [SUPPORT] Hudi Spark DataSource saves TimestampType as bigInt

2022-09-07 Thread GitBox
codope commented on issue #2509: URL: https://github.com/apache/hudi/issues/2509#issuecomment-1239174536 @zuyanton Did you get a chance to try out the suggested patch? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [hudi] codope commented on issue #5843: [SUPPORT] Hoodie can request and complete commits far in the future on its timeline

2022-09-07 Thread GitBox
codope commented on issue #5843: URL: https://github.com/apache/hudi/issues/5843#issuecomment-1239171918 @kasured Did you hear back from AWS wrt the patched version? Would it be easier to upgrade to latest EMR 6.x version which has Hudi 0.11.1 with this fix? -- This is an automated

[GitHub] [hudi] codope commented on issue #5511: [SUPPORT] Inremental query from the beginning of time

2022-09-07 Thread GitBox
codope commented on issue #5511: URL: https://github.com/apache/hudi/issues/5511#issuecomment-1239165659 Closing as the feature is implemented. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] codope closed issue #5511: [SUPPORT] Inremental query from the beginning of time

2022-09-07 Thread GitBox
codope closed issue #5511: [SUPPORT] Inremental query from the beginning of time URL: https://github.com/apache/hudi/issues/5511 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [hudi] codope commented on issue #4887: [SUPPORT] Unexpected behaviour with partitioned hudi tables with impala as query engine

2022-09-07 Thread GitBox
codope commented on issue #4887: URL: https://github.com/apache/hudi/issues/4887#issuecomment-1239164148 @garyli1019 Do we have a timeline when we can upgrade Hudi in Impala? We should close this issue and share the patch with user, if possible. -- This is an automated message from the

[GitHub] [hudi] hudi-bot commented on pull request #6566: [HUDI-4766] Strengthen flink clustering job

2022-09-07 Thread GitBox
hudi-bot commented on PR #6566: URL: https://github.com/apache/hudi/pull/6566#issuecomment-1239148326 ## CI report: * b10c9d062f03c2c2675866c6f4bf6346dc03ea49 UNKNOWN * a2dcd81f74603e88c4db895900d43eee6702a6da UNKNOWN * c404647afc6d26bc0e69a7a8ef93f378b397bb96 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #5091: [HUDI-3453] Fix HoodieBackedTableMetadata concurrent reading issue

2022-09-07 Thread GitBox
hudi-bot commented on PR #5091: URL: https://github.com/apache/hudi/pull/5091#issuecomment-1239146013 ## CI report: * 5694cdd9336488bb255f461da20ce2d71609c0d1 Azure:

[jira] [Updated] (HUDI-4692) Clean up HoodieSparkSqlWriter

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4692: - Sprint: 2022/09/19 (was: 2022/09/05) > Clean up HoodieSparkSqlWriter > - > >

[GitHub] [hudi] dik111 opened a new issue, #6621: [SUPPORT]com.esotericsoftware.kryo.KryoException: Encountered unregistered class ID: 36

2022-09-07 Thread GitBox
dik111 opened a new issue, #6621: URL: https://github.com/apache/hudi/issues/6621 **Describe the problem you faced** I used Flink to update data in realtime, and used spark to read data, it throws an error `com.esotericsoftware.kryo.KryoException: Encountered unregistered class ID:

[GitHub] [hudi] codope commented on issue #4784: [SUPPORT] Partition column not appearing in spark dataframe

2022-09-07 Thread GitBox
codope commented on issue #4784: URL: https://github.com/apache/hudi/issues/4784#issuecomment-1239133491 @yesemsanthoshkumar Did you get a chance to test with the given patch above? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

[jira] [Updated] (HUDI-3646) The Hudi update syntax should not modify the nullability attribute of a column

2022-09-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3646: - Sprint: (was: 2022/09/05) > The Hudi update syntax should not modify the nullability attribute of a

<    1   2   3   4   >