[jira] [Updated] (HUDI-4944) The encoded slash (%2F) in partition path is not properly decoded during Spark read

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4944: Fix Version/s: 0.14.0 (was: 0.13.1) > The encoded slash (%2F) in partition path is no

[jira] [Updated] (HUDI-4937) Fix HoodieTable injecting HoodieBackedTableMetadata not reusing underlying MT readers

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4937: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix HoodieTable injecting HoodieBackedTableMeta

[jira] [Updated] (HUDI-4738) [MOR] Bloom Index missing new records inserted into Log files

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4738: Fix Version/s: 0.14.0 (was: 0.13.1) > [MOR] Bloom Index missing new records inserted

[jira] [Updated] (HUDI-5080) UnpersistRdds unpersist all rdds in the spark context

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5080: Fix Version/s: (was: 0.13.1) > UnpersistRdds unpersist all rdds in the spark context > -

[jira] [Updated] (HUDI-4947) Missing .hoodie/hoodie.properties in Hudi table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4947: Fix Version/s: 0.14.0 (was: 0.13.1) > Missing .hoodie/hoodie.properties in Hudi table

[jira] [Updated] (HUDI-4922) Presto query of bootstrapped data returns null

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4922: Fix Version/s: 0.14.0 (was: 0.13.1) > Presto query of bootstrapped data returns null

[jira] [Updated] (HUDI-5092) Querying Hudi table throws NoSuchMethodError in Databricks runtime

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5092: Fix Version/s: 0.14.0 (was: 0.13.1) > Querying Hudi table throws NoSuchMethodError in

[jira] [Updated] (HUDI-5015) Cleaner does not work properly when metadata table is enabled

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5015: Fix Version/s: 0.14.0 (was: 0.13.1) > Cleaner does not work properly when metadata ta

[jira] [Updated] (HUDI-4958) Provide accurate numDeletes in commit metadata

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4958: Fix Version/s: 0.14.0 (was: 0.13.1) > Provide accurate numDeletes in commit metadata

[jira] [Updated] (HUDI-5229) Add flink avro version entry in root pom

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5229: Fix Version/s: 0.14.0 (was: 0.13.1) > Add flink avro version entry in root pom >

[jira] [Updated] (HUDI-4777) Flink gen bucket index of mor table not consistent with spark lead to duplicate bucket issue

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4777: Fix Version/s: 0.14.0 (was: 0.13.1) > Flink gen bucket index of mor table not consist

[jira] [Updated] (HUDI-4921) Fix last completed commit in CleanPlanner

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4921: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix last completed commit in CleanPlanner > ---

[jira] [Updated] (HUDI-4854) Deltastreamer does not respect partition selector regex for metadata-only bootstrap

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4854: Fix Version/s: 0.14.0 (was: 0.13.1) > Deltastreamer does not respect partition select

[jira] [Updated] (HUDI-4852) Incremental sync not updating pending file groups under clustering

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4852: Fix Version/s: 0.14.0 (was: 0.13.1) > Incremental sync not updating pending file grou

[jira] [Updated] (HUDI-4818) Using CustomKeyGenerator fails w/ SparkHoodieTableFileIndex

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4818: Fix Version/s: 0.14.0 (was: 0.13.1) > Using CustomKeyGenerator fails w/ SparkHoodieTa

[GitHub] [hudi] bvaradar commented on a diff in pull request #8452: [HUDI-6077] Add more partition push down filters

2023-05-22 Thread via GitHub
bvaradar commented on code in PR #8452: URL: https://github.com/apache/hudi/pull/8452#discussion_r1201504641 ## hudi-common/src/main/java/org/apache/hudi/metadata/FileSystemBackedTableMetadata.java: ## @@ -96,11 +109,32 @@ public List getPartitionPathWithPathPrefixes(List relat

[jira] [Updated] (HUDI-4632) Remove the force active property for flink1.14 profile

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4632: Fix Version/s: 0.14.0 (was: 0.13.1) > Remove the force active property for flink1.14

[jira] [Updated] (HUDI-4643) MergeInto syntax WHEN MATCHED is optional but must be set

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4643: Fix Version/s: 0.14.0 (was: 0.13.1) > MergeInto syntax WHEN MATCHED is optional but m

[jira] [Updated] (HUDI-4704) bulk insert overwrite table will delete the table and then recreate a table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4704: Fix Version/s: 0.14.0 (was: 0.13.1) > bulk insert overwrite table will delete the tab

[jira] [Updated] (HUDI-4542) Flink streaming query fails with ClassNotFoundException

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4542: Fix Version/s: 0.14.0 (was: 0.13.1) > Flink streaming query fails with ClassNotFoundE

[jira] [Updated] (HUDI-4573) Fix HoodieMultiTableDeltaStreamer to write all tables in continuous mode

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4573: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix HoodieMultiTableDeltaStreamer to write all

[jira] [Updated] (HUDI-4541) Flink job fails with column stats enabled in metadata table due to NotSerializableException

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4541: Fix Version/s: 0.14.0 (was: 0.13.1) > Flink job fails with column stats enabled in me

[jira] [Updated] (HUDI-4457) Make sure IT docker test return code non-zero when failed

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4457: Fix Version/s: 0.14.0 (was: 0.13.1) > Make sure IT docker test return code non-zero w

[jira] [Updated] (HUDI-4430) Incorrect type casting while reading HUDI table created with CustomKeyGenerator and unixtimestamp paritioning field

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4430: Fix Version/s: 0.14.0 (was: 0.13.1) > Incorrect type casting while reading HUDI table

[jira] [Updated] (HUDI-4539) Make Hudi's CLI API consistent

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4539: Fix Version/s: 0.14.0 (was: 0.13.1) > Make Hudi's CLI API consistent > --

[jira] [Updated] (HUDI-4330) NPE when trying to upsert into a dataset with no Meta Fields

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4330: Fix Version/s: 0.14.0 (was: 0.13.1) > NPE when trying to upsert into a dataset with n

[jira] [Updated] (HUDI-4266) Flink streaming reader can not work when there are multiple partition fields

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4266: Fix Version/s: 0.14.0 (was: 0.13.1) > Flink streaming reader can not work when there

[jira] [Updated] (HUDI-4321) Fix Hudi to not write in Parquet legacy format

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4321: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix Hudi to not write in Parquet legacy format

[jira] [Updated] (HUDI-4184) Creating external table in Spark SQL modifies "hoodie.properties"

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4184: Fix Version/s: 0.14.0 (was: 0.13.1) > Creating external table in Spark SQL modifies "

[jira] [Updated] (HUDI-4369) Hudi Kafka Connect Sink writing to GCS bucket

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4369: Fix Version/s: 0.14.0 (was: 0.13.1) > Hudi Kafka Connect Sink writing to GCS bucket >

[jira] [Updated] (HUDI-4341) HoodieHFileReader is not compatible with Hadoop 3

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4341: Fix Version/s: 0.14.0 (was: 0.13.1) > HoodieHFileReader is not compatible with Hadoop

[jira] [Updated] (HUDI-4185) Evaluate alternatives to using "hoodie.properties" as state store for Metadata Table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4185: Fix Version/s: 0.14.0 (was: 0.13.1) > Evaluate alternatives to using "hoodie.properti

[jira] [Updated] (HUDI-4306) ComplexKeyGenerator and ComplexAvroKeyGenerator support non-partitioned table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4306: Fix Version/s: 0.14.0 (was: 0.13.1) > ComplexKeyGenerator and ComplexAvroKeyGenerator

[jira] [Updated] (HUDI-3940) Lock manager does not increment retry count upon exception

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3940: Fix Version/s: (was: 0.13.1) > Lock manager does not increment retry count upon exception >

[jira] [Updated] (HUDI-3976) Newly introduced HiveSyncConfig config, syncAsSparkDataSourceTable is defaulted as true

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3976: Fix Version/s: 0.14.0 (was: 0.13.1) > Newly introduced HiveSyncConfig config, syncAsS

[jira] [Updated] (HUDI-4112) Relax constraint in metadata table that rollback of a commit that got archived in MDT throws exception

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4112: Fix Version/s: 0.14.0 (was: 0.13.1) > Relax constraint in metadata table that rollbac

[jira] [Updated] (HUDI-3342) MOR Delta Block Rollbacks not applied if Lazy Block reading is disabled

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3342: Fix Version/s: 0.14.0 (was: 0.13.1) > MOR Delta Block Rollbacks not applied if Lazy B

[jira] [Updated] (HUDI-4154) Unable to write HUDI Tables to S3 via Flink SQL

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4154: Fix Version/s: 0.14.0 (was: 0.13.1) > Unable to write HUDI Tables to S3 via Flink SQL

[jira] [Updated] (HUDI-3646) The Hudi update syntax should not modify the nullability attribute of a column

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3646: Fix Version/s: 0.14.0 (was: 0.13.1) > The Hudi update syntax should not modify the nu

[jira] [Updated] (HUDI-3786) how to deduce what MDT partitions to update on the write path w/ async indeing

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3786: Fix Version/s: 0.14.0 (was: 0.13.1) > how to deduce what MDT partitions to update on

[jira] [Updated] (HUDI-3683) Support evolved schema for HFile Reader

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3683: Fix Version/s: 0.14.0 (was: 0.13.1) > Support evolved schema for HFile Reader > -

[jira] [Updated] (HUDI-3626) Refactor TableSchemaResolver to remove `includeMetadataFields` flags

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3626: Fix Version/s: 0.14.0 (was: 0.13.1) > Refactor TableSchemaResolver to remove `include

[jira] [Updated] (HUDI-3603) Support read DateType for hive2/hive3

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3603: Fix Version/s: 0.14.0 (was: 0.13.1) > Support read DateType for hive2/hive3 > -

[jira] [Updated] (HUDI-3639) [Incremental] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3639: Fix Version/s: 0.14.0 (was: 0.13.1) > [Incremental] Add Proper Incremental Records FI

[jira] [Updated] (HUDI-3887) Spark query can not read the data changes which written by flink after the spark query connection created

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3887: Fix Version/s: 0.14.0 (was: 0.13.1) > Spark query can not read the data changes which

[jira] [Updated] (HUDI-3636) Clustering fails due to marker creation failure

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3636: Fix Version/s: 0.14.0 (was: 0.13.1) > Clustering fails due to marker creation failure

[jira] [Updated] (HUDI-3668) Fix failing unit tests in hudi-integ-test

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3668: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix failing unit tests in hudi-integ-test > ---

[jira] [Updated] (HUDI-3818) hudi doesn't support bytes column as primary key

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3818: Fix Version/s: 0.14.0 (was: 0.13.1) > hudi doesn't support bytes column as primary ke

[jira] [Updated] (HUDI-3648) Failed to execute rollback due to HoodieIOException: Could not delete instant

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3648: Fix Version/s: 0.14.0 (was: 0.13.1) > Failed to execute rollback due to HoodieIOExcep

[jira] [Updated] (HUDI-3407) Make sure Restore operation is Not Concurrent w/ Writes in Multi-Writer scenario

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3407: Fix Version/s: 0.14.0 (was: 0.13.1) > Make sure Restore operation is Not Concurrent w

[jira] [Updated] (HUDI-3487) The global index is enabled regardless of changlog

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3487: Fix Version/s: 0.14.0 (was: 0.13.1) > The global index is enabled regardless of chang

[jira] [Updated] (HUDI-3467) Check shutdown logic with async compaction in Spark Structured Streaming

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3467: Fix Version/s: 0.14.0 (was: 0.13.1) > Check shutdown logic with async compaction in S

[jira] [Updated] (HUDI-3517) Unicode in partition path causes it to be resolved wrongly

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3517: Fix Version/s: 0.14.0 (was: 0.13.1) > Unicode in partition path causes it to be resol

[jira] [Updated] (HUDI-3300) Timeline server FSViewManager should avoid point lookup for metadata file partition

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3300: Fix Version/s: 0.14.0 (was: 0.13.1) > Timeline server FSViewManager should avoid poin

[jira] [Updated] (HUDI-3067) "Table already exists" error with multiple writers and dynamodb

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3067: Fix Version/s: 0.14.0 (was: 0.13.1) > "Table already exists" error with multiple writ

[jira] [Updated] (HUDI-1748) Read operation will possibility fail on mor table rt view when a write operations is concurrency running

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1748: Fix Version/s: 0.14.0 (was: 0.13.1) > Read operation will possibility fail on mor tab

[jira] [Updated] (HUDI-3117) Kafka Connect can not clearly distinguish every task log

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3117: Fix Version/s: 0.14.0 (was: 0.13.1) > Kafka Connect can not clearly distinguish every

[jira] [Updated] (HUDI-3057) Instants should be generated strictly under locks

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3057: Fix Version/s: 0.14.0 (was: 0.13.1) > Instants should be generated strictly under loc

[jira] [Updated] (HUDI-3023) Fix order of tests

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3023: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix order of tests > -- > >

[jira] [Updated] (HUDI-3055) Make sure that Compression Codec configuration is respected across the board

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3055: Fix Version/s: 0.14.0 (was: 0.13.1) > Make sure that Compression Codec configuration

[jira] [Updated] (HUDI-1779) Fail to bootstrap/upsert a table which contains timestamp column

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1779: Fix Version/s: 0.14.0 (was: 0.13.1) > Fail to bootstrap/upsert a table which contains

[jira] [Updated] (HUDI-3114) Kafka Connect can not connect Hive by jdbc

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3114: Fix Version/s: 0.14.0 (was: 0.13.1) > Kafka Connect can not connect Hive by jdbc > --

[jira] [Updated] (HUDI-2930) Rollbacks are not archived when metadata table is enabled

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2930: Fix Version/s: 0.14.0 (was: 0.13.1) > Rollbacks are not archived when metadata table

[jira] [Updated] (HUDI-3019) Upserts with Dataype promotion only to a subset of partition fails

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3019: Fix Version/s: 0.14.0 (was: 0.13.1) > Upserts with Dataype promotion only to a subset

[jira] [Updated] (HUDI-2782) Fix marker based strategy for structured streaming

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2782: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix marker based strategy for structured stream

[jira] [Updated] (HUDI-2910) Hudi CLI "commits showarchived" throws NPE

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2910: Fix Version/s: 0.14.0 (was: 0.13.1) > Hudi CLI "commits showarchived" throws NPE > --

[jira] [Updated] (HUDI-2745) Record count does not match input after compaction is scheduled when running Hudi Kafka Connect sink

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2745: Fix Version/s: 0.14.0 (was: 0.13.1) > Record count does not match input after compact

[jira] [Updated] (HUDI-2528) Flaky test: MERGE_ON_READ testTableOperationsWithRestore

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2528: Fix Version/s: 0.14.0 (was: 0.13.1) > Flaky test: MERGE_ON_READ testTableOperationsWi

[jira] [Updated] (HUDI-1889) Support partition path in a nested field in HoodieFileIndex

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1889: Fix Version/s: 0.14.0 (was: 0.13.1) > Support partition path in a nested field in Hoo

[GitHub] [hudi] hudi-bot commented on pull request #8783: Archival enhancements

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8783: URL: https://github.com/apache/hudi/pull/8783#issuecomment-1558495944 ## CI report: * 9dbf3aa2367d1d78221a90ee1555188838424909 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1728

[jira] [Updated] (HUDI-1380) Async cleaning does not work with Timeline Server

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1380: Fix Version/s: 0.14.0 (was: 0.13.1) > Async cleaning does not work with Timeline Serv

[jira] [Updated] (HUDI-1369) Bootstrap datasource jobs from hanging via spark-submit

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1369: Fix Version/s: 0.14.0 (was: 0.13.1) > Bootstrap datasource jobs from hanging via spar

[jira] [Updated] (HUDI-1117) Add tdunning json library to spark and utilities bundle

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1117: Fix Version/s: 0.14.0 (was: 0.13.1) > Add tdunning json library to spark and utilitie

[jira] [Updated] (HUDI-1158) Optimizations in parallelized listing behaviour for markers and bootstrap source files

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1158: Fix Version/s: 0.14.0 (was: 0.13.1) > Optimizations in parallelized listing behaviour

[jira] [Updated] (HUDI-1036) HoodieCombineHiveInputFormat not picking up HoodieRealtimeFileSplit

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1036: Fix Version/s: 0.14.0 (was: 0.13.1) > HoodieCombineHiveInputFormat not picking up Hoo

[jira] [Updated] (HUDI-1145) Debug if Insert operation calls upsert in case of small file handling path.

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1145: Fix Version/s: 0.14.0 (was: 0.13.1) > Debug if Insert operation calls upsert in case

[jira] [Updated] (HUDI-1286) Merge On Read queries (_rt) fails on docker demo for test suite

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1286: Fix Version/s: 0.14.0 (was: 0.13.1) > Merge On Read queries (_rt) fails on docker dem

[jira] [Updated] (HUDI-234) Graceful degradation of ObjectSizeCalculator for non hotspot jvms

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-234: --- Fix Version/s: 0.14.0 (was: 0.13.1) > Graceful degradation of ObjectSizeCalculator for n

[jira] [Updated] (HUDI-992) For hive-style partitioned source data, partition columns synced with Hive will always have String type

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-992: --- Fix Version/s: 0.14.0 (was: 0.13.1) > For hive-style partitioned source data, partition

[jira] [Updated] (HUDI-83) Map Timestamp type in spark to corresponding Timestamp type in Hive during Hive sync

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-83?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-83: -- Fix Version/s: 0.14.0 (was: 0.13.1) > Map Timestamp type in spark to corresponding Timestam

[GitHub] [hudi] hudi-bot commented on pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8076: URL: https://github.com/apache/hudi/pull/8076#issuecomment-1558455680 ## CI report: * 6a239ada8998fd440f19c0082b26d206ed589870 UNKNOWN * 1fadedfb975375bba6571e7ecf51de55d7e8dae2 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8076: URL: https://github.com/apache/hudi/pull/8076#issuecomment-1558451056 ## CI report: * 6a239ada8998fd440f19c0082b26d206ed589870 UNKNOWN * 1fadedfb975375bba6571e7ecf51de55d7e8dae2 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #8771: [HUDI-6245] Automatically downgrade table version of metadata table

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8771: URL: https://github.com/apache/hudi/pull/8771#issuecomment-1558443009 ## CI report: * da61d395636ceb467f3ae534ecd34edd109ed7a4 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1728

[GitHub] [hudi] hudi-bot commented on pull request #8452: [HUDI-6077] Add more partition push down filters

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8452: URL: https://github.com/apache/hudi/pull/8452#issuecomment-1558440341 ## CI report: * 8082df232089396b2a9f9be2b915e51b3645f172 UNKNOWN * 197d58ce002e65cbe5969b2193fb0e8dffe7eac2 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] boneanxs commented on pull request #8452: [HUDI-6077] Add more partition push down filters

2023-05-22 Thread via GitHub
boneanxs commented on PR #8452: URL: https://github.com/apache/hudi/pull/8452#issuecomment-1558420147 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [hudi] boneanxs commented on a diff in pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
boneanxs commented on code in PR #8076: URL: https://github.com/apache/hudi/pull/8076#discussion_r1201426260 ## hudi-spark-datasource/hudi-spark-common/src/main/java/org/apache/hudi/commit/BaseDatasetBulkInsertCommitActionExecutor.java: ## @@ -0,0 +1,124 @@ +/* + * Licensed to t

[GitHub] [hudi] boneanxs commented on a diff in pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
boneanxs commented on code in PR #8076: URL: https://github.com/apache/hudi/pull/8076#discussion_r1201423428 ## hudi-spark-datasource/hudi-spark3.2plus-common/src/main/scala/org/apache/spark/sql/hudi/catalog/HoodieInternalV2Table.scala: ## @@ -106,8 +106,14 @@ private class Hood

[GitHub] [hudi] boneanxs commented on a diff in pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
boneanxs commented on code in PR #8076: URL: https://github.com/apache/hudi/pull/8076#discussion_r1201417934 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/hudi/TestInsertTable.scala: ## @@ -599,138 +582,250 @@ class TestInsertTable extends HoodieSparkSq

[GitHub] [hudi] c-f-cooper opened a new issue, #8784: [SUPPORT]Heartbeat for instant 20230523041356892 has expired, last heartbeat 0

2023-05-22 Thread via GitHub
c-f-cooper opened a new issue, #8784: URL: https://github.com/apache/hudi/issues/8784 **Describe the problem you faced** When i use flink multi-writer write to hudi,the table type is cow,and enabled the async clustering,the flink job always restarted. **Environment Description*

[GitHub] [hudi] hudi-bot commented on pull request #8782: [HUDI-6201] use concurrent map when possible in filesystemview

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8782: URL: https://github.com/apache/hudi/pull/8782#issuecomment-1558310706 ## CI report: * 395e5a0d3310a8b35347c179e886e6831931f516 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1727

[hudi] branch master updated (bec544a0163 -> b74e6ad2eb9)

2023-05-22 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from bec544a0163 [HUDI-3775] Allow for offline compaction of MOR tables via spark streaming (#7632) add b74e6ad2eb9 [HUD

[GitHub] [hudi] yihua merged pull request #8779: [HUDI-6247] Add bundle validation for release candidates

2023-05-22 Thread via GitHub
yihua merged PR #8779: URL: https://github.com/apache/hudi/pull/8779 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

[GitHub] [hudi] hudi-bot commented on pull request #8779: [HUDI-6247] Add bundle validation for release candidates

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8779: URL: https://github.com/apache/hudi/pull/8779#issuecomment-1558254116 ## CI report: * aff465a8e6b11d76be2f9025013ca4b8eaa9c04a UNKNOWN * 87fa4e51b432788ab40dd28203540babea48c258 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[jira] [Updated] (HUDI-6220) Add HUDI code version to commit files and hoodie.properties

2023-05-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6220: - Labels: pull-request-available (was: ) > Add HUDI code version to commit files and hoodie.propert

[GitHub] [hudi] nsivabalan commented on a diff in pull request #8724: [HUDI-6220] Add HUDI code version to commit files and hoodie.properties.

2023-05-22 Thread via GitHub
nsivabalan commented on code in PR #8724: URL: https://github.com/apache/hudi/pull/8724#discussion_r1201317230 ## hudi-common/src/main/java/org/apache/hudi/common/table/HoodieTableConfig.java: ## @@ -310,12 +312,16 @@ private static Properties getOrderedPropertiesWithTableCheck

[jira] [Closed] (HUDI-3775) Allow for offline compaction of MOR tables via spark streaming

2023-05-22 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler closed HUDI-3775. - Resolution: Fixed > Allow for offline compaction of MOR tables via spark streaming > -

[jira] [Commented] (HUDI-5659) Support cleaning for archived files

2023-05-22 Thread clownxc (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17725139#comment-17725139 ] clownxc commented on HUDI-5659: --- I would like to give a try on this, can I take this tickets

[GitHub] [hudi] rahil-c commented on pull request #8682: [DO NOT MERGE] [HUDI-6198] Run gh actions with Spark 3.4.0

2023-05-22 Thread via GitHub
rahil-c commented on PR #8682: URL: https://github.com/apache/hudi/pull/8682#issuecomment-1558191054 Hi @danny0405 @xiarixiaoyao, we are trying to upgrade spark to 3.4.0 in hudi. However we are facing issues with several functional test failures due to another casting exception. For exampl

[GitHub] [hudi] nsivabalan commented on pull request #8759: Add metrics counters for compaction start/stop events.

2023-05-22 Thread via GitHub
nsivabalan commented on PR #8759: URL: https://github.com/apache/hudi/pull/8759#issuecomment-1558187571 @SteNicholas : I get your intention. we did take a look at the active timeline methods where we do the transition. As of now, HoodieActive timeline is lightweight and does not have much d

[GitHub] [hudi] nsivabalan commented on a diff in pull request #8759: Add metrics counters for compaction start/stop events.

2023-05-22 Thread via GitHub
nsivabalan commented on code in PR #8759: URL: https://github.com/apache/hudi/pull/8759#discussion_r1201293956 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/compact/RunCompactionActionExecutor.java: ## @@ -65,10 +73,14 @@ public RunCompactionAction

<    1   2   3   4   5   >