[jira] [Updated] (HUDI-5602) Troubleshoot METADATA_ONLY bootstrapped table not being able to read back partition path

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5602: Fix Version/s: 0.14.0 (was: 0.13.1) > Troubleshoot METADATA_ONLY bootstrapped table

[jira] [Updated] (HUDI-5608) Support decimals w/ precision > 30 in Column Stats

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5608: Fix Version/s: 0.14.0 (was: 0.13.1) > Support decimals w/ precision > 30 in Column

[jira] [Updated] (HUDI-5575) Support any record key generation along w/ any partition path generation for row writer

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5575: Fix Version/s: 0.14.0 (was: 0.13.1) > Support any record key generation along w/ any

[jira] [Updated] (HUDI-5574) Support auto record key generation with Spark SQL

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5574: Fix Version/s: 0.14.0 (was: 0.13.1) > Support auto record key generation with Spark

[jira] [Updated] (HUDI-5588) Fix Metadata table validator to deduce valid partitions when first commit where partition was added is failed

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5588: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix Metadata table validator to deduce valid

[jira] [Updated] (HUDI-5444) FileNotFound issue w/ metadata enabled

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5444: Fix Version/s: 0.14.0 (was: 0.13.1) > FileNotFound issue w/ metadata enabled >

[jira] [Updated] (HUDI-5507) SparkSQL can not read the latest change data without execute "refresh table xxx"

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5507: Fix Version/s: 0.14.0 (was: 0.13.1) > SparkSQL can not read the latest change data

[jira] [Updated] (HUDI-5557) Wrong candidate files found in metadata table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5557: Fix Version/s: 0.14.0 (was: 0.13.1) > Wrong candidate files found in metadata table

[jira] [Updated] (HUDI-5463) Apply rollback commits from data table as rollbacks in MDT instead of Delta commit

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5463: Fix Version/s: 0.14.0 (was: 0.13.1) > Apply rollback commits from data table as

[jira] [Updated] (HUDI-5442) Fix HiveHoodieTableFileIndex to use lazy listing

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5442: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix HiveHoodieTableFileIndex to use lazy

[jira] [Updated] (HUDI-5520) Fail MDT when list of log files grows unboundedly

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5520: Fix Version/s: 0.14.0 (was: 0.13.1) > Fail MDT when list of log files grows

[jira] [Updated] (HUDI-5436) Auto repair tool for MDT out of sync

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5436: Fix Version/s: 0.14.0 (was: 0.13.1) > Auto repair tool for MDT out of sync >

[jira] [Updated] (HUDI-5374) Use KeyGeneratorFactory class for instantiating a KeyGenerator

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5374: Fix Version/s: 0.14.0 (was: 0.13.1) > Use KeyGeneratorFactory class for

[jira] [Updated] (HUDI-5271) Inconsistent reader and writer schema in HoodieAvroDataBlock cause exception

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5271: Fix Version/s: 0.14.0 (was: 0.13.1) > Inconsistent reader and writer schema in

[jira] [Updated] (HUDI-5364) Make sure Hudi's Column Stats are wired into Spark's relation stats

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5364: Fix Version/s: 0.14.0 (was: 0.13.1) > Make sure Hudi's Column Stats are wired into

[jira] [Updated] (HUDI-5385) Make behavior of keeping File Writers open configurable

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5385: Fix Version/s: 0.14.0 (was: 0.13.1) > Make behavior of keeping File Writers open

[jira] [Updated] (HUDI-5322) Bulk-insert (row-writing) is not rewriting incoming dataset into Writer's schema

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5322: Fix Version/s: 0.14.0 (was: 0.13.1) > Bulk-insert (row-writing) is not rewriting

[jira] [Updated] (HUDI-5405) Avoid using Projections in generic Merge Into DMLs

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5405: Fix Version/s: 0.14.0 (was: 0.13.1) > Avoid using Projections in generic Merge Into

[jira] [Updated] (HUDI-5361) Propagate Hudi properties set in Spark's SQLConf to Hudi

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5361: Fix Version/s: 0.14.0 (was: 0.13.1) > Propagate Hudi properties set in Spark's

[jira] [Updated] (HUDI-5438) Benchmark calls w/ metadata enabled and ensure no calls to direct FS

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5438: Fix Version/s: 0.14.0 (was: 0.13.1) > Benchmark calls w/ metadata enabled and ensure

[jira] [Updated] (HUDI-5352) Jackson fails to serialize LocalDate when updating Delta Commit metadata

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5352: Fix Version/s: 0.14.0 (was: 0.13.1) > Jackson fails to serialize LocalDate when

[jira] [Updated] (HUDI-5319) NPE in Bloom Filter Index

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5319: Fix Version/s: 0.14.0 (was: 0.13.1) > NPE in Bloom Filter Index >

[jira] [Updated] (HUDI-4944) The encoded slash (%2F) in partition path is not properly decoded during Spark read

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4944: Fix Version/s: 0.14.0 (was: 0.13.1) > The encoded slash (%2F) in partition path is

[jira] [Updated] (HUDI-4937) Fix HoodieTable injecting HoodieBackedTableMetadata not reusing underlying MT readers

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4937: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix HoodieTable injecting

[jira] [Updated] (HUDI-4738) [MOR] Bloom Index missing new records inserted into Log files

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4738: Fix Version/s: 0.14.0 (was: 0.13.1) > [MOR] Bloom Index missing new records inserted

[jira] [Updated] (HUDI-5080) UnpersistRdds unpersist all rdds in the spark context

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5080: Fix Version/s: (was: 0.13.1) > UnpersistRdds unpersist all rdds in the spark context >

[jira] [Updated] (HUDI-4947) Missing .hoodie/hoodie.properties in Hudi table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4947: Fix Version/s: 0.14.0 (was: 0.13.1) > Missing .hoodie/hoodie.properties in Hudi

[jira] [Updated] (HUDI-4922) Presto query of bootstrapped data returns null

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4922: Fix Version/s: 0.14.0 (was: 0.13.1) > Presto query of bootstrapped data returns

[jira] [Updated] (HUDI-5092) Querying Hudi table throws NoSuchMethodError in Databricks runtime

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5092: Fix Version/s: 0.14.0 (was: 0.13.1) > Querying Hudi table throws NoSuchMethodError

[jira] [Updated] (HUDI-5015) Cleaner does not work properly when metadata table is enabled

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5015: Fix Version/s: 0.14.0 (was: 0.13.1) > Cleaner does not work properly when metadata

[jira] [Updated] (HUDI-4958) Provide accurate numDeletes in commit metadata

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4958: Fix Version/s: 0.14.0 (was: 0.13.1) > Provide accurate numDeletes in commit metadata

[jira] [Updated] (HUDI-5229) Add flink avro version entry in root pom

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5229: Fix Version/s: 0.14.0 (was: 0.13.1) > Add flink avro version entry in root pom >

[jira] [Updated] (HUDI-4777) Flink gen bucket index of mor table not consistent with spark lead to duplicate bucket issue

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4777: Fix Version/s: 0.14.0 (was: 0.13.1) > Flink gen bucket index of mor table not

[jira] [Updated] (HUDI-4921) Fix last completed commit in CleanPlanner

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4921: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix last completed commit in CleanPlanner >

[jira] [Updated] (HUDI-4854) Deltastreamer does not respect partition selector regex for metadata-only bootstrap

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4854: Fix Version/s: 0.14.0 (was: 0.13.1) > Deltastreamer does not respect partition

[jira] [Updated] (HUDI-4852) Incremental sync not updating pending file groups under clustering

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4852: Fix Version/s: 0.14.0 (was: 0.13.1) > Incremental sync not updating pending file

[jira] [Updated] (HUDI-4818) Using CustomKeyGenerator fails w/ SparkHoodieTableFileIndex

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4818: Fix Version/s: 0.14.0 (was: 0.13.1) > Using CustomKeyGenerator fails w/

[jira] [Updated] (HUDI-4632) Remove the force active property for flink1.14 profile

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4632: Fix Version/s: 0.14.0 (was: 0.13.1) > Remove the force active property for flink1.14

[jira] [Updated] (HUDI-4643) MergeInto syntax WHEN MATCHED is optional but must be set

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4643: Fix Version/s: 0.14.0 (was: 0.13.1) > MergeInto syntax WHEN MATCHED is optional but

[jira] [Updated] (HUDI-4704) bulk insert overwrite table will delete the table and then recreate a table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4704: Fix Version/s: 0.14.0 (was: 0.13.1) > bulk insert overwrite table will delete the

[jira] [Updated] (HUDI-4542) Flink streaming query fails with ClassNotFoundException

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4542: Fix Version/s: 0.14.0 (was: 0.13.1) > Flink streaming query fails with

[jira] [Updated] (HUDI-4573) Fix HoodieMultiTableDeltaStreamer to write all tables in continuous mode

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4573: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix HoodieMultiTableDeltaStreamer to write all

[jira] [Updated] (HUDI-4541) Flink job fails with column stats enabled in metadata table due to NotSerializableException

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4541: Fix Version/s: 0.14.0 (was: 0.13.1) > Flink job fails with column stats enabled in

[jira] [Updated] (HUDI-4457) Make sure IT docker test return code non-zero when failed

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4457: Fix Version/s: 0.14.0 (was: 0.13.1) > Make sure IT docker test return code non-zero

[jira] [Updated] (HUDI-4430) Incorrect type casting while reading HUDI table created with CustomKeyGenerator and unixtimestamp paritioning field

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4430: Fix Version/s: 0.14.0 (was: 0.13.1) > Incorrect type casting while reading HUDI

[jira] [Updated] (HUDI-4539) Make Hudi's CLI API consistent

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4539: Fix Version/s: 0.14.0 (was: 0.13.1) > Make Hudi's CLI API consistent >

[jira] [Updated] (HUDI-4330) NPE when trying to upsert into a dataset with no Meta Fields

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4330: Fix Version/s: 0.14.0 (was: 0.13.1) > NPE when trying to upsert into a dataset with

[jira] [Updated] (HUDI-4266) Flink streaming reader can not work when there are multiple partition fields

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4266: Fix Version/s: 0.14.0 (was: 0.13.1) > Flink streaming reader can not work when there

[jira] [Updated] (HUDI-4321) Fix Hudi to not write in Parquet legacy format

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4321: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix Hudi to not write in Parquet legacy format

[jira] [Updated] (HUDI-4184) Creating external table in Spark SQL modifies "hoodie.properties"

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4184: Fix Version/s: 0.14.0 (was: 0.13.1) > Creating external table in Spark SQL modifies

[jira] [Updated] (HUDI-4369) Hudi Kafka Connect Sink writing to GCS bucket

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4369: Fix Version/s: 0.14.0 (was: 0.13.1) > Hudi Kafka Connect Sink writing to GCS bucket

[jira] [Updated] (HUDI-4341) HoodieHFileReader is not compatible with Hadoop 3

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4341: Fix Version/s: 0.14.0 (was: 0.13.1) > HoodieHFileReader is not compatible with

[jira] [Updated] (HUDI-4185) Evaluate alternatives to using "hoodie.properties" as state store for Metadata Table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4185: Fix Version/s: 0.14.0 (was: 0.13.1) > Evaluate alternatives to using

[jira] [Updated] (HUDI-4306) ComplexKeyGenerator and ComplexAvroKeyGenerator support non-partitioned table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4306: Fix Version/s: 0.14.0 (was: 0.13.1) > ComplexKeyGenerator and

[jira] [Updated] (HUDI-3940) Lock manager does not increment retry count upon exception

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3940: Fix Version/s: (was: 0.13.1) > Lock manager does not increment retry count upon exception >

[jira] [Updated] (HUDI-3976) Newly introduced HiveSyncConfig config, syncAsSparkDataSourceTable is defaulted as true

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3976: Fix Version/s: 0.14.0 (was: 0.13.1) > Newly introduced HiveSyncConfig config,

[jira] [Updated] (HUDI-4112) Relax constraint in metadata table that rollback of a commit that got archived in MDT throws exception

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4112: Fix Version/s: 0.14.0 (was: 0.13.1) > Relax constraint in metadata table that

[jira] [Updated] (HUDI-3342) MOR Delta Block Rollbacks not applied if Lazy Block reading is disabled

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3342: Fix Version/s: 0.14.0 (was: 0.13.1) > MOR Delta Block Rollbacks not applied if Lazy

[jira] [Updated] (HUDI-4154) Unable to write HUDI Tables to S3 via Flink SQL

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4154: Fix Version/s: 0.14.0 (was: 0.13.1) > Unable to write HUDI Tables to S3 via Flink

[jira] [Updated] (HUDI-3646) The Hudi update syntax should not modify the nullability attribute of a column

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3646: Fix Version/s: 0.14.0 (was: 0.13.1) > The Hudi update syntax should not modify the

[jira] [Updated] (HUDI-3786) how to deduce what MDT partitions to update on the write path w/ async indeing

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3786: Fix Version/s: 0.14.0 (was: 0.13.1) > how to deduce what MDT partitions to update on

[jira] [Updated] (HUDI-3683) Support evolved schema for HFile Reader

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3683: Fix Version/s: 0.14.0 (was: 0.13.1) > Support evolved schema for HFile Reader >

[jira] [Updated] (HUDI-3626) Refactor TableSchemaResolver to remove `includeMetadataFields` flags

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3626: Fix Version/s: 0.14.0 (was: 0.13.1) > Refactor TableSchemaResolver to remove

[jira] [Updated] (HUDI-3603) Support read DateType for hive2/hive3

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3603: Fix Version/s: 0.14.0 (was: 0.13.1) > Support read DateType for hive2/hive3 >

[jira] [Updated] (HUDI-3639) [Incremental] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3639: Fix Version/s: 0.14.0 (was: 0.13.1) > [Incremental] Add Proper Incremental Records

[jira] [Updated] (HUDI-3887) Spark query can not read the data changes which written by flink after the spark query connection created

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3887: Fix Version/s: 0.14.0 (was: 0.13.1) > Spark query can not read the data changes

[jira] [Updated] (HUDI-3636) Clustering fails due to marker creation failure

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3636: Fix Version/s: 0.14.0 (was: 0.13.1) > Clustering fails due to marker creation

[jira] [Updated] (HUDI-3668) Fix failing unit tests in hudi-integ-test

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3668: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix failing unit tests in hudi-integ-test >

[jira] [Updated] (HUDI-3818) hudi doesn't support bytes column as primary key

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3818: Fix Version/s: 0.14.0 (was: 0.13.1) > hudi doesn't support bytes column as primary

[jira] [Updated] (HUDI-3648) Failed to execute rollback due to HoodieIOException: Could not delete instant

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3648: Fix Version/s: 0.14.0 (was: 0.13.1) > Failed to execute rollback due to

[jira] [Updated] (HUDI-3407) Make sure Restore operation is Not Concurrent w/ Writes in Multi-Writer scenario

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3407: Fix Version/s: 0.14.0 (was: 0.13.1) > Make sure Restore operation is Not Concurrent

[jira] [Updated] (HUDI-3487) The global index is enabled regardless of changlog

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3487: Fix Version/s: 0.14.0 (was: 0.13.1) > The global index is enabled regardless of

[jira] [Updated] (HUDI-3467) Check shutdown logic with async compaction in Spark Structured Streaming

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3467: Fix Version/s: 0.14.0 (was: 0.13.1) > Check shutdown logic with async compaction in

[jira] [Updated] (HUDI-3517) Unicode in partition path causes it to be resolved wrongly

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3517: Fix Version/s: 0.14.0 (was: 0.13.1) > Unicode in partition path causes it to be

[jira] [Updated] (HUDI-3300) Timeline server FSViewManager should avoid point lookup for metadata file partition

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3300: Fix Version/s: 0.14.0 (was: 0.13.1) > Timeline server FSViewManager should avoid

[jira] [Updated] (HUDI-3067) "Table already exists" error with multiple writers and dynamodb

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3067: Fix Version/s: 0.14.0 (was: 0.13.1) > "Table already exists" error with multiple

[jira] [Updated] (HUDI-1748) Read operation will possibility fail on mor table rt view when a write operations is concurrency running

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1748: Fix Version/s: 0.14.0 (was: 0.13.1) > Read operation will possibility fail on mor

[jira] [Updated] (HUDI-3117) Kafka Connect can not clearly distinguish every task log

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3117: Fix Version/s: 0.14.0 (was: 0.13.1) > Kafka Connect can not clearly distinguish

[jira] [Updated] (HUDI-3057) Instants should be generated strictly under locks

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3057: Fix Version/s: 0.14.0 (was: 0.13.1) > Instants should be generated strictly under

[jira] [Updated] (HUDI-3023) Fix order of tests

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3023: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix order of tests > -- > >

[jira] [Updated] (HUDI-3055) Make sure that Compression Codec configuration is respected across the board

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3055: Fix Version/s: 0.14.0 (was: 0.13.1) > Make sure that Compression Codec configuration

[jira] [Updated] (HUDI-1779) Fail to bootstrap/upsert a table which contains timestamp column

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1779: Fix Version/s: 0.14.0 (was: 0.13.1) > Fail to bootstrap/upsert a table which

[jira] [Updated] (HUDI-3114) Kafka Connect can not connect Hive by jdbc

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3114: Fix Version/s: 0.14.0 (was: 0.13.1) > Kafka Connect can not connect Hive by jdbc >

[jira] [Updated] (HUDI-2930) Rollbacks are not archived when metadata table is enabled

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2930: Fix Version/s: 0.14.0 (was: 0.13.1) > Rollbacks are not archived when metadata table

[jira] [Updated] (HUDI-3019) Upserts with Dataype promotion only to a subset of partition fails

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3019: Fix Version/s: 0.14.0 (was: 0.13.1) > Upserts with Dataype promotion only to a

[jira] [Updated] (HUDI-2782) Fix marker based strategy for structured streaming

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2782: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix marker based strategy for structured

[jira] [Updated] (HUDI-2910) Hudi CLI "commits showarchived" throws NPE

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2910: Fix Version/s: 0.14.0 (was: 0.13.1) > Hudi CLI "commits showarchived" throws NPE >

[jira] [Updated] (HUDI-2745) Record count does not match input after compaction is scheduled when running Hudi Kafka Connect sink

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2745: Fix Version/s: 0.14.0 (was: 0.13.1) > Record count does not match input after

[jira] [Updated] (HUDI-2528) Flaky test: MERGE_ON_READ testTableOperationsWithRestore

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2528: Fix Version/s: 0.14.0 (was: 0.13.1) > Flaky test: MERGE_ON_READ

[jira] [Updated] (HUDI-1889) Support partition path in a nested field in HoodieFileIndex

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1889: Fix Version/s: 0.14.0 (was: 0.13.1) > Support partition path in a nested field in

[jira] [Updated] (HUDI-1380) Async cleaning does not work with Timeline Server

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1380: Fix Version/s: 0.14.0 (was: 0.13.1) > Async cleaning does not work with Timeline

[jira] [Updated] (HUDI-1369) Bootstrap datasource jobs from hanging via spark-submit

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1369: Fix Version/s: 0.14.0 (was: 0.13.1) > Bootstrap datasource jobs from hanging via

[jira] [Updated] (HUDI-1117) Add tdunning json library to spark and utilities bundle

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1117: Fix Version/s: 0.14.0 (was: 0.13.1) > Add tdunning json library to spark and

[jira] [Updated] (HUDI-1158) Optimizations in parallelized listing behaviour for markers and bootstrap source files

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1158: Fix Version/s: 0.14.0 (was: 0.13.1) > Optimizations in parallelized listing

[jira] [Updated] (HUDI-1036) HoodieCombineHiveInputFormat not picking up HoodieRealtimeFileSplit

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1036: Fix Version/s: 0.14.0 (was: 0.13.1) > HoodieCombineHiveInputFormat not picking up

[jira] [Updated] (HUDI-1145) Debug if Insert operation calls upsert in case of small file handling path.

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1145: Fix Version/s: 0.14.0 (was: 0.13.1) > Debug if Insert operation calls upsert in case

[jira] [Updated] (HUDI-1286) Merge On Read queries (_rt) fails on docker demo for test suite

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1286: Fix Version/s: 0.14.0 (was: 0.13.1) > Merge On Read queries (_rt) fails on docker

[jira] [Updated] (HUDI-234) Graceful degradation of ObjectSizeCalculator for non hotspot jvms

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-234: --- Fix Version/s: 0.14.0 (was: 0.13.1) > Graceful degradation of ObjectSizeCalculator for

[jira] [Updated] (HUDI-992) For hive-style partitioned source data, partition columns synced with Hive will always have String type

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-992: --- Fix Version/s: 0.14.0 (was: 0.13.1) > For hive-style partitioned source data, partition

[jira] [Updated] (HUDI-83) Map Timestamp type in spark to corresponding Timestamp type in Hive during Hive sync

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-83?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-83: -- Fix Version/s: 0.14.0 (was: 0.13.1) > Map Timestamp type in spark to corresponding

<    1   2   3   4   5   >