[jira] [Updated] (HUDI-4539) Make Hudi's CLI API consistent

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4539: Fix Version/s: 0.14.0 (was: 0.13.1) > Make Hudi's CLI API consistent >

[jira] [Updated] (HUDI-4330) NPE when trying to upsert into a dataset with no Meta Fields

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4330: Fix Version/s: 0.14.0 (was: 0.13.1) > NPE when trying to upsert into a dataset with

[jira] [Updated] (HUDI-4266) Flink streaming reader can not work when there are multiple partition fields

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4266: Fix Version/s: 0.14.0 (was: 0.13.1) > Flink streaming reader can not work when there

[jira] [Updated] (HUDI-4321) Fix Hudi to not write in Parquet legacy format

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4321: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix Hudi to not write in Parquet legacy format

[jira] [Updated] (HUDI-4184) Creating external table in Spark SQL modifies "hoodie.properties"

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4184: Fix Version/s: 0.14.0 (was: 0.13.1) > Creating external table in Spark SQL modifies

[jira] [Updated] (HUDI-4369) Hudi Kafka Connect Sink writing to GCS bucket

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4369: Fix Version/s: 0.14.0 (was: 0.13.1) > Hudi Kafka Connect Sink writing to GCS bucket

[jira] [Updated] (HUDI-4341) HoodieHFileReader is not compatible with Hadoop 3

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4341: Fix Version/s: 0.14.0 (was: 0.13.1) > HoodieHFileReader is not compatible with

[jira] [Updated] (HUDI-4185) Evaluate alternatives to using "hoodie.properties" as state store for Metadata Table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4185: Fix Version/s: 0.14.0 (was: 0.13.1) > Evaluate alternatives to using

[jira] [Updated] (HUDI-4306) ComplexKeyGenerator and ComplexAvroKeyGenerator support non-partitioned table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4306: Fix Version/s: 0.14.0 (was: 0.13.1) > ComplexKeyGenerator and

[jira] [Updated] (HUDI-3940) Lock manager does not increment retry count upon exception

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3940: Fix Version/s: (was: 0.13.1) > Lock manager does not increment retry count upon exception >

[jira] [Updated] (HUDI-3976) Newly introduced HiveSyncConfig config, syncAsSparkDataSourceTable is defaulted as true

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3976: Fix Version/s: 0.14.0 (was: 0.13.1) > Newly introduced HiveSyncConfig config,

[jira] [Updated] (HUDI-4112) Relax constraint in metadata table that rollback of a commit that got archived in MDT throws exception

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4112: Fix Version/s: 0.14.0 (was: 0.13.1) > Relax constraint in metadata table that

[jira] [Updated] (HUDI-3342) MOR Delta Block Rollbacks not applied if Lazy Block reading is disabled

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3342: Fix Version/s: 0.14.0 (was: 0.13.1) > MOR Delta Block Rollbacks not applied if Lazy

[jira] [Updated] (HUDI-4154) Unable to write HUDI Tables to S3 via Flink SQL

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4154: Fix Version/s: 0.14.0 (was: 0.13.1) > Unable to write HUDI Tables to S3 via Flink

[jira] [Updated] (HUDI-3646) The Hudi update syntax should not modify the nullability attribute of a column

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3646: Fix Version/s: 0.14.0 (was: 0.13.1) > The Hudi update syntax should not modify the

[jira] [Updated] (HUDI-3786) how to deduce what MDT partitions to update on the write path w/ async indeing

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3786: Fix Version/s: 0.14.0 (was: 0.13.1) > how to deduce what MDT partitions to update on

[jira] [Updated] (HUDI-3683) Support evolved schema for HFile Reader

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3683: Fix Version/s: 0.14.0 (was: 0.13.1) > Support evolved schema for HFile Reader >

[jira] [Updated] (HUDI-3626) Refactor TableSchemaResolver to remove `includeMetadataFields` flags

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3626: Fix Version/s: 0.14.0 (was: 0.13.1) > Refactor TableSchemaResolver to remove

[jira] [Updated] (HUDI-3603) Support read DateType for hive2/hive3

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3603: Fix Version/s: 0.14.0 (was: 0.13.1) > Support read DateType for hive2/hive3 >

[jira] [Updated] (HUDI-3639) [Incremental] Add Proper Incremental Records FIltering support into Hudi's custom RDD

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3639: Fix Version/s: 0.14.0 (was: 0.13.1) > [Incremental] Add Proper Incremental Records

[jira] [Updated] (HUDI-3887) Spark query can not read the data changes which written by flink after the spark query connection created

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3887: Fix Version/s: 0.14.0 (was: 0.13.1) > Spark query can not read the data changes

[jira] [Updated] (HUDI-3636) Clustering fails due to marker creation failure

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3636: Fix Version/s: 0.14.0 (was: 0.13.1) > Clustering fails due to marker creation

[jira] [Updated] (HUDI-3668) Fix failing unit tests in hudi-integ-test

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3668: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix failing unit tests in hudi-integ-test >

[jira] [Updated] (HUDI-3818) hudi doesn't support bytes column as primary key

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3818: Fix Version/s: 0.14.0 (was: 0.13.1) > hudi doesn't support bytes column as primary

[jira] [Updated] (HUDI-3648) Failed to execute rollback due to HoodieIOException: Could not delete instant

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3648: Fix Version/s: 0.14.0 (was: 0.13.1) > Failed to execute rollback due to

[jira] [Updated] (HUDI-3407) Make sure Restore operation is Not Concurrent w/ Writes in Multi-Writer scenario

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3407: Fix Version/s: 0.14.0 (was: 0.13.1) > Make sure Restore operation is Not Concurrent

[jira] [Updated] (HUDI-3487) The global index is enabled regardless of changlog

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3487: Fix Version/s: 0.14.0 (was: 0.13.1) > The global index is enabled regardless of

[jira] [Updated] (HUDI-3467) Check shutdown logic with async compaction in Spark Structured Streaming

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3467: Fix Version/s: 0.14.0 (was: 0.13.1) > Check shutdown logic with async compaction in

[jira] [Updated] (HUDI-3517) Unicode in partition path causes it to be resolved wrongly

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3517: Fix Version/s: 0.14.0 (was: 0.13.1) > Unicode in partition path causes it to be

[jira] [Updated] (HUDI-3300) Timeline server FSViewManager should avoid point lookup for metadata file partition

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3300: Fix Version/s: 0.14.0 (was: 0.13.1) > Timeline server FSViewManager should avoid

[jira] [Updated] (HUDI-3067) "Table already exists" error with multiple writers and dynamodb

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3067: Fix Version/s: 0.14.0 (was: 0.13.1) > "Table already exists" error with multiple

[jira] [Updated] (HUDI-1748) Read operation will possibility fail on mor table rt view when a write operations is concurrency running

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1748: Fix Version/s: 0.14.0 (was: 0.13.1) > Read operation will possibility fail on mor

[jira] [Updated] (HUDI-3117) Kafka Connect can not clearly distinguish every task log

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3117: Fix Version/s: 0.14.0 (was: 0.13.1) > Kafka Connect can not clearly distinguish

[jira] [Updated] (HUDI-3057) Instants should be generated strictly under locks

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3057: Fix Version/s: 0.14.0 (was: 0.13.1) > Instants should be generated strictly under

[jira] [Updated] (HUDI-3023) Fix order of tests

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3023: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix order of tests > -- > >

[jira] [Updated] (HUDI-3055) Make sure that Compression Codec configuration is respected across the board

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3055: Fix Version/s: 0.14.0 (was: 0.13.1) > Make sure that Compression Codec configuration

[jira] [Updated] (HUDI-1779) Fail to bootstrap/upsert a table which contains timestamp column

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1779: Fix Version/s: 0.14.0 (was: 0.13.1) > Fail to bootstrap/upsert a table which

[jira] [Updated] (HUDI-3114) Kafka Connect can not connect Hive by jdbc

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3114: Fix Version/s: 0.14.0 (was: 0.13.1) > Kafka Connect can not connect Hive by jdbc >

[jira] [Updated] (HUDI-2930) Rollbacks are not archived when metadata table is enabled

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2930: Fix Version/s: 0.14.0 (was: 0.13.1) > Rollbacks are not archived when metadata table

[jira] [Updated] (HUDI-3019) Upserts with Dataype promotion only to a subset of partition fails

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3019: Fix Version/s: 0.14.0 (was: 0.13.1) > Upserts with Dataype promotion only to a

[jira] [Updated] (HUDI-2782) Fix marker based strategy for structured streaming

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2782: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix marker based strategy for structured

[jira] [Updated] (HUDI-2910) Hudi CLI "commits showarchived" throws NPE

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2910: Fix Version/s: 0.14.0 (was: 0.13.1) > Hudi CLI "commits showarchived" throws NPE >

[jira] [Updated] (HUDI-2745) Record count does not match input after compaction is scheduled when running Hudi Kafka Connect sink

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2745: Fix Version/s: 0.14.0 (was: 0.13.1) > Record count does not match input after

[jira] [Updated] (HUDI-2528) Flaky test: MERGE_ON_READ testTableOperationsWithRestore

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2528: Fix Version/s: 0.14.0 (was: 0.13.1) > Flaky test: MERGE_ON_READ

[jira] [Updated] (HUDI-1889) Support partition path in a nested field in HoodieFileIndex

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1889: Fix Version/s: 0.14.0 (was: 0.13.1) > Support partition path in a nested field in

[GitHub] [hudi] hudi-bot commented on pull request #8783: Archival enhancements

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8783: URL: https://github.com/apache/hudi/pull/8783#issuecomment-1558495944 ## CI report: * 9dbf3aa2367d1d78221a90ee1555188838424909 Azure:

[jira] [Updated] (HUDI-1380) Async cleaning does not work with Timeline Server

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1380: Fix Version/s: 0.14.0 (was: 0.13.1) > Async cleaning does not work with Timeline

[jira] [Updated] (HUDI-1369) Bootstrap datasource jobs from hanging via spark-submit

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1369: Fix Version/s: 0.14.0 (was: 0.13.1) > Bootstrap datasource jobs from hanging via

[jira] [Updated] (HUDI-1117) Add tdunning json library to spark and utilities bundle

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1117: Fix Version/s: 0.14.0 (was: 0.13.1) > Add tdunning json library to spark and

[jira] [Updated] (HUDI-1158) Optimizations in parallelized listing behaviour for markers and bootstrap source files

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1158: Fix Version/s: 0.14.0 (was: 0.13.1) > Optimizations in parallelized listing

[jira] [Updated] (HUDI-1036) HoodieCombineHiveInputFormat not picking up HoodieRealtimeFileSplit

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1036: Fix Version/s: 0.14.0 (was: 0.13.1) > HoodieCombineHiveInputFormat not picking up

[jira] [Updated] (HUDI-1145) Debug if Insert operation calls upsert in case of small file handling path.

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1145: Fix Version/s: 0.14.0 (was: 0.13.1) > Debug if Insert operation calls upsert in case

[jira] [Updated] (HUDI-1286) Merge On Read queries (_rt) fails on docker demo for test suite

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1286: Fix Version/s: 0.14.0 (was: 0.13.1) > Merge On Read queries (_rt) fails on docker

[jira] [Updated] (HUDI-234) Graceful degradation of ObjectSizeCalculator for non hotspot jvms

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-234: --- Fix Version/s: 0.14.0 (was: 0.13.1) > Graceful degradation of ObjectSizeCalculator for

[jira] [Updated] (HUDI-992) For hive-style partitioned source data, partition columns synced with Hive will always have String type

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-992: --- Fix Version/s: 0.14.0 (was: 0.13.1) > For hive-style partitioned source data, partition

[jira] [Updated] (HUDI-83) Map Timestamp type in spark to corresponding Timestamp type in Hive during Hive sync

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-83?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-83: -- Fix Version/s: 0.14.0 (was: 0.13.1) > Map Timestamp type in spark to corresponding

[GitHub] [hudi] hudi-bot commented on pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8076: URL: https://github.com/apache/hudi/pull/8076#issuecomment-1558455680 ## CI report: * 6a239ada8998fd440f19c0082b26d206ed589870 UNKNOWN * 1fadedfb975375bba6571e7ecf51de55d7e8dae2 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8076: URL: https://github.com/apache/hudi/pull/8076#issuecomment-1558451056 ## CI report: * 6a239ada8998fd440f19c0082b26d206ed589870 UNKNOWN * 1fadedfb975375bba6571e7ecf51de55d7e8dae2 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8771: [HUDI-6245] Automatically downgrade table version of metadata table

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8771: URL: https://github.com/apache/hudi/pull/8771#issuecomment-1558443009 ## CI report: * da61d395636ceb467f3ae534ecd34edd109ed7a4 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8452: [HUDI-6077] Add more partition push down filters

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8452: URL: https://github.com/apache/hudi/pull/8452#issuecomment-1558440341 ## CI report: * 8082df232089396b2a9f9be2b915e51b3645f172 UNKNOWN * 197d58ce002e65cbe5969b2193fb0e8dffe7eac2 Azure:

[GitHub] [hudi] boneanxs commented on pull request #8452: [HUDI-6077] Add more partition push down filters

2023-05-22 Thread via GitHub
boneanxs commented on PR #8452: URL: https://github.com/apache/hudi/pull/8452#issuecomment-1558420147 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [hudi] boneanxs commented on a diff in pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
boneanxs commented on code in PR #8076: URL: https://github.com/apache/hudi/pull/8076#discussion_r1201426260 ## hudi-spark-datasource/hudi-spark-common/src/main/java/org/apache/hudi/commit/BaseDatasetBulkInsertCommitActionExecutor.java: ## @@ -0,0 +1,124 @@ +/* + * Licensed to

[GitHub] [hudi] boneanxs commented on a diff in pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
boneanxs commented on code in PR #8076: URL: https://github.com/apache/hudi/pull/8076#discussion_r1201423428 ## hudi-spark-datasource/hudi-spark3.2plus-common/src/main/scala/org/apache/spark/sql/hudi/catalog/HoodieInternalV2Table.scala: ## @@ -106,8 +106,14 @@ private class

[GitHub] [hudi] boneanxs commented on a diff in pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
boneanxs commented on code in PR #8076: URL: https://github.com/apache/hudi/pull/8076#discussion_r1201417934 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/hudi/TestInsertTable.scala: ## @@ -599,138 +582,250 @@ class TestInsertTable extends

[GitHub] [hudi] c-f-cooper opened a new issue, #8784: [SUPPORT]Heartbeat for instant 20230523041356892 has expired, last heartbeat 0

2023-05-22 Thread via GitHub
c-f-cooper opened a new issue, #8784: URL: https://github.com/apache/hudi/issues/8784 **Describe the problem you faced** When i use flink multi-writer write to hudi,the table type is cow,and enabled the async clustering,the flink job always restarted. **Environment

[GitHub] [hudi] hudi-bot commented on pull request #8782: [HUDI-6201] use concurrent map when possible in filesystemview

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8782: URL: https://github.com/apache/hudi/pull/8782#issuecomment-1558310706 ## CI report: * 395e5a0d3310a8b35347c179e886e6831931f516 Azure:

[hudi] branch master updated (bec544a0163 -> b74e6ad2eb9)

2023-05-22 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from bec544a0163 [HUDI-3775] Allow for offline compaction of MOR tables via spark streaming (#7632) add b74e6ad2eb9

[GitHub] [hudi] yihua merged pull request #8779: [HUDI-6247] Add bundle validation for release candidates

2023-05-22 Thread via GitHub
yihua merged PR #8779: URL: https://github.com/apache/hudi/pull/8779 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot commented on pull request #8779: [HUDI-6247] Add bundle validation for release candidates

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8779: URL: https://github.com/apache/hudi/pull/8779#issuecomment-1558254116 ## CI report: * aff465a8e6b11d76be2f9025013ca4b8eaa9c04a UNKNOWN * 87fa4e51b432788ab40dd28203540babea48c258 Azure:

[jira] [Updated] (HUDI-6220) Add HUDI code version to commit files and hoodie.properties

2023-05-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6220: - Labels: pull-request-available (was: ) > Add HUDI code version to commit files and

[GitHub] [hudi] nsivabalan commented on a diff in pull request #8724: [HUDI-6220] Add HUDI code version to commit files and hoodie.properties.

2023-05-22 Thread via GitHub
nsivabalan commented on code in PR #8724: URL: https://github.com/apache/hudi/pull/8724#discussion_r1201317230 ## hudi-common/src/main/java/org/apache/hudi/common/table/HoodieTableConfig.java: ## @@ -310,12 +312,16 @@ private static Properties

[jira] [Closed] (HUDI-3775) Allow for offline compaction of MOR tables via spark streaming

2023-05-22 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler closed HUDI-3775. - Resolution: Fixed > Allow for offline compaction of MOR tables via spark streaming >

[jira] [Commented] (HUDI-5659) Support cleaning for archived files

2023-05-22 Thread clownxc (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17725139#comment-17725139 ] clownxc commented on HUDI-5659: --- I would like to give a try on this, can I take this tickets? > Support

[GitHub] [hudi] rahil-c commented on pull request #8682: [DO NOT MERGE] [HUDI-6198] Run gh actions with Spark 3.4.0

2023-05-22 Thread via GitHub
rahil-c commented on PR #8682: URL: https://github.com/apache/hudi/pull/8682#issuecomment-1558191054 Hi @danny0405 @xiarixiaoyao, we are trying to upgrade spark to 3.4.0 in hudi. However we are facing issues with several functional test failures due to another casting exception. For

[GitHub] [hudi] nsivabalan commented on pull request #8759: Add metrics counters for compaction start/stop events.

2023-05-22 Thread via GitHub
nsivabalan commented on PR #8759: URL: https://github.com/apache/hudi/pull/8759#issuecomment-1558187571 @SteNicholas : I get your intention. we did take a look at the active timeline methods where we do the transition. As of now, HoodieActive timeline is lightweight and does not have much

[GitHub] [hudi] nsivabalan commented on a diff in pull request #8759: Add metrics counters for compaction start/stop events.

2023-05-22 Thread via GitHub
nsivabalan commented on code in PR #8759: URL: https://github.com/apache/hudi/pull/8759#discussion_r1201293956 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/compact/RunCompactionActionExecutor.java: ## @@ -65,10 +73,14 @@ public

[GitHub] [hudi] hudi-bot commented on pull request #8781: [MINOR] disable schema validation in master

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8781: URL: https://github.com/apache/hudi/pull/8781#issuecomment-1558173486 ## CI report: * d26759cbd7fdef9819979804b94db7e019ac7490 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8771: [HUDI-6245] Automatically downgrade table version of metadata table

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8771: URL: https://github.com/apache/hudi/pull/8771#issuecomment-1558173385 ## CI report: * 3647baa5b949c0f12c22d50de63c913169c3c5d9 Azure:

[jira] [Updated] (HUDI-6213) Parallelize deletion of files during rollback.

2023-05-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6213: - Labels: pull-request-available (was: ) > Parallelize deletion of files during rollback. >

[GitHub] [hudi] nsivabalan commented on a diff in pull request #8717: [HUDI-6213] Parallelize deletion of files during rollback.

2023-05-22 Thread via GitHub
nsivabalan commented on code in PR #8717: URL: https://github.com/apache/hudi/pull/8717#discussion_r1201280843 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/rollback/ListingBasedRollbackStrategy.java: ## @@ -194,19 +194,15 @@ private String

[GitHub] [hudi] hudi-bot commented on pull request #8771: [HUDI-6245] Automatically downgrade table version of metadata table

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8771: URL: https://github.com/apache/hudi/pull/8771#issuecomment-1558164950 ## CI report: * 3647baa5b949c0f12c22d50de63c913169c3c5d9 Azure:

[hudi] branch master updated (e9cf0443815 -> bec544a0163)

2023-05-22 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from e9cf0443815 [HUDI-6237] Fix call stats_file_sizes failure error due to empty globRegex for partitioned tables

[GitHub] [hudi] nsivabalan merged pull request #7632: [HUDI-3775] Allow for offline compaction of MOR tables via spark streaming

2023-05-22 Thread via GitHub
nsivabalan merged PR #7632: URL: https://github.com/apache/hudi/pull/7632 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot commented on pull request #8783: Archival enhancements

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8783: URL: https://github.com/apache/hudi/pull/8783#issuecomment-1558110587 ## CI report: * 9dbf3aa2367d1d78221a90ee1555188838424909 Azure:

[GitHub] [hudi] forest455 commented on issue #8746: [SUPPORT]Will it be supported that incremental queries and point in time queries through hive connector and prestoDB ?

2023-05-22 Thread via GitHub
forest455 commented on issue #8746: URL: https://github.com/apache/hudi/issues/8746#issuecomment-1558107559 Thanks for your kindness to tell me that which makes me determined to use hudi other than iceberg. -- This is an automated message from the Apache Git Service. To respond to

[GitHub] [hudi] hudi-bot commented on pull request #8783: Archival enhancements

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8783: URL: https://github.com/apache/hudi/pull/8783#issuecomment-1558101580 ## CI report: * 9dbf3aa2367d1d78221a90ee1555188838424909 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] suryaprasanna opened a new pull request, #8783: Archival enhaceements

2023-05-22 Thread via GitHub
suryaprasanna opened a new pull request, #8783: URL: https://github.com/apache/hudi/pull/8783 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any

[GitHub] [hudi] hudi-bot commented on pull request #8782: [HUDI-6201] use concurrent map when possible in filesystemview

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8782: URL: https://github.com/apache/hudi/pull/8782#issuecomment-1558029561 ## CI report: * 395e5a0d3310a8b35347c179e886e6831931f516 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8779: [HUDI-6247] Add bundle validation for release candidates

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8779: URL: https://github.com/apache/hudi/pull/8779#issuecomment-1558029485 ## CI report: * aff465a8e6b11d76be2f9025013ca4b8eaa9c04a UNKNOWN * 8e57681d55a418092cb4c03716ee11228313af05 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8782: [HUDI-6201] use concurrent map when possible in filesystemview

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8782: URL: https://github.com/apache/hudi/pull/8782#issuecomment-1558017651 ## CI report: * 395e5a0d3310a8b35347c179e886e6831931f516 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #8779: [HUDI-6247] Add bundle validation for release candidates

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8779: URL: https://github.com/apache/hudi/pull/8779#issuecomment-1558017592 ## CI report: * aff465a8e6b11d76be2f9025013ca4b8eaa9c04a UNKNOWN * 86c4d5baf362e91f796108e16b4a38b7c94e5439 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8768: [HUDI-1407] Basic python reader for Hudi

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8768: URL: https://github.com/apache/hudi/pull/8768#issuecomment-1557997531 ## CI report: * 24f16ae491c097a3ce2c5a82819126898e035061 Azure:

[GitHub] [hudi] yihua commented on pull request #8779: [HUDI-6247] Add bundle validation for release candidates

2023-05-22 Thread via GitHub
yihua commented on PR #8779: URL: https://github.com/apache/hudi/pull/8779#issuecomment-1557976981 Java CI passes: https://github.com/apache/hudi/actions/runs/5048657788/jobs/9057118563 Going to disable the flag and land (Azure CI is unaffected). -- This is an automated message from

[jira] [Created] (HUDI-6249) Make maps in HoodieTableFileSystemView concurrent maps

2023-05-22 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-6249: - Summary: Make maps in HoodieTableFileSystemView concurrent maps Key: HUDI-6249 URL: https://issues.apache.org/jira/browse/HUDI-6249 Project: Apache Hudi

[GitHub] [hudi] yihua commented on a diff in pull request #8771: [HUDI-6245] Automatically downgrade table version of metadata table

2023-05-22 Thread via GitHub
yihua commented on code in PR #8771: URL: https://github.com/apache/hudi/pull/8771#discussion_r1201046443 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataWriteUtils.java: ## @@ -0,0 +1,175 @@ +/* + * Licensed to the Apache Software

[jira] [Updated] (HUDI-6201) Timeline server sometimes does not send bootstrap base path for a skeleton file

2023-05-22 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-6201: -- Status: Patch Available (was: In Progress) > Timeline server sometimes does not send bootstrap

[jira] [Updated] (HUDI-6201) Timeline server sometimes does not send bootstrap base path for a skeleton file

2023-05-22 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-6201: -- Status: In Progress (was: Open) > Timeline server sometimes does not send bootstrap base path

[jira] [Assigned] (HUDI-6201) Timeline server sometimes does not send bootstrap base path for a skeleton file

2023-05-22 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler reassigned HUDI-6201: - Assignee: Jonathan Vexler > Timeline server sometimes does not send bootstrap base path

[jira] [Updated] (HUDI-6201) Timeline server sometimes does not send bootstrap base path for a skeleton file

2023-05-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6201: - Labels: pull-request-available (was: ) > Timeline server sometimes does not send bootstrap base

[GitHub] [hudi] jonvex opened a new pull request, #8782: [HUDI-6201] use concurrent map when possible in filesystemview

2023-05-22 Thread via GitHub
jonvex opened a new pull request, #8782: URL: https://github.com/apache/hudi/pull/8782 ### Change Logs Issue presented as missing bootstrap base files when doing upsert. Each partition can be modifying the bootstrap fg map at the same time and mappings were dropped sometimes.

<    1   2   3   4   5   >