[jira] [Updated] (HUDI-7938) Missed HoodieSparkKryoRegistrar in Hadoop config by default

2024-07-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7938: - Labels: pull-request-available (was: ) > Missed HoodieSparkKryoRegistrar in Hadoop con

[jira] [Updated] (HUDI-7980) Optimize the configuration content when performing clustering with row writer

2024-07-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7980: - Labels: pull-request-available (was: ) > Optimize the configuration content when perform

[jira] [Updated] (HUDI-7976) Fix BUG introduced in HUDI-7955 due to usage of wrong class

2024-07-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7976: - Labels: pull-request-available (was: ) > Fix BUG introduced in HUDI-7955 due to usage of wr

[jira] [Updated] (HUDI-7979) Fix out of the box defaults with spillable memory configs

2024-07-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7979: - Labels: pull-request-available (was: ) > Fix out of the box defaults with spillable mem

[jira] [Updated] (HUDI-7978) Update docs for older versions to state that partitions should be ordered when creating multiple partitions

2024-07-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7978: - Labels: pull-request-available (was: ) > Update docs for older versions to state that partiti

[jira] [Updated] (HUDI-7977) improve bucket index paritioner

2024-07-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7977: - Labels: pull-request-available (was: ) > improve bucket index paritio

[jira] [Updated] (HUDI-7975) Transfer extrametada to new commits when new data is not ingeested to trigger table services on the dataset

2024-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7975: - Labels: pull-request-available (was: ) > Transfer extrametada to new commits when new d

[jira] [Updated] (HUDI-7974) Create empty clean commit at a cadence and make it configurable

2024-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7974: - Labels: pull-request-available (was: ) > Create empty clean commit at a cadence and m

[jira] [Updated] (HUDI-7970) Add support to read partition fields when partition type is also stored in table config

2024-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7970: - Labels: pull-request-available (was: ) > Add support to read partition fields when partit

[jira] [Updated] (HUDI-7969) Fix data loss caused by concurrent write and clean

2024-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7969: - Labels: pull-request-available (was: ) > Fix data loss caused by concurrent write and cl

[jira] [Updated] (HUDI-7692) Move MDT partiiton type code in HoodieMetadataPaylaod to MetadataPartitionType

2024-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7692: - Labels: hudi-1.0.0-beta2 pull-request-available (was: hudi-1.0.0-beta2) > Move MDT partii

[jira] [Updated] (HUDI-7025) Merge Index and Functional Index Config

2024-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7025: - Labels: hudi-1.0.0-beta2 pull-request-available (was: hudi-1.0.0-beta2) > Merge In

[jira] [Updated] (HUDI-7967) Robust handling of spark task failures and retries

2024-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7967: - Labels: RobustWrites pull-request-available (was: RobustWrites) > Robust handling of spark t

[jira] [Updated] (HUDI-7968) RFC for robust handling of spark task failures and retries

2024-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7968: - Labels: RobustWrites pull-request-available (was: RobustWrites) > RFC for robust handl

[jira] [Updated] (HUDI-7962) Add show create table command

2024-07-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7962: - Labels: pull-request-available (was: ) > Add show create table comm

[jira] [Updated] (HUDI-7966) NPE from AvroSchemaUtils.createNewSchemaFromFieldsWithReference

2024-07-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7966: - Labels: pull-request-available (was: ) > NPE f

[jira] [Updated] (HUDI-7965) Clean up SchemaTestUtil code

2024-07-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7965: - Labels: pull-request-available (was: ) > Clean up SchemaTestUtil c

[jira] [Updated] (HUDI-7963) Avoid generating RLI records when disabled w/ MDT

2024-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7963: - Labels: pull-request-available (was: ) > Avoid generating RLI records when disabled w/

[jira] [Updated] (HUDI-7961) Optimize UpsertPartitioner for prepped write operations

2024-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7961: - Labels: pull-request-available (was: ) > Optimize UpsertPartitioner for prepped write operati

[jira] [Updated] (HUDI-7958) Create partition stats index for all columns when no columns specified

2024-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7958: - Labels: pull-request-available (was: ) > Create partition stats index for all columns w

[jira] [Updated] (HUDI-7957) data skew when writing with bulk_insert + bucket_index enabled

2024-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7957: - Labels: pull-request-available (was: ) > data skew when writing with bulk_insert + bucket_in

[jira] [Updated] (HUDI-7955) Account for WritableTimestampObjectInspector#getPrimitiveJavaObject Hive3 and Hive2 discrepancies

2024-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7955: - Labels: pull-request-available (was: ) > Account for WritableTimestampObjectInspec

[jira] [Updated] (HUDI-7954) Fix data skipping with secondary index when there are no log files

2024-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7954: - Labels: pull-request-available (was: ) > Fix data skipping with secondary index w

[jira] [Updated] (HUDI-7953) Improved the variable naming and formatting of HoodieActiveTimeline and HoodieIndex

2024-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7953: - Labels: pull-request-available (was: ) > Improved the variable naming and formatt

[jira] [Updated] (HUDI-6510) Java 17 compile time support

2024-07-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6510: - Labels: pull-request-available (was: ) > Java 17 compile time supp

[jira] [Updated] (HUDI-7929) Add Flink Hudi Example for K8s

2024-07-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7929: - Labels: pull-request-available (was: ) > Add Flink Hudi Example for

[jira] [Updated] (HUDI-7949) insert into hudi table with columns specified(reordered and not in table schema order) throws exception

2024-07-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7949: - Labels: pull-request-available (was: ) > insert into hudi table with columns specified(reorde

[jira] [Updated] (HUDI-7937) Fix handling of decimals in StreamSync and Clustering

2024-07-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7937: - Labels: pull-request-available (was: ) > Fix handling of decimals in StreamSync and Cluster

[jira] [Updated] (HUDI-7951) Classes using avro causing conflict in hudi-aws-bundle

2024-07-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7951: - Labels: pull-request-available (was: ) > Classes using avro causing conflict in hudi-aws-bun

[jira] [Updated] (HUDI-7950) Shade roaring bitmap dependency in root POM

2024-07-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7950: - Labels: pull-request-available (was: ) > Shade roaring bitmap dependency in root

[jira] [Updated] (HUDI-7941) add show_file_status procedure

2024-07-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7941: - Labels: pull-request-available (was: ) > add show_file_status proced

[jira] [Updated] (HUDI-7948) RFC-80: Support column families for wide tables

2024-07-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7948: - Labels: pull-request-available (was: ) > RFC-80: Support column families for wide tab

[jira] [Updated] (HUDI-7943) Resolve version conflict of fasterxml on spark3.2

2024-07-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7943: - Labels: pull-request-available (was: ) > Resolve version conflict of fasterxml on spark

[jira] [Updated] (HUDI-7883) Ensure 1.x commit instants are readable w/ 0.16.0

2024-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7883: - Labels: pull-request-available (was: ) > Ensure 1.x commit instants are readable w/ 0.1

[jira] [Updated] (HUDI-7945) Fix file pruning using PARTITION_STATS index in Spark

2024-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7945: - Labels: pull-request-available (was: ) > Fix file pruning using PARTITION_STATS index in Sp

[jira] [Updated] (HUDI-7940) Pass metrics to ErrorTableWriter to be able to emit metrics for Error Table

2024-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7940: - Labels: pull-request-available (was: ) > Pass metrics to ErrorTableWriter to be able to e

[jira] [Updated] (HUDI-7882) Umbrella ticket to track all changes required to support reading 1.x tables with 0.16.0

2024-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7882: - Labels: pull-request-available (was: ) > Umbrella ticket to track all changes requi

[jira] [Updated] (HUDI-7905) Use cluster action for clustering pending instants

2024-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7905: - Labels: pull-request-available (was: ) > Use cluster action for clustering pending insta

[jira] [Updated] (HUDI-7859) Rename instant files to be consistent with 0.x naming format

2024-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7859: - Labels: pull-request-available (was: ) > Rename instant files to be consistent with 0.x nam

[jira] [Updated] (HUDI-7915) Spark 4 support

2024-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7915: - Labels: pull-request-available (was: ) > Spark 4 supp

[jira] [Updated] (HUDI-4822) Extract the baseFile and logFIles from HoodieDeltaWriteStat in the right way

2024-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4822: - Labels: pull-request-available (was: ) > Extract the baseFile and logFIles f

[jira] [Updated] (HUDI-7903) Partition Stats Index not getting created with SQL

2024-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7903: - Labels: pull-request-available (was: ) > Partition Stats Index not getting created with

[jira] [Updated] (HUDI-7926) dataskipping failure mode should be strict in test

2024-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7926: - Labels: pull-request-available (was: ) > dataskipping failure mode should be strict in t

[jira] [Updated] (HUDI-7709) Class Cast Exception while reading the data using TimestampBasedKeyGenerator

2024-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7709: - Labels: pull-request-available (was: ) > Class Cast Exception while reading the data us

[jira] [Updated] (HUDI-7924) Capture Latency and Failure Metrics For Hive Table recreation

2024-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7924: - Labels: pull-request-available (was: ) > Capture Latency and Failure Metrics For Hive Ta

[jira] [Updated] (HUDI-7922) Add Hudi CLI bundle for Scala 2.13

2024-06-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7922: - Labels: pull-request-available (was: ) > Add Hudi CLI bundle for Scala 2

[jira] [Updated] (HUDI-7921) Chase down memory leaks in Writeclient with MDT enabled

2024-06-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7921: - Labels: pull-request-available (was: ) > Chase down memory leaks in Writeclient with MDT enab

[jira] [Updated] (HUDI-7911) Enable cdc log for MOR table

2024-06-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7911: - Labels: pull-request-available (was: ) > Enable cdc log for MOR ta

[jira] [Updated] (HUDI-7920) Make Spark 3.5 the default build profile for Spark integration

2024-06-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7920: - Labels: pull-request-available (was: ) > Make Spark 3.5 the default build profile for Sp

[jira] [Updated] (HUDI-7914) Incorrect schema produced in DELETE_PARTITION replacecommit

2024-06-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7914: - Labels: pull-request-available (was: ) > Incorrect schema produced in DELETE_PARTIT

[jira] [Updated] (HUDI-7909) Add Comment to the FieldSchema returned by Aws Glue Client

2024-06-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7909: - Labels: pull-request-available (was: ) > Add Comment to the FieldSchema returned by Aws G

[jira] [Updated] (HUDI-7906) improve the parallelism deduce in rdd write

2024-06-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7906: - Labels: pull-request-available (was: ) > improve the parallelism deduce in rdd wr

[jira] [Updated] (HUDI-7877) Add record position to record index metadata payload

2024-06-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7877: - Labels: pull-request-available (was: ) > Add record position to record index metadata payl

[jira] [Updated] (HUDI-7892) Building workload support set parallelism

2024-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7892: - Labels: pull-request-available (was: ) > Building workload support set parallel

[jira] [Updated] (HUDI-7891) Fix HoodieActiveTimeline#deleteCompletedRollback missing check for Action type

2024-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7891: - Labels: pull-request-available (was: ) > Fix HoodieActiveTimeline#deleteCompletedRollb

[jira] [Updated] (HUDI-7881) Handle table base path changes in meta syncs.

2024-06-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7881: - Labels: pull-request-available (was: ) > Handle table base path changes in meta sy

[jira] [Updated] (HUDI-7880) Support extraMetadata in Spark SQL Insert Into

2024-06-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7880: - Labels: pull-request-available (was: ) > Support extraMetadata in Spark SQL Ins

[jira] [Updated] (HUDI-7879) Optimize the redundant creation of HoodieTable in DataSourceInternalWriterHelper and the unnecessary parameters in createTable within BaseHoodieWriteClient.

2024-06-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7879: - Labels: pull-request-available (was: ) > Optimize the redundant creation of HoodieTa

[jira] [Updated] (HUDI-7876) Use TypedProperties to store the spillable map configs for the FG reader

2024-06-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7876: - Labels: pull-request-available (was: ) > Use TypedProperties to store the spillable map conf

[jira] [Updated] (HUDI-7874) Fail to read 2-level structure Parquet

2024-06-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7874: - Labels: pull-request-available (was: ) > Fail to read 2-level structure Parq

[jira] [Updated] (HUDI-7875) Remove tablePath from HoodieFileGroupReader

2024-06-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7875: - Labels: pull-request-available (was: ) > Remove tablePath from HoodieFileGroupRea

[jira] [Updated] (HUDI-7873) Remove getStorage method from HoodieReaderContext

2024-06-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7873: - Labels: pull-request-available (was: ) > Remove getStorage method from HoodieReaderCont

[jira] [Updated] (HUDI-7872) Recreate Glue table on certain types of exceptions

2024-06-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7872: - Labels: pull-request-available (was: ) > Recreate Glue table on certain types of excepti

[jira] [Updated] (HUDI-7871) Remove tableconfig from HoodieFilegroupReader params

2024-06-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7871: - Labels: pull-request-available (was: ) > Remove tableconfig from HoodieFilegroupReader par

[jira] [Updated] (HUDI-7867) Data deduplication caused by drawback in the delete invalid files before commit

2024-06-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7867: - Labels: pull-request-available (was: ) > Data deduplication caused by drawback in the del

[jira] [Updated] (HUDI-7838) Use Config hoodie.schema.cache.enable in HoodieBaseFileGroupRecordBuffer and AbstractHoodieLogRecordReader

2024-06-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7838: - Labels: pull-request-available (was: ) > Use Config hoodie.schema.cache.ena

[jira] [Updated] (HUDI-7671) Make Hudi timeline backward compatible

2024-06-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7671: - Labels: compatibility pull-request-available (was: compatibility) > Make Hudi timeline backw

[jira] [Updated] (HUDI-7869) Ensure properties are copied when modifying schema

2024-06-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7869: - Labels: pull-request-available (was: ) > Ensure properties are copied when modifying sch

[jira] [Updated] (HUDI-7779) Guarding archival to not archive unintended commits

2024-06-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7779: - Labels: pull-request-available (was: ) > Guarding archival to not archive unintended comm

[jira] [Updated] (HUDI-7847) Infer record merge mode during table upgrade

2024-06-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7847: - Labels: pull-request-available (was: ) > Infer record merge mode during table upgr

[jira] [Updated] (HUDI-7841) RLI and secondary index should consider only pruned partitions for file skipping

2024-06-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7841: - Labels: pull-request-available (was: ) > RLI and secondary index should consider only pru

[jira] [Updated] (HUDI-7855) Add ability to dynamically configure write parallelism for BULK_INSERT for HoodieStreamer

2024-06-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7855: - Labels: pull-request-available (was: ) > Add ability to dynamically configure write parallel

[jira] [Updated] (HUDI-7854) Bump AWS SDK v2 version to 2.25.69

2024-06-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7854: - Labels: pull-request-available (was: ) > Bump AWS SDK v2 version to 2.25

[jira] [Updated] (HUDI-7853) Fix missing serDe properties post migration from hiveSync to glueSync

2024-06-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7853: - Labels: pull-request-available (was: ) > Fix missing serDe properties post migration f

[jira] [Updated] (HUDI-7852) Constrain the comparison of different types of ordering values to limited cases

2024-06-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7852: - Labels: pull-request-available (was: ) > Constrain the comparison of different types of order

[jira] [Updated] (HUDI-7849) Reduce time spent on running testFiltersInFileFormat

2024-06-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7849: - Labels: pull-request-available (was: ) > Reduce time spent on running testFiltersInFileFor

[jira] [Updated] (HUDI-7851) Fix java doc of DeltaWriteProfile

2024-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7851: - Labels: pull-request-available (was: ) > Fix java doc of DeltaWriteProf

[jira] [Updated] (HUDI-7846) Bump apache-rat-plugin to 0.16.1 to eliminate thread-safe warning in maven parallel build

2024-06-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7846: - Labels: pull-request-available (was: ) > Bump apache-rat-plugin to 0.16.1 to eliminate thr

[jira] [Updated] (HUDI-7845) Call show_fsview_latest Procedure support path_regex

2024-06-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7845: - Labels: pull-request-available (was: ) > Call show_fsview_latest Procedure support path_re

[jira] [Updated] (HUDI-7844) Fix HoodieSparkSqlTestBase to throw error upon test failure

2024-06-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7844: - Labels: pull-request-available (was: ) > Fix HoodieSparkSqlTestBase to throw error upon t

[jira] [Updated] (HUDI-7390) [Regression] HoodieStreamer no longer works without --props being supplied

2024-06-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7390: - Labels: pull-request-available (was: ) > [Regression] HoodieStreamer no longer works with

[jira] [Updated] (HUDI-7840) Add position merging back to file group reader

2024-06-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7840: - Labels: pull-request-available (was: ) > Add position merging back to file group rea

[jira] [Updated] (HUDI-7834) Setup table versions to differentiate HUDI 0.16.x and 1.0-beta versions

2024-06-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7834: - Labels: pull-request-available (was: ) > Setup table versions to differentiate HUDI 0.1

[jira] [Updated] (HUDI-7830) Use predicate when calculating snapshot checkpoints.

2024-06-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7830: - Labels: pull-request-available source (was: source) > Use predicate when calculating snaps

[jira] [Updated] (HUDI-7414) Remove hoodie.gcp.bigquery.sync.base_path reference in the gcp docs

2024-06-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7414: - Labels: pull-request-available (was: ) > Remove hoodie.gcp.bigquery.sync.base_path refere

[jira] [Updated] (HUDI-7828) Support Flink 1.18.1

2024-06-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7828: - Labels: pull-request-available (was: ) > Support Flink 1.1

[jira] [Updated] (HUDI-7782) Task not serializable due to DynamoDBBasedLockProvider and HiveMetastoreBasedLockProvider in clean action

2024-06-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7782: - Labels: pull-request-available (was: ) > Task not serializable due to DynamoDBBasedLockProvi

(hudi) branch dependabot/maven/io.airlift-aircompressor-0.27 deleted (was 5042e73eb65)

2024-06-03 Thread github-bot
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/io.airlift-aircompressor-0.27 in repository https://gitbox.apache.org/repos/asf/hudi.git was 5042e73eb65 Bump io.airlift:aircompressor from 0.25 to 0.27 The revisions

[jira] [Updated] (HUDI-7747) In MetaClient remove getBasePathV2() and return StoragePath from getBasePath()

2024-06-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7747: - Labels: pull-request-available (was: ) > In MetaClient remove getBasePathV2() and ret

[jira] [Updated] (HUDI-7826) hoodie.write.set.null.for.missing.columns results in invalid objects

2024-06-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7826: - Labels: pull-request-available (was: ) > hoodie.write.set.null.for.missing.columns resu

(hudi) branch dependabot/maven/io.airlift-aircompressor-0.27 created (now 5042e73eb65)

2024-06-02 Thread github-bot
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/io.airlift-aircompressor-0.27 in repository https://gitbox.apache.org/repos/asf/hudi.git at 5042e73eb65 Bump io.airlift:aircompressor from 0.25 to 0.27 No new

[jira] [Updated] (HUDI-7825) Support Report pending clustering and compaction plan metric

2024-06-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7825: - Labels: pull-request-available (was: ) > Support Report pending clustering and compaction p

[jira] [Updated] (HUDI-7824) Fix incremental partitions fetch logic when savepoint is removed for Incr cleaner

2024-05-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7824: - Labels: pull-request-available (was: ) > Fix incremental partitions fetch logic when savepo

[jira] [Updated] (HUDI-7823) Simplify dependency management on exclusions

2024-05-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7823: - Labels: pull-request-available (was: ) > Simplify dependency management on exclusi

[jira] [Updated] (HUDI-7822) Resolve the conflicts between mixed hdfs and local path in Flink tests

2024-05-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7822: - Labels: pull-request-available (was: ) > Resolve the conflicts between mixed hdfs and local p

[jira] [Updated] (HUDI-7821) Handle schema evolution in proto to avro conversion

2024-05-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7821: - Labels: pull-request-available (was: ) > Handle schema evolution in proto to avro convers

[jira] [Updated] (HUDI-7819) Fix OptionsResolver#allowCommitOnEmptyBatch default value bug

2024-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7819: - Labels: pull-request-available (was: ) > Fix OptionsResolver#allowCommitOnEmptyBatch defa

[jira] [Updated] (HUDI-7817) Use Jackson Core instead of org.codehaus.jackson for JSON encoding

2024-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7817: - Labels: pull-request-available (was: ) > Use Jackson Core instead of org.codehaus.jack

[jira] [Updated] (HUDI-7816) Pass the source profile to the snapshot query splitter

2024-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7816: - Labels: pull-request-available (was: ) > Pass the source profile to the snapshot query split

[jira] [Updated] (HUDI-7815) Multiple writer with bulkinsert getAllPendingClusteringPlans should refresh timeline

2024-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7815: - Labels: pull-request-available (was: ) > Multiple writer with bulkins

  1   2   3   4   5   6   7   8   9   10   >