[jira] [Updated] (HUDI-7167) Type mismatch issue when Spark writes to Hudi and synchronizes metadata

2023-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7167: - Labels: pull-request-available (was: ) > Type mismatch issue when Spark writes to Hudi

[jira] [Updated] (HUDI-7166) Provide a Procedure to Calculate Column Stats Overlap Degree

2023-11-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7166: - Labels: pull-request-available (was: ) > Provide a Procedure to Calculate Column Stats Over

[jira] [Updated] (HUDI-6980) Spark job stuck after completion, due to some non daemon threads still running

2023-11-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6980: - Labels: pull-request-available (was: ) > Spark job stuck after completion, due to some

[jira] [Updated] (HUDI-7100) Data loss when using insert_overwrite_table with insert.drop.duplicates

2023-11-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7100: - Labels: pull-request-available (was: ) > Data loss when using insert_overwrite_table w

[jira] [Updated] (HUDI-7165) Flink multi writer not close the failed instant heartbeat

2023-11-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7165: - Labels: pull-request-available (was: ) > Flink multi writer not close the failed inst

[jira] [Updated] (HUDI-7163) DeltaStreamer compact failed caused by DateTimeParseException: Text '00000000000001999' could not be parsed

2023-11-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7163: - Labels: pull-request-available (was: ) > DeltaStreamer compact failed caused

[jira] [Updated] (HUDI-7164) Add start time query API in CompletionTimeQueryView

2023-11-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7164: - Labels: pull-request-available (was: ) > Add start time query API in CompletionTimeQueryV

[jira] [Updated] (HUDI-7077) Re-enable tests in TestSparkDataSource

2023-11-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7077: - Labels: pull-request-available (was: ) > Re-enable tests in TestSparkDataSou

[jira] [Updated] (HUDI-7161) Add commit action type and ext ra metadata to write callback on commit message

2023-11-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7161: - Labels: pull-request-available (was: ) > Add commit action type and ext ra metadata to wr

[jira] [Updated] (HUDI-7160) Avro Schema Properties are dropped when adding Hoodie Metadata columns

2023-11-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7160: - Labels: pull-request-available (was: ) > Avro Schema Properties are dropped when adding Hoo

[jira] [Updated] (HUDI-7146) Implement secondary index

2023-11-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7146: - Labels: pull-request-available (was: ) > Implement secondary in

[jira] [Updated] (HUDI-7159) Check the table type between hoodie.properies and table options

2023-11-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7159: - Labels: pull-request-available (was: ) > Check the table type between hoodie.properies and ta

[jira] [Updated] (HUDI-7158) Expose event time field to record event time in commit file

2023-11-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7158: - Labels: pull-request-available (was: ) > Expose event time field to record event time in com

[jira] [Updated] (HUDI-7153) Increasing Kafka minPartitions in Streamer causes corrupted offsets

2023-11-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7153: - Labels: pull-request-available (was: ) > Increasing Kafka minPartitions in Streamer cau

[jira] [Updated] (HUDI-6822) Fix deletes handling in hbase index when partition path is updated

2023-11-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6822: - Labels: pull-request-available (was: ) > Fix deletes handling in hbase index when partition p

[jira] [Updated] (HUDI-7154) Hudi Streamer with row writer enabled hits NPE with empty batch

2023-11-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7154: - Labels: pull-request-available (was: ) > Hudi Streamer with row writer enabled hits NPE w

[jira] [Updated] (HUDI-7150) ExternalSpillableMap support values method

2023-11-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7150: - Labels: pull-request-available (was: ) > ExternalSpillableMap support values met

[jira] [Updated] (HUDI-7149) Add a dbt example project with CDC capability

2023-11-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7149: - Labels: pull-request-available (was: ) > Add a dbt example project with CDC capabil

[jira] [Updated] (HUDI-6207) Files pruning for bucket index table pk filtering queries using Spark SQL

2023-11-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6207: - Labels: pull-request-available (was: ) > Files pruning for bucket index table pk filter

[jira] [Updated] (HUDI-7147) Hudi cdc write throws Unsupported Operation Exception

2023-11-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7147: - Labels: pull-request-available (was: ) > Hudi cdc write throws Unsupported Operation Except

[jira] [Updated] (HUDI-6497) Introduce a new HudiFileSystem & HudiPath abstraction to remove Hadoop from hudi-common

2023-11-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6497: - Labels: pull-request-available (was: ) > Introduce a new HudiFileSystem & HudiPath abst

[jira] [Updated] (HUDI-7114) Fix TestHoodieAWSCredentialsProviderFactory#testGetAWSCredentialsWithInvalidAssumeRole

2023-11-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7114: - Labels: pull-request-available (was: ) > Fix > TestHoodieAWSCredentialsProviderF

[jira] [Updated] (HUDI-7142) Support Custom partitioner in append mode

2023-11-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7142: - Labels: pull-request-available (was: ) > Support Custom partitioner in append m

[jira] [Updated] (HUDI-7140) Prepare 0.14.1 branch

2023-11-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7140: - Labels: pull-request-available (was: ) > Prepare 0.14.1 bra

[jira] [Updated] (HUDI-7139) Fix operation type for bulk insert with row writer in Hudi Streamer

2023-11-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7139: - Labels: pull-request-available (was: ) > Fix operation type for bulk insert with row writer

[jira] [Updated] (HUDI-7138) Fix instantiation issues with ErrorTableWriter and Schema Registry Provider

2023-11-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7138: - Labels: pull-request-available (was: ) > Fix instantiation issues with ErrorTableWriter

[jira] [Updated] (HUDI-7137) Implement bootstrap for new filegroup reader

2023-11-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7137: - Labels: pull-request-available (was: ) > Implement bootstrap for new filegroup rea

[jira] [Updated] (HUDI-7135) Spark reads hudi table error when flink creates the table without preCombine fields by catalog or factory

2023-11-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7135: - Labels: pull-request-available (was: ) > Spark reads hudi table error when flink creates

[jira] [Updated] (HUDI-7136) in the dfs catalog scenario, solve the problem of Primary key definition is missing

2023-11-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7136: - Labels: pull-request-available (was: ) > in the dfs catalog scenario, solve the problem

[jira] [Updated] (HUDI-7133) Improve dbt example for better guidance

2023-11-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7133: - Labels: pull-request-available (was: ) > Improve dbt example for better guida

[jira] [Updated] (HUDI-7023) Support querying without syncing partition metadata to catalog

2023-11-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7023: - Labels: pull-request-available (was: ) > Support querying without syncing partition metadata

[jira] [Updated] (HUDI-7034) Refresh view does not work(due to cache)

2023-11-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7034: - Labels: pull-request-available (was: ) > Refresh view does not work(due to ca

[jira] [Updated] (HUDI-7130) Add support to configure value serializer with JsonKafkaSource

2023-11-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7130: - Labels: pull-request-available (was: ) > Add support to configure value serializer w

[jira] [Updated] (HUDI-7128) DeleteMarkerProcedures support delete in batch mode

2023-11-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7128: - Labels: pull-request-available (was: ) > DeleteMarkerProcedures support delete in batch m

[jira] [Updated] (HUDI-7129) Fail to upgrade from table version 3 to table version 4 using UpgradeOrDowngradeProcedure

2023-11-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7129: - Labels: pull-request-available (was: ) > Fail to upgrade from table version 3 to table versio

[jira] [Updated] (HUDI-7127) Fix closure of Spark context in tests

2023-11-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7127: - Labels: pull-request-available (was: ) > Fix closure of Spark context in te

[jira] [Updated] (HUDI-7127) Fix closure of Spark context in tests

2023-11-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7127: - Labels: pull-request-available (was: ) > Fix closure of Spark context in te

[jira] [Updated] (HUDI-7125) Enable CDC queries for HadoopFsRelation

2023-11-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7125: - Labels: pull-request-available (was: ) > Enable CDC queries for HadoopFsRelat

[jira] [Updated] (HUDI-7123) Improve CI scripts

2023-11-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7123: - Labels: pull-request-available (was: ) > Improve CI scri

[jira] [Updated] (HUDI-7120) Performance improvements in deltastreamer executor code path

2023-11-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7120: - Labels: pull-request-available (was: ) > Performance improvements in deltastreamer executor c

[jira] [Updated] (HUDI-7119) Don't write hoodie.table.precombine.field=ts to hoodie.properties when create a insert table using flink.

2023-11-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7119: - Labels: pull-request-available (was: ) > Don't write hoodie.table.precombine.fie

[jira] [Updated] (HUDI-7118) Conf 'spark.sql.parquet.enableVectorizedReader' does not work properly

2023-11-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7118: - Labels: pull-request-available (was: ) > Conf 'spark.sql.parquet.enableVectorizedRead

[jira] [Updated] (HUDI-7107) Reused MetricsReporter fails to publish metrics in Spark streaming job

2023-11-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7107: - Labels: pull-request-available (was: ) > Reused MetricsReporter fails to publish metrics

[jira] [Updated] (HUDI-7111) Performance regression of spark job which written into simple bucket index table

2023-11-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7111: - Labels: pull-request-available (was: ) > Performance regression of spark job which written i

[jira] [Updated] (HUDI-7116) Add docker image for flink 1.14 and spark 2.4.8

2023-11-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7116: - Labels: pull-request-available (was: ) > Add docker image for flink 1.14 and spark 2.

[jira] [Updated] (HUDI-7115) Add more options for BigQuery Sync

2023-11-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7115: - Labels: pull-request-available (was: ) > Add more options for BigQuery S

[jira] [Updated] (HUDI-7113) Update release scripts and docs for Spark 3.5 support

2023-11-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7113: - Labels: pull-request-available (was: ) > Update release scripts and docs for Spark 3.5 supp

[jira] [Updated] (HUDI-7112) Allow reuse of timeline server across tables

2023-11-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7112: - Labels: pull-request-available (was: ) > Allow reuse of timeline server across tab

[jira] [Updated] (HUDI-7110) Add call procedure for show column stats information

2023-11-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7110: - Labels: pull-request-available (was: ) > Add call procedure for show column stats informat

[jira] [Updated] (HUDI-7109) Fix Flink may re-use a committed instant in append mode

2023-11-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7109: - Labels: pull-request-available (was: ) > Fix Flink may re-use a committed instant in append m

[jira] [Updated] (HUDI-7108) Ensure schema is refreshed for every batch when using KafkaAvroSchemaDeserializer

2023-11-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7108: - Labels: pull-request-available (was: ) > Ensure schema is refreshed for every batch when us

[jira] [Updated] (HUDI-7106) Fix SQS deletes logic for S3 events source.

2023-11-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7106: - Labels: pull-request-available (was: ) > Fix SQS deletes logic for S3 events sou

[jira] [Updated] (HUDI-7105) Add FileSystemViewManager configuable

2023-11-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7105: - Labels: clean pull-request-available (was: clean) > Add FileSystemViewManager configua

[jira] [Updated] (HUDI-7102) A bug for the time travel queries for MOR tables

2023-11-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7102: - Labels: pull-request-available (was: ) > A bug for the time travel queries for MOR tab

[jira] [Updated] (HUDI-7103) Enable Time travel queries for COW

2023-11-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7103: - Labels: pull-request-available (was: ) > Enable Time travel queries for

[jira] [Updated] (HUDI-7099) Providing metrics for archive and defining som string constants

2023-11-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7099: - Labels: pull-request-available (was: ) > Providing metrics for archive and defining som str

[jira] [Updated] (HUDI-7098) Add max bytes per partition w/ cloud store incr source

2023-11-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7098: - Labels: pull-request-available (was: ) > Add max bytes per partition w/ cloud store incr sou

[jira] [Updated] (HUDI-7097) Handle the way hms Uri is instantiated w/ HiveSyncTool

2023-11-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7097: - Labels: pull-request-available (was: ) > Handle the way hms Uri is instantiated w/ HiveSyncT

[jira] [Updated] (HUDI-7096) Improve Incr Query for partitions touched based on start and end

2023-11-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7096: - Labels: pull-request-available (was: ) > Improve Incr Query for partitions touched based

[jira] [Updated] (HUDI-7095) Perf fixes to Json serde

2023-11-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7095: - Labels: pull-request-available (was: ) > Perf fixes to Json se

[jira] [Updated] (HUDI-7094) AlterTableAddColumnCommand/AlterTableChangeColumnCommand miss to update ro/rt table

2023-11-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7094: - Labels: pull-request-available (was: ) > AlterTableAddColumnComm

[jira] [Updated] (HUDI-7092) Release notes for 1.0.0-beta1

2023-11-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7092: - Labels: pull-request-available (was: ) > Release notes for 1.0.0-be

[jira] [Updated] (HUDI-7090) Set maxParallelism for singleton operator ,for example compact_plan_generate、split_monitor、compact_commit

2023-11-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7090: - Labels: pull-request-available (was: ) > Set maxParallelism for singleton operator ,for exam

[jira] [Updated] (HUDI-7089) Add docs for new features in 1.0.0-beta1

2023-11-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7089: - Labels: pull-request-available (was: ) > Add docs for new features in 1.0.0-be

[jira] [Updated] (HUDI-6958) Update Schema Evolution Documentation

2023-11-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6958: - Labels: pull-request-available (was: ) > Update Schema Evolution Documentat

[jira] [Updated] (HUDI-7088) Hudi Spark datasource doesn't convert Avro Logical Type of Local timestamp

2023-11-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7088: - Labels: pull-request-available (was: ) > Hudi Spark datasource doesn't convert Avro Logi

[jira] [Updated] (HUDI-7086) Scale GCS event source to consume large no of msgs from queue

2023-11-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7086: - Labels: pull-request-available (was: ) > Scale GCS event source to consume large no of msgs f

[jira] [Updated] (HUDI-7085) Update release scripts

2023-11-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7085: - Labels: pull-request-available (was: ) > Update release scri

[jira] [Updated] (HUDI-7035) Exception while Reading from CDC table when new partition is added.

2023-11-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7035: - Labels: pull-request-available (was: ) > Exception while Reading from CDC table when

[jira] [Updated] (HUDI-7084) Handle schema retrieval for hudi table w/ empty commits

2023-11-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7084: - Labels: pull-request-available (was: ) > Handle schema retrieval for hudi table w/ empty comm

[jira] [Updated] (HUDI-7083) Support multiple table scraping w/ prometheus reporter

2023-11-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7083: - Labels: pull-request-available (was: ) > Support multiple table scraping w/ prometheus repor

[jira] [Updated] (HUDI-7082) Add Flink 1.14 and Spark 3.13 docker image script

2023-11-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7082: - Labels: pull-request-available (was: ) > Add Flink 1.14 and Spark 3.13 docker image scr

[jira] [Updated] (HUDI-6658) Implement MOR Incremental for new file format

2023-11-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6658: - Labels: pull-request-available (was: ) > Implement MOR Incremental for new file for

[jira] [Updated] (HUDI-6613) New file format does not work with in memory index

2023-11-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6613: - Labels: pull-request-available (was: ) > New file format does not work with in memory in

[jira] [Updated] (HUDI-7079) Disable new file reader for metadata table

2023-11-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7079: - Labels: pull-request-available (was: ) > Disable new file reader for metadata ta

[jira] [Updated] (HUDI-7076) Turn on new features by default through configs for 1.0.0-beta1

2023-11-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7076: - Labels: pull-request-available (was: ) > Turn on new features by default through configs

[jira] [Updated] (HUDI-7073) Fix schema projection in file group reader-based parquet file format

2023-11-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7073: - Labels: pull-request-available (was: ) > Fix schema projection in file group reader-ba

[jira] [Updated] (HUDI-7074) Flink incremental query for non-blocking concurrency control

2023-11-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7074: - Labels: pull-request-available (was: ) > Flink incremental query for non-blocking concurre

[jira] [Updated] (HUDI-7072) remove Flink 1.13

2023-11-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7072: - Labels: pull-request-available (was: ) > remove Flink 1

[jira] [Updated] (HUDI-7071) Compaction/Clustering job not fail when throw HoodieException

2023-11-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7071: - Labels: pull-request-available (was: ) > Compaction/Clustering job not fail when th

[jira] [Updated] (HUDI-7070) Disable partial update for MERGE INTO statement without update action

2023-11-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7070: - Labels: pull-request-available (was: ) > Disable partial update for MERGE INTO statement with

[jira] [Updated] (HUDI-7069) Optimize metaclient construction and include table config in write config for multi-table services.

2023-11-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7069: - Labels: pull-request-available (was: ) > Optimize metaclient construction and include ta

[jira] [Updated] (HUDI-7068) Disable vectorized reader for hoodie filegroup reader when schema isn't supported

2023-11-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7068: - Labels: pull-request-available (was: ) > Disable vectorized reader for hoodie filegroup rea

[jira] [Updated] (HUDI-7067) Add fallback to full update if all fields are updated in MERGE INTO statement

2023-11-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7067: - Labels: pull-request-available (was: ) > Add fallback to full update if all fields are upda

[jira] [Updated] (HUDI-7064) Temp view different reads cause issues with support batch

2023-11-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7064: - Labels: pull-request-available (was: ) > Temp view different reads cause issues with supp

[jira] [Updated] (HUDI-7063) Use existing relation logic for queries reading base files only in Spark

2023-11-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7063: - Labels: pull-request-available (was: ) > Use existing relation logic for queries reading b

[jira] [Updated] (HUDI-7062) Use caching iterator style in new filegroup reader

2023-11-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7062: - Labels: pull-request-available (was: ) > Use caching iterator style in new filegroup rea

[jira] [Updated] (HUDI-7060) Change query type triggering conditions

2023-11-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7060: - Labels: pull-request-available (was: ) > Change query type triggering conditi

[jira] [Updated] (HUDI-7059) Read record positions with filter pushdown using Spark parquet reader

2023-11-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7059: - Labels: pull-request-available (was: ) > Read record positions with filter pushdown using Sp

[jira] [Updated] (HUDI-7058) HoodieBaseFileGroupRecordBuffer doesn't check if option is empty

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7058: - Labels: pull-request-available (was: ) > HoodieBaseFileGroupRecordBuffer doesn't check i

[jira] [Updated] (HUDI-7057) Support CopyToTableProcedure with patitial column copy

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7057: - Labels: pull-request-available sparksql (was: sparksql) > Support CopyToTableProcedure w

[jira] [Updated] (HUDI-7056) Create config for choosing if you want to read using position based merging

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7056: - Labels: pull-request-available (was: ) > Create config for choosing if you want to read us

[jira] [Updated] (HUDI-7055) Support reading only log files in file group reader-based Spark parquet file format

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7055: - Labels: pull-request-available (was: ) > Support reading only log files in file group rea

[jira] [Updated] (HUDI-7054) ShowPartitionsCommand should consider lazy delete_partitions

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7054: - Labels: pull-request-available (was: ) > ShowPartitionsCommand should consider l

[jira] [Updated] (HUDI-7053) Fix the filter push down logic

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7053: - Labels: pull-request-available (was: ) > Fix the filter push down lo

[jira] [Updated] (HUDI-7052) Fix partition key validation for key generators.

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7052: - Labels: pull-request-available (was: ) > Fix partition key validation for key generat

[jira] [Updated] (HUDI-7050) Flink hoodiehivecatalog supports hadoop parameters

2023-11-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7050: - Labels: pull-request-available (was: ) > Flink hoodiehivecatalog supports hadoop paramet

[jira] [Updated] (HUDI-7046) Fix partial merging logic based on projected schema

2023-11-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7046: - Labels: pull-request-available (was: ) > Fix partial merging logic based on projected sch

[jira] [Updated] (HUDI-7049) Implement File System-based Metrics Reporter

2023-11-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7049: - Labels: pull-request-available (was: ) > Implement File System-based Metrics Repor

[jira] [Updated] (HUDI-7048) Fix checkpoint loss issue when changing MOR to COW in streamer

2023-11-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7048: - Labels: pull-request-available (was: ) > Fix checkpoint loss issue when changing MOR to COW

<    3   4   5   6   7   8   9   10   11   12   >