[jira] [Updated] (HUDI-7559) Fix issues with functional index (on column stats) based pruning

2024-04-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7559: - Labels: pull-request-available (was: ) > Fix issues with functional index (on column st

[jira] [Updated] (HUDI-7557) NoSuchElementException when commit corresponding to savepoint has been removed or archived

2024-04-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7557: - Labels: pull-request-available (was: ) > NoSuchElementException when commit correspond

[jira] [Updated] (HUDI-7552) Remove the suffix for MDT table service instants

2024-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7552: - Labels: pull-request-available (was: ) > Remove the suffix for MDT table service insta

[jira] [Updated] (HUDI-7526) Fix constructors for all bulk insert sort partitioners to ensure we could use it as user defined partitioners

2024-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7526: - Labels: pull-request-available (was: ) > Fix constructors for all bulk insert sort partition

[jira] [Updated] (HUDI-6884) hudi-cli should generate correct HoodieTimeGeneratorConfig

2024-03-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6884: - Labels: pull-request-available (was: ) > hudi-cli should generate corr

[jira] [Updated] (HUDI-7556) Fix MDT validator to account for additional partitions in MDT

2024-03-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7556: - Labels: pull-request-available (was: ) > Fix MDT validator to account for additional partiti

[jira] [Updated] (HUDI-6538) Refactor methods in TimelineDiffHelper class

2024-03-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6538: - Labels: pull-request-available (was: ) > Refactor methods in TimelineDiffHelper cl

[jira] [Updated] (HUDI-7551) Avoid loading all partitions into memory for cleaner planner

2024-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7551: - Labels: pull-request-available (was: ) > Avoid loading all partitions into memory for clea

[jira] [Updated] (HUDI-2458) Relax compaction in metadata being fenced based on inflight requests in data table

2024-03-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-2458: - Labels: pull-request-available (was: ) > Relax compaction in metadata being fenced ba

[jira] [Updated] (HUDI-7531) Consider pending clustering when scheduling a new clustering plan

2024-03-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7531: - Labels: pull-request-available (was: ) > Consider pending clustering when scheduling a

[jira] [Updated] (HUDI-7549) Data inconsistency issue w/ spurious log block detection

2024-03-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7549: - Labels: pull-request-available (was: ) > Data inconsistency issue w/ spurious log bl

(hudi) branch dependabot/maven/packaging/hudi-cli-bundle/org.apache.commons-commons-configuration2-2.10.1 created (now 882fb5ceed4)

2024-03-24 Thread github-bot
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/packaging/hudi-cli-bundle/org.apache.commons-commons-configuration2-2.10.1 in repository https://gitbox.apache.org/repos/asf/hudi.git at 882fb5ceed4 Bump

[jira] [Updated] (HUDI-7535) Add metrics for source parallelism for Kafka and S3/GCS sources

2024-03-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7535: - Labels: pull-request-available (was: ) > Add metrics for source parallelism for Kafka and S3/

[jira] [Updated] (HUDI-7534) Refactoring of handleUpdate in CommitActionExecutors and HoodieTables

2024-03-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7534: - Labels: pull-request-available (was: ) > Refactoring of handleUpdate in CommitActionExecut

[jira] [Updated] (HUDI-7532) Fix schedule compact to only consider DCs after last compaction commit

2024-03-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7532: - Labels: pull-request-available (was: ) > Fix schedule compact to only consider DCs after l

[jira] [Updated] (HUDI-7518) Fix HoodieMetadataPayload merging logic around repeated deletes

2024-03-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7518: - Labels: pull-request-available (was: ) > Fix HoodieMetadataPayload merging logic around repea

[jira] [Updated] (HUDI-7530) Refactoring of handleUpdateInternal in CommitActionExecutors and HoodieTables

2024-03-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7530: - Labels: pull-request-available (was: ) > Refactoring of handleUpdateInter

[jira] [Updated] (HUDI-7487) Investigate flaky test in MERGE INTO

2024-03-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7487: - Labels: pull-request-available (was: ) > Investigate flaky test in ME

[jira] [Updated] (HUDI-7528) Fix RowCustomColumnsSortPartitioner to use repartition instead of coalesce

2024-03-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7528: - Labels: pull-request-available (was: ) > Fix RowCustomColumnsSortPartitioner to use repartit

[jira] [Updated] (HUDI-7525) Prevent dag trigger in mappartitions if possible

2024-03-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7525: - Labels: pull-request-available (was: ) > Prevent dag trigger in mappartitions if possi

[jira] [Updated] (HUDI-7524) Ensure existing hoodie.properties are not overwritten with HoodieTableMetaClient creation

2024-03-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7524: - Labels: pull-request-available (was: ) > Ensure existing hoodie.properties are not overwrit

[jira] [Updated] (HUDI-7523) Add HOODIE_SPARK_DATASOURCE_OPTIONS to be used in HoodieIncrSource

2024-03-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7523: - Labels: pull-request-available (was: ) > Add HOODIE_SPARK_DATASOURCE_OPTIONS to be u

[jira] [Updated] (HUDI-7522) Delete the bucket index partition when bucket id multiple to confirm next write success

2024-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7522: - Labels: pull-request-available (was: ) > Delete the bucket index partition when bucket

[jira] [Updated] (HUDI-7517) Add ability to reset the checkpoint for kafka source

2024-03-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7517: - Labels: pull-request-available (was: ) > Add ability to reset the checkpoint for kafka sou

[jira] [Updated] (HUDI-7516) Put jdbc-h2 creds into static variables for hudi-utilities tests

2024-03-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7516: - Labels: pull-request-available (was: ) > Put jdbc-h2 creds into static variables for h

[jira] [Updated] (HUDI-7515) Fix partition metadata write failure

2024-03-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7515: - Labels: pull-request-available (was: ) > Fix partition metadata write fail

[jira] [Updated] (HUDI-7514) Update Manifest file after the parquet writer closed in LSMTimelineWriter

2024-03-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7514: - Labels: pull-request-available (was: ) > Update Manifest file after the parquet writer clo

[jira] [Updated] (HUDI-7513) Add jackson-module-scala to spark bundle

2024-03-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7513: - Labels: pull-request-available (was: ) > Add jackson-module-scala to spark bun

[jira] [Updated] (HUDI-7512) Support sorting of input records in insert operation

2024-03-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7512: - Labels: pull-request-available (was: ) > Support sorting of input records in insert operat

[jira] [Updated] (HUDI-7511) Offset range calculation in kafka should return all topic partitions

2024-03-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7511: - Labels: pull-request-available (was: ) > Offset range calculation in kafka should return

[jira] [Updated] (HUDI-7510) Loosen the compaction scheduling and rollback check for MDT

2024-03-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7510: - Labels: pull-request-available (was: ) > Loosen the compaction scheduling and rollback ch

[jira] [Updated] (HUDI-7508) Avoid converting iterator to list HoodieStreamerUtils.createHoodieRecords

2024-03-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7508: - Labels: pull-request-available (was: ) > Avoid converting iterator to l

[jira] [Updated] (HUDI-7421) Build HoodieDeltaWriteStat using CommitMetadataUtils#getHoodieDeltaWriteStatFromPreviousStat

2024-03-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7421: - Labels: pull-request-available (was: ) > Build HoodieDeltaWriteStat using > CommitMetadat

[jira] [Updated] (HUDI-7506) Compute offsetRanges based on eventsPerPartition allocated in each range

2024-03-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7506: - Labels: pull-request-available (was: ) > Compute offsetRanges based on eventsPerPartit

[jira] [Updated] (HUDI-7187) Fix integ test props to honor new streamer properties

2024-03-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7187: - Labels: pull-request-available (was: ) > Fix integ test props to honor new streamer propert

[jira] [Updated] (HUDI-7504) Replace expensive file existance check (in object store) with spark options

2024-03-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7504: - Labels: pull-request-available (was: ) > Replace expensive file existance check (in object st

[jira] [Updated] (HUDI-7502) Reorganize content in the "how to" sections in the side bar

2024-03-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7502: - Labels: pull-request-available (was: ) > Reorganize content in the "how to" section

[jira] [Updated] (HUDI-7501) Use source profile for S3 and GCS sources

2024-03-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7501: - Labels: pull-request-available (was: ) > Use source profile for S3 and GCS sour

[jira] [Updated] (HUDI-7480) initializeFunctionalIndexPartition is called multiple times

2024-03-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7480: - Labels: pull-request-available (was: ) > initializeFunctionalIndexPartition is called multi

[jira] [Updated] (HUDI-7500) SchemaProvider not deduced for some deltastreamer scenarios

2024-03-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7500: - Labels: pull-request-available (was: ) > SchemaProvider not deduced for some deltastrea

[jira] [Updated] (HUDI-7499) Support PrecombineAbsoluteGreaterPayload for Hudi

2024-03-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7499: - Labels: pull-request-available (was: ) > Support PrecombineAbsoluteGreaterPayload for H

[jira] [Updated] (HUDI-7493) Clean configuration for clean service

2024-03-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7493: - Labels: pull-request-available (was: ) > Clean configuration for clean serv

[jira] [Updated] (HUDI-7498) Fix schema for HoodieTimestampAwareParquetInputFormat

2024-03-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7498: - Labels: pull-request-available (was: ) > Fix schema for HoodieTimestampAwareParquetInputFor

[jira] [Updated] (HUDI-7497) Add a global timeline mingled with active and archived instants

2024-03-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7497: - Labels: pull-request-available (was: ) > Add a global timeline mingled with active and archi

(hudi) branch dependabot/maven/hudi-platform-service/hudi-metaserver/hudi-metaserver-server/org.mybatis-mybatis-3.5.6 deleted (was 695458d1c65)

2024-03-09 Thread github-bot
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/hudi-platform-service/hudi-metaserver/hudi-metaserver-server/org.mybatis-mybatis-3.5.6 in repository https://gitbox.apache.org/repos/asf/hudi.git was 695458d1c65 Bump

[jira] [Updated] (HUDI-7496) Bump mybatis from 3.4.6 to 3.5.6 in /hudi-platform-service/hudi-metaserver/hudi-metaserver-server

2024-03-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7496: - Labels: pull-request-available (was: ) > Bump mybatis from 3.4.6 to 3.5.6 in > /hudi-pl

(hudi) branch dependabot/maven/org.xerial.snappy-snappy-java-1.1.10.1 deleted (was 07147166d45)

2024-03-09 Thread github-bot
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/org.xerial.snappy-snappy-java-1.1.10.1 in repository https://gitbox.apache.org/repos/asf/hudi.git was 07147166d45 Bump snappy-java from 1.1.8.3 to 1.1.10.1

(hudi) branch dependabot/maven/hudi-platform-service/hudi-metaserver/hudi-metaserver-server/mysql-mysql-connector-java-8.0.28 deleted (was eb0f8809705)

2024-03-09 Thread github-bot
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/hudi-platform-service/hudi-metaserver/hudi-metaserver-server/mysql-mysql-connector-java-8.0.28 in repository https://gitbox.apache.org/repos/asf/hudi.git

[jira] [Updated] (HUDI-7495) Bump mysql-connector-java from 8.0.22 to 8.0.28 in /hudi-platform-service/hudi-metaserver/hudi-metaserver-server

2024-03-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7495: - Labels: pull-request-available (was: ) > Bump mysql-connector-java from 8.0.22 to 8.0

(hudi) branch dependabot/maven/packaging/hudi-cli-bundle/com.google.code.gson-gson-2.8.9 deleted (was c1b20713d7a)

2024-03-09 Thread github-bot
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch dependabot/maven/packaging/hudi-cli-bundle/com.google.code.gson-gson-2.8.9 in repository https://gitbox.apache.org/repos/asf/hudi.git was c1b20713d7a Bump gson from 2.6.2 to 2.8.9

[jira] [Updated] (HUDI-7494) multi writer sync partition to glue will missing some partitions

2024-03-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7494: - Labels: pull-request-available (was: ) > multi writer sync partition to glue will missing s

[jira] [Updated] (HUDI-7492) When using Flinkcatalog to create hudi multiple partitions or multiple primary keys, the keygenerator generation is incorrect

2024-03-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7492: - Labels: pull-request-available (was: ) > When using Flinkcatalog to create hudi multi

[jira] [Updated] (HUDI-6037) Improve compaction docs

2024-03-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6037: - Labels: docs pull-request-available (was: docs) > Improve compaction d

[jira] [Updated] (HUDI-7491) Handle null extra metadata w/ clean commit metadata

2024-03-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7491: - Labels: pull-request-available (was: ) > Handle null extra metadata w/ clean commit metad

[jira] [Updated] (HUDI-7489) Row writer clustering collects write statuses on the driver

2024-03-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7489: - Labels: pull-request-available (was: ) > Row writer clustering collects write statu

[jira] [Updated] (HUDI-7466) AWS Glue sync

2024-03-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7466: - Labels: pull-request-available (was: ) > AWS Glue sync > - > >

[jira] [Updated] (HUDI-7488) The BigQuerySyncTool can't work well when the hudi table schema changed

2024-03-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7488: - Labels: pull-request-available (was: ) > The BigQuerySyncTool can't work well when the h

[jira] [Updated] (HUDI-7486) Classify exceptions as schema exceptions when converting from avro to spark row format

2024-03-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7486: - Labels: pull-request-available (was: ) > Classify exceptions as schema exceptions w

[jira] [Updated] (HUDI-7482) Update schema evolution docs to explicitly state allowed type promotions

2024-03-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7482: - Labels: pull-request-available (was: ) > Update schema evolution docs to explicitly st

[jira] [Updated] (HUDI-7475) Disable ITs in hudi-aws module

2024-03-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7475: - Labels: pull-request-available (was: ) > Disable ITs in hudi-aws mod

[jira] [Updated] (HUDI-7478) Fix max delta commits guard check w/ MDT

2024-03-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7478: - Labels: pull-request-available (was: ) > Fix max delta commits guard check w/

[jira] [Updated] (HUDI-7479) SQL confs don't propagate to spark row writer

2024-03-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7479: - Labels: pull-request-available (was: ) > SQL confs don't propagate to spark row wri

[jira] [Updated] (HUDI-6947) Clean up HoodieSparkSqlWriter.deduceWriterSchema

2024-03-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6947: - Labels: pull-request-available (was: ) > Clean up HoodieSparkSqlWriter.deduceWriterSch

[jira] [Updated] (HUDI-7476) Incremental loading for archived timeline

2024-03-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7476: - Labels: pull-request-available (was: ) > Incremental loading for archived timel

[jira] [Updated] (HUDI-7473) Rebalance CI

2024-03-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7473: - Labels: pull-request-available (was: ) > Rebalance CI > > >

[jira] [Updated] (HUDI-7472) Creating a functional index implicitly drops metadata RLI partition

2024-03-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7472: - Labels: pull-request-available (was: ) > Creating a functional index implicitly drops metad

[jira] [Updated] (HUDI-7471) Increase the number of Spark executors in tests

2024-03-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7471: - Labels: pull-request-available (was: ) > Increase the number of Spark executors in te

[jira] [Updated] (HUDI-7470) Compaction completed not need write to mdt if it is disable

2024-03-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7470: - Labels: pull-request-available (was: ) > Compaction completed not need write to

[jira] [Updated] (HUDI-7469) Reduce redundant tests with Hudi record types

2024-03-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7469: - Labels: pull-request-available (was: ) > Reduce redundant tests with Hudi record ty

[jira] [Updated] (HUDI-7465) Split tests in CI further to reduce total CI elapsed time

2024-03-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7465: - Labels: pull-request-available (was: ) > Split tests in CI further to reduce total CI elap

[jira] [Updated] (HUDI-7464) JsonKafkaSource Metadata Bug

2024-03-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7464: - Labels: pull-request-available (was: ) > JsonKafkaSource Metadata

[jira] [Updated] (HUDI-7463) Bump Spark 3.5 version to Spark 3.5.1

2024-03-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7463: - Labels: pull-request-available (was: ) > Bump Spark 3.5 version to Spark 3.

[jira] [Updated] (HUDI-7462) Refactor checkTopicCheckpoint in KafkaOffsetGen for reusability

2024-03-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7462: - Labels: pull-request-available (was: ) > Refactor checkTopicCheckpoint in KafkaOffset

[jira] [Updated] (HUDI-7458) Creating multiple functional index fails

2024-03-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7458: - Labels: pull-request-available (was: ) > Creating multiple functional index fa

[jira] [Updated] (HUDI-7460) Fix compaction schedule with pending delta commits

2024-02-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7460: - Labels: pull-request-available (was: ) > Fix compaction schedule with pending delta comm

[jira] [Updated] (HUDI-7459) Update hudi-gcp-bundle pom to make it consistent with hudi-gcp

2024-02-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7459: - Labels: pull-request-available (was: ) > Update hudi-gcp-bundle pom to make it consist

[jira] [Updated] (HUDI-7457) Remove runtime shutdown hook from HoodieLogFormatWriter

2024-02-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7457: - Labels: pull-request-available (was: ) > Remove runtime shutdown hook from HoodieLogFormatWri

[jira] [Updated] (HUDI-7456) Set 'hudi' as the explicit provider for new table properties when create table by spark

2024-02-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7456: - Labels: pull-request-available (was: ) > Set 'hudi' as the explicit provider for new ta

[jira] [Updated] (HUDI-7452) Repartition row dataset in S3/GCS based on task size

2024-02-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7452: - Labels: pull-request-available (was: ) > Repartition row dataset in S3/GCS based on task s

[jira] [Updated] (HUDI-7396) Improve Flink doc related to table services

2024-02-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7396: - Labels: pull-request-available (was: ) > Improve Flink doc related to table servi

[jira] [Updated] (HUDI-7450) Fix invalid kafka offset range bug in KafkaOffsetGen.computeOffsetRanges

2024-02-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7450: - Labels: pull-request-available (was: ) > Fix invalid kafka offset range

[jira] [Updated] (HUDI-7447) Fix not bootstrap when subTask restart when OPCoordinator handle CheckPointComplete not finished

2024-02-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7447: - Labels: pull-request-available (was: ) > Fix not bootstrap when subTask restart w

[jira] [Updated] (HUDI-7446) Enable CI on branch-0.x

2024-02-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7446: - Labels: pull-request-available (was: ) > Enable CI on branch-

[jira] [Updated] (HUDI-7429) Fix avg record size estimation for delta commits and replace commits

2024-02-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7429: - Labels: pull-request-available (was: ) > Fix avg record size estimation for delta comm

[jira] [Updated] (HUDI-7444) Make hoodiestreamer docs up to date

2024-02-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7444: - Labels: pull-request-available (was: ) > Make hoodiestreamer docs up to d

[jira] [Updated] (HUDI-7445) Move PR size labeling to GitHub scheduled workflow

2024-02-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7445: - Labels: pull-request-available (was: ) > Move PR size labeling to GitHub scheduled workf

[jira] [Updated] (HUDI-7008) Fixing usage of Kafka Avro deserializer w/ debezium sources

2024-02-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7008: - Labels: pull-request-available (was: ) > Fixing usage of Kafka Avro deserializer w/ debez

[jira] [Updated] (HUDI-7398) clarify clustering strategy for java client

2024-02-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7398: - Labels: pull-request-available (was: ) > clarify clustering strategy for java cli

[jira] [Updated] (HUDI-7443) Improve Compatibility for Legacy Decimal Types with Bytes as Actual Data Representationype that have bytes as actual type

2024-02-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7443: - Labels: pull-request-available (was: ) > Improve Compatibility for Legacy Decimal Ty

[jira] [Updated] (HUDI-7441) Decouple hive dependencies during flink streaming read hudi

2024-02-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7441: - Labels: pull-request-available (was: ) > Decouple hive dependencies during flink streaming r

[jira] [Updated] (HUDI-7440) Verify field exist in schema before fetching the value

2024-02-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7440: - Labels: pull-request-available (was: ) > Verify field exist in schema before fetching the va

[jira] [Updated] (HUDI-7275) org.apache.hudi.TestHoodieSparkSqlWriter#testInsertDatasetWithTimelineTimezoneUTC causes issues with following tests

2024-02-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7275: - Labels: pull-request-available (was: ) > org.apache.hudi.TestHoodieSparkSqlWri

[jira] [Updated] (HUDI-7437) bug fix: TestAvroKafkaSource. testAppendKafkaOffsetsSourceFormatAdapter

2024-02-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7437: - Labels: pull-request-available (was: ) > bug fix: TestAvroKafkaSou

[jira] [Updated] (HUDI-7438) Add GitHub action to check Azure CI report

2024-02-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7438: - Labels: pull-request-available (was: ) > Add GitHub action to check Azure CI rep

[jira] [Updated] (HUDI-6089) Handle default insert behaviour to ingest duplicates

2024-02-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6089: - Labels: insert pull-request-available (was: insert) > Handle default insert behaviour to ing

[jira] [Updated] (HUDI-7436) Not need to reWriteRecord when method SchemaCompatibility.checkReaderWriterCompatibility return SchemaCompatibilityType.COMPATIBLE

2024-02-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7436: - Labels: pull-request-available (was: ) > Not need to reWriteRecord when met

[jira] [Updated] (HUDI-7435) Remove Shaded of codahale metrics in flink bundle

2024-02-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7435: - Labels: pull-request-available (was: ) > Remove Shaded of codahale metrics in flink bun

[jira] [Updated] (HUDI-7432) Fix excessive object creation in KeyGenUtils

2024-02-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7432: - Labels: pull-request-available (was: ) > Fix excessive object creation in KeyGenUt

[jira] [Updated] (HUDI-7430) Fix the empty schema for compactor

2024-02-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7430: - Labels: pull-request-available (was: ) > Fix the empty schema for compac

[jira] [Updated] (HUDI-7431) Add replication and block size to StoragePathInfo to be backwards compatible

2024-02-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7431: - Labels: pull-request-available (was: ) > Add replication and block size to StoragePathI

<    1   2   3   4   5   6   7   8   9   10   >