[jira] [Updated] (HUDI-2616) Implement BloomIndex for Dataset

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2616: - Story Points: 2 > Implement BloomIndex for Dataset > - > >

[jira] [Created] (HUDI-2619) Make table services work with Dataset

2021-10-25 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2619: Summary: Make table services work with Dataset Key: HUDI-2619 URL: https://issues.apache.org/jira/browse/HUDI-2619 Project: Apache Hudi Issue Type: Sub-task

[jira] [Updated] (HUDI-2618) Implement operations other than upsert in SparkDataFrameWriteClient

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2618: - Story Points: 3 (was: 4) > Implement operations other than upsert in SparkDataFrameWriteClient >

[jira] [Updated] (HUDI-2618) Implement operations other than upsert in SparkDataFrameWriteClient

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2618: - Story Points: 4 > Implement operations other than upsert in SparkDataFrameWriteClient >

[jira] [Created] (HUDI-2618) Implement operations other than upsert in SparkDataFrameWriteClient

2021-10-25 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2618: Summary: Implement operations other than upsert in SparkDataFrameWriteClient Key: HUDI-2618 URL: https://issues.apache.org/jira/browse/HUDI-2618 Project: Apache Hudi

[jira] [Created] (HUDI-2620) Benchmark SparkDataFrameWriteClient

2021-10-25 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2620: Summary: Benchmark SparkDataFrameWriteClient Key: HUDI-2620 URL: https://issues.apache.org/jira/browse/HUDI-2620 Project: Apache Hudi Issue Type: Sub-task

[jira] [Created] (HUDI-2622) Enhance DataFrameWriter with LazyIterator and SpillableMap

2021-10-25 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2622: Summary: Enhance DataFrameWriter with LazyIterator and SpillableMap Key: HUDI-2622 URL: https://issues.apache.org/jira/browse/HUDI-2622 Project: Apache Hudi Issue

[jira] [Assigned] (HUDI-2623) Make hudi-bot comment at PR thread bottom

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-2623: Assignee: Raymond Xu > Make hudi-bot comment at PR thread bottom >

[jira] [Assigned] (HUDI-2620) Benchmark SparkDataFrameWriteClient

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-2620: Assignee: Raymond Xu > Benchmark SparkDataFrameWriteClient > --- >

[jira] [Commented] (HUDI-2781) Test 0.10 RC for Spark 3.x

2021-11-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17451546#comment-17451546 ] Raymond Xu commented on HUDI-2781: -- [~biyan900...@gmail.com] a quick discussion recap: # current test

[jira] [Commented] (HUDI-2151) Make performant out-of-box configs

2021-12-03 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17453199#comment-17453199 ] Raymond Xu commented on HUDI-2151: -- Codec change: change default codec to SNAPPY instead of GZIP * it's

[jira] [Updated] (HUDI-2811) Support Spark 3.2 and Parquet 1.12.x

2021-12-04 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2811: - Description: Reported issues * [https://github.com/apache/hudi/issues/4001] *

[jira] [Closed] (HUDI-2418) Support HiveSchemaProvider

2021-12-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-2418. > Support HiveSchemaProvider > --- > > Key: HUDI-2418 >

[jira] [Resolved] (HUDI-2418) Support HiveSchemaProvider

2021-12-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu resolved HUDI-2418. -- > Support HiveSchemaProvider > --- > > Key: HUDI-2418 >

[jira] [Updated] (HUDI-2811) Support Spark 3.2 and Parquet 1.12.x

2021-12-04 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2811: - Description: Reported issues * [https://github.com/apache/hudi/issues/4001] *

[jira] [Assigned] (HUDI-3065) spark auto partition discovery does not work from 0.9.0

2021-12-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-3065: Assignee: Yann Byron (was: Raymond Xu) > spark auto partition discovery does not work from 0.9.0

[jira] [Closed] (HUDI-3052) Flaky TestJsonKafkaSource in CI runs

2021-12-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-3052. Resolution: Fixed > Flaky TestJsonKafkaSource in CI runs > - > >

[jira] [Updated] (HUDI-2970) Archival fails with Delete_partition commits

2021-12-17 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2970: - Status: Patch Available (was: In Progress) > Archival fails with Delete_partition commits >

[jira] [Updated] (HUDI-2454) Remove travis config from asf-site branch

2021-12-17 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2454: - Fix Version/s: 0.11.0 > Remove travis config from asf-site branch >

[jira] [Updated] (HUDI-2987) event time not recorded in commit metadata when insert or bulk insert

2021-12-17 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2987: - Status: In Progress (was: Open) > event time not recorded in commit metadata when insert or bulk insert

[jira] [Updated] (HUDI-2987) event time not recorded in commit metadata when insert or bulk insert

2021-12-17 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2987: - Labels: sev:high (was: ) > event time not recorded in commit metadata when insert or bulk insert >

[jira] [Closed] (HUDI-2454) Remove travis config from asf-site branch

2021-12-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-2454. Resolution: Done > Remove travis config from asf-site branch > - >

[jira] [Updated] (HUDI-2987) event time not recorded in commit metadata when insert or bulk insert

2021-12-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2987: - Status: Patch Available (was: In Progress) > event time not recorded in commit metadata when insert or

[jira] [Updated] (HUDI-2989) Hive sync to Glue tables not updating S3 location

2021-12-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2989: - Status: Patch Available (was: In Progress) > Hive sync to Glue tables not updating S3 location >

[jira] [Updated] (HUDI-3112) KafkaConnect can not sync to Hive

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3112: - Sprint: Hudi-Sprint-0.10.1 > KafkaConnect can not sync to Hive > - > >

[jira] [Updated] (HUDI-3098) Billing mode in dynamo db based lock configs has a default value, but still checks for mandatory setting

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3098: - Sprint: Hudi-Sprint-0.10.1 > Billing mode in dynamo db based lock configs has a default value, but still

[jira] [Updated] (HUDI-3021) HoodieAppendHandle#appendDataAndDeleteBlocks writer object occurred NPE

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3021: - Sprint: Hudi-Sprint-0.10.1 > HoodieAppendHandle#appendDataAndDeleteBlocks writer object occurred NPE >

[jira] [Updated] (HUDI-2938) Code Refactor: Metadata util to get latest file slices for readers and writers

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2938: - Sprint: Hudi-Sprint-0.10.1 > Code Refactor: Metadata util to get latest file slices for readers and

[jira] [Updated] (HUDI-3104) Hudi-kafka-connect can not scan hadoop config files by HADOOP_CONF_DIR

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3104: - Sprint: Hudi-Sprint-0.10.1 > Hudi-kafka-connect can not scan hadoop config files by HADOOP_CONF_DIR >

[jira] [Updated] (HUDI-2558) Clustering w/ sort columns with null values fails

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2558: - Sprint: Hudi-Sprint-0.10.1 > Clustering w/ sort columns with null values fails >

[jira] [Commented] (HUDI-3132) Minor fixes for HoodieCatalog

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17467075#comment-17467075 ] Raymond Xu commented on HUDI-3132: -- [~danny0405] can you put more details in the ticket? trying to

[jira] [Updated] (HUDI-2465) Fix merge, update for spark sql dml support to test suite infra

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2465: - Fix Version/s: (was: 0.10.1) > Fix merge, update for spark sql dml support to test suite infra >

[jira] [Updated] (HUDI-3120) Cache compactionPlan in buffer

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3120: - Fix Version/s: (was: 0.10.1) > Cache compactionPlan in buffer > -- > >

[jira] [Updated] (HUDI-3001) clean up temp marker directory when finish bootstrap operation.

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3001: - Sprint: Hudi-Sprint-0.10.1 > clean up temp marker directory when finish bootstrap operation. >

[jira] [Updated] (HUDI-2780) Mor reads the log file and skips the complete block as a bad block, resulting in data loss

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2780: - Parent: (was: HUDI-2749) Issue Type: Bug (was: Sub-task) > Mor reads the log file and skips

[jira] [Updated] (HUDI-3097) Address dependency issue with hudi-trino-bundle in connector

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3097: - Fix Version/s: (was: 0.10.1) > Address dependency issue with hudi-trino-bundle in connector >

[jira] [Updated] (HUDI-2876) hudi should remove the temp file which create by HoodieMergedLogRecordScanner, when we use presto

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2876: - Sprint: Hudi-Sprint-0.10.1 > hudi should remove the temp file which create by >

[jira] [Updated] (HUDI-2675) Not an Avro data file

2021-12-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2675: - Sprint: Hudi-Sprint-0.10.1 > Not an Avro data file > - > > Key:

[jira] [Updated] (HUDI-2837) The original hoodie.table.name should be maintained in Spark SQL

2021-12-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2837: - Fix Version/s: 0.10.1 > The original hoodie.table.name should be maintained in Spark SQL >

[jira] [Closed] (HUDI-2970) Archival fails with Delete_partition commits

2021-12-20 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-2970. > Archival fails with Delete_partition commits > > >

[jira] [Updated] (HUDI-2970) Archival fails with Delete_partition commits

2021-12-20 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2970: - Fix Version/s: 0.10.1 > Archival fails with Delete_partition commits >

[jira] [Updated] (HUDI-2986) Deltastreamer continuous mode run into Too many open files exception

2021-12-20 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2986: - Fix Version/s: 0.10.1 > Deltastreamer continuous mode run into Too many open files exception >

[jira] [Updated] (HUDI-2989) Hive sync to Glue tables not updating S3 location

2021-12-20 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2989: - Fix Version/s: 0.10.1 > Hive sync to Glue tables not updating S3 location >

[jira] [Updated] (HUDI-2987) event time not recorded in commit metadata when insert or bulk insert

2021-12-20 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2987: - Fix Version/s: 0.10.1 > event time not recorded in commit metadata when insert or bulk insert >

[jira] [Commented] (HUDI-2834) Validate against supported hive versions

2021-12-20 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17462988#comment-17462988 ] Raymond Xu commented on HUDI-2834: -- [~codope] [~shivnarayan] This was deprioritized from 0.10.0 release.

[jira] [Updated] (HUDI-3070) Improve Test

2021-12-20 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3070: - Component/s: Testing > Improve Test > > > Key: HUDI-3070 >

[jira] [Closed] (HUDI-3070) Improve Test

2021-12-20 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-3070. Assignee: Yue Zhang Resolution: Done > Improve Test > > > Key: HUDI-3070

[jira] [Updated] (HUDI-3070) Improve Test

2021-12-20 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3070: - Fix Version/s: 0.11.0 0.10.1 > Improve Test > > > Key:

[jira] [Updated] (HUDI-3100) Hive Conditional sync cannot be set from deltastreamer

2021-12-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3100: - Fix Version/s: 0.11.0 0.10.1 > Hive Conditional sync cannot be set from deltastreamer

[jira] [Created] (HUDI-3100) Hive Conditional sync cannot be set from deltastreamer

2021-12-23 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-3100: Summary: Hive Conditional sync cannot be set from deltastreamer Key: HUDI-3100 URL: https://issues.apache.org/jira/browse/HUDI-3100 Project: Apache Hudi Issue Type:

[jira] [Assigned] (HUDI-3100) Hive Conditional sync cannot be set from deltastreamer

2021-12-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-3100: Assignee: Raymond Xu > Hive Conditional sync cannot be set from deltastreamer >

[jira] [Updated] (HUDI-1248) [UMBRELLA] Tests cleanup and fixes

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1248: - Issue Type: Epic (was: Improvement) > [UMBRELLA] Tests cleanup and fixes >

[jira] [Updated] (HUDI-1958) [Umbrella] Follow up items from 1 pass over GH issues

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1958: - Issue Type: Epic (was: Improvement) > [Umbrella] Follow up items from 1 pass over GH issues >

[jira] [Updated] (HUDI-60) [UMBRELLA] Support Apache Beam for incremental tailing

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-60: --- Issue Type: Epic (was: Improvement) > [UMBRELLA] Support Apache Beam for incremental tailing >

[jira] [Updated] (HUDI-2531) [UMBRELLA] Support Dataset APIs in writer paths

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2531: - Issue Type: Epic (was: New Feature) > [UMBRELLA] Support Dataset APIs in writer paths >

[jira] [Updated] (HUDI-2687) [UMBRELLA] A new Trino connector for Hudi

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2687: - Issue Type: Epic (was: New Feature) > [UMBRELLA] A new Trino connector for Hudi >

[jira] [Updated] (HUDI-1042) [Umbrella] Support clustering on filegroups

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1042: - Issue Type: Epic (was: Bug) > [Umbrella] Support clustering on filegroups >

[jira] [Updated] (HUDI-1297) [Umbrella] Spark Datasource Support

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1297: - Issue Type: Epic (was: Improvement) > [Umbrella] Spark Datasource Support >

[jira] [Updated] (HUDI-2564) Support comprehensive schema evolution in Hudi (rename, drop etc)

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2564: - Issue Type: Epic (was: Improvement) > Support comprehensive schema evolution in Hudi (rename, drop etc)

[jira] [Updated] (HUDI-1239) [UMBRELLA] Config clean up

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1239: - Issue Type: Epic (was: Improvement) > [UMBRELLA] Config clean up > -- > >

[jira] [Updated] (HUDI-2224) Hudi Examples

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2224: - Issue Type: Epic (was: Task) > Hudi Examples > - > > Key: HUDI-2224 >

[jira] [Updated] (HUDI-57) [UMBRELLA] Support ORC Storage

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-57?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-57: --- Issue Type: Epic (was: Improvement) > [UMBRELLA] Support ORC Storage > -- > >

[jira] [Updated] (HUDI-2429) [UMBRELLA] Comprehensive Schema evolution in Hudi

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2429: - Issue Type: Epic (was: New Feature) > [UMBRELLA] Comprehensive Schema evolution in Hudi >

[jira] [Updated] (HUDI-1387) [UMBRELLA] Support Apache Calcite for writing/querying Hudi datasets

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1387: - Issue Type: Epic (was: New Feature) > [UMBRELLA] Support Apache Calcite for writing/querying Hudi

[jira] [Updated] (HUDI-1390) [UMBRELLA] Support schema inference for unstructured data

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1390: - Issue Type: Epic (was: Improvement) > [UMBRELLA] Support schema inference for unstructured data >

[jira] [Updated] (HUDI-2235) [UMBRELLA] Add virtual key support to Hudi

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2235: - Issue Type: Epic (was: New Feature) > [UMBRELLA] Add virtual key support to Hudi >

[jira] [Updated] (HUDI-2565) Metadata based indices

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2565: - Issue Type: Epic (was: Improvement) > Metadata based indices > --- > >

[jira] [Updated] (HUDI-868) [UMBRELLA] Insert Overwrite API

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-868: Issue Type: Epic (was: New Feature) > [UMBRELLA] Insert Overwrite API > --- > >

[jira] [Updated] (HUDI-3039) [Umbrella] Bucket Index

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3039: - Issue Type: Epic (was: New Feature) > [Umbrella] Bucket Index > --- > >

[jira] [Updated] (HUDI-1822) [Umbrella] Multi Modal Indexing

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1822: - Issue Type: Epic (was: New Feature) > [Umbrella] Multi Modal Indexing > ---

[jira] [Updated] (HUDI-1292) [Umbrella] RFC-15 : File Listing and Query Planning Optimizations

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1292: - Issue Type: Epic (was: Improvement) > [Umbrella] RFC-15 : File Listing and Query Planning Optimizations

[jira] [Updated] (HUDI-270) [UMBRELLA] Improve Hudi website UI and documentation

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-270: Issue Type: Epic (was: Task) > [UMBRELLA] Improve Hudi website UI and documentation >

[jira] [Updated] (HUDI-1385) [UMBRELLA] Improve source ingestion support in DeltaStreamer

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1385: - Issue Type: Epic (was: Improvement) > [UMBRELLA] Improve source ingestion support in DeltaStreamer >

[jira] [Updated] (HUDI-538) [UMBRELLA] Restructuring hudi client module for multi engine support

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-538: Issue Type: Epic (was: Improvement) > [UMBRELLA] Restructuring hudi client module for multi engine support

[jira] [Updated] (HUDI-2505) [UMBRELLA] Spark DataSource APIs and Spark SQL discrepancies

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2505: - Issue Type: Epic (was: Improvement) > [UMBRELLA] Spark DataSource APIs and Spark SQL discrepancies >

[jira] [Updated] (HUDI-1290) Implement Debezium avro source for Delta Streamer

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1290: - Issue Type: Epic (was: Improvement) > Implement Debezium avro source for Delta Streamer >

[jira] [Updated] (HUDI-466) [Umbrella] Record level, global low-latency index implementation

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-466: Issue Type: Epic (was: Improvement) > [Umbrella] Record level, global low-latency index implementation >

[jira] [Updated] (HUDI-1859) [UMBRELLA] RFC - 14 : JDBC incremental puller

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1859: - Issue Type: Epic (was: New Feature) > [UMBRELLA] RFC - 14 : JDBC incremental puller >

[jira] [Updated] (HUDI-1238) [UMBRELLA] Perf test env

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1238: - Issue Type: Epic (was: Improvement) > [UMBRELLA] Perf test env > > >

[jira] [Updated] (HUDI-2519) [UMBRELLA] Seamless Hudi Hive Sync

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2519: - Issue Type: Epic (was: Improvement) > [UMBRELLA] Seamless Hudi Hive Sync >

[jira] [Updated] (HUDI-1896) [UMBRELLA] Implement DeltaStreamer Source for cloud object stores

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1896: - Issue Type: Epic (was: New Feature) > [UMBRELLA] Implement DeltaStreamer Source for cloud object stores

[jira] [Updated] (HUDI-1249) [UMBRELLA] refactor tests for ease of development

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1249: - Issue Type: Epic (was: Improvement) > [UMBRELLA] refactor tests for ease of development >

[jira] [Updated] (HUDI-1250) [UMBRELLA] Test coverage

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1250: - Issue Type: Epic (was: Improvement) > [UMBRELLA] Test coverage > > >

[jira] [Updated] (HUDI-1237) [UMBRELLA] Checkstyle, formatting, warnings, spotless

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1237: - Issue Type: Epic (was: Improvement) > [UMBRELLA] Checkstyle, formatting, warnings, spotless >

[jira] [Updated] (HUDI-1388) [UMBRELLA] Improve CLI features and usabilities

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1388: - Issue Type: Epic (was: Improvement) > [UMBRELLA] Improve CLI features and usabilities >

[jira] [Updated] (HUDI-1628) [Umbrella] Improve data locality during ingestion

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1628: - Issue Type: Epic (was: New Feature) > [Umbrella] Improve data locality during ingestion >

[jira] [Updated] (HUDI-2100) [UMBRELLA] Support Space curve for hudi

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2100: - Issue Type: Epic (was: New Feature) > [UMBRELLA] Support Space curve for hudi >

[jira] [Updated] (HUDI-1251) [UMBRELLA] Migrate CI to azure

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1251: - Issue Type: Epic (was: Improvement) > [UMBRELLA] Migrate CI to azure > ---

[jira] [Updated] (HUDI-1236) [UMBRELLA] Integ Test suite infra

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1236: - Issue Type: Epic (was: Test) > [UMBRELLA] Integ Test suite infra > -- >

[jira] [Updated] (HUDI-1658) [UMBRELLA] Spark Sql Support For Hudi

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1658: - Issue Type: Epic (was: New Feature) > [UMBRELLA] Spark Sql Support For Hudi >

[jira] [Updated] (HUDI-3000) [UMBRELLA] Consistent Hashing Index

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3000: - Issue Type: Epic (was: New Feature) > [UMBRELLA] Consistent Hashing Index >

[jira] [Updated] (HUDI-2832) [Umbrella] [RFC-40] Implement SnowflakeSyncTool to support Hudi to Snowflake Integration

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2832: - Issue Type: Epic (was: New Feature) > [Umbrella] [RFC-40] Implement SnowflakeSyncTool to support Hudi to

[jira] [Updated] (HUDI-1928) [UMBRELLA] Improve user experience by simplifying hudi task parameter configuration

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1928: - Issue Type: Epic (was: Task) > [UMBRELLA] Improve user experience by simplifying hudi task parameter >

[jira] [Updated] (HUDI-909) [UMBRELLA]Integrate hudi with flink engine

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-909: Issue Type: Epic (was: New Feature) > [UMBRELLA]Integrate hudi with flink engine >

[jira] [Updated] (HUDI-2948) [UMBRELLA] Hudi Clustering Performance

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2948: - Issue Type: Epic (was: Task) > [UMBRELLA] Hudi Clustering Performance >

[jira] [Updated] (HUDI-2575) [UMBRELLA] Revamp CI bot

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2575: - Issue Type: Epic (was: New Feature) > [UMBRELLA] Revamp CI bot > - > >

[jira] [Updated] (HUDI-3081) [UMBRELLA] Revisiting Read Path Infra across Query Engines

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3081: - Issue Type: Epic (was: Bug) > [UMBRELLA] Revisiting Read Path Infra across Query Engines >

[jira] [Updated] (HUDI-2438) [Umbrella] [RFC-34] Implement BigQuerySyncTool for BigQuery Sync

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2438: - Issue Type: Epic (was: New Feature) > [Umbrella] [RFC-34] Implement BigQuerySyncTool for BigQuery Sync >

[jira] [Updated] (HUDI-3128) [UMBRELLA] Test Hudi Epic

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3128: - Issue Type: Epic (was: New Feature) > [UMBRELLA] Test Hudi Epic > - > >

[jira] [Updated] (HUDI-2324) [UMBRELLA] Implement Hudi Transaction writes for Kafka Connect platform

2022-01-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2324: - Fix Version/s: 0.11.0 > [UMBRELLA] Implement Hudi Transaction writes for Kafka Connect platform >

<    3   4   5   6   7   8   9   10   11   12   >