[jira] [Updated] (HUDI-538) [UMBRELLA] Restructuring hudi client module for multi engine support

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-538: Fix Version/s: (was: 0.7.0) > [UMBRELLA] Restructuring hudi client module for multi engine suppor

[jira] [Updated] (HUDI-1502) Restore on MOR table leaves metadata table out-of-sync from data table

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1502: - Status: Closed (was: Patch Available) > Restore on MOR table leaves metadata table out-of-sync fr

[jira] [Updated] (HUDI-954) Test COW : Presto Read Optimized Query with metadata bootstrap

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-954: Fix Version/s: (was: 0.7.0) 0.8.0 > Test COW : Presto Read Optimized Query wit

[jira] [Updated] (HUDI-955) Test MOR : Presto Read Optimized Query with metadata bootstrap

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-955: Fix Version/s: (was: 0.7.0) 0.8.0 > Test MOR : Presto Read Optimized Query wit

[jira] [Updated] (HUDI-945) Cleanup spillable map files eagerly as part of close

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-945: Fix Version/s: (was: 0.7.0) 0.8.0 > Cleanup spillable map files eagerly as par

[jira] [Updated] (HUDI-956) Test COW : Presto Realtime Query with metadata bootstrap

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-956: Fix Version/s: (was: 0.7.0) 0.8.0 > Test COW : Presto Realtime Query with meta

[jira] [Updated] (HUDI-837) Fix AvroKafkaSource to use the latest schema for reading

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-837: Fix Version/s: (was: 0.7.0) 0.8.0 > Fix AvroKafkaSource to use the latest sche

[jira] [Updated] (HUDI-1120) Support spotless for scala

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1120: - Fix Version/s: (was: 0.7.0) 0.8.0 > Support spotless for scala > --

[jira] [Updated] (HUDI-1214) Need ability to set deltastreamer checkpoints when doing Spark datasource writes

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1214: - Fix Version/s: (was: 0.7.0) 0.8.0 > Need ability to set deltastreamer check

[jira] [Updated] (HUDI-1201) HoodieDeltaStreamer: Allow user overrides to read from earliest kafka offset when commit files do not have checkpoint

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1201: - Fix Version/s: (was: 0.7.0) 0.8.0 > HoodieDeltaStreamer: Allow user overrid

[jira] [Updated] (HUDI-1280) Add tool to capture earliest or latest offsets in kafka topics

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1280: - Fix Version/s: (was: 0.7.0) 0.8.0 > Add tool to capture earliest or latest

[jira] [Updated] (HUDI-1264) incremental read support with replace

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1264: - Fix Version/s: (was: 0.7.0) 0.8.0 > incremental read support with replace >

[jira] [Updated] (HUDI-1353) Incremental timeline support for pending clustering operations

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1353: - Fix Version/s: (was: 0.7.0) 0.8.0 > Incremental timeline support for pendin

[jira] [Updated] (HUDI-1284) preCombine all HoodieRecords and update all fields(which is not DefaultValue) according to orderingVal

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1284: - Fix Version/s: (was: 0.7.0) 0.8.0 > preCombine all HoodieRecords and update

[jira] [Updated] (HUDI-1363) Provide Option to drop columns after they are used to generate partition or record keys

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1363: - Fix Version/s: (was: 0.7.0) 0.8.0 > Provide Option to drop columns after th

[jira] [Updated] (HUDI-993) Use hoodie.delete.shuffle.parallelism for Delete API

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-993: Fix Version/s: (was: 0.8.0) 0.7.0 > Use hoodie.delete.shuffle.parallelism for

[jira] [Closed] (HUDI-993) Use hoodie.delete.shuffle.parallelism for Delete API

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar closed HUDI-993. --- Resolution: Fixed > Use hoodie.delete.shuffle.parallelism for Delete API >

[jira] [Updated] (HUDI-993) Use hoodie.delete.shuffle.parallelism for Delete API

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-993: Status: Open (was: New) > Use hoodie.delete.shuffle.parallelism for Delete API > ---

[GitHub] [hudi] danny0405 commented on a change in pull request #2449: [HUDI-1528] hudi-sync-tools supports synchronization to remote hive

2021-01-20 Thread GitBox
danny0405 commented on a change in pull request #2449: URL: https://github.com/apache/hudi/pull/2449#discussion_r561628258 ## File path: hudi-spark-datasource/hudi-spark-common/src/main/java/org/apache/hudi/DataSourceUtils.java ## @@ -284,6 +284,9 @@ public static HiveSyncConf

[jira] [Closed] (HUDI-1427) Throw a FileAlreadyExistsException when set HOODIE_AUTO_COMMIT_PROP to true

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar closed HUDI-1427. Resolution: Fixed > Throw a FileAlreadyExistsException when set HOODIE_AUTO_COMMIT_PROP to true > --

[jira] [Updated] (HUDI-1427) Throw a FileAlreadyExistsException when set HOODIE_AUTO_COMMIT_PROP to true

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1427: - Status: Open (was: New) > Throw a FileAlreadyExistsException when set HOODIE_AUTO_COMMIT_PROP to

[jira] [Updated] (HUDI-1427) Throw a FileAlreadyExistsException when set HOODIE_AUTO_COMMIT_PROP to true

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1427: - Fix Version/s: (was: 0.8.0) 0.7.0 > Throw a FileAlreadyExistsException when

[jira] [Closed] (HUDI-1424) Write Type changed to BULK_INSERT when set ENABLE_ROW_WRITER_OPT_KEY=true

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar closed HUDI-1424. Resolution: Fixed > Write Type changed to BULK_INSERT when set ENABLE_ROW_WRITER_OPT_KEY=true >

[jira] [Updated] (HUDI-1424) Write Type changed to BULK_INSERT when set ENABLE_ROW_WRITER_OPT_KEY=true

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1424: - Status: Open (was: New) > Write Type changed to BULK_INSERT when set ENABLE_ROW_WRITER_OPT_KEY=tr

[jira] [Updated] (HUDI-1424) Write Type changed to BULK_INSERT when set ENABLE_ROW_WRITER_OPT_KEY=true

2021-01-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1424: - Fix Version/s: (was: 0.8.0) 0.7.0 > Write Type changed to BULK_INSERT when

[GitHub] [hudi] jessica0530 commented on issue #143: Tracking ticket for folks to be added to slack group

2021-01-20 Thread GitBox
jessica0530 commented on issue #143: URL: https://github.com/apache/hudi/issues/143#issuecomment-764415027 please add me wjxdtc10...@gmail.com thanks This is an automated message from the Apache Git Service. To respond to th

svn commit: r45524 - /dev/hudi/KEYS

2021-01-20 Thread vinoth
Author: vinoth Date: Thu Jan 21 06:59:10 2021 New Revision: 45524 Log: Updating Vinoth Chandar's key to hudi keys Modified: dev/hudi/KEYS Modified: dev/hudi/KEYS == --- dev/hudi/KEYS (original) +++ dev/hudi/KEYS Thu

svn commit: r45525 - /release/hudi/KEYS

2021-01-20 Thread vinoth
Author: vinoth Date: Thu Jan 21 06:59:35 2021 New Revision: 45525 Log: Adding Vinoth Chandar's key to hudi release keys Modified: release/hudi/KEYS Modified: release/hudi/KEYS == --- release/hudi/KEYS (original) +++

[hudi] branch release-0.7.0 updated (2c69f69 -> ab4319d)

2021-01-20 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a change to branch release-0.7.0 in repository https://gitbox.apache.org/repos/asf/hudi.git. from 2c69f69 Create release branch for version 0.7.0. new bead433 [MINOR] Disable flaky tests new ab4319d

[hudi] 02/02: [MINOR] Make a separate travis CI job for hudi-utilities

2021-01-20 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch release-0.7.0 in repository https://gitbox.apache.org/repos/asf/hudi.git commit ab4319ddbc07dacf6f3e34c4e08d1156391ae63a Author: Vinoth Chandar AuthorDate: Wed Jan 20 20:07:26 2021 -0800 [MI

[hudi] 01/02: [MINOR] Disable flaky tests

2021-01-20 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch release-0.7.0 in repository https://gitbox.apache.org/repos/asf/hudi.git commit bead4331ba17c67b31425722f9bc22d3c303979d Author: Vinoth Chandar AuthorDate: Wed Jan 20 18:58:27 2021 -0800 [MI

[GitHub] [hudi] vrtrepp commented on issue #2461: All records are present in athena query result on glue crawled Hudi tables

2021-01-20 Thread GitBox
vrtrepp commented on issue #2461: URL: https://github.com/apache/hudi/issues/2461#issuecomment-76438 Hi Rubenssoto, That is how we are planning but it will involve writing few more steps in the pipeline.However our current architecture is based on running glue crawlers and removing

[GitHub] [hudi] Trevor-zhang commented on a change in pull request #2449: [HUDI-1528] hudi-sync-tools supports synchronization to remote hive

2021-01-20 Thread GitBox
Trevor-zhang commented on a change in pull request #2449: URL: https://github.com/apache/hudi/pull/2449#discussion_r561662407 ## File path: hudi-spark-datasource/hudi-spark-common/src/main/java/org/apache/hudi/DataSourceUtils.java ## @@ -284,6 +284,9 @@ public static HiveSyncC

<    1   2   3