[GitHub] [hudi] hudi-bot edited a comment on pull request #3744: [HUDI-2108] Fix flakiness in TestHoodieBackedMetadata

2021-10-04 Thread GitBox
hudi-bot edited a comment on pull request #3744: URL: https://github.com/apache/hudi/pull/3744#issuecomment-932845780 ## CI report: * 5a724c6c859d67980473db571c9a90b8babcf710 UNKNOWN * 6e3f4d10734095ac947d0821db536ead47abda7c Azure:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3744: [HUDI-2108] Fix flakiness in TestHoodieBackedMetadata

2021-10-04 Thread GitBox
hudi-bot edited a comment on pull request #3744: URL: https://github.com/apache/hudi/pull/3744#issuecomment-932845780 ## CI report: * 5a724c6c859d67980473db571c9a90b8babcf710 UNKNOWN * 6e3f4d10734095ac947d0821db536ead47abda7c Azure:

[GitHub] [hudi] xushiyan commented on pull request #3748: [HUDI-2516] Upgrade JUnit to 5.8.1

2021-10-04 Thread GitBox
xushiyan commented on pull request #3748: URL: https://github.com/apache/hudi/pull/3748#issuecomment-934056856 put on hold as i look further into why CI env does not trigger the FT tests with the new annotations while local env does. -- This is an automated message from the Apache Git

[GitHub] [hudi] nsivabalan commented on pull request #3590: [HUDI-2285][HUDI-2476] Metadata table synchronous design. Rebased and Squashed from pull/3426

2021-10-04 Thread GitBox
nsivabalan commented on pull request #3590: URL: https://github.com/apache/hudi/pull/3590#issuecomment-934032408 @hudi-bot azure run -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] nsivabalan commented on a change in pull request #3740: [HUDI-2496] Insert duplicate keys when precombined is deactivated

2021-10-04 Thread GitBox
nsivabalan commented on a change in pull request #3740: URL: https://github.com/apache/hudi/pull/3740#discussion_r721856150 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/HoodieConcatHandle.java ## @@ -94,4 +104,22 @@ public void

[GitHub] [hudi] nsivabalan commented on a change in pull request #3740: [HUDI-2496] Insert duplicate keys when precombined is deactivated

2021-10-04 Thread GitBox
nsivabalan commented on a change in pull request #3740: URL: https://github.com/apache/hudi/pull/3740#discussion_r721853559 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/HoodieMergeHandle.java ## @@ -257,6 +257,21 @@ private boolean

[GitHub] [hudi] nsivabalan commented on pull request #3416: [HUDI-2362] Add external config file support

2021-10-04 Thread GitBox
nsivabalan commented on pull request #3416: URL: https://github.com/apache/hudi/pull/3416#issuecomment-934016862 @xushiyan : I will let you review this PR. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [hudi] nsivabalan commented on pull request #3416: [HUDI-2362] Add external config file support

2021-10-04 Thread GitBox
nsivabalan commented on pull request #3416: URL: https://github.com/apache/hudi/pull/3416#issuecomment-934016721 Can you please update the description of the patch as well. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [hudi] nsivabalan edited a comment on pull request #3590: [HUDI-2285][HUDI-2476] Metadata table synchronous design. Rebased and Squashed from pull/3426

2021-10-04 Thread GitBox
nsivabalan edited a comment on pull request #3590: URL: https://github.com/apache/hudi/pull/3590#issuecomment-933938711 > @nsivabalan have we unit tested `HoodieBackedTableMetadata` or `BaseTableMetadata` here? I started writing unit tests and later realized that all of our existing

[GitHub] [hudi] nsivabalan commented on pull request #3590: [HUDI-2285][HUDI-2476] Metadata table synchronous design. Rebased and Squashed from pull/3426

2021-10-04 Thread GitBox
nsivabalan commented on pull request #3590: URL: https://github.com/apache/hudi/pull/3590#issuecomment-933938711 > @nsivabalan have we unit tested `HoodieBackedTableMetadata` or `BaseTableMetadata` here? I started writing unit tests and later realized that all of our existing

[GitHub] [hudi] nsivabalan commented on pull request #3590: [HUDI-2285][HUDI-2476] Metadata table synchronous design. Rebased and Squashed from pull/3426

2021-10-04 Thread GitBox
nsivabalan commented on pull request #3590: URL: https://github.com/apache/hudi/pull/3590#issuecomment-933930608 @hudi-bot azure run -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[jira] [Commented] (HUDI-1542) Fix Flaky test : TestHoodieMetadata#testSync

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17424216#comment-17424216 ] sivabalan narayanan commented on HUDI-1542: --- Fixed it along w/ metadata sync patch

[jira] [Resolved] (HUDI-1542) Fix Flaky test : TestHoodieMetadata#testSync

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-1542. --- Resolution: Fixed > Fix Flaky test : TestHoodieMetadata#testSync >

[jira] [Assigned] (HUDI-1537) Move validation of file listings to something that happens before each write

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-1537: - Assignee: Prashant Wason (was: sivabalan narayanan) > Move validation of file

[jira] [Resolved] (HUDI-1537) Move validation of file listings to something that happens before each write

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-1537. --- Resolution: Fixed > Move validation of file listings to something that happens before

[jira] [Comment Edited] (HUDI-1492) Handle DeltaWriteStat correctly for storage schemes that support appends

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17424212#comment-17424212 ] sivabalan narayanan edited comment on HUDI-1492 at 10/4/21, 10:22 PM: --

[jira] [Comment Edited] (HUDI-1492) Handle DeltaWriteStat correctly for storage schemes that support appends

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17424212#comment-17424212 ] sivabalan narayanan edited comment on HUDI-1492 at 10/4/21, 10:21 PM: --

[jira] [Commented] (HUDI-1492) Handle DeltaWriteStat correctly for storage schemes that support appends

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17424212#comment-17424212 ] sivabalan narayanan commented on HUDI-1492: --- While merging multiple entires in

[GitHub] [hudi] Rap70r edited a comment on issue #3697: [SUPPORT] Performance Tuning: How to speed up stages?

2021-10-04 Thread GitBox
Rap70r edited a comment on issue #3697: URL: https://github.com/apache/hudi/issues/3697#issuecomment-933893616 Hi @xushiyan, Here is an update for our latest tests. I have switched to d3.xlarge instance type and used the following configs: `spark-submit --deploy-mode cluster

[GitHub] [hudi] Rap70r edited a comment on issue #3697: [SUPPORT] Performance Tuning: How to speed up stages?

2021-10-04 Thread GitBox
Rap70r edited a comment on issue #3697: URL: https://github.com/apache/hudi/issues/3697#issuecomment-933893616 Hi @xushiyan, Here is an update for our latest tests. I have switched to d3.xlarge instance type and used the following configs: `spark-submit --deploy-mode cluster

[GitHub] [hudi] Rap70r commented on issue #3697: [SUPPORT] Performance Tuning: How to speed up stages?

2021-10-04 Thread GitBox
Rap70r commented on issue #3697: URL: https://github.com/apache/hudi/issues/3697#issuecomment-933893616 Hi @xushiyan, Here is an update for our latest tests. I have switched to d3.xlarge instance type and used the following configs: `spark-submit --deploy-mode cluster --conf

[GitHub] [hudi] hudi-bot edited a comment on pull request #3694: [HUDI-2469] [Kafka Connect] Replace json based payload with protobuf for Transaction protocol.

2021-10-04 Thread GitBox
hudi-bot edited a comment on pull request #3694: URL: https://github.com/apache/hudi/pull/3694#issuecomment-923312239 ## CI report: * 7b1c98735a9d96bafdd6f848e307c35d3f29ad8d Azure:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3694: [HUDI-2469] [Kafka Connect] Replace json based payload with protobuf for Transaction protocol.

2021-10-04 Thread GitBox
hudi-bot edited a comment on pull request #3694: URL: https://github.com/apache/hudi/pull/3694#issuecomment-923312239 ## CI report: * 57f95f0c70db96ca47a877831411bbf753502d9a Azure:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3694: [HUDI-2469] [Kafka Connect] Replace json based payload with protobuf for Transaction protocol.

2021-10-04 Thread GitBox
hudi-bot edited a comment on pull request #3694: URL: https://github.com/apache/hudi/pull/3694#issuecomment-923312239 ## CI report: * 57f95f0c70db96ca47a877831411bbf753502d9a Azure:

[GitHub] [hudi] davehagman edited a comment on issue #3733: [SUPPORT] Periodic and sustained latency spikes during index lookup

2021-10-04 Thread GitBox
davehagman edited a comment on issue #3733: URL: https://github.com/apache/hudi/issues/3733#issuecomment-933845183 > just partitioning on year, month and day did not work out for you and hence you have to go w/ hour as well? We tested multiple partitioning schemes and this gave us

[GitHub] [hudi] davehagman commented on issue #3733: [SUPPORT] Periodic and sustained latency spikes during index lookup

2021-10-04 Thread GitBox
davehagman commented on issue #3733: URL: https://github.com/apache/hudi/issues/3733#issuecomment-933845183 > just partitioning on year, month and day did not work out for you and hence you have to go w/ hour as well? We tested multiple partitioning schemes and this gave us a good

[GitHub] [hudi] nsivabalan commented on issue #3733: [SUPPORT] Periodic and sustained latency spikes during index lookup

2021-10-04 Thread GitBox
nsivabalan commented on issue #3733: URL: https://github.com/apache/hudi/issues/3733#issuecomment-933830007 But you are most welcome for contributions. I can help review it if you have some patch helping hudi improve index look up. This is very core to hudi and will definitely help

[GitHub] [hudi] nsivabalan commented on issue #3733: [SUPPORT] Periodic and sustained latency spikes during index lookup

2021-10-04 Thread GitBox
nsivabalan commented on issue #3733: URL: https://github.com/apache/hudi/issues/3733#issuecomment-933829540 Hey Dave. so, my understanding is that, just partitioning on year, month and day did not work out for you and hence you have to go w/ hour as well? bcoz, cardinality of

[jira] [Comment Edited] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-10-04 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17424154#comment-17424154 ] Dave Hagman edited comment on HUDI-2275 at 10/4/21, 8:09 PM: - [~vinoth] I'd

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-10-04 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17424154#comment-17424154 ] Dave Hagman commented on HUDI-2275: --- [~vinoth] I'd argue that this is still a blocker as it completely

[jira] [Comment Edited] (HUDI-2005) Audit and remove references of fs.listStatus() and fs.getFileStatus() or fs.exists()

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17408836#comment-17408836 ] sivabalan narayanan edited comment on HUDI-2005 at 10/4/21, 7:14 PM: -

[GitHub] [hudi] hudi-bot edited a comment on pull request #3740: [HUDI-2496] Insert duplicate keys when precombined is deactivated

2021-10-04 Thread GitBox
hudi-bot edited a comment on pull request #3740: URL: https://github.com/apache/hudi/pull/3740#issuecomment-931381693 ## CI report: * e4b4c092dc5e911ba265e6386d736faa932e5c7c UNKNOWN * 849c3476ecd00486984052ce2b33c25924532add UNKNOWN *

[jira] [Updated] (HUDI-2494) Fix usage of different key generators with metadata enabled

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2494: -- Description: With [sync metadata patch|https://github.com/apache/hudi/pull/3590/], when

[jira] [Assigned] (HUDI-1242) Clean up all warnings during compilation

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-1242: - Assignee: sivabalan narayanan > Clean up all warnings during compilation >

[jira] [Updated] (HUDI-1431) Pending items for Bulk Insert with Row

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1431: -- Status: Open (was: New) > Pending items for Bulk Insert with Row >

[jira] [Assigned] (HUDI-1431) Pending items for Bulk Insert with Row

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-1431: - Assignee: sivabalan narayanan > Pending items for Bulk Insert with Row >

[jira] [Updated] (HUDI-1431) Pending items for Bulk Insert with Row

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1431: -- Status: In Progress (was: Open) > Pending items for Bulk Insert with Row >

[jira] [Resolved] (HUDI-1431) Pending items for Bulk Insert with Row

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-1431. --- Fix Version/s: (was: 0.10.0) 0.9.0 Resolution: Fixed >

[jira] [Updated] (HUDI-1604) Fix archival max log size and potentially a bug in archival

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1604: -- Labels: sev:high sev:triage user-support-issues (was: sev:triage user-support-issues)

[jira] [Assigned] (HUDI-1604) Fix archival max log size and potentially a bug in archival

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-1604: - Assignee: sivabalan narayanan > Fix archival max log size and potentially a bug

[jira] [Updated] (HUDI-1608) MOR fetches all records for read optimized query w/ spark sql

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1608: -- Labels: pull-request-available sev:high (was: pull-request-available sev:critical) >

[jira] [Assigned] (HUDI-1609) Issues w/ using hive metastore by disabling jdbc

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-1609: - Assignee: Sagar Sumit > Issues w/ using hive metastore by disabling jdbc >

[jira] [Closed] (HUDI-1769) websites updates for 0.8.0 release

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan closed HUDI-1769. - Fix Version/s: 0.8.0 Resolution: Fixed > websites updates for 0.8.0 release >

[jira] [Updated] (HUDI-1769) websites updates for 0.8.0 release

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1769: -- Status: In Progress (was: Open) > websites updates for 0.8.0 release >

[jira] [Closed] (HUDI-2001) SnapshotRead on MOR table runs into NoSuchMethodError w/ PartitionedFile.init in MergeOnReadSnapshotRelation

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan closed HUDI-2001. - Assignee: sivabalan narayanan Resolution: Invalid > SnapshotRead on MOR table runs

[jira] [Updated] (HUDI-2025) Ensure parity between row writer bulk_insert and rdd based bulk_insert

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2025: -- Status: In Progress (was: Open) > Ensure parity between row writer bulk_insert and rdd

[jira] [Resolved] (HUDI-2025) Ensure parity between row writer bulk_insert and rdd based bulk_insert

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2025. --- Fix Version/s: 0.9.0 Resolution: Fixed > Ensure parity between row writer

[jira] [Assigned] (HUDI-2025) Ensure parity between row writer bulk_insert and rdd based bulk_insert

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-2025: - Assignee: sivabalan narayanan > Ensure parity between row writer bulk_insert and

[jira] [Updated] (HUDI-2065) Fix flaky test: TestHoodieBackedMetadata#testOnlyValidPartitionsAdded

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2065: -- Fix Version/s: 0.10.0 > Fix flaky test:

[jira] [Commented] (HUDI-2065) Fix flaky test: TestHoodieBackedMetadata#testOnlyValidPartitionsAdded

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17424074#comment-17424074 ] sivabalan narayanan commented on HUDI-2065: --- rewrote the test using test table with synchronous

[jira] [Updated] (HUDI-2027) Certify bulk_insert row writing for COW and MOR w/ test suite infra

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2027: -- Status: In Progress (was: Open) > Certify bulk_insert row writing for COW and MOR w/

[jira] [Resolved] (HUDI-2027) Certify bulk_insert row writing for COW and MOR w/ test suite infra

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2027. --- Fix Version/s: 0.9.0 Resolution: Fixed > Certify bulk_insert row writing for

[jira] [Assigned] (HUDI-2027) Certify bulk_insert row writing for COW and MOR w/ test suite infra

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-2027: - Assignee: sivabalan narayanan > Certify bulk_insert row writing for COW and MOR

[jira] [Resolved] (HUDI-2065) Fix flaky test: TestHoodieBackedMetadata#testOnlyValidPartitionsAdded

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2065. --- Resolution: Fixed > Fix flaky test:

[jira] [Updated] (HUDI-2065) Fix flaky test: TestHoodieBackedMetadata#testOnlyValidPartitionsAdded

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2065: -- Status: In Progress (was: Open) > Fix flaky test:

[jira] [Updated] (HUDI-2308) Support delete partitions via alter table

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2308: -- Labels: sev:high (was: ) > Support delete partitions via alter table >

[jira] [Updated] (HUDI-2308) Support delete partitions via alter table

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2308: -- Fix Version/s: 0.10.0 > Support delete partitions via alter table >

[jira] [Resolved] (HUDI-2311) Fix parsingPartitionColumns for spark and flink for upgrade infra

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2311. --- Fix Version/s: (was: 0.10.0) 0.9.0 Resolution: Fixed >

[GitHub] [hudi] hudi-bot edited a comment on pull request #3740: [HUDI-2496] Insert duplicate keys when precombined is deactivated

2021-10-04 Thread GitBox
hudi-bot edited a comment on pull request #3740: URL: https://github.com/apache/hudi/pull/3740#issuecomment-931381693 ## CI report: * e4b4c092dc5e911ba265e6386d736faa932e5c7c UNKNOWN * 849c3476ecd00486984052ce2b33c25924532add UNKNOWN *

[jira] [Updated] (HUDI-2311) Fix parsingPartitionColumns for spark and flink for upgrade infra

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2311: -- Status: In Progress (was: Open) > Fix parsingPartitionColumns for spark and flink for

[jira] [Resolved] (HUDI-2349) Add sparkDeleteNode to integ test suite

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2349. --- Fix Version/s: (was: 0.10.0) 0.9.0 Resolution: Fixed >

[jira] [Updated] (HUDI-2348) Publish a blog on schema evolution with KafkaAvroCustomDeserializer

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2348: -- Status: In Progress (was: Open) > Publish a blog on schema evolution with

[jira] [Assigned] (HUDI-2311) Fix parsingPartitionColumns for spark and flink for upgrade infra

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-2311: - Assignee: sivabalan narayanan > Fix parsingPartitionColumns for spark and flink

[jira] [Resolved] (HUDI-2348) Publish a blog on schema evolution with KafkaAvroCustomDeserializer

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2348. --- Fix Version/s: (was: 0.10.0) 0.9.0 Resolution: Fixed >

[jira] [Resolved] (HUDI-2356) Fix spark-sql docs with spark quick start guide

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2356. --- Fix Version/s: 0.10.0 Resolution: Fixed > Fix spark-sql docs with spark quick

[jira] [Resolved] (HUDI-2395) Make metadata tests lean and consistent

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2395. --- Fix Version/s: 0.10.0 Resolution: Fixed > Make metadata tests lean and

[jira] [Assigned] (HUDI-2349) Add sparkDeleteNode to integ test suite

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-2349: - Assignee: sivabalan narayanan > Add sparkDeleteNode to integ test suite >

[jira] [Updated] (HUDI-2381) Fix spark quick start guide for minor issues

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2381: -- Status: In Progress (was: Open) > Fix spark quick start guide for minor issues >

[jira] [Assigned] (HUDI-2356) Fix spark-sql docs with spark quick start guide

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-2356: - Assignee: sivabalan narayanan > Fix spark-sql docs with spark quick start guide

[jira] [Assigned] (HUDI-2381) Fix spark quick start guide for minor issues

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-2381: - Assignee: sivabalan narayanan > Fix spark quick start guide for minor issues >

[jira] [Updated] (HUDI-2356) Fix spark-sql docs with spark quick start guide

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2356: -- Status: In Progress (was: Open) > Fix spark-sql docs with spark quick start guide >

[jira] [Updated] (HUDI-2349) Add sparkDeleteNode to integ test suite

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2349: -- Status: In Progress (was: Open) > Add sparkDeleteNode to integ test suite >

[jira] [Resolved] (HUDI-2381) Fix spark quick start guide for minor issues

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2381. --- Fix Version/s: 0.10.0 Resolution: Fixed > Fix spark quick start guide for

[jira] [Updated] (HUDI-2395) Make metadata tests lean and consistent

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2395: -- Status: In Progress (was: Open) > Make metadata tests lean and consistent >

[jira] [Resolved] (HUDI-2474) Fix refreshing timeline for every operation

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2474. --- Resolution: Fixed > Fix refreshing timeline for every operation >

[jira] [Updated] (HUDI-2474) Fix refreshing timeline for every operation

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2474: -- Status: In Progress (was: Open) > Fix refreshing timeline for every operation >

[jira] [Updated] (HUDI-2488) Support bootstrapping a single or more partitions in metadata table while regular writers and table services are in progress

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2488: -- Fix Version/s: 0.10.0 > Support bootstrapping a single or more partitions in metadata

[jira] [Updated] (HUDI-2493) Verify removing glob pattern works w/ all key generators

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2493: -- Labels: sev:critical (was: ) > Verify removing glob pattern works w/ all key

[jira] [Updated] (HUDI-2494) Fix usage of different key generators with metadata enabled

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2494: -- Labels: sev:critical (was: ) > Fix usage of different key generators with metadata

[jira] [Updated] (HUDI-2511) Aggressive archival configs compared to cleaner configs make cleaning moot

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2511: -- Labels: sev:high user-support-issues (was: ) > Aggressive archival configs compared to

[jira] [Updated] (HUDI-2495) Difference in behavior between GenericRecord based key gen and Row based key gen

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2495: -- Labels: sev:critical user-support-issues (was: sev:critical) > Difference in behavior

[jira] [Updated] (HUDI-2512) Multi-writer w/ DeltaStreamer and Spark datasource does not work

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2512: -- Labels: pull-request-available sev:critical user-support-issues (was:

[jira] [Updated] (HUDI-2525) Test prometheus metrics with hudi (both spark ds and deltastreamer)

2021-10-04 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2525: -- Labels: sev:critical user-support-issues (was: ) > Test prometheus metrics with hudi

[GitHub] [hudi] hudi-bot edited a comment on pull request #3740: [HUDI-2496] Insert duplicate keys when precombined is deactivated

2021-10-04 Thread GitBox
hudi-bot edited a comment on pull request #3740: URL: https://github.com/apache/hudi/pull/3740#issuecomment-931381693 ## CI report: * e4b4c092dc5e911ba265e6386d736faa932e5c7c UNKNOWN * 849c3476ecd00486984052ce2b33c25924532add UNKNOWN *

[jira] [Assigned] (HUDI-1958) [Umbrella] Follow up items from 1 pass over GH issues

2021-10-04 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-1958: Assignee: Vinoth Chandar (was: Raymond Xu) > [Umbrella] Follow up items from 1 pass over

[jira] [Updated] (HUDI-1958) [Umbrella] Follow up items from 1 pass over GH issues

2021-10-04 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1958: - Labels: Docs hudi-umbrellas release-blocker (was: hudi-umbrellas release-blocker) > [Umbrella]

[jira] [Assigned] (HUDI-2337) Implement Multiwriter support for Kafka connect

2021-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2337: --- Assignee: Ethan Guo (was: Rajesh Mahindra) > Implement Multiwriter support for Kafka connect >

[jira] [Assigned] (HUDI-2445) Implement time based bucketing of inserts

2021-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2445: --- Assignee: Ethan Guo (was: Vinoth Chandar) > Implement time based bucketing of inserts >

[jira] [Assigned] (HUDI-2326) Marker file integration for Kafka connect in the Coordinator with timeline server

2021-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2326: --- Assignee: Ethan Guo (was: Rajesh Mahindra) > Marker file integration for Kafka connect in the

[jira] [Assigned] (HUDI-2336) Metadata table integration for Kafka connect

2021-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2336: --- Assignee: Ethan Guo (was: Rajesh Mahindra) > Metadata table integration for Kafka connect >

[jira] [Assigned] (HUDI-2353) Rewrite the java write client with the right abstractions

2021-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2353: --- Assignee: Ethan Guo (was: Rajesh Mahindra) > Rewrite the java write client with the right

[jira] [Assigned] (HUDI-2469) Implement protobuf based protocol for control plane instead of json

2021-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2469: --- Assignee: Ethan Guo (was: Rajesh Mahindra) > Implement protobuf based protocol for control plane

[jira] [Assigned] (HUDI-2325) Implement and test Hive Sync support for Kafka Connect

2021-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2325: --- Assignee: Ethan Guo (was: Rajesh Mahindra) > Implement and test Hive Sync support for Kafka Connect

[jira] [Assigned] (HUDI-2331) Implement keyed data for Updates/ deletes for Java Kafka Client

2021-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2331: --- Assignee: Ethan Guo (was: Rajesh Mahindra) > Implement keyed data for Updates/ deletes for Java

[jira] [Assigned] (HUDI-2334) Implement monitoring of hudi stats for Kafka connect

2021-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2334: --- Assignee: Ethan Guo (was: Rajesh Mahindra) > Implement monitoring of hudi stats for Kafka connect >

[jira] [Assigned] (HUDI-2446) Test edge cases around correctness and offset management and certify

2021-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2446: --- Assignee: Ethan Guo > Test edge cases around correctness and offset management and certify >

[jira] [Assigned] (HUDI-2431) Reimplement BufferedWriter in streaming fashion

2021-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2431: --- Assignee: Ethan Guo (was: Vinoth Chandar) > Reimplement BufferedWriter in streaming fashion >

[jira] [Assigned] (HUDI-2332) Implement scheduling of compaction/ clustering for Kafka Connect

2021-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2332: --- Assignee: Ethan Guo (was: Vinoth Chandar) > Implement scheduling of compaction/ clustering for

[GitHub] [hudi] hudi-bot edited a comment on pull request #3740: [HUDI-2496] Insert duplicate keys when precombined is deactivated

2021-10-04 Thread GitBox
hudi-bot edited a comment on pull request #3740: URL: https://github.com/apache/hudi/pull/3740#issuecomment-931381693 ## CI report: * e4b4c092dc5e911ba265e6386d736faa932e5c7c UNKNOWN * 849c3476ecd00486984052ce2b33c25924532add UNKNOWN *

[jira] [Created] (HUDI-2526) Make spark.sql.parquet.writeLegacyFormat configurable

2021-10-04 Thread Sagar Sumit (Jira)
Sagar Sumit created HUDI-2526: - Summary: Make spark.sql.parquet.writeLegacyFormat configurable Key: HUDI-2526 URL: https://issues.apache.org/jira/browse/HUDI-2526 Project: Apache Hudi Issue

  1   2   3   >