[jira] [Created] (HUDI-6339) Ability to Disable Partition Deletes during Clean

2023-06-08 Thread Dave Hagman (Jira)
Dave Hagman created HUDI-6339: - Summary: Ability to Disable Partition Deletes during Clean Key: HUDI-6339 URL: https://issues.apache.org/jira/browse/HUDI-6339 Project: Apache Hudi Issue Type: Imp

[jira] [Updated] (HUDI-6339) Ability to Disable Partition Deletes during Clean

2023-06-08 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Hagman updated HUDI-6339: -- Description: We recently experienced a large data loss in one of our largest Hudi tables. We observed t

[jira] [Comment Edited] (HUDI-1307) spark datasource load path format is confused for snapshot and increment read mode

2022-02-22 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17496163#comment-17496163 ] Dave Hagman edited comment on HUDI-1307 at 2/22/22, 3:40 PM: -

[jira] [Commented] (HUDI-1307) spark datasource load path format is confused for snapshot and increment read mode

2022-02-22 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17496163#comment-17496163 ] Dave Hagman commented on HUDI-1307: --- [~vinoth] [~309637554] [~xushiyan]   I'd like to +1

[jira] [Commented] (HUDI-2173) Enhancing DynamoDB based LockProvider

2022-01-26 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17482553#comment-17482553 ] Dave Hagman commented on HUDI-2173: --- [~shivnarayan] I did not yet get a chance to run te

[jira] [Commented] (HUDI-2173) Enhancing DynamoDB based LockProvider

2021-11-19 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446559#comment-17446559 ] Dave Hagman commented on HUDI-2173: --- Will do! > Enhancing DynamoDB based LockProvider >

[jira] [Closed] (HUDI-2549) Exceptions when using second writer into Hudi table managed by DeltaStreamer

2021-10-19 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Hagman closed HUDI-2549. - Resolution: Duplicate > Exceptions when using second writer into Hudi table managed by DeltaStreamer > ---

[jira] [Created] (HUDI-2579) Deltastreamer checkpoint metadata is not merged from previous commit instant

2021-10-19 Thread Dave Hagman (Jira)
Dave Hagman created HUDI-2579: - Summary: Deltastreamer checkpoint metadata is not merged from previous commit instant Key: HUDI-2579 URL: https://issues.apache.org/jira/browse/HUDI-2579 Project: Apache Hu

[jira] [Commented] (HUDI-2559) Ensure unique timestamps are generated for commit times with concurrent writers

2021-10-18 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17430050#comment-17430050 ] Dave Hagman commented on HUDI-2559: --- Testing approach 1 should be very easy given the wa

[jira] [Comment Edited] (HUDI-2559) Ensure unique timestamps are generated for commit times with concurrent writers

2021-10-18 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17430048#comment-17430048 ] Dave Hagman edited comment on HUDI-2559 at 10/18/21, 2:40 PM: --

[jira] [Commented] (HUDI-2559) Ensure unique timestamps are generated for commit times with concurrent writers

2021-10-18 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17430048#comment-17430048 ] Dave Hagman commented on HUDI-2559: --- I have been extensively testing approach #2 and so

[jira] [Commented] (HUDI-2549) Exceptions when using second writer into Hudi table managed by DeltaStreamer

2021-10-14 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17428798#comment-17428798 ] Dave Hagman commented on HUDI-2549: --- OK so you only ran 1 iteration of each (1 commit fr

[jira] [Commented] (HUDI-2549) Exceptions when using second writer into Hudi table managed by DeltaStreamer

2021-10-13 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17428499#comment-17428499 ] Dave Hagman commented on HUDI-2549: --- While continuing to test, I found that the _*FileAl

[jira] [Closed] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-10-12 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Hagman closed HUDI-2275. - Fix Version/s: (was: 0.10.0) 0.9.0 Resolution: Fixed > HoodieDeltaStreamerEx

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-10-12 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17427809#comment-17427809 ] Dave Hagman commented on HUDI-2275: --- We are experiencing new issues when migrating to ve

[jira] [Commented] (HUDI-2549) Exceptions when using second writer into Hudi table managed by DeltaStreamer

2021-10-12 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17427807#comment-17427807 ] Dave Hagman commented on HUDI-2549: --- In order to try and validate my hypothesis about ra

[jira] [Created] (HUDI-2549) Exceptions when using second writer into Hudi table managed by DeltaStreamer

2021-10-12 Thread Dave Hagman (Jira)
Dave Hagman created HUDI-2549: - Summary: Exceptions when using second writer into Hudi table managed by DeltaStreamer Key: HUDI-2549 URL: https://issues.apache.org/jira/browse/HUDI-2549 Project: Apache Hu

[jira] [Updated] (HUDI-2549) Exceptions when using second writer into Hudi table managed by DeltaStreamer

2021-10-12 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Hagman updated HUDI-2549: -- Description: When running the DeltaStreamer along with a second spark datasource writer (with [ZK-based

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-10-07 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17425773#comment-17425773 ] Dave Hagman commented on HUDI-2275: --- Also I am not specifying a value for _hoodie.metada

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-10-07 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17425756#comment-17425756 ] Dave Hagman commented on HUDI-2275: --- [~shivnarayan] I received the same error (above) ev

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-10-07 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17425741#comment-17425741 ] Dave Hagman commented on HUDI-2275: --- I did not try that patch. I will do that and report

[jira] [Comment Edited] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-10-04 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17424154#comment-17424154 ] Dave Hagman edited comment on HUDI-2275 at 10/4/21, 8:09 PM: -

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-10-04 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17424154#comment-17424154 ] Dave Hagman commented on HUDI-2275: --- [~vinoth] I'd argue that this is still a blocker as

[jira] [Comment Edited] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-12 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17398090#comment-17398090 ] Dave Hagman edited comment on HUDI-2275 at 8/12/21, 2:46 PM: -

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-12 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17398090#comment-17398090 ] Dave Hagman commented on HUDI-2275: --- Continuing to investigate this and found something

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-12 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17397979#comment-17397979 ] Dave Hagman commented on HUDI-2275: --- I have run into the same issue even with the prefix

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-11 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17397582#comment-17397582 ] Dave Hagman commented on HUDI-2275: --- Perfect tyvm. I will try this now! > HoodieDeltaSt

[jira] [Comment Edited] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-11 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17397454#comment-17397454 ] Dave Hagman edited comment on HUDI-2275 at 8/11/21, 4:19 PM: -

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-11 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17397454#comment-17397454 ] Dave Hagman commented on HUDI-2275: --- [~vinoth] just so I understand, this configuration

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-11 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17397328#comment-17397328 ] Dave Hagman commented on HUDI-2275: --- I will try that this morning and report back.  > H

[jira] [Comment Edited] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-10 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17396830#comment-17396830 ] Dave Hagman edited comment on HUDI-2275 at 8/10/21, 6:25 PM: -

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-10 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17396830#comment-17396830 ] Dave Hagman commented on HUDI-2275: --- [~vinoth] OK that at least ties together what I was

[jira] [Commented] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-05 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17394190#comment-17394190 ] Dave Hagman commented on HUDI-2275: --- Comment in Slack from Shiv Narayan:   {code:java}

[jira] [Updated] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-04 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Hagman updated HUDI-2275: -- Description:  I am trying to utilize [Optimistic Concurrency Control|https://hudi.apache.org/docs/concu

[jira] [Updated] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-04 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Hagman updated HUDI-2275: -- Description:  I am trying to utilize [Optimistic Concurrency Control|https://hudi.apache.org/docs/concu

[jira] [Updated] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-04 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Hagman updated HUDI-2275: -- Description:  I am trying to utilize [Optimistic Concurrency Control|https://hudi.apache.org/docs/concu

[jira] [Updated] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-04 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Hagman updated HUDI-2275: -- Description:  I am trying to utilize [Optimistic Concurrency Control|https://hudi.apache.org/docs/concu

[jira] [Created] (HUDI-2275) HoodieDeltaStreamerException when using OCC and a second concurrent writer

2021-08-04 Thread Dave Hagman (Jira)
Dave Hagman created HUDI-2275: - Summary: HoodieDeltaStreamerException when using OCC and a second concurrent writer Key: HUDI-2275 URL: https://issues.apache.org/jira/browse/HUDI-2275 Project: Apache Hudi

[jira] [Assigned] (HUDI-2230) "Task not serializable" exception due to non-serializable Codahale Timers

2021-07-27 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Hagman reassigned HUDI-2230: - Assignee: Dave Hagman > "Task not serializable" exception due to non-serializable Codahale Timers

[jira] [Updated] (HUDI-2230) "Task not serializable" exception due to non-serializable Codahale Timers

2021-07-27 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Hagman updated HUDI-2230: -- Description: Steps to reproduce: * Enable graphite metrics via props file. Example: {noformat} hoodie

[jira] [Updated] (HUDI-2230) "Task not serializable" exception due to non-serializable Codahale Timers

2021-07-27 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Hagman updated HUDI-2230: -- Description: Steps to reproduce: * Enable graphite metrics via props file. Example: {noformat} hoodie

[jira] [Created] (HUDI-2230) "Task not serializable" exception due to non-serializable Codahale Timers

2021-07-27 Thread Dave Hagman (Jira)
Dave Hagman created HUDI-2230: - Summary: "Task not serializable" exception due to non-serializable Codahale Timers Key: HUDI-2230 URL: https://issues.apache.org/jira/browse/HUDI-2230 Project: Apache Hudi

[jira] [Comment Edited] (HUDI-2173) Implement a DynamoDB based LockProvider

2021-07-18 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17382962#comment-17382962 ] Dave Hagman edited comment on HUDI-2173 at 7/19/21, 2:58 AM: -

[jira] [Comment Edited] (HUDI-2173) Implement a DynamoDB based LockProvider

2021-07-18 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17382962#comment-17382962 ] Dave Hagman edited comment on HUDI-2173 at 7/19/21, 2:57 AM: -

[jira] [Commented] (HUDI-2173) Implement a DynamoDB based LockProvider

2021-07-18 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17382962#comment-17382962 ] Dave Hagman commented on HUDI-2173: --- Very basic pseudocode for an optimistic locking imp

[jira] [Comment Edited] (HUDI-2173) Implement a DynamoDB based LockProvider

2021-07-18 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17382961#comment-17382961 ] Dave Hagman edited comment on HUDI-2173 at 7/19/21, 2:42 AM: -

[jira] [Commented] (HUDI-2173) Implement a DynamoDB based LockProvider

2021-07-18 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17382961#comment-17382961 ] Dave Hagman commented on HUDI-2173: --- I have started to look into the various options we

[jira] [Updated] (HUDI-2173) Implement a DynamoDB based LockProvider

2021-07-18 Thread Dave Hagman (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Hagman updated HUDI-2173: -- Status: In Progress (was: Open) > Implement a DynamoDB based LockProvider > ---