[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2021-01-30 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17275658#comment-17275658 ] sivabalan narayanan commented on HUDI-1007: --- [~liujinhui]: So, you mean to say w

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-07-13 Thread liujinhui (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17156581#comment-17156581 ] liujinhui commented on HUDI-1007: -   I have used spark2.4 and still have this problem 2.4.

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-07-08 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17154174#comment-17154174 ] Vinoth Chandar commented on HUDI-1007: -- [~liujinhui] please take a look at [https://g

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-06-16 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1713#comment-1713 ] Vinoth Chandar commented on HUDI-1007: -- Interesting.. >seeking to offset 4479769508

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-06-16 Thread liujinhui (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136660#comment-17136660 ] liujinhui commented on HUDI-1007: -   So the best way at present is to discover the data de

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-06-16 Thread liujinhui (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136655#comment-17136655 ] liujinhui commented on HUDI-1007: - > in your case, is this true? does setting the flag hel

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-06-16 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136653#comment-17136653 ] Vinoth Chandar commented on HUDI-1007: -- Starting from latest, is problematic IMO, due

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-06-16 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136651#comment-17136651 ] Vinoth Chandar commented on HUDI-1007: -- > If this is a compacted topic, consider enab

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-06-12 Thread liujinhui (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17134635#comment-17134635 ] liujinhui commented on HUDI-1007: - I think that starting from the latest offect can indeed

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-06-12 Thread liujinhui (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17134634#comment-17134634 ] liujinhui commented on HUDI-1007: - Caused by: org.apache.spark.SparkException: Job aborted

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-06-11 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17133412#comment-17133412 ] Vinoth Chandar commented on HUDI-1007: -- a scenario where the expiry is happening cont

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-06-08 Thread liujinhui (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17128400#comment-17128400 ] liujinhui commented on HUDI-1007: - Yes, every run will check the offset of the earliest in

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-06-08 Thread liujinhui (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17128395#comment-17128395 ] liujinhui commented on HUDI-1007: - # This test case is really special and requires a produ

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-06-08 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17128238#comment-17128238 ] Vinoth Chandar commented on HUDI-1007: -- [~liujinhui] My understanding is this must be

[jira] [Commented] (HUDI-1007) When earliestOffsets is greater than checkpoint, Hudi will not be able to successfully consume data

2020-06-08 Thread liujinhui (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17128220#comment-17128220 ] liujinhui commented on HUDI-1007: - *[~vinoth]  What is your idea?* > When earliestOffsets