[jira] [Commented] (HUDI-920) Incremental view on MOR table using Spark Datasource

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17173594#comment-17173594 ] Yanjia Gary Li commented on HUDI-920: - The most challenging thing of the incremental qu

[GitHub] [hudi] bvaradar commented on a change in pull request #1869: [HUDI-427] Implement CLI support for performing bootstrap

2020-08-07 Thread GitBox
bvaradar commented on a change in pull request #1869: URL: https://github.com/apache/hudi/pull/1869#discussion_r467370346 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/BootstrapCommand.java ## @@ -0,0 +1,185 @@ +/* + * Licensed to the Apache Software Founda

[GitHub] [hudi] xushiyan commented on a change in pull request #1871: [WIP] [HUDI-781] Introduce HoodieTestTable for test preparation

2020-08-07 Thread GitBox
xushiyan commented on a change in pull request #1871: URL: https://github.com/apache/hudi/pull/1871#discussion_r467367212 ## File path: hudi-client/src/test/java/org/apache/hudi/testutils/HoodieClientTestUtils.java ## @@ -74,57 +74,6 @@ public class HoodieClientTestUtils { R

[GitHub] [hudi] bvaradar commented on a change in pull request #1869: [HUDI-427] Implement CLI support for performing bootstrap

2020-08-07 Thread GitBox
bvaradar commented on a change in pull request #1869: URL: https://github.com/apache/hudi/pull/1869#discussion_r467369486 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/BootstrapCommand.java ## @@ -0,0 +1,185 @@ +/* + * Licensed to the Apache Software Founda

[GitHub] [hudi] bvaradar commented on a change in pull request #1869: [HUDI-427] Implement CLI support for performing bootstrap

2020-08-07 Thread GitBox
bvaradar commented on a change in pull request #1869: URL: https://github.com/apache/hudi/pull/1869#discussion_r467369460 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/BootstrapCommand.java ## @@ -0,0 +1,185 @@ +/* + * Licensed to the Apache Software Founda

[GitHub] [hudi] bvaradar commented on a change in pull request #1869: [HUDI-427] Implement CLI support for performing bootstrap

2020-08-07 Thread GitBox
bvaradar commented on a change in pull request #1869: URL: https://github.com/apache/hudi/pull/1869#discussion_r467369382 ## File path: hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/BootstrapIndex.java ## @@ -73,12 +73,12 @@ public final boolean useIndex() {

[GitHub] [hudi] bvaradar commented on a change in pull request #1869: [HUDI-427] Implement CLI support for performing bootstrap

2020-08-07 Thread GitBox
bvaradar commented on a change in pull request #1869: URL: https://github.com/apache/hudi/pull/1869#discussion_r467368588 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/BootstrapCommand.java ## @@ -0,0 +1,184 @@ +/* + * Licensed to the Apache Software Founda

[GitHub] [hudi] UZi5136225 commented on pull request #1931: [HUDI-210] hudi-support-prometheus-pushgateway

2020-08-07 Thread GitBox
UZi5136225 commented on pull request #1931: URL: https://github.com/apache/hudi/pull/1931#issuecomment-670830555 rerun tests This is an automated message from the Apache Git Service. To respond to the message, please log on t

[GitHub] [hudi] wangxianghu commented on pull request #1935: [HUDI-1121][DOC]Provide a document describing how to use callback

2020-08-07 Thread GitBox
wangxianghu commented on pull request #1935: URL: https://github.com/apache/hudi/pull/1935#issuecomment-670828182 @yanghua @leesf please take a look when free This is an automated message from the Apache Git Service. To respo

[GitHub] [hudi] wangxianghu opened a new pull request #1935: [HUDI-1121][DOC]Provide a document describing how to use callback

2020-08-07 Thread GitBox
wangxianghu opened a new pull request #1935: URL: https://github.com/apache/hudi/pull/1935 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the

[jira] [Updated] (HUDI-1121) Provide a document describing how to use callback

2020-08-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1121: - Labels: pull-request-available (was: ) > Provide a document describing how to use callback >

[GitHub] [hudi] bvaradar commented on pull request #1702: [HUDI-426] Bootstrap datasource integration

2020-08-07 Thread GitBox
bvaradar commented on pull request #1702: URL: https://github.com/apache/hudi/pull/1702#issuecomment-670826574 @umehrot2 : Can you confirm if all review comments are resolved and the PR is ready otherwise. This is an automa

[GitHub] [hudi] bvaradar commented on pull request #1702: [HUDI-426] Bootstrap datasource integration

2020-08-07 Thread GitBox
bvaradar commented on pull request #1702: URL: https://github.com/apache/hudi/pull/1702#issuecomment-670826438 @umehrot2 : Thanks for the update. Yeah, the integration test flakiness is a know issue and the logs shows the same pattern. Let me do one pass of it along with other bootstrap PR

[GitHub] [hudi] xushiyan opened a new pull request #1934: [MINOR] Move a test method to Transformations

2020-08-07 Thread GitBox
xushiyan opened a new pull request #1934: URL: https://github.com/apache/hudi/pull/1934 - Move TestHoodieKeyLocationFetchHandle#getRecordsPerPartition to Transformations - Improve some var namings ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please r

[GitHub] [hudi] leesf commented on pull request #1931: [HUDI-210] hudi-support-prometheus-pushgateway

2020-08-07 Thread GitBox
leesf commented on pull request #1931: URL: https://github.com/apache/hudi/pull/1931#issuecomment-670820501 rerun tests This is an automated message from the Apache Git Service. To respond to the message, please log on to Git

[hudi] branch master updated: [HUDI-1026] Removed slf4j dependency from HoodieClientTestHarness (#1928)

2020-08-07 Thread leesf
This is an automated email from the ASF dual-hosted git repository. leesf pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 1072f27 [HUDI-1026] Removed slf4j dependency from

[GitHub] [hudi] leesf merged pull request #1928: [HUDI-1026]: removed slf4j dependency from HoodieClientTestHarness

2020-08-07 Thread GitBox
leesf merged pull request #1928: URL: https://github.com/apache/hudi/pull/1928 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the s

[GitHub] [hudi] leesf merged pull request #1932: [MINOR]Remove unused import

2020-08-07 Thread GitBox
leesf merged pull request #1932: URL: https://github.com/apache/hudi/pull/1932 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the s

[hudi] branch master updated: [MINOR] Remove unused import (#1932)

2020-08-07 Thread leesf
This is an automated email from the ASF dual-hosted git repository. leesf pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 8b66524 [MINOR] Remove unused import (#1932) 8b665

[GitHub] [hudi] garyli1019 commented on pull request #1702: [HUDI-426] Bootstrap datasource integration

2020-08-07 Thread GitBox
garyli1019 commented on pull request #1702: URL: https://github.com/apache/hudi/pull/1702#issuecomment-670818469 The integration test fails sometimes for no reason. I have been seeing this for a few times. Maybe rerun will fix if lucky.

Build failed in Jenkins: hudi-snapshot-deployment-0.5 #363

2020-08-07 Thread Apache Jenkins Server
See Changes: -- [...truncated 2.59 KB...] cdi-api-1.0.jar cdi-api.license commons-cli-1.4.jar commons-cli.license commons-io-2.5.jar commons-io.license commons-lang3-3.5.jar

[GitHub] [hudi] umehrot2 commented on pull request #1702: [HUDI-426] Bootstrap datasource integration

2020-08-07 Thread GitBox
umehrot2 commented on pull request #1702: URL: https://github.com/apache/hudi/pull/1702#issuecomment-670814850 @vinothchandar the unit tests issues are resolved now. But the integration tests are behaving crazy. They passed the last time, and failed now even though I didn't make any code c

[GitHub] [hudi] umehrot2 commented on a change in pull request #1702: [HUDI-426] Bootstrap datasource integration

2020-08-07 Thread GitBox
umehrot2 commented on a change in pull request #1702: URL: https://github.com/apache/hudi/pull/1702#discussion_r46735 ## File path: hudi-spark/src/test/scala/org/apache/hudi/functional/TestDataSourceForBootstrap.scala ## @@ -0,0 +1,616 @@ +/* + * Licensed to the Apache Sof

[GitHub] [hudi] garyli1019 commented on a change in pull request #1702: [HUDI-426] Bootstrap datasource integration

2020-08-07 Thread GitBox
garyli1019 commented on a change in pull request #1702: URL: https://github.com/apache/hudi/pull/1702#discussion_r467341322 ## File path: hudi-spark/src/test/scala/org/apache/hudi/functional/TestDataSourceForBootstrap.scala ## @@ -0,0 +1,616 @@ +/* + * Licensed to the Apache S

[GitHub] [hudi] garyli1019 commented on pull request #1702: [HUDI-426] Bootstrap datasource integration

2020-08-07 Thread GitBox
garyli1019 commented on pull request #1702: URL: https://github.com/apache/hudi/pull/1702#issuecomment-670799557 > @vinothchandar I fixed the rebase issue, and resolved the `bootstrap` related test failures. I still see `MOR data source` related unit test failures because of `spark context

[GitHub] [hudi] umehrot2 commented on pull request #1702: [HUDI-426] Bootstrap datasource integration

2020-08-07 Thread GitBox
umehrot2 commented on pull request #1702: URL: https://github.com/apache/hudi/pull/1702#issuecomment-670793211 @vinothchandar I fixed the rebase issue, and resolved the `bootstrap` related test failures. I still see `MOR data source` related unit test failures because of `spark context`. I

[GitHub] [hudi] umehrot2 commented on pull request #1702: [HUDI-426] Bootstrap datasource integration

2020-08-07 Thread GitBox
umehrot2 commented on pull request #1702: URL: https://github.com/apache/hudi/pull/1702#issuecomment-670717238 > @umehrot2 some tests are failing . looking at them later today. > > Before we head into the weekend, is this PR ready from your perspective. if so, I will take care of mak

[GitHub] [hudi] zhedoubushishi commented on a change in pull request #1869: [HUDI-427] Implement CLI support for performing bootstrap

2020-08-07 Thread GitBox
zhedoubushishi commented on a change in pull request #1869: URL: https://github.com/apache/hudi/pull/1869#discussion_r467272326 ## File path: hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/HFileBootstrapIndex.java ## @@ -240,13 +240,21 @@ private HoodieBootst

[jira] [Resolved] (HUDI-69) Support realtime view in Spark datasource #136

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-69?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-69. Resolution: Fixed > Support realtime view in Spark datasource #136 > -

[jira] [Reopened] (HUDI-69) Support realtime view in Spark datasource #136

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-69?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li reopened HUDI-69: > Support realtime view in Spark datasource #136 > -- > >

[jira] [Resolved] (HUDI-1051) Improve MOR datasource reader file listing and path handling

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-1051. -- Resolution: Fixed > Improve MOR datasource reader file listing and path handling > -

[jira] [Updated] (HUDI-69) Support realtime view in Spark datasource #136

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-69?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-69: --- Status: Closed (was: Patch Available) > Support realtime view in Spark datasource #136 > --

[jira] [Resolved] (HUDI-1052) Support vectorized reader for MOR datasource reader

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-1052. -- Resolution: Fixed > Support vectorized reader for MOR datasource reader > --

[jira] [Resolved] (HUDI-1050) Support filter pushdown and column pruning for MOR table on Spark Datasource

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-1050. -- Resolution: Fixed > Support filter pushdown and column pruning for MOR table on Spark Datasource

[jira] [Updated] (HUDI-1052) Support vectorized reader for MOR datasource reader

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1052: - Status: In Progress (was: Open) > Support vectorized reader for MOR datasource reader > -

[GitHub] [hudi] rufferjr commented on issue #1923: [SUPPORT] Hive Sync fails to add decimal partition

2020-08-07 Thread GitBox
rufferjr commented on issue #1923: URL: https://github.com/apache/hudi/issues/1923#issuecomment-670636479 @bvaradar would you like the S3 partition path? If so, the following examples may be of use: s3://data-beta/vault/cod_combinations/partition_val=1003 s3://data-beta/vault/cod_

[jira] [Comment Edited] (HUDI-1146) DeltaStreamer fails to start when No updated records + schemaProvider not supplied

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17173348#comment-17173348 ] Balaji Varadarajan edited comment on HUDI-1146 at 8/7/20, 5:25 PM: -

[jira] [Comment Edited] (HUDI-1146) DeltaStreamer fails to start when No updated records + schemaProvider not supplied

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17173348#comment-17173348 ] Balaji Varadarajan edited comment on HUDI-1146 at 8/7/20, 5:25 PM: -

[jira] [Comment Edited] (HUDI-1146) DeltaStreamer fails to start when No updated records + schemaProvider not supplied

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17173348#comment-17173348 ] Balaji Varadarajan edited comment on HUDI-1146 at 8/7/20, 5:24 PM: -

[jira] [Commented] (HUDI-1146) DeltaStreamer fails to start when No updated records + schemaProvider not supplied

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17173348#comment-17173348 ] Balaji Varadarajan commented on HUDI-1146: -- [~bdscheller]: I think InputBatch::g

[jira] [Created] (HUDI-1171) Hudi 0.5.2 with ScalaTest and Spark 2.4.0 java.lang.NoClassDefFoundError: Could not initialize class org.apache.hudi.avro.model.HoodieCleanerPlan

2020-08-07 Thread Prashanth (Jira)
Prashanth created HUDI-1171: --- Summary: Hudi 0.5.2 with ScalaTest and Spark 2.4.0 java.lang.NoClassDefFoundError: Could not initialize class org.apache.hudi.avro.model.HoodieCleanerPlan Key: HUDI-1171 URL: https://issue

[GitHub] [hudi] zhedoubushishi commented on pull request #1933: [HUDI-971] Fix HFileBootstrapIndexReader.getIndexedPartitions() returns unclean partition name

2020-08-07 Thread GitBox
zhedoubushishi commented on pull request #1933: URL: https://github.com/apache/hudi/pull/1933#issuecomment-670594285 > @vinothchandar : It looks like @zhedoubushishi had addressed the same issue in his original PR. So, I am going to close this one. @zhedoubushishi : Can you us the changes

[GitHub] [hudi] UZi5136225 commented on pull request #1931: [HUDI-210] hudi-support-prometheus-pushgateway

2020-08-07 Thread GitBox
UZi5136225 commented on pull request #1931: URL: https://github.com/apache/hudi/pull/1931#issuecomment-670594131 @leesf Please review This is an automated message from the Apache Git Service. To respond to the message, please

[jira] [Assigned] (HUDI-971) Fix HFileBootstrapIndexReader.getIndexedPartitions() returns unclean partition name

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-971: --- Assignee: Wenning Ding (was: Balaji Varadarajan) > Fix HFileBootstrapIndexReader.getI

[GitHub] [hudi] bvaradar commented on a change in pull request #1869: [HUDI-427] Implement CLI support for performing bootstrap

2020-08-07 Thread GitBox
bvaradar commented on a change in pull request #1869: URL: https://github.com/apache/hudi/pull/1869#discussion_r467135559 ## File path: hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/HFileBootstrapIndex.java ## @@ -240,13 +240,21 @@ private HoodieBootstrapInd

[GitHub] [hudi] bvaradar edited a comment on pull request #1933: [HUDI-971] Fix HFileBootstrapIndexReader.getIndexedPartitions() returns unclean partition name

2020-08-07 Thread GitBox
bvaradar edited a comment on pull request #1933: URL: https://github.com/apache/hudi/pull/1933#issuecomment-670592925 @vinothchandar : It looks like @zhedoubushishi had addressed the same issue in his original PR. So, I am going to close this one. @zhedoubushishi : Can you us the changes

[GitHub] [hudi] bvaradar commented on pull request #1933: [HUDI-971] Fix HFileBootstrapIndexReader.getIndexedPartitions() returns unclean partition name

2020-08-07 Thread GitBox
bvaradar commented on pull request #1933: URL: https://github.com/apache/hudi/pull/1933#issuecomment-670592925 @vinothchandar : It looks like @zhedoubushishi had addressed it in his original PR. So, I am going to close this one. @zhedoubushishi : Can you us the changes in this PR regardin

[GitHub] [hudi] bvaradar closed pull request #1933: [HUDI-971] Fix HFileBootstrapIndexReader.getIndexedPartitions() returns unclean partition name

2020-08-07 Thread GitBox
bvaradar closed pull request #1933: URL: https://github.com/apache/hudi/pull/1933 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to th

[GitHub] [hudi] bvaradar commented on a change in pull request #1869: [HUDI-427] Implement CLI support for performing bootstrap

2020-08-07 Thread GitBox
bvaradar commented on a change in pull request #1869: URL: https://github.com/apache/hudi/pull/1869#discussion_r467133890 ## File path: hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/HFileBootstrapIndex.java ## @@ -240,13 +240,21 @@ private HoodieBootstrapInd

[GitHub] [hudi] bvaradar commented on a change in pull request #1869: [HUDI-427] Implement CLI support for performing bootstrap

2020-08-07 Thread GitBox
bvaradar commented on a change in pull request #1869: URL: https://github.com/apache/hudi/pull/1869#discussion_r467133890 ## File path: hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/HFileBootstrapIndex.java ## @@ -240,13 +240,21 @@ private HoodieBootstrapInd

[jira] [Commented] (HUDI-1015) Audit all getAllPartitionPaths() calls and keep em out of fast path

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17173242#comment-17173242 ] Balaji Varadarajan commented on HUDI-1015: -- Subtasks added to track all location

[jira] [Updated] (HUDI-1170) File Listing during log file rollback is affecting ingestion latency in S3

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1170: - Fix Version/s: 0.6.1 > File Listing during log file rollback is affecting ingestion latenc

[jira] [Updated] (HUDI-1170) File Listing during log file rollback is affecting ingestion latency in S3

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1170: - Status: Open (was: New) > File Listing during log file rollback is affecting ingestion la

[jira] [Updated] (HUDI-1170) File Listing during log file rollback is affecting ingestion latency in S3

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1170: - Description: (Source : [https://github.com/apache/hudi/issues/1852])   :  sun.net.www.pr

[jira] [Created] (HUDI-1170) File Listing during log file rollback is affecting ingestion latency in S3

2020-08-07 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1170: Summary: File Listing during log file rollback is affecting ingestion latency in S3 Key: HUDI-1170 URL: https://issues.apache.org/jira/browse/HUDI-1170 Projec

[jira] [Updated] (HUDI-1015) Audit all getAllPartitionPaths() calls and keep em out of fast path

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1015: - Priority: Major (was: Blocker) > Audit all getAllPartitionPaths() calls and keep em out o

[jira] [Updated] (HUDI-1015) Audit all getAllPartitionPaths() calls and keep em out of fast path

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1015: - Fix Version/s: (was: 0.6.0) 0.6.1 > Audit all getAllPartitionPaths(

[jira] [Created] (HUDI-1169) Audit Partition Listing : Snapshot Copier and Exporter Utilities

2020-08-07 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1169: Summary: Audit Partition Listing : Snapshot Copier and Exporter Utilities Key: HUDI-1169 URL: https://issues.apache.org/jira/browse/HUDI-1169 Project: Apache

[jira] [Created] (HUDI-1168) Audit Partition Listing : Savepoint Creation

2020-08-07 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1168: Summary: Audit Partition Listing : Savepoint Creation Key: HUDI-1168 URL: https://issues.apache.org/jira/browse/HUDI-1168 Project: Apache Hudi Issue

[GitHub] [hudi] UZi5136225 commented on pull request #1931: [HUDI-210] hudi-support-prometheus-pushgateway

2020-08-07 Thread GitBox
UZi5136225 commented on pull request #1931: URL: https://github.com/apache/hudi/pull/1931#issuecomment-670584668 ![9C0310E1-BE19-4cbf-9476-5351C72908FC](https://user-images.githubusercontent.com/25769285/89663903-e600cf80-d908-11ea-9d96-ada9f7a039f2.png) ![9C0310E1-BE19-4cbf-9476-5351C7

[GitHub] [hudi] UZi5136225 commented on pull request #1931: [HUDI-210] hudi-support-prometheus-pushgateway

2020-08-07 Thread GitBox
UZi5136225 commented on pull request #1931: URL: https://github.com/apache/hudi/pull/1931#issuecomment-670584994 ![9C0310E1-BE19-4cbf-9476-5351C72908FC](https://user-images.githubusercontent.com/25769285/89664026-19435e80-d909-11ea-936a-dff502005fff.png) ![BC04E798-8AFA-40a0-8FFF-43D6F8

[GitHub] [hudi] vinothchandar commented on pull request #1702: [HUDI-426] Bootstrap datasource integration

2020-08-07 Thread GitBox
vinothchandar commented on pull request #1702: URL: https://github.com/apache/hudi/pull/1702#issuecomment-670584927 @umehrot2 some tests are failing . looking at them later today. Before we head into the weekend, is this PR ready from your perspective. if so, I will take care of mak

[GitHub] [hudi] UZi5136225 removed a comment on pull request #1931: [HUDI-210] hudi-support-prometheus-pushgateway

2020-08-07 Thread GitBox
UZi5136225 removed a comment on pull request #1931: URL: https://github.com/apache/hudi/pull/1931#issuecomment-670584668 ![9C0310E1-BE19-4cbf-9476-5351C72908FC](https://user-images.githubusercontent.com/25769285/89663903-e600cf80-d908-11ea-9d96-ada9f7a039f2.png) ![9C0310E1-BE19-4cbf-947

[jira] [Created] (HUDI-1167) Audit Partition Listing : Hive Syncing

2020-08-07 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1167: Summary: Audit Partition Listing : Hive Syncing Key: HUDI-1167 URL: https://issues.apache.org/jira/browse/HUDI-1167 Project: Apache Hudi Issue Type:

[GitHub] [hudi] UZi5136225 commented on pull request #1931: [HUDI-210] hudi-support-prometheus-pushgateway

2020-08-07 Thread GitBox
UZi5136225 commented on pull request #1931: URL: https://github.com/apache/hudi/pull/1931#issuecomment-670583885 ![61960BDB-6E83-4086-BA4C-F0F0DBBC6722](https://user-images.githubusercontent.com/25769285/89663836-c8cc0100-d908-11ea-9d96-a0666441986a.png) --

[jira] [Created] (HUDI-1166) Audit Partition Listing : Rollback By Listing

2020-08-07 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1166: Summary: Audit Partition Listing : Rollback By Listing Key: HUDI-1166 URL: https://issues.apache.org/jira/browse/HUDI-1166 Project: Apache Hudi Issue

[jira] [Created] (HUDI-1165) Audit Partition Listing : Compaction Scheduling

2020-08-07 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1165: Summary: Audit Partition Listing : Compaction Scheduling Key: HUDI-1165 URL: https://issues.apache.org/jira/browse/HUDI-1165 Project: Apache Hudi Iss

[GitHub] [hudi] jpugliesi commented on issue #1925: [SUPPORT] Support for Confluent Cloud SchemaRegistryProvider

2020-08-07 Thread GitBox
jpugliesi commented on issue #1925: URL: https://github.com/apache/hudi/issues/1925#issuecomment-670581206 @bvaradar brilliant, didn't think of this - I'll give it a try and report back. This is an automated message from the

[GitHub] [hudi] vinothchandar commented on pull request #1933: [HUDI-971] Fix HFileBootstrapIndexReader.getIndexedPartitions() returns unclean partition name

2020-08-07 Thread GitBox
vinothchandar commented on pull request #1933: URL: https://github.com/apache/hudi/pull/1933#issuecomment-670580147 @bvaradar is this a release blocker? sounds like that? This is an automated message from the Apache Git Ser

[jira] [Updated] (HUDI-1164) Audit Partition Listing Location : CleanPlanner.getPartitionPathsForFullCleaning

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1164: - Description: This ticket is to track all known locations where we call FSUtils.getAllPart

[jira] [Updated] (HUDI-1164) Audit Partition Listing Location : CleanPlanner.getPartitionPathsForFullCleaning

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1164: - Status: Open (was: New) > Audit Partition Listing Location : > CleanPlanner.getPartition

[jira] [Created] (HUDI-1164) Audit Partition Listing Location : CleanPlanner.getPartitionPathsForFullCleaning

2020-08-07 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1164: Summary: Audit Partition Listing Location : CleanPlanner.getPartitionPathsForFullCleaning Key: HUDI-1164 URL: https://issues.apache.org/jira/browse/HUDI-1164

[jira] [Updated] (HUDI-1163) Audit Partition Listing Location : Global Simple Index lookup

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1163: - Status: Open (was: New) > Audit Partition Listing Location : Global Simple Index lookup >

[jira] [Created] (HUDI-1163) Audit Partition Listing Location : Global Simple Index lookup

2020-08-07 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1163: Summary: Audit Partition Listing Location : Global Simple Index lookup Key: HUDI-1163 URL: https://issues.apache.org/jira/browse/HUDI-1163 Project: Apache Hud

[jira] [Updated] (HUDI-1162) Audit Partition Listing Location : Global Bloom Index lookup

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1162: - Status: Open (was: New) > Audit Partition Listing Location : Global Bloom Index lookup >

[jira] [Created] (HUDI-1162) Audit Partition Listing Location : Global Bloom Index lookup

2020-08-07 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1162: Summary: Audit Partition Listing Location : Global Bloom Index lookup Key: HUDI-1162 URL: https://issues.apache.org/jira/browse/HUDI-1162 Project: Apache Hudi

[jira] [Updated] (HUDI-971) Fix HFileBootstrapIndexReader.getIndexedPartitions() returns unclean partition name

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-971: Status: Patch Available (was: In Progress) > Fix HFileBootstrapIndexReader.getIndexedPartiti

[jira] [Updated] (HUDI-971) Fix HFileBootstrapIndexReader.getIndexedPartitions() returns unclean partition name

2020-08-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-971: Status: In Progress (was: Open) > Fix HFileBootstrapIndexReader.getIndexedPartitions() retur

[jira] [Updated] (HUDI-971) Fix HFileBootstrapIndexReader.getIndexedPartitions() returns unclean partition name

2020-08-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-971: Labels: pull-request-available (was: ) > Fix HFileBootstrapIndexReader.getIndexedPartitions() return

[GitHub] [hudi] bvaradar commented on pull request #1933: [HUDI-971] Fix HFileBootstrapIndexReader.getIndexedPartitions() returns unclean partition name

2020-08-07 Thread GitBox
bvaradar commented on pull request #1933: URL: https://github.com/apache/hudi/pull/1933#issuecomment-670573069 @zhedoubushishi : Can you review this. This would impact your bootstrap CLI. This is an automated message from

[GitHub] [hudi] bvaradar opened a new pull request #1933: [HUDI-971] Fix HFileBootstrapIndexReader.getIndexedPartitions() returns unclean partition name

2020-08-07 Thread GitBox
bvaradar opened a new pull request #1933: URL: https://github.com/apache/hudi/pull/1933 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [hudi] Yungthuis opened a new pull request #1932: [MINOR]Remove unused import

2020-08-07 Thread GitBox
Yungthuis opened a new pull request #1932: URL: https://github.com/apache/hudi/pull/1932 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the p

[GitHub] [hudi] UZi5136225 opened a new pull request #1931: [HUDI-210] hudi-support-prometheus-pushgateway

2020-08-07 Thread GitBox
UZi5136225 opened a new pull request #1931: URL: https://github.com/apache/hudi/pull/1931 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the

[GitHub] [hudi] UZi5136225 closed pull request #1930: [HUDI-210] hudi-support-prometheus-pushgateway

2020-08-07 Thread GitBox
UZi5136225 closed pull request #1930: URL: https://github.com/apache/hudi/pull/1930 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] UZi5136225 opened a new pull request #1930: [HUDI-210] hudi-support-prometheus-pushgateway

2020-08-07 Thread GitBox
UZi5136225 opened a new pull request #1930: URL: https://github.com/apache/hudi/pull/1930 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the

[GitHub] [hudi] UZi5136225 closed pull request #1726: [HUDI-210]Hudi support prometheus

2020-08-07 Thread GitBox
UZi5136225 closed pull request #1726: URL: https://github.com/apache/hudi/pull/1726 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[jira] [Updated] (HUDI-1159) Parquet encryption policy interface

2020-08-07 Thread Gidon Gershinsky (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gidon Gershinsky updated HUDI-1159: --- Description: Provide an interface for Parquet column encryption policy engine clients. > Parq

[jira] [Updated] (HUDI-1159) Parquet encryption policy interface

2020-08-07 Thread Gidon Gershinsky (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gidon Gershinsky updated HUDI-1159: --- Summary: Parquet encryption policy interface (was: Encryption policy interface) > Parquet en

[GitHub] [hudi] Mathieu1124 commented on pull request #1901: [HUDI-532]Add java doc for hudi test suite test classes

2020-08-07 Thread GitBox
Mathieu1124 commented on pull request #1901: URL: https://github.com/apache/hudi/pull/1901#issuecomment-670506233 @cheshta2904 @pratyakshsharma I have addressed all your concerns, thanks for your detailed review :) This is

[GitHub] [hudi] Mathieu1124 commented on a change in pull request #1901: [HUDI-532]Add java doc for hudi test suite test classes

2020-08-07 Thread GitBox
Mathieu1124 commented on a change in pull request #1901: URL: https://github.com/apache/hudi/pull/1901#discussion_r467025816 ## File path: hudi-integ-test/src/test/java/org/apache/hudi/integ/testsuite/dag/HiveSyncDagGenerator.java ## @@ -31,6 +31,9 @@ import org.apache.hudi.i

[GitHub] [hudi] Mathieu1124 commented on a change in pull request #1901: [HUDI-532]Add java doc for hudi test suite test classes

2020-08-07 Thread GitBox
Mathieu1124 commented on a change in pull request #1901: URL: https://github.com/apache/hudi/pull/1901#discussion_r467025753 ## File path: hudi-integ-test/src/test/java/org/apache/hudi/integ/testsuite/dag/ComplexDagGenerator.java ## @@ -33,6 +33,9 @@ import org.apache.hudi.in

[GitHub] [hudi] Mathieu1124 commented on a change in pull request #1901: [HUDI-532]Add java doc for hudi test suite test classes

2020-08-07 Thread GitBox
Mathieu1124 commented on a change in pull request #1901: URL: https://github.com/apache/hudi/pull/1901#discussion_r467025989 ## File path: hudi-integ-test/src/test/java/org/apache/hudi/integ/testsuite/utils/TestUtils.java ## @@ -28,6 +28,9 @@ import org.apache.spark.api.java.

[GitHub] [hudi] Mathieu1124 commented on a change in pull request #1901: [HUDI-532]Add java doc for hudi test suite test classes

2020-08-07 Thread GitBox
Mathieu1124 commented on a change in pull request #1901: URL: https://github.com/apache/hudi/pull/1901#discussion_r467025252 ## File path: hudi-integ-test/src/test/java/org/apache/hudi/integ/ITTestBase.java ## @@ -48,6 +48,9 @@ import static org.junit.jupiter.api.Assertions.as

[jira] [Updated] (HUDI-1160) Support update partial fields for CoW table

2020-08-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1160: - Labels: pull-request-available (was: ) > Support update partial fields for CoW table > --

[GitHub] [hudi] leesf opened a new pull request #1929: [HUDI-1160] Support update partial fields for CoW table

2020-08-07 Thread GitBox
leesf opened a new pull request #1929: URL: https://github.com/apache/hudi/pull/1929 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the pull

[jira] [Created] (HUDI-1161) Support update partial fields for MoR table

2020-08-07 Thread leesf (Jira)
leesf created HUDI-1161: --- Summary: Support update partial fields for MoR table Key: HUDI-1161 URL: https://issues.apache.org/jira/browse/HUDI-1161 Project: Apache Hudi Issue Type: Sub-task Co

[jira] [Created] (HUDI-1160) Support update partial fields for CoW table

2020-08-07 Thread leesf (Jira)
leesf created HUDI-1160: --- Summary: Support update partial fields for CoW table Key: HUDI-1160 URL: https://issues.apache.org/jira/browse/HUDI-1160 Project: Apache Hudi Issue Type: Sub-task Co

[GitHub] [hudi] nsivabalan commented on pull request #1912: [HUDI-1098] Adding TimedWaitOnAppearConsistencyGuard

2020-08-07 Thread GitBox
nsivabalan commented on pull request #1912: URL: https://github.com/apache/hudi/pull/1912#issuecomment-670480225 @umehrot2 : Would appreciate if you agree on the approach here. Before I go ahead and address feedback want to have consensus.

[GitHub] [hudi] nsivabalan commented on pull request #1912: [HUDI-1098] Adding TimedWaitOnAppearConsistencyGuard

2020-08-07 Thread GitBox
nsivabalan commented on pull request #1912: URL: https://github.com/apache/hudi/pull/1912#issuecomment-670479909 @bvaradar : since you suggested to have the TimedWaitOnAppearCG as default opt in, I would suggest to introduce a new config for the sleep time. so that we can set it to 2 or 3

[GitHub] [hudi] nsivabalan commented on a change in pull request #1858: [HUDI-1014] Adding Upgrade and downgrade infra for smooth transitioning from list based rollback to marker based rollback

2020-08-07 Thread GitBox
nsivabalan commented on a change in pull request #1858: URL: https://github.com/apache/hudi/pull/1858#discussion_r466991187 ## File path: hudi-client/src/main/java/org/apache/hudi/client/AbstractHoodieWriteClient.java ## @@ -186,10 +188,14 @@ public HoodieMetrics getMetrics()

  1   2   >