[GitHub] [hudi] minihippo commented on pull request #3193: [HUDI-2107] Support Read Log Only MOR Table For Spark

2021-07-06 Thread GitBox
minihippo commented on pull request #3193: URL: https://github.com/apache/hudi/pull/3193#issuecomment-874526750 LGTM -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

[jira] [Commented] (HUDI-2107) Support Read Log Only MOR Table For Spark

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375316#comment-17375316 ] ASF GitHub Bot commented on HUDI-2107: -- minihippo commented on a change in pull reque

[jira] [Commented] (HUDI-2138) Implement Parquest Data blocks for file inlining

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375315#comment-17375315 ] ASF GitHub Bot commented on HUDI-2138: -- codecov-commenter commented on pull request #

[GitHub] [hudi] minihippo commented on a change in pull request #3193: [HUDI-2107] Support Read Log Only MOR Table For Spark

2021-07-06 Thread GitBox
minihippo commented on a change in pull request #3193: URL: https://github.com/apache/hudi/pull/3193#discussion_r664301735 ## File path: hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/hudi/TestMergeIntoTable2.scala ## @@ -0,0 +1,91 @@ +/* + * Licensed to

[GitHub] [hudi] minihippo commented on a change in pull request #3193: [HUDI-2107] Support Read Log Only MOR Table For Spark

2021-07-06 Thread GitBox
minihippo commented on a change in pull request #3193: URL: https://github.com/apache/hudi/pull/3193#discussion_r664301524 ## File path: hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/utils/HoodieRealtimeInputFormatUtils.java ## @@ -161,29 +162,21 @@ .map(ins

[jira] [Commented] (HUDI-2107) Support Read Log Only MOR Table For Spark

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375317#comment-17375317 ] ASF GitHub Bot commented on HUDI-2107: -- minihippo commented on a change in pull reque

[GitHub] [hudi] codecov-commenter commented on pull request #3228: [HUDI-2138] Add Parquest Log Formats

2021-07-06 Thread GitBox
codecov-commenter commented on pull request #3228: URL: https://github.com/apache/hudi/pull/3228#issuecomment-874525846 # [Codecov](https://codecov.io/gh/apache/hudi/pull/3228?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+Apache

[jira] [Commented] (HUDI-2120) Update docs about schema in flink sql configuration

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375314#comment-17375314 ] ASF GitHub Bot commented on HUDI-2120: -- wangxianghu opened a new pull request #3229:

[jira] [Updated] (HUDI-2120) Update docs about schema in flink sql configuration

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-2120: - Labels: pull-request-available (was: ) > Update docs about schema in flink sql configuration > --

[GitHub] [hudi] wangxianghu opened a new pull request #3229: [HUDI-2120] Update docs about schema in flink sql configuration

2021-07-06 Thread GitBox
wangxianghu opened a new pull request #3229: URL: https://github.com/apache/hudi/pull/3229 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the

[jira] [Commented] (HUDI-2138) Implement Parquest Data blocks for file inlining

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375313#comment-17375313 ] ASF GitHub Bot commented on HUDI-2138: -- hudi-bot edited a comment on pull request #32

[GitHub] [hudi] hudi-bot edited a comment on pull request #3228: [HUDI-2138] Add Parquest Log Formats

2021-07-06 Thread GitBox
hudi-bot edited a comment on pull request #3228: URL: https://github.com/apache/hudi/pull/3228#issuecomment-874522164 ## CI report: * 169a9b4f401bfabd2c23ec2cf999a806f7cb5fe8 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/res

[jira] [Commented] (HUDI-2138) Implement Parquest Data blocks for file inlining

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375311#comment-17375311 ] ASF GitHub Bot commented on HUDI-2138: -- hudi-bot commented on pull request #3228: URL

[GitHub] [hudi] hudi-bot commented on pull request #3228: [HUDI-2138] Add Parquest Log Formats

2021-07-06 Thread GitBox
hudi-bot commented on pull request #3228: URL: https://github.com/apache/hudi/pull/3228#issuecomment-874522164 ## CI report: * 169a9b4f401bfabd2c23ec2cf999a806f7cb5fe8 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run travis`

[jira] [Updated] (HUDI-2138) Implement Parquest Data blocks for file inlining

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-2138: - Labels: performance pull-request-available (was: performance) > Implement Parquest Data blocks fo

[jira] [Commented] (HUDI-2138) Implement Parquest Data blocks for file inlining

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375309#comment-17375309 ] ASF GitHub Bot commented on HUDI-2138: -- rmahindra123 opened a new pull request #3228:

[GitHub] [hudi] rmahindra123 opened a new pull request #3228: [HUDI-2138] Add Parquest Log Formats

2021-07-06 Thread GitBox
rmahindra123 opened a new pull request #3228: URL: https://github.com/apache/hudi/pull/3228 WIP PR for Add Parquest Log Formats -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comme

[jira] [Resolved] (HUDI-2093) Fix empty avro schema path caused by duplicate parameters

2021-07-06 Thread Xianghu Wang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianghu Wang resolved HUDI-2093. Resolution: Resolved Resolved via master : f2621da32f33bf16fb53f8a71eaf7bca24e6d166 > Fix empty avr

[jira] [Assigned] (HUDI-2138) Implement Parquest Data blocks for file inlining

2021-07-06 Thread Rajesh Mahindra (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Mahindra reassigned HUDI-2138: - Assignee: Rajesh Mahindra > Implement Parquest Data blocks for file inlining > --

[jira] [Commented] (HUDI-2009) Fix extra commit metadata in row writer path

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375307#comment-17375307 ] ASF GitHub Bot commented on HUDI-2009: -- vinothchandar commented on a change in pull r

[jira] [Commented] (HUDI-2055) Add metric for time of lastSync

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375305#comment-17375305 ] ASF GitHub Bot commented on HUDI-2055: -- sbernauer commented on a change in pull reque

[jira] [Commented] (HUDI-2093) Fix empty avro schema path caused by duplicate parameters

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375306#comment-17375306 ] ASF GitHub Bot commented on HUDI-2093: -- wangxianghu merged pull request #3177: URL: h

[jira] [Created] (HUDI-2138) Implement Parquest Data blocks for file inlining

2021-07-06 Thread Rajesh Mahindra (Jira)
Rajesh Mahindra created HUDI-2138: - Summary: Implement Parquest Data blocks for file inlining Key: HUDI-2138 URL: https://issues.apache.org/jira/browse/HUDI-2138 Project: Apache Hudi Issue Ty

[GitHub] [hudi] vinothchandar commented on a change in pull request #3075: [HUDI-2009] Fixing extra commit metadata in row writer path

2021-07-06 Thread GitBox
vinothchandar commented on a change in pull request #3075: URL: https://github.com/apache/hudi/pull/3075#discussion_r664291612 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/HoodieSparkSqlWriter.scala ## @@ -337,6 +337,11 @@ object HoodieSparkSql

[hudi] branch master updated (60e0254 -> f2621da)

2021-07-06 Thread wangxianghu
This is an automated email from the ASF dual-hosted git repository. wangxianghu pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 60e0254 [HUDI-1996] Adding functionality to allow the providing of basic auth creds for confluent cloud schema r

[GitHub] [hudi] wangxianghu merged pull request #3177: [HUDI-2093] Fix empty avro schema path caused by duplicate parameters

2021-07-06 Thread GitBox
wangxianghu merged pull request #3177: URL: https://github.com/apache/hudi/pull/3177 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsub

[GitHub] [hudi] sbernauer commented on a change in pull request #3129: [HUDI-2055] Added metric for time of lastSync

2021-07-06 Thread GitBox
sbernauer commented on a change in pull request #3129: URL: https://github.com/apache/hudi/pull/3129#discussion_r664294219 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/DeltaSync.java ## @@ -289,6 +289,8 @@ public void refreshTimeline() thr

[jira] [Updated] (HUDI-2029) Implement compression for DiskBasedMap in Spillable Map

2021-07-06 Thread Rajesh Mahindra (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Mahindra updated HUDI-2029: -- Status: In Progress (was: Open) > Implement compression for DiskBasedMap in Spillable Map > ---

[jira] [Closed] (HUDI-2028) Implement RockDbBasedMap as an alternate to DiskBasedMap in SpillableMap

2021-07-06 Thread Rajesh Mahindra (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Mahindra closed HUDI-2028. - Resolution: Fixed > Implement RockDbBasedMap as an alternate to DiskBasedMap in SpillableMap > ---

[jira] [Commented] (HUDI-1904) Make SchemaProvider spark free and move it to hudi-client-common

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375303#comment-17375303 ] ASF GitHub Bot commented on HUDI-1904: -- wangxianghu commented on pull request #2963:

[GitHub] [hudi] wangxianghu commented on pull request #2963: [HUDI-1904] Introduce SchemaProviderInterface to make SchemaProvider unified

2021-07-06 Thread GitBox
wangxianghu commented on pull request #2963: URL: https://github.com/apache/hudi/pull/2963#issuecomment-874518387 Hi, @vinothchandar, any thoughts on the pr? In this patch, I introduced a `SchemaProviderInterface ` as the base class of all `SchemaProvider`s and it is engine-free. it i

[jira] [Updated] (HUDI-2028) Implement RockDbBasedMap as an alternate to DiskBasedMap in SpillableMap

2021-07-06 Thread Rajesh Mahindra (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Mahindra updated HUDI-2028: -- Status: In Progress (was: Open) > Implement RockDbBasedMap as an alternate to DiskBasedMap in S

[jira] [Commented] (HUDI-2009) Fix extra commit metadata in row writer path

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375302#comment-17375302 ] ASF GitHub Bot commented on HUDI-2009: -- vinothchandar commented on a change in pull r

[GitHub] [hudi] vinothchandar commented on a change in pull request #3075: [HUDI-2009] Fixing extra commit metadata in row writer path

2021-07-06 Thread GitBox
vinothchandar commented on a change in pull request #3075: URL: https://github.com/apache/hudi/pull/3075#discussion_r660152303 ## File path: hudi-spark-datasource/hudi-spark-common/src/main/java/org/apache/hudi/DataSourceUtils.java ## @@ -111,6 +112,18 @@ public static HoodieR

[jira] [Commented] (HUDI-1105) Bulk insert dataset - Dedup

2021-07-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375301#comment-17375301 ] ASF GitHub Bot commented on HUDI-1105: -- vinothchandar commented on a change in pull r

[GitHub] [hudi] vinothchandar commented on a change in pull request #2206: [HUDI-1105] Adding dedup support for Bulk Insert w/ Rows

2021-07-06 Thread GitBox
vinothchandar commented on a change in pull request #2206: URL: https://github.com/apache/hudi/pull/2206#discussion_r664284205 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieWriteConfig.java ## @@ -98,6 +98,8 @@ public static final S

<    3   4   5   6   7   8