[GitHub] [hudi] yanghua commented on pull request #2271: [HUDI-1335] Introduce FlinkHoodieSimpleIndex to hudi-flink-client

2021-01-29 Thread GitBox
yanghua commented on pull request #2271: URL: https://github.com/apache/hudi/pull/2271#issuecomment-770146052 @wangxianghu Please fix the conflicting file. This is an automated message from the Apache Git Service. To respond

[GitHub] [hudi] yanghua commented on pull request #2337: [HUDI-982] Flink support mor table

2021-01-29 Thread GitBox
yanghua commented on pull request #2337: URL: https://github.com/apache/hudi/pull/2337#issuecomment-770142516 conflicting files should be fixed This is an automated message from the Apache Git Service. To respond to the messa

[GitHub] [hudi] yanghua commented on a change in pull request #2443: [HUDI-1269] Make whether the failure of connect hive affects hudi ingest process configurable

2021-01-29 Thread GitBox
yanghua commented on a change in pull request #2443: URL: https://github.com/apache/hudi/pull/2443#discussion_r567184581 ## File path: hudi-spark-datasource/hudi-spark-common/src/main/java/org/apache/hudi/DataSourceUtils.java ## @@ -293,6 +293,8 @@ public static HiveSyncConfig

[hudi] branch master updated: [MINOR] Quickstart.generateUpdates method add check (#2505)

2021-01-29 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository. vinoyang pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 5d053b4 [MINOR] Quickstart.generateUpdates meth

[GitHub] [hudi] yanghua merged pull request #2505: [MINOR] Quickstart.generateUpdates method add check

2021-01-29 Thread GitBox
yanghua merged pull request #2505: URL: https://github.com/apache/hudi/pull/2505 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] yanghua commented on pull request #2505: [MINOR] Quickstart.generateUpdates method add check

2021-01-29 Thread GitBox
yanghua commented on pull request #2505: URL: https://github.com/apache/hudi/pull/2505#issuecomment-770140718 > @wangxianghu You can try to merge after the CI is OK. Since you still did not get merge permission, I'd like to merge it right now. ---

[GitHub] [hudi] prashantwason commented on pull request #2496: [HUDI-1554] Introduced buffering for streams in HUDI.

2021-01-29 Thread GitBox
prashantwason commented on pull request #2496: URL: https://github.com/apache/hudi/pull/2496#issuecomment-770134465 @n3nash I have implemented the on/off config. PTAL and approve. This is an automated message from the Apache

[GitHub] [hudi] prashantwason commented on pull request #2496: [HUDI-1554] Introduced buffering for streams in HUDI.

2021-01-29 Thread GitBox
prashantwason commented on pull request #2496: URL: https://github.com/apache/hudi/pull/2496#issuecomment-770133276 > On passing configs, the way I can think of is to transfer the values from writeConfig to the hadoop configuration object Implemented this. @vinothchandar PTAL -

[GitHub] [hudi] prashantwason commented on a change in pull request #2496: [HUDI-1554] Introduced buffering for streams in HUDI.

2021-01-29 Thread GitBox
prashantwason commented on a change in pull request #2496: URL: https://github.com/apache/hudi/pull/2496#discussion_r567157772 ## File path: hudi-common/src/main/java/org/apache/hudi/common/fs/HoodieWrapperFileSystem.java ## @@ -192,76 +233,110 @@ public FSDataOutputStream cre

[GitHub] [hudi] prashantwason commented on a change in pull request #2496: [HUDI-1554] Introduced buffering for streams in HUDI.

2021-01-29 Thread GitBox
prashantwason commented on a change in pull request #2496: URL: https://github.com/apache/hudi/pull/2496#discussion_r567157271 ## File path: hudi-common/src/main/java/org/apache/hudi/common/fs/HoodieWrapperFileSystem.java ## @@ -79,8 +82,16 @@ public static void setMetricsRegi

[GitHub] [hudi] prashantwason commented on a change in pull request #2496: [HUDI-1554] Introduced buffering for streams in HUDI.

2021-01-29 Thread GitBox
prashantwason commented on a change in pull request #2496: URL: https://github.com/apache/hudi/pull/2496#discussion_r567157247 ## File path: hudi-common/src/main/java/org/apache/hudi/common/fs/HoodieWrapperFileSystem.java ## @@ -192,76 +233,110 @@ public FSDataOutputStream cre

[GitHub] [hudi] codecov-io edited a comment on pull request #2485: [HUDI-1109] Support Spark Structured Streaming read from Hudi table

2021-01-29 Thread GitBox
codecov-io edited a comment on pull request #2485: URL: https://github.com/apache/hudi/pull/2485#issuecomment-766519181 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2485?src=pr&el=h1) Report > Merging [#2485](https://codecov.io/gh/apache/hudi/pull/2485?src=pr&el=desc) (eedd49b) in

[GitHub] [hudi] codecov-io commented on pull request #2497: [HUDI-1550] Incorrect query result for MOR table when merge base data…

2021-01-29 Thread GitBox
codecov-io commented on pull request #2497: URL: https://github.com/apache/hudi/pull/2497#issuecomment-770090356 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2497?src=pr&el=h1) Report > Merging [#2497](https://codecov.io/gh/apache/hudi/pull/2497?src=pr&el=desc) (7b3d36e) into [ma

[GitHub] [hudi] vinothchandar commented on issue #2470: [SUPPORT] Heavy skew in ListingBasedRollbackHelper

2021-01-29 Thread GitBox
vinothchandar commented on issue #2470: URL: https://github.com/apache/hudi/issues/2470#issuecomment-770029110 Thanks @jtmzheng . WIth 0.7.0 and `hoodie.metadata.enable=true`, this should be much faster to go over the file listings. marker based rollback avoids that altogether and can effi

[GitHub] [hudi] vinothchandar commented on pull request #2496: [HUDI-1554] Introduced buffering for streams in HUDI.

2021-01-29 Thread GitBox
vinothchandar commented on pull request #2496: URL: https://github.com/apache/hudi/pull/2496#issuecomment-770018126 Once we fix CI and the minor stuff, we can land This is an automated message from the Apache Git Service. To

[GitHub] [hudi] vinothchandar commented on a change in pull request #2496: [HUDI-1554] Introduced buffering for streams in HUDI.

2021-01-29 Thread GitBox
vinothchandar commented on a change in pull request #2496: URL: https://github.com/apache/hudi/pull/2496#discussion_r567056507 ## File path: hudi-common/src/main/java/org/apache/hudi/common/fs/HoodieWrapperFileSystem.java ## @@ -79,8 +82,16 @@ public static void setMetricsRegi

[GitHub] [hudi] vinothchandar commented on pull request #2496: [HUDI-1554] Introduced buffering for streams in HUDI.

2021-01-29 Thread GitBox
vinothchandar commented on pull request #2496: URL: https://github.com/apache/hudi/pull/2496#issuecomment-770011435 cc @umehrot2 would this additional buffering pose inefficiencies for S3 FileSystem? TL;DR HDFS's `DistributedFileSystem` does not buffer reads, neither does the parquet reade

[GitHub] [hudi] n3nash commented on a change in pull request #2496: [HUDI-1554] Introduced buffering for streams in HUDI.

2021-01-29 Thread GitBox
n3nash commented on a change in pull request #2496: URL: https://github.com/apache/hudi/pull/2496#discussion_r567045896 ## File path: hudi-common/src/main/java/org/apache/hudi/common/fs/HoodieWrapperFileSystem.java ## @@ -90,11 +94,18 @@ public static void setMetricsRegistry(R

[jira] [Created] (HUDI-1566) Typo in account request caused wrong name in Apache id

2021-01-29 Thread Craig L Russell (Jira)
Craig L Russell created HUDI-1566: - Summary: Typo in account request caused wrong name in Apache id Key: HUDI-1566 URL: https://issues.apache.org/jira/browse/HUDI-1566 Project: Apache Hudi Is

[GitHub] [hudi] stackfun commented on issue #2367: [SUPPORT] Seek error when querying MOR Tables in GCP

2021-01-29 Thread GitBox
stackfun commented on issue #2367: URL: https://github.com/apache/hudi/issues/2367#issuecomment-769977226 I'll test this patch in the next few days. Wondering what level of testing is done on GCP before a release? This

[hudi] branch master updated: [HUDI-1266] Add unit test for validating replacecommit rollback (#2418)

2021-01-29 Thread satish
This is an automated email from the ASF dual-hosted git repository. satish pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 9cb6cb8 [HUDI-1266] Add unit test for validating

[GitHub] [hudi] satishkotha merged pull request #2418: [HUDI-1266] Add unit test for validating replacecommit rollback

2021-01-29 Thread GitBox
satishkotha merged pull request #2418: URL: https://github.com/apache/hudi/pull/2418 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[hudi] branch master updated (23f2ef3 -> 2d2d5c8)

2021-01-29 Thread satish
This is an automated email from the ASF dual-hosted git repository. satish pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 23f2ef3 [HUDI-623] Remove UpgradePayloadFromUberToApache (#2455) add 2d2d5c8 [HUDI-1555] Remove isEmpty to impro

[GitHub] [hudi] satishkotha merged pull request #2502: [HUDI-1555] Remove isEmpty to improve clustering execution performance

2021-01-29 Thread GitBox
satishkotha merged pull request #2502: URL: https://github.com/apache/hudi/pull/2502 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] jtmzheng commented on issue #2470: [SUPPORT] Heavy skew in ListingBasedRollbackHelper

2021-01-29 Thread GitBox
jtmzheng commented on issue #2470: URL: https://github.com/apache/hudi/issues/2470#issuecomment-769948718 Sorry for the delay, I believe the slowness was because compaction wasn't keeping up with the number of files (we partition by date and we have many partitions updated with a small num

[GitHub] [hudi] vburenin commented on a change in pull request #2476: [HUDI-1538] Try to init class trying different signatures instead of checking its name

2021-01-29 Thread GitBox
vburenin commented on a change in pull request #2476: URL: https://github.com/apache/hudi/pull/2476#discussion_r566977565 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/UtilHelpers.java ## @@ -96,19 +94,21 @@ private static final Logger LOG = LogManage

[GitHub] [hudi] nsivabalan commented on a change in pull request #2497: [HUDI-1550] Incorrect query result for MOR table when merge base data…

2021-01-29 Thread GitBox
nsivabalan commented on a change in pull request #2497: URL: https://github.com/apache/hudi/pull/2497#discussion_r566968882 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/HoodieTableMetaClient.java ## @@ -328,38 +328,62 @@ public synchronized HoodieArchiv

[GitHub] [hudi] nsivabalan commented on a change in pull request #2497: [HUDI-1550] Incorrect query result for MOR table when merge base data…

2021-01-29 Thread GitBox
nsivabalan commented on a change in pull request #2497: URL: https://github.com/apache/hudi/pull/2497#discussion_r566961281 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/HoodieTableMetaClient.java ## @@ -328,38 +328,62 @@ public synchronized HoodieArchiv

[GitHub] [hudi] codecov-io edited a comment on pull request #2505: [MINOR] Quickstart.generateUpdates method add check

2021-01-29 Thread GitBox
codecov-io edited a comment on pull request #2505: URL: https://github.com/apache/hudi/pull/2505#issuecomment-769557231 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2505?src=pr&el=h1) Report > Merging [#2505](https://codecov.io/gh/apache/hudi/pull/2505?src=pr&el=desc) (453c184) in

[GitHub] [hudi] nsivabalan commented on issue #1737: [SUPPORT]spark streaming create small parquet files

2021-01-29 Thread GitBox
nsivabalan commented on issue #1737: URL: https://github.com/apache/hudi/issues/1737#issuecomment-769920342 cool. @kimberlyamandalu : can you please close this ticket out if you don't have any more questions. This is an aut

[GitHub] [hudi] nsivabalan edited a comment on issue #2498: [SUPPORT] Hudi MERGE_ON_READ load to dataframe fails for the versions [0.6.0],[0.7.0] and runs for [0.5.3]

2021-01-29 Thread GitBox
nsivabalan edited a comment on issue #2498: URL: https://github.com/apache/hudi/issues/2498#issuecomment-769918590 @zafer-sahin : not sure if its some env issue. Were you able to run the pyspark examples given in [quick start](https://hudi.apache.org/docs/quick-start-guide.html). If that w

[GitHub] [hudi] nsivabalan edited a comment on issue #2498: [SUPPORT] Hudi MERGE_ON_READ load to dataframe fails for the versions [0.6.0],[0.7.0] and runs for [0.5.3]

2021-01-29 Thread GitBox
nsivabalan edited a comment on issue #2498: URL: https://github.com/apache/hudi/issues/2498#issuecomment-769918590 @zafer-sahin : not sure if its some env issue. Were you able to run the pyspark examples given in [quick start](https://hudi.apache.org/docs/quick-start-guide.html). If that w

[GitHub] [hudi] nsivabalan commented on issue #2498: [SUPPORT] Hudi MERGE_ON_READ load to dataframe fails for the versions [0.6.0],[0.7.0] and runs for [0.5.3]

2021-01-29 Thread GitBox
nsivabalan commented on issue #2498: URL: https://github.com/apache/hudi/issues/2498#issuecomment-769918590 @zafer-sahin : not sure if its some env issue. Were you able to run the pyspark examples given in [quick start](https://hudi.apache.org/docs/quick-start-guide.html). If that works, b

[jira] [Updated] (HUDI-1565) Document all maven commands

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1565: -- Labels: user-support-issues (was: ) > Document all maven commands > ---

[jira] [Created] (HUDI-1565) Document all maven commands

2021-01-29 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-1565: - Summary: Document all maven commands Key: HUDI-1565 URL: https://issues.apache.org/jira/browse/HUDI-1565 Project: Apache Hudi Issue Type: Improveme

[jira] [Created] (HUDI-1564) Blog: Dfs -> Hudi followed by Kafka to Hudi

2021-01-29 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-1564: - Summary: Blog: Dfs -> Hudi followed by Kafka to Hudi Key: HUDI-1564 URL: https://issues.apache.org/jira/browse/HUDI-1564 Project: Apache Hudi Issue

[jira] [Updated] (HUDI-1564) Blog: Dfs -> Hudi followed by Kafka to Hudi

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1564: -- Labels: user-support-issues (was: ) > Blog: Dfs -> Hudi followed by Kafka to Hudi > ---

[jira] [Updated] (HUDI-1563) Documentation on small file handling

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1563: -- Labels: user-support-issues (was: ) > Documentation on small file handling > --

[jira] [Created] (HUDI-1563) Documentation on small file handling

2021-01-29 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-1563: - Summary: Documentation on small file handling Key: HUDI-1563 URL: https://issues.apache.org/jira/browse/HUDI-1563 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-1562) Delta streamer checkpointing documentation

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1562: -- Labels: user-support-issues (was: ) > Delta streamer checkpointing documentation >

[jira] [Created] (HUDI-1562) Delta streamer checkpointing documentation

2021-01-29 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-1562: - Summary: Delta streamer checkpointing documentation Key: HUDI-1562 URL: https://issues.apache.org/jira/browse/HUDI-1562 Project: Apache Hudi Issue

[jira] [Updated] (HUDI-1561) Documentation on every hudi-cli command

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1561: -- Labels: user-support-issues (was: ) > Documentation on every hudi-cli command > ---

[jira] [Created] (HUDI-1561) Documentation on every hudi-cli command

2021-01-29 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-1561: - Summary: Documentation on every hudi-cli command Key: HUDI-1561 URL: https://issues.apache.org/jira/browse/HUDI-1561 Project: Apache Hudi Issue Typ

[jira] [Updated] (HUDI-1560) How to deduce bloom filter configs

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1560: -- Labels: user-support-issues (was: ) > How to deduce bloom filter configs >

[jira] [Created] (HUDI-1560) How to deduce bloom filter configs

2021-01-29 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-1560: - Summary: How to deduce bloom filter configs Key: HUDI-1560 URL: https://issues.apache.org/jira/browse/HUDI-1560 Project: Apache Hudi Issue Type: Im

[jira] [Updated] (HUDI-1559) Document how to detect record size

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1559: -- Labels: user-support-issues (was: ) > Document how to detect record size >

[jira] [Created] (HUDI-1559) Document how to detect record size

2021-01-29 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-1559: - Summary: Document how to detect record size Key: HUDI-1559 URL: https://issues.apache.org/jira/browse/HUDI-1559 Project: Apache Hudi Issue Type: Im

[jira] [Resolved] (HUDI-825) Write a small blog on how to use hudi-spark with pyspark

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-825. -- Fix Version/s: 0.5.2 Resolution: Fixed > Write a small blog on how to use hudi-spa

[jira] [Updated] (HUDI-259) Hadoop 3 support for Hudi writing

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-259: - Status: In Progress (was: Open) > Hadoop 3 support for Hudi writing >

[jira] [Resolved] (HUDI-259) Hadoop 3 support for Hudi writing

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-259. -- Fix Version/s: (was: 0.8.0) 0.7.0 Resolution: Fixed > Hadoo

[jira] [Resolved] (HUDI-776) Document community support triage process

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-776. -- Fix Version/s: 0.5.2 Resolution: Fixed > Document community support triage process

[jira] [Updated] (HUDI-776) Document community support triage process

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-776: - Status: In Progress (was: Open) > Document community support triage process > ---

[jira] [Updated] (HUDI-776) Document community support triage process

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-776: - Status: Open (was: New) > Document community support triage process > ---

[jira] [Updated] (HUDI-825) Write a small blog on how to use hudi-spark with pyspark

2021-01-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-825: - Status: In Progress (was: Open) > Write a small blog on how to use hudi-spark with pyspark

[GitHub] [hudi] pengzhiwei2018 commented on a change in pull request #2497: [HUDI-1550] Incorrect query result for MOR table when merge base data…

2021-01-29 Thread GitBox
pengzhiwei2018 commented on a change in pull request #2497: URL: https://github.com/apache/hudi/pull/2497#discussion_r566917341 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/MergeOnReadSnapshotRelation.scala ## @@ -50,7 +50,8 @@ case class Hoodi

[GitHub] [hudi] pengzhiwei2018 commented on a change in pull request #2497: [HUDI-1550] Incorrect query result for MOR table when merge base data…

2021-01-29 Thread GitBox
pengzhiwei2018 commented on a change in pull request #2497: URL: https://github.com/apache/hudi/pull/2497#discussion_r566917206 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/MergeOnReadIncrementalRelation.scala ## @@ -78,7 +78,16 @@ class MergeO

[GitHub] [hudi] pengzhiwei2018 commented on a change in pull request #2497: [HUDI-1550] Incorrect query result for MOR table when merge base data…

2021-01-29 Thread GitBox
pengzhiwei2018 commented on a change in pull request #2497: URL: https://github.com/apache/hudi/pull/2497#discussion_r566913898 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/HoodieMergeOnReadRDD.scala ## @@ -18,6 +18,8 @@ package org.apache.h

[GitHub] [hudi] kimberlyamandalu commented on issue #1737: [SUPPORT]spark streaming create small parquet files

2021-01-29 Thread GitBox
kimberlyamandalu commented on issue #1737: URL: https://github.com/apache/hudi/issues/1737#issuecomment-769882987 @vinothchandar @nsivabalan I just tested cleaner in Hudi 0.7.0 and can confirm that I see it being triggered and cleaning old file slices. Thank you! --

[GitHub] [hudi] pengzhiwei2018 commented on a change in pull request #2497: [HUDI-1550] Incorrect query result for MOR table when merge base data…

2021-01-29 Thread GitBox
pengzhiwei2018 commented on a change in pull request #2497: URL: https://github.com/apache/hudi/pull/2497#discussion_r566877440 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/MergeOnReadIncrementalRelation.scala ## @@ -78,7 +78,16 @@ class MergeO

[GitHub] [hudi] codecov-io edited a comment on pull request #2506: [HUDI-1557] Make Flink write pipeline write task scalable

2021-01-29 Thread GitBox
codecov-io edited a comment on pull request #2506: URL: https://github.com/apache/hudi/pull/2506#issuecomment-769390677 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2506?src=pr&el=h1) Report > Merging [#2506](https://codecov.io/gh/apache/hudi/pull/2506?src=pr&el=desc) (fb4e0f8) in

[GitHub] [hudi] yanghua commented on pull request #2506: [HUDI-1557] Make Flink write pipeline write task scalable

2021-01-29 Thread GitBox
yanghua commented on pull request #2506: URL: https://github.com/apache/hudi/pull/2506#issuecomment-769764823 @danny0405 ping us, when you are ready to review. This is an automated message from the Apache Git Service. To resp

[GitHub] [hudi] yanghua commented on pull request #2505: [MINOR] Quickstart.generateUpdates method add check

2021-01-29 Thread GitBox
yanghua commented on pull request #2505: URL: https://github.com/apache/hudi/pull/2505#issuecomment-769762431 @wangxianghu You can try to merge after the CI is OK. This is an automated message from the Apache Git Service. To

[GitHub] [hudi] codecov-io edited a comment on pull request #2485: [HUDI-1109] Support Spark Structured Streaming read from Hudi table

2021-01-29 Thread GitBox
codecov-io edited a comment on pull request #2485: URL: https://github.com/apache/hudi/pull/2485#issuecomment-766519181 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2485?src=pr&el=h1) Report > Merging [#2485](https://codecov.io/gh/apache/hudi/pull/2485?src=pr&el=desc) (f9a4121) in

[GitHub] [hudi] zafer-sahin edited a comment on issue #2498: [SUPPORT] Hudi MERGE_ON_READ load to dataframe fails for the versions [0.6.0],[0.7.0] and runs for [0.5.3]

2021-01-29 Thread GitBox
zafer-sahin edited a comment on issue #2498: URL: https://github.com/apache/hudi/issues/2498#issuecomment-769721260 Hi, I am still getting a similar error at the time of reading. >>> hudi_options_insert = { ... "hoodie.table.name": "the_table_name", ... "hoodie.d

[GitHub] [hudi] zafer-sahin commented on issue #2498: [SUPPORT] Hudi MERGE_ON_READ load to dataframe fails for the versions [0.6.0],[0.7.0] and runs for [0.5.3]

2021-01-29 Thread GitBox
zafer-sahin commented on issue #2498: URL: https://github.com/apache/hudi/issues/2498#issuecomment-769721260 Hi, I am still getting a similar error. >>> hudi_options_insert = { ... "hoodie.table.name": "the_table_name", ... "hoodie.datasource.write.storage.type":

[GitHub] [hudi] lw309637554 commented on pull request #2502: [HUDI-1555] Remove isEmpty to improve clustering execution performance

2021-01-29 Thread GitBox
lw309637554 commented on pull request #2502: URL: https://github.com/apache/hudi/pull/2502#issuecomment-769697170 LGTM This is an automated message from the Apache Git Service. To respond to the message, please log on to GitH

[GitHub] [hudi] lw309637554 commented on a change in pull request #2502: [HUDI-1555] Remove isEmpty to improve clustering execution performance

2021-01-29 Thread GitBox
lw309637554 commented on a change in pull request #2502: URL: https://github.com/apache/hudi/pull/2502#discussion_r566695943 ## File path: hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/hudi/functional/TestStructuredStreaming.scala ## @@ -243,17 +243,24 @@ class Te

[GitHub] [hudi] kirkuz commented on issue #2323: [SUPPORT] GLOBAL_BLOOM index significantly slowing down processing time

2021-01-29 Thread GitBox
kirkuz commented on issue #2323: URL: https://github.com/apache/hudi/issues/2323#issuecomment-769682139 Yes, feel free to close it. I'll check that in the new release. This is an automated message from the Apache Git Service.

[GitHub] [hudi] kirkuz closed issue #2323: [SUPPORT] GLOBAL_BLOOM index significantly slowing down processing time

2021-01-29 Thread GitBox
kirkuz closed issue #2323: URL: https://github.com/apache/hudi/issues/2323 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the speci