[GitHub] [hudi] Liulietong commented on a change in pull request #2584: [Hudi-1583]: Fix bug that Hudi will skip remaining log files if there is logFile with zero size in logFileList when merge on rea

2021-02-25 Thread GitBox
Liulietong commented on a change in pull request #2584: URL: https://github.com/apache/hudi/pull/2584#discussion_r583434679 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/log/HoodieLogFormatReader.java ## @@ -104,7 +104,7 @@ public boolean hasNext() {

[GitHub] [hudi] Liulietong commented on a change in pull request #2584: [Hudi-1583]: Fix bug that Hudi will skip remaining log files if there is logFile with zero size in logFileList when merge on rea

2021-02-25 Thread GitBox
Liulietong commented on a change in pull request #2584: URL: https://github.com/apache/hudi/pull/2584#discussion_r583434679 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/log/HoodieLogFormatReader.java ## @@ -104,7 +104,7 @@ public boolean hasNext() {

[GitHub] [hudi] codecov-io edited a comment on pull request #2607: [HUDI-1643] Hudi observability - framework to report stats from execu…

2021-02-25 Thread GitBox
codecov-io edited a comment on pull request #2607: URL: https://github.com/apache/hudi/pull/2607#issuecomment-786454326 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2607?src=pr&el=h1) Report > Merging [#2607](https://codecov.io/gh/apache/hudi/pull/2607?src=pr&el=desc) (ab93c26) in

[GitHub] [hudi] codecov-io commented on pull request #2607: [HUDI-1643] Hudi observability - framework to report stats from execu…

2021-02-25 Thread GitBox
codecov-io commented on pull request #2607: URL: https://github.com/apache/hudi/pull/2607#issuecomment-786454326 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2607?src=pr&el=h1) Report > Merging [#2607](https://codecov.io/gh/apache/hudi/pull/2607?src=pr&el=desc) (ab93c26) into [ma

[GitHub] [hudi] t0il3ts0ap edited a comment on issue #2589: [SUPPORT] Issue with adding column while running deltastreamer with kafka source.

2021-02-25 Thread GitBox
t0il3ts0ap edited a comment on issue #2589: URL: https://github.com/apache/hudi/issues/2589#issuecomment-786029402 @satishkotha Ran again on fresh table, still same issue. SparkSubmit: ``` spark-submit --master yarn --packages org.apache.spark:spark-avro_2.12:3.0.1,org

[GitHub] [hudi] liujinhui1994 commented on pull request #2438: [HUDI-1447] DeltaStreamer kafka source supports consuming from specified timestamp

2021-02-25 Thread GitBox
liujinhui1994 commented on pull request #2438: URL: https://github.com/apache/hudi/pull/2438#issuecomment-786452099 I will add the unit test, and then please review This is an automated message from the Apache Git Service. To

[jira] [Updated] (HUDI-1643) [Hudi Observability] Framework for reporting stats from executors

2021-02-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1643: - Labels: pull-request-available (was: ) > [Hudi Observability] Framework for reporting stats from

[GitHub] [hudi] yanghua commented on a change in pull request #2596: [HUDI-1636] Support Builder Pattern To Build Table Properties For Hoo…

2021-02-25 Thread GitBox
yanghua commented on a change in pull request #2596: URL: https://github.com/apache/hudi/pull/2596#discussion_r583407438 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/HoodieTableConfig.java ## @@ -258,4 +260,142 @@ public String getArchivelogFolder() {

[GitHub] [hudi] nbalajee opened a new pull request #2607: [HUDI-1643] Hudi observability - framework to report stats from execu…

2021-02-25 Thread GitBox
nbalajee opened a new pull request #2607: URL: https://github.com/apache/hudi/pull/2607 …tors ## What is the purpose of the pull request Frame work for collecting Hudi Observability stats from the executors. ## Brief change log - Using distributed registry, report sta

[GitHub] [hudi] xushiyan commented on a change in pull request #2541: [HUDI-1587] Add latency and freshness support

2021-02-25 Thread GitBox
xushiyan commented on a change in pull request #2541: URL: https://github.com/apache/hudi/pull/2541#discussion_r583403910 ## File path: hudi-common/src/main/java/org/apache/hudi/common/util/DateTimeUtils.java ## @@ -0,0 +1,41 @@ +/* + * Licensed to the Apache Software Foundati

[hudi] branch asf-site updated: Travis CI build asf-site

2021-02-25 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new c5d50f0 Travis CI build asf-site c5d50f0 is d

[jira] [Created] (HUDI-1643) [Hudi Observability] Framework for reporting stats from executors

2021-02-25 Thread Balajee Nagasubramaniam (Jira)
Balajee Nagasubramaniam created HUDI-1643: - Summary: [Hudi Observability] Framework for reporting stats from executors Key: HUDI-1643 URL: https://issues.apache.org/jira/browse/HUDI-1643 Proje

[hudi] branch asf-site updated: [MINOR] Fixing slideshare link (#2606)

2021-02-25 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 1dd7a41 [MINOR] Fixing slideshare link (#2606

[GitHub] [hudi] vinothchandar merged pull request #2606: [MINOR] Fixing slideshare link

2021-02-25 Thread GitBox
vinothchandar merged pull request #2606: URL: https://github.com/apache/hudi/pull/2606 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [hudi] xushiyan commented on a change in pull request #2541: [HUDI-1587] Add latency and freshness support

2021-02-25 Thread GitBox
xushiyan commented on a change in pull request #2541: URL: https://github.com/apache/hudi/pull/2541#discussion_r583402182 ## File path: hudi-common/src/main/java/org/apache/hudi/common/model/DefaultHoodieRecordPayload.java ## @@ -37,6 +39,9 @@ */ public class DefaultHoodieR

[GitHub] [hudi] satishkotha commented on a change in pull request #2584: [Hudi-1583]: Fix bug that Hudi will skip remaining log files if there is logFile with zero size in logFileList when merge on re

2021-02-25 Thread GitBox
satishkotha commented on a change in pull request #2584: URL: https://github.com/apache/hudi/pull/2584#discussion_r583398027 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/log/HoodieLogFormatReader.java ## @@ -104,7 +104,7 @@ public boolean hasNext() {

[GitHub] [hudi] pengzhiwei2018 commented on a change in pull request #2596: [HUDI-1636] Support Builder Pattern To Build Table Properties For Hoo…

2021-02-25 Thread GitBox
pengzhiwei2018 commented on a change in pull request #2596: URL: https://github.com/apache/hudi/pull/2596#discussion_r583385132 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/TableCommand.java ## @@ -106,10 +106,13 @@ public String createTable( throw n

[GitHub] [hudi] n3nash commented on pull request #2606: [MINOR] Fixing slideshare link

2021-02-25 Thread GitBox
n3nash commented on pull request #2606: URL: https://github.com/apache/hudi/pull/2606#issuecomment-786408716 Verified locally that it works. This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [hudi] n3nash opened a new pull request #2606: [MINOR] Fixing slideshare link

2021-02-25 Thread GitBox
n3nash opened a new pull request #2606: URL: https://github.com/apache/hudi/pull/2606 Fixing broken link ## Committer checklist - [ ] Has a corresponding JIRA in PR title & commit - [ ] Commit message is descriptive of the change - [ ] CI is green - [

[GitHub] [hudi] danny0405 commented on a change in pull request #2600: [HUDI-1638] Some improvements to BucketAssignFunction

2021-02-25 Thread GitBox
danny0405 commented on a change in pull request #2600: URL: https://github.com/apache/hudi/pull/2600#discussion_r583369329 ## File path: hudi-flink/src/main/java/org/apache/hudi/operator/partitioner/BucketAssignFunction.java ## @@ -136,15 +137,10 @@ public void open(Configurat

[GitHub] [hudi] hk-lrzy commented on a change in pull request #2600: [HUDI-1638] Some improvements to BucketAssignFunction

2021-02-25 Thread GitBox
hk-lrzy commented on a change in pull request #2600: URL: https://github.com/apache/hudi/pull/2600#discussion_r583366344 ## File path: hudi-flink/src/main/java/org/apache/hudi/operator/partitioner/BucketAssignFunction.java ## @@ -136,15 +137,10 @@ public void open(Configuratio

[GitHub] [hudi] hk-lrzy closed pull request #2604: [hudi-1639][hudi-flink] fix BucketAssigner npe

2021-02-25 Thread GitBox
hk-lrzy closed pull request #2604: URL: https://github.com/apache/hudi/pull/2604 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[hudi] branch asf-site updated: Travis CI build asf-site

2021-02-25 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 1f101fe Travis CI build asf-site 1f101fe is d

[GitHub] [hudi] codejoyan commented on issue #2592: [SUPPORT] Does latest versions of Hudi (0.7.0, 0.6.0) work with Spark 2.3.0 when reading orc files?

2021-02-25 Thread GitBox
codejoyan commented on issue #2592: URL: https://github.com/apache/hudi/issues/2592#issuecomment-786379571 This is the spark version of the cluster being used at work so I will have to use Spark 2.3 until there is an upgrade. Since the documentation says: **Hudi works with Spark-2.x**, I w

[hudi] branch asf-site updated: [HUDI 1642] Adding Hudi Learning series presentation & Uber eng blog links (#2602)

2021-02-25 Thread lamberken
This is an automated email from the ASF dual-hosted git repository. lamberken pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new eeb146a [HUDI 1642] Adding Hudi Learning s

[GitHub] [hudi] lamber-ken merged pull request #2602: [HUDI 1642] Adding Hudi Learning series presentation & Uber eng blog links

2021-02-25 Thread GitBox
lamber-ken merged pull request #2602: URL: https://github.com/apache/hudi/pull/2602 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] lamber-ken edited a comment on pull request #2602: [HUDI 1642] Adding Hudi Learning series presentation & Uber eng blog links

2021-02-25 Thread GitBox
lamber-ken edited a comment on pull request #2602: URL: https://github.com/apache/hudi/pull/2602#issuecomment-786378892 Thanks @n3nash 👍 LGTM This is an automated message from the Apache Git Service. To respond to the message

[GitHub] [hudi] lamber-ken commented on pull request #2602: [HUDI 1642] Adding Hudi Learning series presentation & Uber eng blog links

2021-02-25 Thread GitBox
lamber-ken commented on pull request #2602: URL: https://github.com/apache/hudi/pull/2602#issuecomment-786378892 Thanks @n3nash 👍 This is an automated message from the Apache Git Service. To respond to the message, please lo

[GitHub] [hudi] lamber-ken commented on a change in pull request #2602: [HUDI 1642] Adding Hudi Learning series presentation & Uber eng blog links

2021-02-25 Thread GitBox
lamber-ken commented on a change in pull request #2602: URL: https://github.com/apache/hudi/pull/2602#discussion_r583352619 ## File path: content/docs/0.5.3-powered_by.html ## @@ -462,13 +462,17 @@ Talks & Presentations https://drive.google.com/open?id=1Pk_WdFxfEZxMMf

[GitHub] [hudi] codecov-io edited a comment on pull request #2596: [HUDI-1636] Support Builder Pattern To Build Table Properties For Hoo…

2021-02-25 Thread GitBox
codecov-io edited a comment on pull request #2596: URL: https://github.com/apache/hudi/pull/2596#issuecomment-784717451 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2596?src=pr&el=h1) Report > Merging [#2596](https://codecov.io/gh/apache/hudi/pull/2596?src=pr&el=desc) (c71fe74) in

[GitHub] [hudi] garyli1019 commented on issue #2498: [SUPPORT] Hudi MERGE_ON_READ load to dataframe fails for the versions [0.6.0],[0.7.0] and runs for [0.5.3]

2021-02-25 Thread GitBox
garyli1019 commented on issue #2498: URL: https://github.com/apache/hudi/issues/2498#issuecomment-786364265 I am seeing the same problem when the compiled spark distribution is different from the runtime spark distribution. Compile hudi jar against the runtime spark distribution should fix

[GitHub] [hudi] garyli1019 commented on pull request #2584: [Hudi-1583]: Fix bug that Hudi will skip remaining log files if there is logFile with zero size in logFileList when merge on read.

2021-02-25 Thread GitBox
garyli1019 commented on pull request #2584: URL: https://github.com/apache/hudi/pull/2584#issuecomment-786361769 hi @satishkotha , this PR seems related to #2583 , would you take a look? This is an automated message from the

[GitHub] [hudi] pengzhiwei2018 commented on a change in pull request #2596: [HUDI-1636] Support Builder Pattern To Build Table Properties For Hoo…

2021-02-25 Thread GitBox
pengzhiwei2018 commented on a change in pull request #2596: URL: https://github.com/apache/hudi/pull/2596#discussion_r583334728 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/TableCommand.java ## @@ -106,10 +106,13 @@ public String createTable( throw n

[GitHub] [hudi] yanghua commented on a change in pull request #2596: [HUDI-1636] Support Builder Pattern To Build Table Properties For Hoo…

2021-02-25 Thread GitBox
yanghua commented on a change in pull request #2596: URL: https://github.com/apache/hudi/pull/2596#discussion_r583313615 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/TableCommand.java ## @@ -106,10 +106,13 @@ public String createTable( throw new Ille

[GitHub] [hudi] n3nash merged pull request #2565: [HUDI-1611] Added a configuration to allow specific directories to be filtered out during Metadata Table bootstrap.

2021-02-25 Thread GitBox
n3nash merged pull request #2565: URL: https://github.com/apache/hudi/pull/2565 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[hudi] branch master updated: [HUDI-1611] Added a configuration to allow specific directories to be filtered out during Metadata Table bootstrap. (#2565)

2021-02-25 Thread nagarwal
This is an automated email from the ASF dual-hosted git repository. nagarwal pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 022df0d [HUDI-1611] Added a configuration to al

[hudi] branch master updated: Fixing README for hudi test suite long running job (#2578)

2021-02-25 Thread nagarwal
This is an automated email from the ASF dual-hosted git repository. nagarwal pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 9f5e8cc Fixing README for hudi test suite long

[GitHub] [hudi] n3nash merged pull request #2578: [MINOR] Fixing Hudi Test suite readme for long running job

2021-02-25 Thread GitBox
n3nash merged pull request #2578: URL: https://github.com/apache/hudi/pull/2578 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[jira] [Created] (HUDI-1642) Add Links to Uber engineering blog and meet up slides

2021-02-25 Thread Nishith Agarwal (Jira)
Nishith Agarwal created HUDI-1642: - Summary: Add Links to Uber engineering blog and meet up slides Key: HUDI-1642 URL: https://issues.apache.org/jira/browse/HUDI-1642 Project: Apache Hudi Iss

[GitHub] [hudi] n3nash commented on a change in pull request #2602: Adding Hudi Learning series presentation & Uber eng blog links

2021-02-25 Thread GitBox
n3nash commented on a change in pull request #2602: URL: https://github.com/apache/hudi/pull/2602#discussion_r583286829 ## File path: content/docs/0.5.3-powered_by.html ## @@ -462,13 +462,17 @@ Talks & Presentations https://drive.google.com/open?id=1Pk_WdFxfEZxMMfAOn0

[GitHub] [hudi] afeldman1 closed issue #2399: [SUPPORT] Hudi deletes not being properly commited

2021-02-25 Thread GitBox
afeldman1 closed issue #2399: URL: https://github.com/apache/hudi/issues/2399 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

[GitHub] [hudi] afeldman1 commented on issue #2399: [SUPPORT] Hudi deletes not being properly commited

2021-02-25 Thread GitBox
afeldman1 commented on issue #2399: URL: https://github.com/apache/hudi/issues/2399#issuecomment-786204481 Apologies for the delayed response. And thank you to @bvaradar for the initial hint. The issue turned out to be caused not by the keys but another one of the configuration properties.

[GitHub] [hudi] kpurella commented on issue #2240: [SUPPORT] Performance Issue : HUDI MOR ,UPSERT Job running forever

2021-02-25 Thread GitBox
kpurella commented on issue #2240: URL: https://github.com/apache/hudi/issues/2240#issuecomment-786183675 @vinothchandar Sure ,i will !! This is an automated message from the Apache Git Service. To respond to the message, ple

[GitHub] [hudi] toninis commented on issue #2149: Help with Reading Kafka topic written using Debezium Connector - Deltastreamer

2021-02-25 Thread GitBox
toninis commented on issue #2149: URL: https://github.com/apache/hudi/issues/2149#issuecomment-786183618 @vinothchandar Sorry I took so long to respond . It had worked and compiled successfully . I probably had missed something at the time . Thanks for your response at the time .

[jira] [Updated] (HUDI-1641) Issue for Integrating Hudi with Kafka using Avro Schema

2021-02-25 Thread PRASHANT BHOSALE (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] PRASHANT BHOSALE updated HUDI-1641: --- Description: I am trying to integrate Hudi with Kafka topic. teps followed : # Created Kafka

[GitHub] [hudi] t0il3ts0ap edited a comment on issue #2589: [SUPPORT] Issue with adding column while running deltastreamer with kafka source.

2021-02-25 Thread GitBox
t0il3ts0ap edited a comment on issue #2589: URL: https://github.com/apache/hudi/issues/2589#issuecomment-786029402 @satishkotha Ran again on fresh table, still same issue. SparkSubmit: ``` spark-submit --master yarn --packages org.apache.spark:spark-avro_2.12:3.0.1,org

[GitHub] [hudi] t0il3ts0ap commented on issue #2589: [SUPPORT] Issue with adding column while running deltastreamer with kafka source.

2021-02-25 Thread GitBox
t0il3ts0ap commented on issue #2589: URL: https://github.com/apache/hudi/issues/2589#issuecomment-786029402 Ran again on fresh table, still same issue. SparkSubmit: ``` spark-submit --master yarn --packages org.apache.spark:spark-avro_2.12:3.0.1,org.apache.hudi:hudi-ut

[jira] [Resolved] (HUDI-1269) Make whether the failure of connect hive affects hudi ingest process configurable

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-1269. --- Fix Version/s: 0.8.0 Resolution: Fixed > Make whether the failure of connect hi

[hudi] branch master updated: [HUDI-1269] Make whether the failure of connect hive affects hudi ingest process configurable (#2443)

2021-02-25 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 8c2197a [HUDI-1269] Make whether the failure o

[jira] [Updated] (HUDI-1269) Make whether the failure of connect hive affects hudi ingest process configurable

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1269: -- Status: Open (was: New) > Make whether the failure of connect hive affects hudi ingest

[jira] [Updated] (HUDI-1269) Make whether the failure of connect hive affects hudi ingest process configurable

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1269: -- Status: In Progress (was: Open) > Make whether the failure of connect hive affects hudi

[GitHub] [hudi] nsivabalan merged pull request #2443: [HUDI-1269] Make whether the failure of connect hive affects hudi ingest process configurable

2021-02-25 Thread GitBox
nsivabalan merged pull request #2443: URL: https://github.com/apache/hudi/pull/2443 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] Rap70r commented on issue #2586: [SUPPORT] - How to guarantee snapshot isolation when reading Hudi tables in S3?

2021-02-25 Thread GitBox
Rap70r commented on issue #2586: URL: https://github.com/apache/hudi/issues/2586#issuecomment-785941366 Hi nsivabalan, Thank you for your reply. * Incremental updates include both inserts and updates. Mostly updates. * We can try increasing retention version to a higher valu

[GitHub] [hudi] rubenssoto commented on issue #2588: [SUPPORT] Cannot create hive connection

2021-02-25 Thread GitBox
rubenssoto commented on issue #2588: URL: https://github.com/apache/hudi/issues/2588#issuecomment-785933427 @bvaradar I think it is a hive issue, I'm trying to increase hive heap size, I hope it helps. I process the tables in threads, so I have almost 20 hive connections open.

[GitHub] [hudi] bvaradar commented on issue #2448: [SUPPORT] deltacommit for client 172.16.116.102 already exists

2021-02-25 Thread GitBox
bvaradar commented on issue #2448: URL: https://github.com/apache/hudi/issues/2448#issuecomment-785932137 @root18039532923 : Please look at https://hudi.apache.org/blog/async-compaction-deployment-model/ for running async compactions --

[GitHub] [hudi] bvaradar commented on issue #2555: [SUPPORT] Trying and Understanding Clustering

2021-02-25 Thread GitBox
bvaradar commented on issue #2555: URL: https://github.com/apache/hudi/issues/2555#issuecomment-785930358 For bulkinsert, you need to size the parallelism to control. Please see https://cwiki.apache.org/confluence/display/HUDI/FAQ#FAQ-Whatperformance/ingestlatencycanIexpectforHudiwriting

[GitHub] [hudi] bvaradar commented on issue #2588: [SUPPORT] Cannot create hive connection

2021-02-25 Thread GitBox
bvaradar commented on issue #2588: URL: https://github.com/apache/hudi/issues/2588#issuecomment-785928870 @rubenssoto : The stack-trace does not contain Hudi in it. So, I dont know how to help in this regard. Regarding high cpu load on hive server, Are you also running hive queries apart f

[jira] [Created] (HUDI-1641) Issue for Integrating Hudi with Kafka using Avro Schema

2021-02-25 Thread PRASHANT BHOSALE (Jira)
PRASHANT BHOSALE created HUDI-1641: -- Summary: Issue for Integrating Hudi with Kafka using Avro Schema Key: HUDI-1641 URL: https://issues.apache.org/jira/browse/HUDI-1641 Project: Apache Hudi

[GitHub] [hudi] bvaradar commented on a change in pull request #2520: [HUDI-1446] Support skip bootstrapIndex's init in abstract fs view init

2021-02-25 Thread GitBox
bvaradar commented on a change in pull request #2520: URL: https://github.com/apache/hudi/pull/2520#discussion_r582853801 ## File path: hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/DefaultBootstrapIndex.java ## @@ -0,0 +1,82 @@ +/* + * Licensed to the Apach

[GitHub] [hudi] bvaradar commented on issue #2605: [SUPPORT] How to reload a writeConfig from a existed hudi path ?

2021-02-25 Thread GitBox
bvaradar commented on issue #2605: URL: https://github.com/apache/hudi/issues/2605#issuecomment-785906565 Ideally, spark data-source should provide that option (like optionFromFile(...). Not sure if there is anything like that. Created : https://issues.apache.org/jira/browse/HUDI-16

[jira] [Commented] (HUDI-1640) Implement Spark Datasource option to read hudi configs from properties file

2021-02-25 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290922#comment-17290922 ] Balaji Varadarajan commented on HUDI-1640: -- [~shivnarayan]: Can you vet this and

[jira] [Created] (HUDI-1640) Implement Spark Datasource option to read hudi configs from properties file

2021-02-25 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1640: Summary: Implement Spark Datasource option to read hudi configs from properties file Key: HUDI-1640 URL: https://issues.apache.org/jira/browse/HUDI-1640 Proje

[GitHub] [hudi] bvaradar commented on issue #2592: [SUPPORT] Does latest versions of Hudi (0.7.0, 0.6.0) work with Spark 2.3.0 when reading orc files?

2021-02-25 Thread GitBox
bvaradar commented on issue #2592: URL: https://github.com/apache/hudi/issues/2592#issuecomment-785896778 I was unable to setup spark-2.3.0 in my setup. But,with spark-2.4.4, this works fine as below. Can you use spark-2.4.x version. spark-2.3 seems too old though ? `21/02/25 05:14

[GitHub] [hudi] codecov-io edited a comment on pull request #2443: [HUDI-1269] Make whether the failure of connect hive affects hudi ingest process configurable

2021-02-25 Thread GitBox
codecov-io edited a comment on pull request #2443: URL: https://github.com/apache/hudi/pull/2443#issuecomment-760147630 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2443?src=pr&el=h1) Report > Merging [#2443](https://codecov.io/gh/apache/hudi/pull/2443?src=pr&el=desc) (7baf5de) in

[GitHub] [hudi] codecov-io edited a comment on pull request #2378: [HUDI-1491] Support partition pruning for MOR snapshot query

2021-02-25 Thread GitBox
codecov-io edited a comment on pull request #2378: URL: https://github.com/apache/hudi/pull/2378#issuecomment-751218636 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2378?src=pr&el=h1) Report > Merging [#2378](https://codecov.io/gh/apache/hudi/pull/2378?src=pr&el=desc) (ef2107f) in

[jira] [Resolved] (HUDI-1367) Make delastreamer transition from dfsSouce to kafkasouce

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-1367. --- Fix Version/s: 0.8.0 Resolution: Fixed > Make delastreamer transition from dfsS

[jira] [Reopened] (HUDI-1367) Make delastreamer transition from dfsSouce to kafkasouce

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reopened HUDI-1367: --- > Make delastreamer transition from dfsSouce to kafkasouce > -

[jira] [Updated] (HUDI-1367) Make delastreamer transition from dfsSouce to kafkasouce

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1367: -- Status: Patch Available (was: In Progress) > Make delastreamer transition from dfsSouce

[jira] [Updated] (HUDI-1367) Make delastreamer transition from dfsSouce to kafkasouce

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1367: -- Status: Closed (was: Patch Available) > Make delastreamer transition from dfsSouce to k

[hudi] branch master updated: [HUDI-1367] Make deltaStreamer transition from dfsSouce to kafkasouce (#2227)

2021-02-25 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 617cc24 [HUDI-1367] Make deltaStreamer transit

[GitHub] [hudi] nsivabalan merged pull request #2227: [HUDI-1367] Make deltaStreamer transition from dfsSouce to kafkasouce

2021-02-25 Thread GitBox
nsivabalan merged pull request #2227: URL: https://github.com/apache/hudi/pull/2227 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] codecov-io edited a comment on pull request #2519: [HUDI-1573] Spark Sql Writer support Multi preCmp Field

2021-02-25 Thread GitBox
codecov-io edited a comment on pull request #2519: URL: https://github.com/apache/hudi/pull/2519#issuecomment-771782258 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2519?src=pr&el=h1) Report > Merging [#2519](https://codecov.io/gh/apache/hudi/pull/2519?src=pr&el=desc) (6f0fd84) in

[GitHub] [hudi] rakeshramakrishnan edited a comment on issue #2439: [SUPPORT] Unable to sync with external hive metastore via metastore uris in the thrift protocol

2021-02-25 Thread GitBox
rakeshramakrishnan edited a comment on issue #2439: URL: https://github.com/apache/hudi/issues/2439#issuecomment-785795722 @nsivabalan : There are no errors, however through hudi, the connection is made to the local hive metastore (from spark). It doesn't connect to the external hive metas

[GitHub] [hudi] rakeshramakrishnan commented on issue #2439: [SUPPORT] Unable to sync with external hive metastore via metastore uris in the thrift protocol

2021-02-25 Thread GitBox
rakeshramakrishnan commented on issue #2439: URL: https://github.com/apache/hudi/issues/2439#issuecomment-785795722 @nsivabalan : There are no errors, however through hudi, the connection is made to the local hive metastore (from spark). It doesn't connect to the external hive metastore.

[GitHub] [hudi] Xoln commented on a change in pull request #2520: [HUDI-1446] Support skip bootstrapIndex's init in abstract fs view init

2021-02-25 Thread GitBox
Xoln commented on a change in pull request #2520: URL: https://github.com/apache/hudi/pull/2520#discussion_r582717002 ## File path: hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/DefaultBootstrapIndex.java ## @@ -0,0 +1,61 @@ +package org.apache.hudi.common.b

[GitHub] [hudi] codecov-io commented on pull request #2604: [hudi-1639][hudi-flink] fix BucketAssigner npe

2021-02-25 Thread GitBox
codecov-io commented on pull request #2604: URL: https://github.com/apache/hudi/pull/2604#issuecomment-785704660 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2604?src=pr&el=h1) Report > Merging [#2604](https://codecov.io/gh/apache/hudi/pull/2604?src=pr&el=desc) (fed6575) into [ma