[GitHub] [hudi] pratyakshsharma commented on a diff in pull request #5598: [HUDI-4107] Updated documentation for 0.11.0 - DeltaStreamer

2022-05-17 Thread GitBox
pratyakshsharma commented on code in PR #5598: URL: https://github.com/apache/hudi/pull/5598#discussion_r875534409 ## website/versioned_docs/version-0.11.0/hoodie_deltastreamer.md: ## @@ -48,8 +48,15 @@ Options: Compaction is enabled for MoR table by default. This flag di

[GitHub] [hudi] xiarixiaoyao commented on a diff in pull request #3391: [HUDI-83] Fix Timestamp/Date type read by Hive3

2022-05-17 Thread GitBox
xiarixiaoyao commented on code in PR #3391: URL: https://github.com/apache/hudi/pull/3391#discussion_r875532837 ## hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/avro/HudiAvroParquetReader.java: ## @@ -0,0 +1,104 @@ +/* + * Licensed to the Apache Software Foundation (ASF) u

[GitHub] [hudi] xiarixiaoyao commented on a diff in pull request #3391: [HUDI-83] Fix Timestamp/Date type read by Hive3

2022-05-17 Thread GitBox
xiarixiaoyao commented on code in PR #3391: URL: https://github.com/apache/hudi/pull/3391#discussion_r875529876 ## hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/HoodieParquetInputFormat.java: ## @@ -49,12 +54,32 @@ public class HoodieParquetInputFormat extends HoodieParqu

[GitHub] [hudi] xiarixiaoyao commented on a diff in pull request #3391: [HUDI-83] Fix Timestamp/Date type read by Hive3

2022-05-17 Thread GitBox
xiarixiaoyao commented on code in PR #3391: URL: https://github.com/apache/hudi/pull/3391#discussion_r875529666 ## hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/HoodieColumnProjectionUtils.java: ## @@ -109,4 +109,25 @@ public static List> getIOColumnNameAndTypes(Configura

[GitHub] [hudi] gtwuser commented on issue #5612: Exception while upserting records - An error occurred while calling o155.pyWriteDynamicFrame. Expected instance of group converter but got "org.apache

2022-05-17 Thread GitBox
gtwuser commented on issue #5612: URL: https://github.com/apache/hudi/issues/5612#issuecomment-1129635743 Have added a sample of payload sent in bulk insert and also during the next upsert. Basic scenario is the increments may have only empty arrays for properties which had data during bulk

[GitHub] [hudi] gtwuser commented on issue #5612: Exception while upserting records - An error occurred while calling o155.pyWriteDynamicFrame. Expected instance of group converter but got "org.apache

2022-05-17 Thread GitBox
gtwuser commented on issue #5612: URL: https://github.com/apache/hudi/issues/5612#issuecomment-1129634916 > Hi All, > > Getting this error `An error occurred while calling o155.pyWriteDynamicFrame. Expected instance of group converter but got "org.apache.parquet.avro.AvroConverters$F

[GitHub] [hudi] hudi-bot commented on pull request #5618: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-17 Thread GitBox
hudi-bot commented on PR #5618: URL: https://github.com/apache/hudi/pull/5618#issuecomment-1129631140 ## CI report: * 70f0fad3ac6cc3431c0b4206b5b6f99d390b10de Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8731

[GitHub] [hudi] hudi-bot commented on pull request #5392: [HUDI-3942] [RFC-50] Improve Timeline Server

2022-05-17 Thread GitBox
hudi-bot commented on PR #5392: URL: https://github.com/apache/hudi/pull/5392#issuecomment-1129626251 ## CI report: * 349862d416acd0d6ca77046669f21c077f21ee38 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8724

[GitHub] [hudi] hudi-bot commented on pull request #5392: [HUDI-3942] [RFC-50] Improve Timeline Server

2022-05-17 Thread GitBox
hudi-bot commented on PR #5392: URL: https://github.com/apache/hudi/pull/5392#issuecomment-1129624012 ## CI report: * 349862d416acd0d6ca77046669f21c077f21ee38 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8724

[GitHub] [hudi] jinxing64 commented on pull request #5588: [HUDI-4100] CTAS failed to clean up when given an illegal MANAGED table definition

2022-05-17 Thread GitBox
jinxing64 commented on PR #5588: URL: https://github.com/apache/hudi/pull/5588#issuecomment-1129621249 hi @leesf Thanks a lot looking into this ~ It's great to get your feedback on my points sometime. -- This is an automated message from the Apache Git Service. To respond to the messa

[GitHub] [hudi] hudi-bot commented on pull request #5618: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-17 Thread GitBox
hudi-bot commented on PR #5618: URL: https://github.com/apache/hudi/pull/5618#issuecomment-1129619687 ## CI report: * ce2934b6ee43c711eb6b232e1b6b55ae64892032 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8728

[GitHub] [hudi] jinxing64 commented on a diff in pull request #5588: [HUDI-4100] CTAS failed to clean up when given an illegal MANAGED table definition

2022-05-17 Thread GitBox
jinxing64 commented on code in PR #5588: URL: https://github.com/apache/hudi/pull/5588#discussion_r875510464 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/hudi/TestCreateTable.scala: ## @@ -763,4 +763,22 @@ class TestCreateTable extends HoodieSparkSqlTe

[GitHub] [hudi] jinxing64 commented on a diff in pull request #5588: [HUDI-4100] CTAS failed to clean up when given an illegal MANAGED table definition

2022-05-17 Thread GitBox
jinxing64 commented on code in PR #5588: URL: https://github.com/apache/hudi/pull/5588#discussion_r875509658 ## hudi-spark-datasource/hudi-spark3/src/main/scala/org/apache/spark/sql/hudi/catalog/HoodieCatalog.scala: ## @@ -206,29 +236,17 @@ class HoodieCatalog extends Delegating

[GitHub] [hudi] hudi-bot commented on pull request #5618: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-17 Thread GitBox
hudi-bot commented on PR #5618: URL: https://github.com/apache/hudi/pull/5618#issuecomment-1129617085 ## CI report: * ce2934b6ee43c711eb6b232e1b6b55ae64892032 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8728

[GitHub] [hudi] jinxing64 commented on a diff in pull request #5588: [HUDI-4100] CTAS failed to clean up when given an illegal MANAGED table definition

2022-05-17 Thread GitBox
jinxing64 commented on code in PR #5588: URL: https://github.com/apache/hudi/pull/5588#discussion_r875507389 ## hudi-spark-datasource/hudi-spark3/src/main/scala/org/apache/spark/sql/hudi/catalog/HoodieCatalog.scala: ## @@ -206,29 +236,17 @@ class HoodieCatalog extends Delegating

[GitHub] [hudi] SabyasachiDasTR commented on issue #5422: [SUPPORT] Enabling metadata on MOR table causes FileNotFound exception

2022-05-17 Thread GitBox
SabyasachiDasTR commented on issue #5422: URL: https://github.com/apache/hudi/issues/5422#issuecomment-1129594263 Hello , After upgrading hudi version to the latest available we are still facing the same issue on enabling metadata. Tried with EMR 6.6.0 hudi 0.10.0 spark 3.2.0. This is

[GitHub] [hudi] gnailJC commented on a diff in pull request #5616: [MINOR]: Optimize the judgment logic of `SparkDataSourceOptions Key`

2022-05-17 Thread GitBox
gnailJC commented on code in PR #5616: URL: https://github.com/apache/hudi/pull/5616#discussion_r875489218 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/spark/sql/hudi/HoodieSqlCommonUtils.scala: ## @@ -250,7 +250,7 @@ object HoodieSqlCommonUtils extends S

[GitHub] [hudi] hudi-bot commented on pull request #5606: [HUDI-4111] Bump ANTLR runtime version to 4.8 in Spark 3.2

2022-05-17 Thread GitBox
hudi-bot commented on PR #5606: URL: https://github.com/apache/hudi/pull/5606#issuecomment-1129580493 ## CI report: * dcec5b77d66a83d20f9c8ffc97ab645a0dbcc1f2 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8717

[jira] [Resolved] (HUDI-4108) Clean the marker files before starting new flink compaction

2022-05-17 Thread loukey_j (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] loukey_j resolved HUDI-4108. > Clean the marker files before starting new flink compaction >

[GitHub] [hudi] hudi-bot commented on pull request #5608: [HUDI-2150] Rename/Restructure configs for better modularity

2022-05-17 Thread GitBox
hudi-bot commented on PR #5608: URL: https://github.com/apache/hudi/pull/5608#issuecomment-1129578770 ## CI report: * 97320f8acd66532c570ad34e45aafd6a51b5a186 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8725

[GitHub] [hudi] hudi-bot commented on pull request #5606: [HUDI-4111] Bump ANTLR runtime version to 4.8 in Spark 3.2

2022-05-17 Thread GitBox
hudi-bot commented on PR #5606: URL: https://github.com/apache/hudi/pull/5606#issuecomment-1129578745 ## CI report: * dcec5b77d66a83d20f9c8ffc97ab645a0dbcc1f2 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8717

[GitHub] [hudi] hudi-bot commented on pull request #5618: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-17 Thread GitBox
hudi-bot commented on PR #5618: URL: https://github.com/apache/hudi/pull/5618#issuecomment-1129577018 ## CI report: * ce2934b6ee43c711eb6b232e1b6b55ae64892032 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8728

[GitHub] [hudi] hudi-bot commented on pull request #5616: [MINOR]: Optimize the judgment logic of `SparkDataSourceOptions Key`

2022-05-17 Thread GitBox
hudi-bot commented on PR #5616: URL: https://github.com/apache/hudi/pull/5616#issuecomment-1129576985 ## CI report: * beb60e0e8dd7b36d2fdc09c234eac5ea7f5a830b Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8726

[GitHub] [hudi] leesf commented on pull request #5606: [HUDI-4111] Bump ANTLR runtime version to 4.8 in Spark 3.2

2022-05-17 Thread GitBox
leesf commented on PR #5606: URL: https://github.com/apache/hudi/pull/5606#issuecomment-1129566212 > > I am wondering if we would upgrade ANTLR to 4.8 for all spark versions? that means not only for spark 3.2.0 > > At present, it is also possible to directly upgrade to 4.8. Now the Sp

[GitHub] [hudi] anth0ny-x commented on issue #2247: [SUPPORT] org.eclipse.jetty.util.thread.ThreadPool$SizedThreadPool.getThreadPoolBudget()

2022-05-17 Thread GitBox
anth0ny-x commented on issue #2247: URL: https://github.com/apache/hudi/issues/2247#issuecomment-1129565992 Ran into the same issue, the issue is conflicting jars in the CDH spark stack and Hudi. Resolve it by getting spark-shell to load the Hudi jar before the CDH jars using --driv

[GitHub] [hudi] leesf commented on a diff in pull request #5588: [HUDI-4100] CTAS failed to clean up when given an illegal MANAGED table definition

2022-05-17 Thread GitBox
leesf commented on code in PR #5588: URL: https://github.com/apache/hudi/pull/5588#discussion_r875466366 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/hudi/TestCreateTable.scala: ## @@ -763,4 +763,22 @@ class TestCreateTable extends HoodieSparkSqlTestBa

[GitHub] [hudi] leesf commented on a diff in pull request #5588: [HUDI-4100] CTAS failed to clean up when given an illegal MANAGED table definition

2022-05-17 Thread GitBox
leesf commented on code in PR #5588: URL: https://github.com/apache/hudi/pull/5588#discussion_r875465444 ## hudi-spark-datasource/hudi-spark3/src/main/scala/org/apache/spark/sql/hudi/catalog/HoodieCatalog.scala: ## @@ -206,29 +236,17 @@ class HoodieCatalog extends DelegatingCata

[GitHub] [hudi] cxzl25 commented on pull request #5606: [HUDI-4111] Bump ANTLR runtime version to 4.8 in Spark 3.2

2022-05-17 Thread GitBox
cxzl25 commented on PR #5606: URL: https://github.com/apache/hudi/pull/5606#issuecomment-1129563909 > I am wondering if we would upgrade ANTLR to 4.8 for all spark versions? that means not only for spark 3.2.0 At present, it is also possible to directly upgrade to 4.8. Now the Spa

[GitHub] [hudi] leesf commented on pull request #5606: [HUDI-4111] Bump ANTLR runtime version to 4.8 in Spark 3.2

2022-05-17 Thread GitBox
leesf commented on PR #5606: URL: https://github.com/apache/hudi/pull/5606#issuecomment-1129561716 > The antlr runtime version used by Spark3.1 is `4.8-1`, maybe we can also bump antlr version in Spark3.1 profile? @leesf > > [[SPARK-33475](https://issues.apache.org/jira/browse/SPARK-3

[GitHub] [hudi] hudi-bot commented on pull request #5392: [HUDI-3942] [RFC-50] Improve Timeline Server

2022-05-17 Thread GitBox
hudi-bot commented on PR #5392: URL: https://github.com/apache/hudi/pull/5392#issuecomment-1129558647 ## CI report: * 349862d416acd0d6ca77046669f21c077f21ee38 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8724

[GitHub] [hudi] hudi-bot commented on pull request #5620: [HUDI-4116] Unify clustering/compaction related procedures' output type

2022-05-17 Thread GitBox
hudi-bot commented on PR #5620: URL: https://github.com/apache/hudi/pull/5620#issuecomment-1129552502 ## CI report: * 1378f6ce813b4fea31f1408c615e2449198ac970 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8732

[GitHub] [hudi] hudi-bot commented on pull request #5618: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-17 Thread GitBox
hudi-bot commented on PR #5618: URL: https://github.com/apache/hudi/pull/5618#issuecomment-1129552477 ## CI report: * ce2934b6ee43c711eb6b232e1b6b55ae64892032 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8728

[jira] [Updated] (HUDI-4116) Unify clustering/compaction related procedures' output type

2022-05-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4116: - Labels: pull-request-available (was: ) > Unify clustering/compaction related procedures' output t

[GitHub] [hudi] hudi-bot commented on pull request #5620: [HUDI-4116] Unify clustering/compaction related procedures' output type

2022-05-17 Thread GitBox
hudi-bot commented on PR #5620: URL: https://github.com/apache/hudi/pull/5620#issuecomment-1129550844 ## CI report: * 1378f6ce813b4fea31f1408c615e2449198ac970 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[jira] [Updated] (HUDI-4116) Unify clustering/compaction related procedures' output type

2022-05-17 Thread shibei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shibei updated HUDI-4116: - Status: In Progress (was: Open) > Unify clustering/compaction related procedures' output type > -

[GitHub] [hudi] yuzhaojing commented on a diff in pull request #5392: [HUDI-3942] [RFC-50] Improve Timeline Server

2022-05-17 Thread GitBox
yuzhaojing commented on code in PR #5392: URL: https://github.com/apache/hudi/pull/5392#discussion_r875446437 ## rfc/rfc-50/rfc-50.md: ## @@ -0,0 +1,94 @@ + + +# RFC-50: Improve Timeline Server + +## Proposers +- @yuzhaojing + +## Approvers + - @xushiyan + - @danny0405 + +## Abs

[GitHub] [hudi] huberylee opened a new pull request, #5620: [HUDI-4416] Unify clustering/compaction related procedures' output type

2022-05-17 Thread GitBox
huberylee opened a new pull request, #5620: URL: https://github.com/apache/hudi/pull/5620 ## What is the purpose of the pull request Unify clustering/compaction related procedures' output type to provide more useful info ## Brief change log - Modify the output type of

[jira] [Assigned] (HUDI-4116) Unify clustering/compaction related procedures' output type

2022-05-17 Thread shibei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shibei reassigned HUDI-4116: Assignee: shibei > Unify clustering/compaction related procedures' output type > --

[jira] [Created] (HUDI-4116) Unify clustering/compaction related procedures' output type

2022-05-17 Thread shibei (Jira)
shibei created HUDI-4116: Summary: Unify clustering/compaction related procedures' output type Key: HUDI-4116 URL: https://issues.apache.org/jira/browse/HUDI-4116 Project: Apache Hudi Issue Type: Im

[GitHub] [hudi] hudi-bot commented on pull request #5619: [HUDI-3889] Do not validate table config if save mode is set to Overwrite

2022-05-17 Thread GitBox
hudi-bot commented on PR #5619: URL: https://github.com/apache/hudi/pull/5619#issuecomment-1129532013 ## CI report: * 558ab873cd0380e142c46a188be8151363279d2a Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8730

[GitHub] [hudi] hudi-bot commented on pull request #5619: [HUDI-3889] Do not validate table config if save mode is set to Overwrite

2022-05-17 Thread GitBox
hudi-bot commented on PR #5619: URL: https://github.com/apache/hudi/pull/5619#issuecomment-1129530773 ## CI report: * 558ab873cd0380e142c46a188be8151363279d2a UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #5618: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-17 Thread GitBox
hudi-bot commented on PR #5618: URL: https://github.com/apache/hudi/pull/5618#issuecomment-1129530752 ## CI report: * ce2934b6ee43c711eb6b232e1b6b55ae64892032 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8728

[jira] [Updated] (HUDI-3889) Do not validate table config if save mode is set to Overwrite

2022-05-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3889: - Labels: pull-request-available (was: ) > Do not validate table config if save mode is set to Over

[GitHub] [hudi] xicm opened a new pull request, #5619: [HUDI-3889] Do not validate table config if save mode is set to Overwrite

2022-05-17 Thread GitBox
xicm opened a new pull request, #5619: URL: https://github.com/apache/hudi/pull/5619 ## What is the purpose of the pull request Skip table config validatiton if save mode is set to Overwrite ## Brief change log Check the save mode when doing validating ## Verify th

[GitHub] [hudi] cxzl25 commented on pull request #5606: [HUDI-4111] Bump ANTLR runtime version to 4.8 in Spark 3.2

2022-05-17 Thread GitBox
cxzl25 commented on PR #5606: URL: https://github.com/apache/hudi/pull/5606#issuecomment-1129527461 The antlr runtime version used by Spark3.1 is `4.8-1`, maybe we can also bump antlr version in Spark3.1 profile? @leesf [[SPARK-33475](https://issues.apache.org/jira/browse/SPARK-33475

[GitHub] [hudi] liujinhui1994 closed pull request #5618: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-17 Thread GitBox
liujinhui1994 closed pull request #5618: [HUDI-3555] Re-use spark config for parquet timestamp format URL: https://github.com/apache/hudi/pull/5618 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] hudi-bot commented on pull request #5618: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-17 Thread GitBox
hudi-bot commented on PR #5618: URL: https://github.com/apache/hudi/pull/5618#issuecomment-1129526384 ## CI report: * ce2934b6ee43c711eb6b232e1b6b55ae64892032 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8728

[GitHub] [hudi] hudi-bot commented on pull request #3748: [HUDI-2516] Upgrade JUnit to 5.8.1

2022-05-17 Thread GitBox
hudi-bot commented on PR #3748: URL: https://github.com/apache/hudi/pull/3748#issuecomment-1129525674 ## CI report: * 4fbc75d8510edab1397afa5e84e7afa8d87e2f75 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=3236

[hudi] branch master updated: Clean the marker files for flink compaction (#5611)

2022-05-17 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new a1017c66aa Clean the marker files for flink com

[GitHub] [hudi] danny0405 merged pull request #5611: [HUDI-4108] Clean the marker files before starting new flink compaction

2022-05-17 Thread GitBox
danny0405 merged PR #5611: URL: https://github.com/apache/hudi/pull/5611 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache

[GitHub] [hudi] hudi-bot commented on pull request #5618: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-17 Thread GitBox
hudi-bot commented on PR #5618: URL: https://github.com/apache/hudi/pull/5618#issuecomment-1129524809 ## CI report: * ce2934b6ee43c711eb6b232e1b6b55ae64892032 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[jira] [Created] (HUDI-4115) Split FlinkOptions to HiveConfigProperty

2022-05-17 Thread liujinhui (Jira)
liujinhui created HUDI-4115: --- Summary: Split FlinkOptions to HiveConfigProperty Key: HUDI-4115 URL: https://issues.apache.org/jira/browse/HUDI-4115 Project: Apache Hudi Issue Type: Sub-task

[GitHub] [hudi] hudi-bot commented on pull request #5611: [HUDI-4108] Clean the marker files before starting new flink compaction

2022-05-17 Thread GitBox
hudi-bot commented on PR #5611: URL: https://github.com/apache/hudi/pull/5611#issuecomment-1129524789 ## CI report: * ef7caaeffd437110c8811dce4845a3635c01a473 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8718

[GitHub] [hudi] danny0405 commented on pull request #5611: [HUDI-4108] Clean the marker files before starting new flink compaction

2022-05-17 Thread GitBox
danny0405 commented on PR #5611: URL: https://github.com/apache/hudi/pull/5611#issuecomment-1129524631 The test failure is not caused by this, so i would just merge it soon ~ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [hudi] hudi-bot commented on pull request #3748: [HUDI-2516] Upgrade JUnit to 5.8.1

2022-05-17 Thread GitBox
hudi-bot commented on PR #3748: URL: https://github.com/apache/hudi/pull/3748#issuecomment-1129524192 ## CI report: * 4fbc75d8510edab1397afa5e84e7afa8d87e2f75 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=3236

[GitHub] [hudi] hudi-bot commented on pull request #5611: [HUDI-4108] Clean the marker files before starting new flink compaction

2022-05-17 Thread GitBox
hudi-bot commented on PR #5611: URL: https://github.com/apache/hudi/pull/5611#issuecomment-1129523271 ## CI report: * ef7caaeffd437110c8811dce4845a3635c01a473 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8718

[GitHub] [hudi] danny0405 commented on a diff in pull request #5392: [HUDI-3942] [RFC-50] Improve Timeline Server

2022-05-17 Thread GitBox
danny0405 commented on code in PR #5392: URL: https://github.com/apache/hudi/pull/5392#discussion_r875427334 ## rfc/rfc-50/rfc-50.md: ## @@ -0,0 +1,93 @@ + + +# RFC-50: Improve Timeline Server + +## Proposers +- @yuzhaojing + +## Approvers + - @xushiyan + - @danny0405 + +## Abst

[GitHub] [hudi] liujinhui1994 commented on pull request #3748: [HUDI-2516] Upgrade JUnit to 5.8.1

2022-05-17 Thread GitBox
liujinhui1994 commented on PR #3748: URL: https://github.com/apache/hudi/pull/3748#issuecomment-1129517523 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[GitHub] [hudi] danny0405 commented on a diff in pull request #5392: [HUDI-3942] [RFC-50] Improve Timeline Server

2022-05-17 Thread GitBox
danny0405 commented on code in PR #5392: URL: https://github.com/apache/hudi/pull/5392#discussion_r875426679 ## rfc/rfc-50/rfc-50.md: ## @@ -0,0 +1,94 @@ + + +# RFC-50: Improve Timeline Server + +## Proposers +- @yuzhaojing + +## Approvers + - @xushiyan + - @danny0405 + +## Abst

[jira] [Updated] (HUDI-3555) re-use spark config for parquet timestamp format instead of having our own config

2022-05-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3555: - Labels: pull-request-available (was: ) > re-use spark config for parquet timestamp format instead

[GitHub] [hudi] liujinhui1994 opened a new pull request, #5618: [HUDI-3555] Re-use spark config for parquet timestamp format

2022-05-17 Thread GitBox
liujinhui1994 opened a new pull request, #5618: URL: https://github.com/apache/hudi/pull/5618 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the p

[GitHub] [hudi] danny0405 commented on pull request #5611: [HUDI-4108] Clean the marker files before starting new flink compaction

2022-05-17 Thread GitBox
danny0405 commented on PR #5611: URL: https://github.com/apache/hudi/pull/5611#issuecomment-1129509290 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. T

[GitHub] [hudi] hudi-bot commented on pull request #5611: [HUDI-4108] Clean the marker files before starting new flink compaction

2022-05-17 Thread GitBox
hudi-bot commented on PR #5611: URL: https://github.com/apache/hudi/pull/5611#issuecomment-1129507333 ## CI report: * ef7caaeffd437110c8811dce4845a3635c01a473 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8718

[GitHub] [hudi] hudi-bot commented on pull request #5611: [HUDI-4108] Clean the marker files before starting new flink compaction

2022-05-17 Thread GitBox
hudi-bot commented on PR #5611: URL: https://github.com/apache/hudi/pull/5611#issuecomment-1129505726 ## CI report: * ef7caaeffd437110c8811dce4845a3635c01a473 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8718

[GitHub] [hudi] danny0405 commented on pull request #5611: [HUDI-4108] Clean the marker files before starting new flink compaction

2022-05-17 Thread GitBox
danny0405 commented on PR #5611: URL: https://github.com/apache/hudi/pull/5611#issuecomment-1129505512 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. T

[GitHub] [hudi] hudi-bot commented on pull request #4309: [HUDI-3016][RFC-43] Proposal to implement Table Management Service

2022-05-17 Thread GitBox
hudi-bot commented on PR #4309: URL: https://github.com/apache/hudi/pull/4309#issuecomment-1129505116 ## CI report: * fbe27691b5d9de58128cc58158047a4df2b53750 UNKNOWN * 5125110c970ec0e22d3497e4bb3b65a8216a9f8d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #5617: [HUDI-4114] Remove the unnecessary fs view sync for BaseWriteClient#i…

2022-05-17 Thread GitBox
hudi-bot commented on PR #5617: URL: https://github.com/apache/hudi/pull/5617#issuecomment-1129504115 ## CI report: * 5608887388d04850d78a0fe322eb6130f269da7c Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8727

[GitHub] [hudi] danny0405 commented on a diff in pull request #5616: [MINOR]: Optimize the judgment logic of `SparkDataSourceOptions Key`

2022-05-17 Thread GitBox
danny0405 commented on code in PR #5616: URL: https://github.com/apache/hudi/pull/5616#discussion_r875415160 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/spark/sql/hudi/HoodieSqlCommonUtils.scala: ## @@ -250,7 +250,7 @@ object HoodieSqlCommonUtils extends

[GitHub] [hudi] yuzhaojing commented on a diff in pull request #5392: [HUDI-3942] [RFC-50] Improve Timeline Server

2022-05-17 Thread GitBox
yuzhaojing commented on code in PR #5392: URL: https://github.com/apache/hudi/pull/5392#discussion_r875414766 ## rfc/rfc-50/rfc-50.md: ## @@ -0,0 +1,94 @@ + + +# RFC-50: Improve Timeline Server + +## Proposers +- @yuzhaojing + +## Approvers + - @xushiyan + - @danny0405 + +## Abs

[GitHub] [hudi] hudi-bot commented on pull request #5617: [HUDI-4114] Remove the unnecessary fs view sync for BaseWriteClient#i…

2022-05-17 Thread GitBox
hudi-bot commented on PR #5617: URL: https://github.com/apache/hudi/pull/5617#issuecomment-1129502253 ## CI report: * 5608887388d04850d78a0fe322eb6130f269da7c UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[jira] [Updated] (HUDI-4114) Remove the unnecessary fs view sync for BaseWriteClient#initTable

2022-05-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4114: - Labels: pull-request-available (was: ) > Remove the unnecessary fs view sync for BaseWriteClient#

[GitHub] [hudi] danny0405 opened a new pull request, #5617: [HUDI-4114] Remove the unnecessary fs view sync for BaseWriteClient#i…

2022-05-17 Thread GitBox
danny0405 opened a new pull request, #5617: URL: https://github.com/apache/hudi/pull/5617 …nitTable No need to #sync actively because the table instance is instantiated freshly, its view manager has empty fiew instantces, the fs view would be synced lazily when is it requested.

[GitHub] [hudi] hudi-bot commented on pull request #5616: [MINOR]: Optimize the judgment logic of `SparkDataSourceOptions Key`

2022-05-17 Thread GitBox
hudi-bot commented on PR #5616: URL: https://github.com/apache/hudi/pull/5616#issuecomment-1129500566 ## CI report: * beb60e0e8dd7b36d2fdc09c234eac5ea7f5a830b Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8726

[GitHub] [hudi] hudi-bot commented on pull request #5608: [HUDI-2150] Rename/Restructure configs for better modularity

2022-05-17 Thread GitBox
hudi-bot commented on PR #5608: URL: https://github.com/apache/hudi/pull/5608#issuecomment-1129500546 ## CI report: * 2e35c7e1235096366b120e451700f2d331f58b2d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8712

[GitHub] [hudi] hudi-bot commented on pull request #5392: [HUDI-3942] [RFC-50] Improve Timeline Server

2022-05-17 Thread GitBox
hudi-bot commented on PR #5392: URL: https://github.com/apache/hudi/pull/5392#issuecomment-1129500323 ## CI report: * 0b65ac83fdefc35f8f627c2cce6887a16487f528 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8226

[jira] [Created] (HUDI-4114) Remove the unnecessary fs view sync for BaseWriteClient#initTable

2022-05-17 Thread Danny Chen (Jira)
Danny Chen created HUDI-4114: Summary: Remove the unnecessary fs view sync for BaseWriteClient#initTable Key: HUDI-4114 URL: https://issues.apache.org/jira/browse/HUDI-4114 Project: Apache Hudi

[GitHub] [hudi] hudi-bot commented on pull request #5616: [MINOR]: Optimize the judgment logic of `SparkDataSourceOptions Key`

2022-05-17 Thread GitBox
hudi-bot commented on PR #5616: URL: https://github.com/apache/hudi/pull/5616#issuecomment-1129498948 ## CI report: * beb60e0e8dd7b36d2fdc09c234eac5ea7f5a830b UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #5608: [HUDI-2150] Rename/Restructure configs for better modularity

2022-05-17 Thread GitBox
hudi-bot commented on PR #5608: URL: https://github.com/apache/hudi/pull/5608#issuecomment-1129498925 ## CI report: * 2e35c7e1235096366b120e451700f2d331f58b2d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8712

[GitHub] [hudi] hudi-bot commented on pull request #5392: [HUDI-3942] [RFC-50] Improve Timeline Server

2022-05-17 Thread GitBox
hudi-bot commented on PR #5392: URL: https://github.com/apache/hudi/pull/5392#issuecomment-1129498708 ## CI report: * 0b65ac83fdefc35f8f627c2cce6887a16487f528 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8226

[jira] [Resolved] (HUDI-4109) Copy the old record directly when it is chosen for merging

2022-05-17 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen resolved HUDI-4109. -- > Copy the old record directly when it is chosen for merging > -

[jira] [Commented] (HUDI-4109) Copy the old record directly when it is chosen for merging

2022-05-17 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17538544#comment-17538544 ] Danny Chen commented on HUDI-4109: -- Fixed via master branch: f1f8a1abb7636b43463ee86bf6db

[GitHub] [hudi] hudi-bot commented on pull request #4309: [HUDI-3016][RFC-43] Proposal to implement Table Management Service

2022-05-17 Thread GitBox
hudi-bot commented on PR #4309: URL: https://github.com/apache/hudi/pull/4309#issuecomment-1129496807 ## CI report: * fbe27691b5d9de58128cc58158047a4df2b53750 UNKNOWN * 9f70edff7df1d467b06c00147c5cc128f4ce4c9d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[hudi] branch master updated (ebbe56e862 -> f1f8a1abb7)

2022-05-17 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from ebbe56e862 [minor] Some code refactoring for LogFileComparator and Instant instantiation (#5600) add f1f8a1abb7

[GitHub] [hudi] danny0405 merged pull request #5603: [HUDI-4109] Copy the old record directly when it is chosen for merging

2022-05-17 Thread GitBox
danny0405 merged PR #5603: URL: https://github.com/apache/hudi/pull/5603 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache

[GitHub] [hudi] yuzhaojing commented on a diff in pull request #5392: [HUDI-3942] [RFC-50] Improve Timeline Server

2022-05-17 Thread GitBox
yuzhaojing commented on code in PR #5392: URL: https://github.com/apache/hudi/pull/5392#discussion_r875408492 ## rfc/rfc-50/rfc-50.md: ## @@ -0,0 +1,94 @@ + + +# RFC-50: Improve Timeline Server + +## Proposers +- @yuzhaojing + +## Approvers + - @xushiyan + - @danny0405 + +## Abs

[GitHub] [hudi] gnailJC opened a new pull request, #5616: [MINOR]: Optimize the judgment logic of `SparkDataSourceOptions Key`

2022-05-17 Thread GitBox
gnailJC opened a new pull request, #5616: URL: https://github.com/apache/hudi/pull/5616 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the purpose

[GitHub] [hudi] yanghua commented on pull request #5587: [HUDI-3890] Fix rat plugin issue

2022-05-17 Thread GitBox
yanghua commented on PR #5587: URL: https://github.com/apache/hudi/pull/5587#issuecomment-1129486406 @uday08bce Would you please rebase this PR to see if the flaky test has been fixed? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [hudi] loukey-lj commented on pull request #5611: [HUDI-4108] Clean the marker files before starting new flink compaction

2022-05-17 Thread GitBox
loukey-lj commented on PR #5611: URL: https://github.com/apache/hudi/pull/5611#issuecomment-1129485161 @danny0405 please take a look when free. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] hudi-bot commented on pull request #4309: [HUDI-3016][RFC-43] Proposal to implement Table Management Service

2022-05-17 Thread GitBox
hudi-bot commented on PR #4309: URL: https://github.com/apache/hudi/pull/4309#issuecomment-1129476068 ## CI report: * fbe27691b5d9de58128cc58158047a4df2b53750 UNKNOWN * 9f70edff7df1d467b06c00147c5cc128f4ce4c9d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[hudi] branch master updated (f8b9399615 -> ebbe56e862)

2022-05-17 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from f8b9399615 [MINOR] Fixing spark long running yaml for non-partitioned (#5607) add ebbe56e862 [minor] Some code

[GitHub] [hudi] danny0405 merged pull request #5600: [minor] Some code refactoring for LogFileComparator and Instant insta…

2022-05-17 Thread GitBox
danny0405 merged PR #5600: URL: https://github.com/apache/hudi/pull/5600 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache

[GitHub] [hudi] xiarixiaoyao commented on issue #5519: [SUPPORT] Schema Evolution - Error with datatype promotion

2022-05-17 Thread GitBox
xiarixiaoyao commented on issue #5519: URL: https://github.com/apache/hudi/issues/5519#issuecomment-1129473840 @gy19951022 @JoshuaZhuCN if you have webchat you can add me '1037817390' -- This is an automated message from the Apache Git Service. To respond to the message, please log on t

[GitHub] [hudi] xiarixiaoyao commented on issue #5519: [SUPPORT] Schema Evolution - Error with datatype promotion

2022-05-17 Thread GitBox
xiarixiaoyao commented on issue #5519: URL: https://github.com/apache/hudi/issues/5519#issuecomment-1129472182 @neerajpadarthi Sorry for the late reply. yes, this is an issue with with 0.9. since when spark read parquet, if we use long type to read a int type cloumn spark will throw

[GitHub] [hudi] hudi-bot commented on pull request #5600: [minor] Some code refactoring for LogFileComparator and Instant insta…

2022-05-17 Thread GitBox
hudi-bot commented on PR #5600: URL: https://github.com/apache/hudi/pull/5600#issuecomment-1129468338 ## CI report: * f4fd3cb0e45201006f333cb708154417d78863e3 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8704

[GitHub] [hudi] hudi-bot commented on pull request #5600: [minor] Some code refactoring for LogFileComparator and Instant insta…

2022-05-17 Thread GitBox
hudi-bot commented on PR #5600: URL: https://github.com/apache/hudi/pull/5600#issuecomment-1129442456 ## CI report: * f4fd3cb0e45201006f333cb708154417d78863e3 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8704

[GitHub] [hudi] hudi-bot commented on pull request #5600: [minor] Some code refactoring for LogFileComparator and Instant insta…

2022-05-17 Thread GitBox
hudi-bot commented on PR #5600: URL: https://github.com/apache/hudi/pull/5600#issuecomment-1129440649 ## CI report: * f4fd3cb0e45201006f333cb708154417d78863e3 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=8704

[GitHub] [hudi] hudi-bot commented on pull request #5600: [minor] Some code refactoring for LogFileComparator and Instant insta…

2022-05-17 Thread GitBox
hudi-bot commented on PR #5600: URL: https://github.com/apache/hudi/pull/5600#issuecomment-1129438960 ## CI report: * f4fd3cb0e45201006f333cb708154417d78863e3 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[hudi] branch asf-site updated: [DOCS] Add images to more blogs (#5615)

2022-05-17 Thread bhavanisudha
This is an automated email from the ASF dual-hosted git repository. bhavanisudha pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 8d61bc2ff1 [DOCS] Add images to more blo

[GitHub] [hudi] bhasudha merged pull request #5615: [DOCS] Add images to more blogs

2022-05-17 Thread GitBox
bhasudha merged PR #5615: URL: https://github.com/apache/hudi/pull/5615 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

[GitHub] [hudi] bhasudha opened a new pull request, #5615: [DOCS] Add images to more blogs

2022-05-17 Thread GitBox
bhasudha opened a new pull request, #5615: URL: https://github.com/apache/hudi/pull/5615 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the purpos

  1   2   3   >