[jira] [Updated] (HUDI-1465) Optimize InstantGenerateOperator of Flink's integration on HUDI

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1465: -- Affects Version/s: 0.9.0 > Optimize InstantGenerateOperator of Flink's integration on HUDI > ---

[jira] [Updated] (HUDI-1658) [UMBRELLA] Spark Sql Support For Hudi

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1658: -- Affects Version/s: 0.9.0 > [UMBRELLA] Spark Sql Support For Hudi > - > >

[jira] [Updated] (HUDI-1588) Support multiple ordering fields via payload class config

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1588: -- Affects Version/s: 0.9.0 > Support multiple ordering fields via payload class config > -

[jira] [Updated] (HUDI-1605) Add more documentation around archival process and configs

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1605: -- Affects Version/s: 0.9.0 > Add more documentation around archival process and configs >

[jira] [Updated] (HUDI-1242) Clean up all warnings during compilation

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1242: -- Affects Version/s: 0.9.0 > Clean up all warnings during compilation > >

[jira] [Updated] (HUDI-1505) Allow pluggable option to write error records to side table, queue

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1505: -- Affects Version/s: 0.9.0 > Allow pluggable option to write error records to side table, queue >

[jira] [Updated] (HUDI-1690) Fix StackOverflowError while running clustering with large number of partitions

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1690: -- Affects Version/s: 0.9.0 > Fix StackOverflowError while running clustering with large number of > partitions >

[jira] [Updated] (HUDI-1468) incremental read support with clustering

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1468: -- Affects Version/s: 0.9.0 > incremental read support with clustering > >

[jira] [Updated] (HUDI-251) JDBC incremental load to HUDI with DeltaStreamer

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-251: - Affects Version/s: 0.9.0 > JDBC incremental load to HUDI with DeltaStreamer > -

[jira] [Updated] (HUDI-767) Support transformation when export to Hudi

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-767: - Affects Version/s: 0.9.0 > Support transformation when export to Hudi > --

[jira] [Updated] (HUDI-1414) HoodieInputFormat support for bucketed partitions

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1414: -- Affects Version/s: 0.9.0 > HoodieInputFormat support for bucketed partitions > -

[jira] [Updated] (HUDI-1456) Concurrent Writing to Hudi tables

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1456: -- Affects Version/s: 0.9.0 > Concurrent Writing to Hudi tables > - > >

[jira] [Updated] (HUDI-512) Decouple logical partitioning from physical one.

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-512: - Affects Version/s: 0.9.0 > Decouple logical partitioning from physical one. >

[jira] [Updated] (HUDI-1602) Corrupted Avro schema extracted from parquet file

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1602: -- Affects Version/s: 0.9.0 > Corrupted Avro schema extracted from parquet file > -

[jira] [Updated] (HUDI-1079) Cannot upsert on schema with Array of Record with single field

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1079: -- Affects Version/s: (was: 0.5.3) 0.9.0 > Cannot upsert on schema with Array of Record

[jira] [Updated] (HUDI-1010) Fix the memory leak for hudi-client unit tests

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1010: -- Affects Version/s: 0.9.0 > Fix the memory leak for hudi-client unit tests >

[jira] [Updated] (HUDI-1491) Support partition pruning for MOR snapshot query

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1491: -- Affects Version/s: 0.9.0 > Support partition pruning for MOR snapshot query > --

[jira] [Updated] (HUDI-1172) Use OverwriteWithLatestAvroPayload as default payload class everywhere

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1172: -- Affects Version/s: 0.9.0 > Use OverwriteWithLatestAvroPayload as default payload class everywhere >

[jira] [Updated] (HUDI-1108) Allow parallel listing of dataset partitions for various actions during write

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1108: -- Affects Version/s: 0.9.0 > Allow parallel listing of dataset partitions for various actions during write > -

[jira] [Updated] (HUDI-1337) Deduplicate data in one batch for flink engine

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1337: -- Affects Version/s: 0.9.0 > Deduplicate data in one batch for flink engine >

[jira] [Updated] (HUDI-52) Implement Savepoints for Merge On Read table #88

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-52?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-52: Affects Version/s: 0.9.0 > Implement Savepoints for Merge On Read table #88 >

[jira] [Updated] (HUDI-1413) Need binary release of Hudi to distribute tools like hudi-cli.sh and hudi-sync

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1413: -- Affects Version/s: 0.9.0 > Need binary release of Hudi to distribute tools like hudi-cli.sh and hudi-sync >

[jira] [Updated] (HUDI-686) Implement BloomIndexV2 that does not depend on memory caching

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-686: - Affects Version/s: 0.9.0 > Implement BloomIndexV2 that does not depend on memory caching >

[jira] [Updated] (HUDI-1591) Improve Hoodie Table Query Performance And Ease Of Use For Spark

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1591: -- Affects Version/s: 0.9.0 > Improve Hoodie Table Query Performance And Ease Of Use For Spark > --

[jira] [Updated] (HUDI-1165) Audit Partition Listing : Compaction Scheduling

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1165: -- Affects Version/s: 0.9.0 > Audit Partition Listing : Compaction Scheduling > ---

[jira] [Updated] (HUDI-1558) Struct Stream Source Support Spark3

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1558: -- Affects Version/s: 0.9.0 > Struct Stream Source Support Spark3 > > >

[jira] [Updated] (HUDI-904) Segregate metrics configs by reporter type

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-904: - Affects Version/s: 0.9.0 > Segregate metrics configs by reporter type > --

[jira] [Updated] (HUDI-1021) [Bug] Unable to update bootstrapped table using rows from the written bootstrapped table

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1021: -- Affects Version/s: 0.9.0 > [Bug] Unable to update bootstrapped table using rows from the written > bootstrapped

[jira] [Updated] (HUDI-1415) Read Hoodie Table As Spark DataSource Table

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1415: -- Affects Version/s: 0.9.0 > Read Hoodie Table As Spark DataSource Table > --

[jira] [Updated] (HUDI-1038) Adding perf benchmark using jmh to Hudi

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1038: -- Affects Version/s: 0.9.0 > Adding perf benchmark using jmh to Hudi > --- > >

[jira] [Updated] (HUDI-1444) fix the error when rollback commit that belong to a non partition table

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1444: -- Affects Version/s: (was: 0.8.0) 0.9.0 > fix the error when rollback commit that belon

[jira] [Updated] (HUDI-1683) When using hudi on flink write data to the HDFS ClassCastException: scala. Tuple2 always be cast to org.apache.hudi.com mon. Util. Collection. The Pair

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1683: -- Affects Version/s: (was: 0.7.0) 0.9.0 > When using hudi on flink write data to the HD

[jira] [Updated] (HUDI-1464) Make DefaultHoodieRecordPayload default payload class

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1464: -- Affects Version/s: 0.9.0 > Make DefaultHoodieRecordPayload default payload class > -

[jira] [Updated] (HUDI-1185) KeyGenerator class/interfaces need to be decoupled from Spark

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1185: -- Affects Version/s: 0.9.0 > KeyGenerator class/interfaces need to be decoupled from Spark > -

[jira] [Updated] (HUDI-940) Audit bad/dangling configs and code

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-940: - Affects Version/s: 0.9.0 > Audit bad/dangling configs and code > > >

[jira] [Updated] (HUDI-1271) Add utility scripts to perform Restores

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1271: -- Affects Version/s: 0.9.0 > Add utility scripts to perform Restores > --- > >

[jira] [Updated] (HUDI-1220) Leverage java docs for publishing to config pages in site

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1220: -- Affects Version/s: 0.9.0 > Leverage java docs for publishing to config pages in site > -

[jira] [Updated] (HUDI-1355) Allowing multipleSourceOrdering fields for doing the preCombine on payload

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1355: -- Affects Version/s: 0.9.0 > Allowing multipleSourceOrdering fields for doing the preCombine on payload >

[jira] [Updated] (HUDI-1138) Re-implement marker files via timeline server

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1138: -- Affects Version/s: 0.9.0 > Re-implement marker files via timeline server > -

[jira] [Updated] (HUDI-1207) Add kafka implementation of write commit callback to Spark datasources

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1207: -- Affects Version/s: 0.9.0 > Add kafka implementation of write commit callback to Spark datasources >

[jira] [Updated] (HUDI-747) Implement Rollback like API in HoodieWriteClient which can revert all actions

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-747: - Affects Version/s: 0.9.0 > Implement Rollback like API in HoodieWriteClient which can revert all actions > ---

[jira] [Updated] (HUDI-1606) HoodieJavaWriteClientExample fail with exception

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1606: -- Affects Version/s: 0.9.0 > HoodieJavaWriteClientExample fail with exception > --

[jira] [Updated] (HUDI-1593) Add support for "show restore" in hudi-cli

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1593: -- Affects Version/s: 0.9.0 > Add support for "show restore" in hudi-cli >

[jira] [Updated] (HUDI-922) [UMBRELLA] Transfer out of the Incubator

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-922: - Affects Version/s: 0.9.0 > [UMBRELLA] Transfer out of the Incubator > > >

[jira] [Updated] (HUDI-1219) Code optimization on hudi-common moudle

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1219: -- Affects Version/s: 0.9.0 > Code optimization on hudi-common moudle > --- > >

[jira] [Updated] (HUDI-1267) Additional Metadata Details for Hudi Transactions

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1267: -- Affects Version/s: 0.9.0 > Additional Metadata Details for Hudi Transactions > -

[jira] [Updated] (HUDI-1261) CLI tools update to support REPLACE and insert overwrite

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1261: -- Affects Version/s: 0.9.0 > CLI tools update to support REPLACE and insert overwrite > --

[jira] [Updated] (HUDI-1093) Add support for COW tables from Prestosql

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1093: -- Affects Version/s: 0.9.0 > Add support for COW tables from Prestosql > -

[jira] [Updated] (HUDI-857) Overhaul unit-tests for Cleaner and ROllbacks

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-857: - Affects Version/s: 0.9.0 > Overhaul unit-tests for Cleaner and ROllbacks >

[jira] [Updated] (HUDI-1430) Support Dataset write w/o conversion to RDD

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1430: -- Affects Version/s: 0.9.0 > Support Dataset write w/o conversion to RDD > ---

[jira] [Updated] (HUDI-1676) Support SQL with spark3

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1676: -- Affects Version/s: 0.9.0 > Support SQL with spark3 > --- > > Key: HUDI-1676

[jira] [Updated] (HUDI-1425) Performance loss with the additional hoodieRecords.isEmpty() in HoodieSparkSqlWriter#write

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1425: -- Affects Version/s: 0.9.0 > Performance loss with the additional hoodieRecords.isEmpty() in > HoodieSparkSqlWrit

[jira] [Updated] (HUDI-1499) Support configuration to let user override record-size estimate

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1499: -- Affects Version/s: 0.9.0 > Support configuration to let user override record-size estimate > -

[jira] [Updated] (HUDI-483) Fix unit test for Archiving to reflect empty instant files for requested commit/deltacommits

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-483: - Affects Version/s: 0.9.0 > Fix unit test for Archiving to reflect empty instant files for requested > commit/delta

[jira] [Updated] (HUDI-1096) MOR queries support from Prestosql

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1096: -- Affects Version/s: 0.9.0 > MOR queries support from Prestosql > -- > >

[jira] [Updated] (HUDI-1600) Fix document to reflect Hudi supports MOR for spark datasource for incremental queries

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1600: -- Affects Version/s: 0.9.0 > Fix document to reflect Hudi supports MOR for spark datasource for > incremental que

[jira] [Updated] (HUDI-1142) Complete remaining code review comments/follow ups

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1142: -- Affects Version/s: 0.9.0 > Complete remaining code review comments/follow ups > ---

[jira] [Updated] (HUDI-1440) Allow option to override schema when doing spark.write

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1440: -- Affects Version/s: 0.9.0 > Allow option to override schema when doing spark.write >

[jira] [Updated] (HUDI-1453) Throw Exception when input data schema is not equal to the hoodie table schema

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1453: -- Affects Version/s: 0.9.0 > Throw Exception when input data schema is not equal to the hoodie table schema >

[jira] [Updated] (HUDI-619) Investigate and implement mechanism to have hive/presto/sparksql queries avoid stitching and return null values for hoodie columns

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-619: - Affects Version/s: 0.9.0 > Investigate and implement mechanism to have hive/presto/sparksql queries > avoid stitch

[jira] [Updated] (HUDI-1556) Add App Id and App name to HoodieDeltaStreamerMetrics

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1556: -- Affects Version/s: 0.9.0 > Add App Id and App name to HoodieDeltaStreamerMetrics > -

[jira] [Updated] (HUDI-1179) Add Row tests to all key generator test classes

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1179: -- Affects Version/s: 0.9.0 > Add Row tests to all key generator test classes > ---

[jira] [Updated] (HUDI-847) Umbrella ticket for tuning default configs for 0.6.0

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-847: - Affects Version/s: 0.9.0 > Umbrella ticket for tuning default configs for 0.6.0 > -

[jira] [Updated] (HUDI-1104) Bulk insert Dataset - UserDefinedPartitioner

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1104: -- Fix Version/s: (was: 0.8.0) 0.9.0 > Bulk insert Dataset - UserDefinedPartitioner > --

[jira] [Updated] (HUDI-1512) Fix hudi-spark2 unit tests failure with Spark 3.0.0

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1512: -- Fix Version/s: (was: 0.7.0) 0.8.0 > Fix hudi-spark2 unit tests failure with Spark 3.0.0

[jira] [Closed] (HUDI-1512) Fix hudi-spark2 unit tests failure with Spark 3.0.0

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1512. - > Fix hudi-spark2 unit tests failure with Spark 3.0.0 > > >

[jira] [Updated] (HUDI-1511) InstantGenerateOperator support multiple parallelism

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1511: -- Fix Version/s: 0.8.0 > InstantGenerateOperator support multiple parallelism > --

[jira] [Updated] (HUDI-1476) Introduce unit test infra for java client

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1476: -- Fix Version/s: 0.8.0 > Introduce unit test infra for java client > - > >

[jira] [Closed] (HUDI-1476) Introduce unit test infra for java client

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1476. - > Introduce unit test infra for java client > - > > Key: HUDI-

[jira] [Closed] (HUDI-623) Remove UpgradePayloadFromUberToApache

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-623. Assignee: Xianghu Wang (was: wangxianghu#1) > Remove UpgradePayloadFromUberToApache > --

[jira] [Closed] (HUDI-1234) Insert new records regardless of small file when using insert operation

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1234. - > Insert new records regardless of small file when using insert operation > --

[jira] [Closed] (HUDI-1266) Add e2e integration tests for replace and insert-overwrite

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1266. - > Add e2e integration tests for replace and insert-overwrite > ---

[jira] [Resolved] (HUDI-1266) Add e2e integration tests for replace and insert-overwrite

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li resolved HUDI-1266. --- Resolution: Resolved > Add e2e integration tests for replace and insert-overwrite > --

[jira] [Updated] (HUDI-1555) clustering bugs from large scale testing

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1555: -- Fix Version/s: 0.8.0 > clustering bugs from large scale testing > > >

[jira] [Updated] (HUDI-1266) Add e2e integration tests for replace and insert-overwrite

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1266: -- Status: Open (was: New) > Add e2e integration tests for replace and insert-overwrite >

[jira] [Assigned] (HUDI-1519) Improve minKey/maxKey compute in HoodieHFileWriter

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li reassigned HUDI-1519: - Assignee: steven zhang > Improve minKey/maxKey compute in HoodieHFileWriter > ---

[jira] [Resolved] (HUDI-1519) Improve minKey/maxKey compute in HoodieHFileWriter

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li resolved HUDI-1519. --- Resolution: Resolved > Improve minKey/maxKey compute in HoodieHFileWriter > --

[jira] [Closed] (HUDI-1519) Improve minKey/maxKey compute in HoodieHFileWriter

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1519. - > Improve minKey/maxKey compute in HoodieHFileWriter > -- > >

[jira] [Closed] (HUDI-1523) Avoid excessive mkdir calls when creating new files

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1523. - > Avoid excessive mkdir calls when creating new files > --- > >

[jira] [Closed] (HUDI-1547) CI intermittent failure: TestJsonStringToHoodieRecordMapFunction.testMapFunction

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1547. - > CI intermittent failure: > TestJsonStringToHoodieRecordMapFunction.testMapFunction > -

[jira] [Closed] (HUDI-1538) UtilHelpers.createSource has hardcoded class name checks

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1538. - > UtilHelpers.createSource has hardcoded class name checks >

[jira] [Closed] (HUDI-1420) HoodieTableMetaClient.getMarkerFolderPath works incorrectly on windows client with hdfs server for wrong file seperator

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1420. - > HoodieTableMetaClient.getMarkerFolderPath works incorrectly on windows client > with hdfs server for wrong file sep

[jira] [Closed] (HUDI-1571) Expose record size info for commits w/ hudi-cli

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1571. - > Expose record size info for commits w/ hudi-cli > --- > >

[jira] [Resolved] (HUDI-1571) Expose record size info for commits w/ hudi-cli

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li resolved HUDI-1571. --- Resolution: Resolved > Expose record size info for commits w/ hudi-cli > -

[jira] [Assigned] (HUDI-1571) Expose record size info for commits w/ hudi-cli

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li reassigned HUDI-1571: - Assignee: sivabalan narayanan > Expose record size info for commits w/ hudi-cli > ---

[jira] [Updated] (HUDI-1589) Rollback metadata is not backwards compatible

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1589: -- Fix Version/s: 0.8.0 > Rollback metadata is not backwards compatible > -

[jira] [Resolved] (HUDI-1589) Rollback metadata is not backwards compatible

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li resolved HUDI-1589. --- Resolution: Fixed > Rollback metadata is not backwards compatible > --

[jira] [Closed] (HUDI-1589) Rollback metadata is not backwards compatible

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1589. - > Rollback metadata is not backwards compatible > - > > Ke

[jira] [Closed] (HUDI-1545) Add test cases for INSERT_OVERWRITE Operation

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1545. - [~slamke] thanks for your contribution. I can't assign this ticket to you since you don't have contributor access yet

[jira] [Resolved] (HUDI-1545) Add test cases for INSERT_OVERWRITE Operation

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li resolved HUDI-1545. --- Resolution: Fixed > Add test cases for INSERT_OVERWRITE Operation > --

[jira] [Updated] (HUDI-1545) Add test cases for INSERT_OVERWRITE Operation

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1545: -- Fix Version/s: 0.8.0 > Add test cases for INSERT_OVERWRITE Operation > -

[jira] [Closed] (HUDI-1526) Translate the spark api partitionBy to hoodie.datasource.write.partitionpath.field

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1526. - > Translate the spark api partitionBy to > hoodie.datasource.write.partitionpath.field >

[jira] [Closed] (HUDI-1603) DefaultHoodieRecordPayload Serialization Failed

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1603. - > DefaultHoodieRecordPayload Serialization Failed > --- > >

[jira] [Closed] (HUDI-1109) Support Spark Structured Streaming read from Hudi table

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1109. - > Support Spark Structured Streaming read from Hudi table > --- >

[jira] [Resolved] (HUDI-1109) Support Spark Structured Streaming read from Hudi table

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li resolved HUDI-1109. --- Resolution: Implemented > Support Spark Structured Streaming read from Hudi table > --

[jira] [Updated] (HUDI-1109) Support Spark Structured Streaming read from Hudi table

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1109: -- Status: Open (was: New) > Support Spark Structured Streaming read from Hudi table > ---

[jira] [Updated] (HUDI-1109) Support Spark Structured Streaming read from Hudi table

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1109: -- Fix Version/s: 0.8.0 > Support Spark Structured Streaming read from Hudi table > ---

[jira] [Closed] (HUDI-1381) Schedule compaction based on time elapsed

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1381. - > Schedule compaction based on time elapsed > -- > > Key: HUD

[jira] [Resolved] (HUDI-1582) HiveSyncTool - silently fails (RuntimeException is swallowed)

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li resolved HUDI-1582. --- Resolution: Fixed > HiveSyncTool - silently fails (RuntimeException is swallowed) > -

[jira] [Closed] (HUDI-1582) HiveSyncTool - silently fails (RuntimeException is swallowed)

2021-03-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li closed HUDI-1582. - > HiveSyncTool - silently fails (RuntimeException is swallowed) > ---

<    1   2   3   4   5   6   7   8   9   >