[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3035: - Epic Link: HUDI-6243 > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Improvement > Components: code-quality, writer-core >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Major > Fix For: 1.0.0, 0.15.0 > > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3035: - Fix Version/s: 1.0.0 0.15.0 > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Improvement > Components: code-quality, writer-core >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Major > Fix For: 1.0.0, 0.15.0 > > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3035: -- Fix Version/s: (was: 0.13.0) > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Improvement > Components: code-quality, writer-core >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Blocker > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3035: -- Priority: Major (was: Blocker) > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Improvement > Components: code-quality, writer-core >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Major > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3035: -- Fix Version/s: 0.13.0 (was: 0.12.0) > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Improvement > Components: code-quality, writer-core >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Blocker > Fix For: 0.13.0 > > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3035: - Fix Version/s: 0.12.0 (was: 0.11.0) > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Improvement > Components: code-quality, writer-core >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Major > Fix For: 0.12.0 > > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3035: - Component/s: code-quality > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Improvement > Components: code-quality, writer-core >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Major > Fix For: 0.11.0 > > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3035: - Issue Type: Improvement (was: Bug) > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Improvement > Components: writer-core >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Critical > Fix For: 0.11.0 > > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3035: - Priority: Blocker (was: Major) > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Improvement > Components: code-quality, writer-core >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Blocker > Fix For: 0.12.0 > > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3035: - Priority: Major (was: Critical) > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Improvement > Components: writer-core >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Major > Fix For: 0.11.0 > > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-3035: -- Component/s: writer-core > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Bug > Components: writer-core >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Critical > Fix For: 0.11.0 > > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3035: -- Priority: Critical (was: Blocker) > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Bug >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Critical > Fix For: 0.11.0 > > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3035: -- Priority: Blocker (was: Major) > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Bug >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Blocker > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-3035) Unify Parquet writers
[ https://issues.apache.org/jira/browse/HUDI-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3035: -- Fix Version/s: 0.11.0 > Unify Parquet writers > - > > Key: HUDI-3035 > URL: https://issues.apache.org/jira/browse/HUDI-3035 > Project: Apache Hudi > Issue Type: Bug >Reporter: Alexey Kudinkin >Assignee: Alexey Kudinkin >Priority: Blocker > Fix For: 0.11.0 > > > Currently we have at least 3 implementations of the ParquetWriters (which is > 3x more than we actually need): > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/io/storage/row/HoodieRowDataParquetWriter.java] > [https://github.com/apache/hudi/blob/master/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/row/HoodieInternalRowParquetWriter.java] > > Implementations (while identical in principle) have diverged, essentially > living their own lifecycle. -- This message was sent by Atlassian Jira (v8.20.1#820001)