This is an automated email from the ASF dual-hosted git repository.
danny0405 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git
from 59a5d889bbe9 fix: add all fields in HoodieSourceSplitSerializer
(#18243)
add 363f41acbbda fix: [HUDI-CLUSTERING] Optimize binary copy performance
with lazy loading, bulk reads, and double buffering (#18241)
No new revisions were added by this update.
Summary of changes:
.../parquet/io/ByteArraySeekableInputStream.java | 125 +++++++
.../parquet/io/HoodieParquetBinaryCopyBase.java | 388 ++++++++-------------
.../parquet/io/HoodieParquetFileBinaryCopier.java | 316 +++++++++++++++--
.../io/TestByteArraySeekableInputStream.java | 195 +++++++++++
...HoodieParquetBinaryCopyBaseSchemaEvolution.java | 132 ++++---
.../io/TestHoodieParquetFileBinaryCopier.java | 101 +++++-
.../TestHoodieParquetFileBinaryCopierPrefetch.java | 142 ++++++++
.../io/TestOutputStreamBackedOutputFile.java | 83 +++++
style/checkstyle-suppressions.xml | 1 +
9 files changed, 1145 insertions(+), 338 deletions(-)
create mode 100644
hudi-hadoop-common/src/main/java/org/apache/hudi/parquet/io/ByteArraySeekableInputStream.java
create mode 100644
hudi-hadoop-common/src/test/java/org/apache/hudi/parquet/io/TestByteArraySeekableInputStream.java
create mode 100644
hudi-hadoop-common/src/test/java/org/apache/hudi/parquet/io/TestHoodieParquetFileBinaryCopierPrefetch.java
create mode 100644
hudi-hadoop-common/src/test/java/org/apache/hudi/parquet/io/TestOutputStreamBackedOutputFile.java