[
https://issues.apache.org/jira/browse/HBASE-28100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17767008#comment-17767008
]
alan.zhao commented on HBASE-28100:
-----------------------------------
hi,My idea is to determine whether to open a new file by obtaining the
compressed size during HFile generation. However, after reviewing the code flow
for writing HFile, I find this approach to be quite challenging. Could anyone
provide me with some suggestions? Thanks.
> The size that is checked against the maxfilesize threshold is the
> uncompressed size of the HFile
> -------------------------------------------------------------------------------------------------
>
> Key: HBASE-28100
> URL: https://issues.apache.org/jira/browse/HBASE-28100
> Project: HBase
> Issue Type: Bug
> Environment: HBase 2.x
> Reporter: alan.zhao
> Assignee: alan.zhao
> Priority: Major
> Attachments: image-2023-09-20-12-09-49-959.png
>
>
> HBase server is configured to use Snappy compression.when doing bulkload in
> HBase, the size that is checked against the maxfilesize threshold is the
> uncompressed size of the HFile, not the compressed size.
> HFileOutputFormat2.class
>
> {code:java}
> //代码占位符
> new RecordWriter<ImmutableBytesWritable, V>() {
> ....
> @Override
> public void write(ImmutableBytesWritable row, V cell) throws IOException {
> ...
> }
> }
> {code}
> !image-2023-09-20-12-09-49-959.png!
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)