[ 
https://issues.apache.org/jira/browse/HBASE-28100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17767008#comment-17767008
 ] 

alan.zhao commented on HBASE-28100:
-----------------------------------

hi,My idea is to determine whether to open a new file by obtaining the 
compressed size during HFile generation. However, after reviewing the code flow 
for writing HFile, I find this approach to be quite challenging. Could anyone 
provide me with some suggestions? Thanks.

>  The size that is checked against the maxfilesize threshold is the 
> uncompressed size of the HFile
> -------------------------------------------------------------------------------------------------
>
>                 Key: HBASE-28100
>                 URL: https://issues.apache.org/jira/browse/HBASE-28100
>             Project: HBase
>          Issue Type: Bug
>         Environment: HBase 2.x
>            Reporter: alan.zhao
>            Assignee: alan.zhao
>            Priority: Major
>         Attachments: image-2023-09-20-12-09-49-959.png
>
>
> HBase server is configured to use Snappy compression.when doing bulkload in 
> HBase, the size that is checked against the maxfilesize threshold is the 
> uncompressed size of the HFile, not the compressed size.
> HFileOutputFormat2.class
>  
> {code:java}
> //代码占位符
> new RecordWriter<ImmutableBytesWritable, V>() {
> ....
> @Override
> public void write(ImmutableBytesWritable row, V cell) throws IOException {
> ...
> }
> }
>  {code}
> !image-2023-09-20-12-09-49-959.png!
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to