Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/20482 )

Change subject: IMPALA-12431: Support reading compressed JSON file
......................................................................


Patch Set 4:

(4 comments)

So sorry for my late review. Have been busy in other stuffs. Good to see the 
patch passed all the tests. I just have some minor comments.

http://gerrit.cloudera.org:8080/#/c/20482/4/be/src/exec/json/hdfs-json-scanner.cc
File be/src/exec/json/hdfs-json-scanner.cc:

http://gerrit.cloudera.org:8080/#/c/20482/4/be/src/exec/json/hdfs-json-scanner.cc@283
PS4, Line 283: Status HdfsJsonScanner::DecompressFileToBuffer(uint8** buffer, 
int64_t* bytes_read) {
Most of the codes in the method are copied from 
HdfsTextScanner::FillByteBufferCompressedFile(). Can we consolidate them into 
one?


http://gerrit.cloudera.org:8080/#/c/20482/4/be/src/exec/json/hdfs-json-scanner.cc@329
PS4, Line 329: Status HdfsJsonScanner::DecompressStreamToBuffer(uint8** buffer, 
int64_t* bytes_read) {
Most of the codes of this method come from 
HdfsTextScanner::FillByteBufferCompressedStream(). It'd be nice if we can 
consolidate them into one.


http://gerrit.cloudera.org:8080/#/c/20482/4/be/src/exec/json/hdfs-json-scanner.cc@333
PS4, Line 333:     data_buffer_pool_->FreeAll();
Is it safe to free all? Shouldn't we attach the memory to the previous row 
batch (its tuple_data_pool)?


http://gerrit.cloudera.org:8080/#/c/20482/4/be/src/exec/json/hdfs-json-scanner.cc@367
PS4, Line 367:     uint8_t** decompressed_buffer, int64_t* decompressed_len, 
bool *eosr) {
This method seems to be exactly the same as 
HdfsTextScanner::DecompressBufferStream(). Can we use that instead? Or did I 
miss something?



--
To view, visit http://gerrit.cloudera.org:8080/20482
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2471855d97d4cdd51363b321055e6b06aa6d81e8
Gerrit-Change-Number: 20482
Gerrit-PatchSet: 4
Gerrit-Owner: Zihao Ye <eyiz...@163.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Quanlong Huang <huangquanl...@gmail.com>
Gerrit-Reviewer: Zihao Ye <eyiz...@163.com>
Gerrit-Comment-Date: Tue, 05 Dec 2023 08:55:19 +0000
Gerrit-HasComments: Yes

Reply via email to