Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
......................................................................


Patch Set 3:

(5 comments)

http://gerrit.cloudera.org:8080/#/c/17062/3//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/17062/3//COMMIT_MSG@9
PS3, Line 9: first unexpected remote read of each
           : scanrange
Does this happen for large numbers of scan ranges simultaneously or is it more 
sporadic?

I'm just trying to get a sense of how much output this could produce.


http://gerrit.cloudera.org:8080/#/c/17062/3//COMMIT_MSG@13
PS3, Line 13: Reviewed-by: Aman Sinha <amsi...@cloudera.com>
Nit: Remove this line (gerrit will add these lines when this merges).


http://gerrit.cloudera.org:8080/#/c/17062/3/be/src/runtime/io/hdfs-file-reader.cc
File be/src/runtime/io/hdfs-file-reader.cc:

http://gerrit.cloudera.org:8080/#/c/17062/3/be/src/runtime/io/hdfs-file-reader.cc@81
PS3, Line 81: appendHostList(const hdfsFS hdfs_fs, const char* file_name,
            :     int64_t file_offset, int64_t bytes_to_read, ostream& ostr) {
Nit: Couple style things

It may make sense to make this a private method on HdfsFileReader rather than a 
regular C-style function. The hdfs_fs and file_name both come from the class 
anyway. Either way, it would be nice to have a comment describing the expected 
behavior.

For this, I think it would also be cleaner not to have to pass in the ostream. 
One way to do it would be to use a std::ostringstream to collect the individual 
logging statements and then have this function return a std::string (i.e. 
ostreamstream's str() method). Then the logging would look like:

LOG(INFO) << "Normal log stuff"
          << appendHostList(fs, filename, offset, ...);


http://gerrit.cloudera.org:8080/#/c/17062/3/be/src/runtime/io/hdfs-file-reader.cc@232
PS3, Line 232:       int64_t save_num_remote_bytes = num_remote_bytes_;
Nit: Since we are using this like a boolean, does it make sense to have a 
boolean saying whether we have printed the unexpected remote logging already?


http://gerrit.cloudera.org:8080/#/c/17062/3/be/src/runtime/io/hdfs-file-reader.cc@242
PS3, Line 242: num_remote_bytes_-save_num_remote_bytes
Nit: We only enter this if save_num_remote_bytes==0, so we could drop this.



--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 3
Gerrit-Owner: Kurt Deschler <kdesc...@cloudera.com>
Gerrit-Reviewer: Aman Sinha <amsi...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Joe McDonnell <joemcdonn...@cloudera.com>
Gerrit-Comment-Date: Wed, 17 Feb 2021 01:38:20 +0000
Gerrit-HasComments: Yes

Reply via email to