[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 7: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 7
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Wed, 24 Feb 2021 00:23:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..

IMPALA-10504: Add tracing for remote block reads

This patch logs metadata for the first unexpected remote read of each
scanrange when the flag fs_trace_remote_reads is set to true. This
logging is intended to help diagnose the root cause of remote reads.

Since a message may be logged for each scan range, there could be
several hundred lines of output in a degenerate case. However, the
remote read condition is not expected and verbose output may be needed
to diagnose the root cause.

Reviewed-by: Aman Sinha 
Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Reviewed-on: http://gerrit.cloudera.org:8080/17062
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M be/src/runtime/io/hdfs-file-reader.cc
M be/src/runtime/io/hdfs-file-reader.h
2 files changed, 44 insertions(+), 0 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 8
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 7: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 7
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Tue, 23 Feb 2021 18:38:55 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 7:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6915/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 7
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Tue, 23 Feb 2021 18:38:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-23 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 6: Code-Review+2

Looks good. Thanks for adding this!


--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 6
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Tue, 23 Feb 2021 16:26:08 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-22 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/8201/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 6
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Tue, 23 Feb 2021 04:25:31 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-22 Thread Kurt Deschler (Code Review)
Hello Aman Sinha, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17062

to look at the new patch set (#6).

Change subject: IMPALA-10504: Add tracing for remote block reads
..

IMPALA-10504: Add tracing for remote block reads

This patch logs metadata for the first unexpected remote read of each
scanrange when the flag fs_trace_remote_reads is set to true. This
logging is intended to help diagnose the root cause of remote reads.

Since a message may be logged for each scan range, there could be
several hundred lines of output in a degenerate case. However, the
remote read condition is not expected and verbose output may be needed
to diagnose the root cause.

Reviewed-by: Aman Sinha 
Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
---
M be/src/runtime/io/hdfs-file-reader.cc
M be/src/runtime/io/hdfs-file-reader.h
2 files changed, 44 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/62/17062/6
--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 6
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-22 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 5:

This is looking good to me. Please wrap the long lines, then I'll bump to +2.


--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 5
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Mon, 22 Feb 2021 18:54:45 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-22 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 5: Code-Review+1


--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 5
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Mon, 22 Feb 2021 18:54:49 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-21 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 5:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/8183/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 5
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Sun, 21 Feb 2021 21:24:00 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-21 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 5:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/17062/5/be/src/runtime/io/hdfs-file-reader.h
File be/src/runtime/io/hdfs-file-reader.h:

http://gerrit.cloudera.org:8080/#/c/17062/5/be/src/runtime/io/hdfs-file-reader.h@81
PS5, Line 81:   /// Return a string that contains the block indexes and list of 
hosts where each block resides
line too long (96 > 90)


http://gerrit.cloudera.org:8080/#/c/17062/5/be/src/runtime/io/hdfs-file-reader.cc
File be/src/runtime/io/hdfs-file-reader.cc:

http://gerrit.cloudera.org:8080/#/c/17062/5/be/src/runtime/io/hdfs-file-reader.cc@84
PS5, Line 84: std::string HdfsFileReader::GetHostList(int64_t file_offset, 
int64_t bytes_to_read) const {
line too long (91 > 90)


http://gerrit.cloudera.org:8080/#/c/17062/5/be/src/runtime/io/hdfs-file-reader.cc@85
PS5, Line 85:   char*** hosts = hdfsGetHosts(hdfs_fs_, 
scan_range_->file_string()->c_str(), file_offset, bytes_to_read);
line too long (106 > 90)


http://gerrit.cloudera.org:8080/#/c/17062/5/be/src/runtime/io/hdfs-file-reader.cc@240
PS5, Line 240:   if (FLAGS_fs_trace_remote_reads && expected_local_ && 
num_remote_bytes_ > 0 && is_first_read) {
line too long (101 > 90)



--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 5
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Sun, 21 Feb 2021 21:02:31 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-21 Thread Kurt Deschler (Code Review)
Hello Aman Sinha, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17062

to look at the new patch set (#5).

Change subject: IMPALA-10504: Add tracing for remote block reads
..

IMPALA-10504: Add tracing for remote block reads

This patch logs metadata for the first unexpected remote read of each
scanrange when the flag fs_trace_remote_reads is set to true. This
logging is intended to help diagnose the root cause of remote reads.

Since a message may be logged for each scan range, there could be
several hundred lines of output in a degenerate case. However, the
remote read condition is not expected and verbose output may be needed
to diagnose the root cause.

Reviewed-by: Aman Sinha 
Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
---
M be/src/runtime/io/hdfs-file-reader.cc
M be/src/runtime/io/hdfs-file-reader.h
2 files changed, 41 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/62/17062/5
--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 5
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-17 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 4:

(2 comments)

Thanks for fixing the last round of comments. Code-wise, this is close.

We've had issues with logging verbosity in the past, and we may want to limit 
this or have it off by default.

http://gerrit.cloudera.org:8080/#/c/17062/4/be/src/runtime/io/hdfs-file-reader.h
File be/src/runtime/io/hdfs-file-reader.h:

http://gerrit.cloudera.org:8080/#/c/17062/4/be/src/runtime/io/hdfs-file-reader.h@82
PS4, Line 82: vb1540.halxg.cloudera.com
Nit: we would rather not have cloudera.com hostnames in the source code.


http://gerrit.cloudera.org:8080/#/c/17062/4/be/src/runtime/io/hdfs-file-reader.cc
File be/src/runtime/io/hdfs-file-reader.cc:

http://gerrit.cloudera.org:8080/#/c/17062/4/be/src/runtime/io/hdfs-file-reader.cc@239
PS4, Line 239: LOG(INFO)
 : << "First remote read of scan range on file "
 : << *scan_range_->file_string()
 : << " " << num_remote_bytes_
 : << " bytes. offsets " << file_offset
 : << "-" << file_offset+bytes_to_read-1
 : << GetHostList(file_offset, bytes_to_read);
The only remaining thought is that we may want a way to throttle or disable 
this. If we are reading parquet, the columns are usually on the same HDFS 
block, so if one column is hitting it, the others may also hit it. Does this 
need to be enabled by default? Could it be enabled when someone sees a problem?

One option is to add a startup flag to control whether to produce this output. 
On startup, there can be some logic to turn it on for configurations where it 
is especially useful.



--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 4
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Wed, 17 Feb 2021 23:24:15 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-17 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 4:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/8147/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 4
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Wed, 17 Feb 2021 18:26:48 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-17 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 4:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/17062/4/be/src/runtime/io/hdfs-file-reader.h
File be/src/runtime/io/hdfs-file-reader.h:

http://gerrit.cloudera.org:8080/#/c/17062/4/be/src/runtime/io/hdfs-file-reader.h@81
PS4, Line 81:   /// Return a string that contains the block indexes and list of 
hosts where each block resides
line too long (96 > 90)


http://gerrit.cloudera.org:8080/#/c/17062/4/be/src/runtime/io/hdfs-file-reader.h@82
PS4, Line 82:   /// i.e. [0] { vb1540.halxg.cloudera.com, 
vb1541.halxg.cloudera.com, vb1539.halxg.cloudera.com }
line too long (98 > 90)


http://gerrit.cloudera.org:8080/#/c/17062/4/be/src/runtime/io/hdfs-file-reader.cc
File be/src/runtime/io/hdfs-file-reader.cc:

http://gerrit.cloudera.org:8080/#/c/17062/4/be/src/runtime/io/hdfs-file-reader.cc@81
PS4, Line 81: std::string HdfsFileReader::GetHostList(int64_t file_offset, 
int64_t bytes_to_read) const {
line too long (91 > 90)


http://gerrit.cloudera.org:8080/#/c/17062/4/be/src/runtime/io/hdfs-file-reader.cc@82
PS4, Line 82:   char*** hosts = hdfsGetHosts(hdfs_fs_, 
scan_range_->file_string()->c_str(), file_offset, bytes_to_read);
line too long (106 > 90)



--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 4
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Wed, 17 Feb 2021 18:06:51 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-17 Thread Kurt Deschler (Code Review)
Hello Aman Sinha, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17062

to look at the new patch set (#4).

Change subject: IMPALA-10504: Add tracing for remote block reads
..

IMPALA-10504: Add tracing for remote block reads

This patch logs metadata for the first unexpected remote read of each
scanrange. This logging is intended to help diagnose the root cause of
remote reads. Since a message may be logged for each scan range,
there could be several hundred lines of output in a degenerate case.
However, the remote read condition is not expected and verbose
output may be needed to diagnose the root cause.

Reviewed-by: Aman Sinha 
Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
---
M be/src/runtime/io/hdfs-file-reader.cc
M be/src/runtime/io/hdfs-file-reader.h
2 files changed, 38 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/62/17062/4
--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 4
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-16 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 3:

(5 comments)

http://gerrit.cloudera.org:8080/#/c/17062/3//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/17062/3//COMMIT_MSG@9
PS3, Line 9: first unexpected remote read of each
   : scanrange
Does this happen for large numbers of scan ranges simultaneously or is it more 
sporadic?

I'm just trying to get a sense of how much output this could produce.


http://gerrit.cloudera.org:8080/#/c/17062/3//COMMIT_MSG@13
PS3, Line 13: Reviewed-by: Aman Sinha 
Nit: Remove this line (gerrit will add these lines when this merges).


http://gerrit.cloudera.org:8080/#/c/17062/3/be/src/runtime/io/hdfs-file-reader.cc
File be/src/runtime/io/hdfs-file-reader.cc:

http://gerrit.cloudera.org:8080/#/c/17062/3/be/src/runtime/io/hdfs-file-reader.cc@81
PS3, Line 81: appendHostList(const hdfsFS hdfs_fs, const char* file_name,
: int64_t file_offset, int64_t bytes_to_read, ostream& ostr) {
Nit: Couple style things

It may make sense to make this a private method on HdfsFileReader rather than a 
regular C-style function. The hdfs_fs and file_name both come from the class 
anyway. Either way, it would be nice to have a comment describing the expected 
behavior.

For this, I think it would also be cleaner not to have to pass in the ostream. 
One way to do it would be to use a std::ostringstream to collect the individual 
logging statements and then have this function return a std::string (i.e. 
ostreamstream's str() method). Then the logging would look like:

LOG(INFO) << "Normal log stuff"
  << appendHostList(fs, filename, offset, ...);


http://gerrit.cloudera.org:8080/#/c/17062/3/be/src/runtime/io/hdfs-file-reader.cc@232
PS3, Line 232:   int64_t save_num_remote_bytes = num_remote_bytes_;
Nit: Since we are using this like a boolean, does it make sense to have a 
boolean saying whether we have printed the unexpected remote logging already?


http://gerrit.cloudera.org:8080/#/c/17062/3/be/src/runtime/io/hdfs-file-reader.cc@242
PS3, Line 242: num_remote_bytes_-save_num_remote_bytes
Nit: We only enter this if save_num_remote_bytes==0, so we could drop this.



--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 3
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Wed, 17 Feb 2021 01:38:20 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-12 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/8131/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 3
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 12 Feb 2021 18:11:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-12 Thread Kurt Deschler (Code Review)
Hello Aman Sinha, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17062

to look at the new patch set (#3).

Change subject: IMPALA-10504: Add tracing for remote block reads
..

IMPALA-10504: Add tracing for remote block reads

This patch logs metadata for the first unexpected remote read of each
scanrange. This logging is intended to help diagnose the root cause of
remote reads.

Reviewed-by: Aman Sinha 
Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
---
M be/src/runtime/io/hdfs-file-reader.cc
1 file changed, 32 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/62/17062/3
--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 3
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-12 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/8130/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 2
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 12 Feb 2021 17:47:52 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-12 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/8129/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 1
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 12 Feb 2021 17:45:34 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-12 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 2:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/17062/2/be/src/runtime/io/hdfs-file-reader.cc
File be/src/runtime/io/hdfs-file-reader.cc:

http://gerrit.cloudera.org:8080/#/c/17062/2/be/src/runtime/io/hdfs-file-reader.cc@81
PS2, Line 81: void appendHostList(const hdfsFS hdfs_fs, const char* file_name, 
int64_t file_offset, int64_t bytes_to_read, ostream& ostr) {
line too long (125 > 90)


http://gerrit.cloudera.org:8080/#/c/17062/2/be/src/runtime/io/hdfs-file-reader.cc@236
PS2, Line 236: appendHostList(hdfs_fs_, 
scan_range_->file_string()->c_str(), file_offset, bytes_to_read,
line too long (97 > 90)



--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 2
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 12 Feb 2021 17:29:08 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-12 Thread Kurt Deschler (Code Review)
Hello Aman Sinha, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17062

to look at the new patch set (#2).

Change subject: IMPALA-10504: Add tracing for remote block reads
..

IMPALA-10504: Add tracing for remote block reads

This patch logs metadata for the first unexpected remote read of each
scanrange. This logging is intended to help diagnose the root cause of
remote reads.

Reviewed-by: Aman Sinha 
Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
---
M be/src/runtime/io/hdfs-file-reader.cc
1 file changed, 30 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/62/17062/2
--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 2
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-12 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17062 )

Change subject: IMPALA-10504: Add tracing for remote block reads
..


Patch Set 1:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/17062/1/be/src/runtime/io/hdfs-file-reader.cc
File be/src/runtime/io/hdfs-file-reader.cc:

http://gerrit.cloudera.org:8080/#/c/17062/1/be/src/runtime/io/hdfs-file-reader.cc@81
PS1, Line 81: void appendHostList(const hdfsFS hdfs_fs, const char* file_name, 
int64_t file_offset, int64_t bytes_to_read, ostream& ostr) {
line too long (125 > 90)


http://gerrit.cloudera.org:8080/#/c/17062/1/be/src/runtime/io/hdfs-file-reader.cc@236
PS1, Line 236: appendHostList(hdfs_fs_, 
scan_range_->file_string()->c_str(), file_offset, bytes_to_read,
line too long (97 > 90)



--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 1
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 12 Feb 2021 17:27:21 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10504: Add tracing for remote block reads

2021-02-12 Thread Kurt Deschler (Code Review)
Hello Aman Sinha,

I'd like you to do a code review. Please visit

http://gerrit.cloudera.org:8080/17062

to review the following change.


Change subject: IMPALA-10504: Add tracing for remote block reads
..

IMPALA-10504: Add tracing for remote block reads

This patch logs metadata for the first unexpected remote read of each
scanrange. This logging is intended to help diagnose the root cause of
remote reads.

Reviewed-by: Aman Sinha 
Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
---
M be/src/runtime/io/hdfs-file-reader.cc
1 file changed, 30 insertions(+), 0 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/62/17062/1
--
To view, visit http://gerrit.cloudera.org:8080/17062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I8c6a3e92f44813048022edf2b91299b3b0a20257
Gerrit-Change-Number: 17062
Gerrit-PatchSet: 1
Gerrit-Owner: Kurt Deschler 
Gerrit-Reviewer: Aman Sinha