Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17771 )

Change subject: WiP: IMPALA-10798 : Prototype for JSON reader
......................................................................


Patch Set 22:

(33 comments)

http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-json-scanner.cc
File be/src/exec/hdfs-json-scanner.cc:

http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-json-scanner.cc@171
PS22, Line 171:     // VLOG_QUERY << "decimal128" << 
arrow::decimal128(ct.precision, ct.scale)->ToString();
line too long (91 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc
File be/src/exec/hdfs-scan-node-base.cc:

http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@70
PS22, Line 70:                                           "for all reads, 
regardless of whether the read is local or remote. By default, the "
line too long (126 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@71
PS22, Line 71:                                           "IO data cache is only 
used if the data is expected to be remote. Used by tests.");
line too long (125 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@79
PS22, Line 79:                                                        " across 
all Disk I/O threads in HDFS read operations.");
line too long (112 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@81
PS22, Line 81:                                                            " 
spent across all Disk I/O threads in HDFS open operations.");
line too long (122 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@84
PS22, Line 84:                              " while it is executing I/O 
operations on behalf of a scan.");
line too long (91 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@91
PS22, Line 91:                                                                  
 "disks accessed by HDFS scan. Each local disk is counted as a disk and each 
type of"
line too long (150 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@92
PS22, Line 92:                                                                  
 " remote filesystem (e.g. HDFS remote reads, S3) is counted as a distinct 
disk.");
line too long (148 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@94
PS22, Line 94:                                                                  
             " average number of HDFS read threads executing read operations on 
behalf of this "
line too long (161 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@95
PS22, Line 95:                                                                  
             "scan. Higher values (i.e. close to the aggregate number of I/O 
threads across "
line too long (158 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@96
PS22, Line 96:                                                                  
             "all disks accessed) show that this scan is using a larger 
proportion of the I/O "
line too long (160 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@97
PS22, Line 97:                                                                  
             "capacity of the system. Lower values show that either this scan 
is not I/O bound"
line too long (160 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@98
PS22, Line 98:                                                                  
             " or that it is getting a small share of the I/O capacity of the 
system.");
line too long (153 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@106
PS22, Line 106:                   "Use this to determine if the scan got all of 
the reservation it wanted. Does not "
line too long (101 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@107
PS22, Line 107:                   "include subsequent reservation increases 
done by scanner implementation "
line too long (92 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@127
PS22, Line 127:                                                     "threads 
spent waiting for I/O. This value can be compared to the value of "
line too long (128 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@128
PS22, Line 128:                                                     
"ScannerThreadsTotalWallClockTime of MT_DOP = 0 scan nodes or otherwise 
compared "
line too long (134 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@129
PS22, Line 129:                                                     "to the 
total time reported for MT_DOP > 0 scan nodes. High values show that "
line too long (130 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@130
PS22, Line 130:                                                     "scanner 
threads are spending significant time waiting for I/O instead of "
line too long (127 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@131
PS22, Line 131:                                                     "processing 
data. Note that this includes the time when the thread is runnable "
line too long (132 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@135
PS22, Line 135:                   "Each sample in the counter is the size of a 
single column that is scanned by the "
line too long (101 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@139
PS22, Line 139:                   "Each sample in the counter is the size of a 
single column that is scanned by the "
line too long (101 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@747
PS22, Line 747:                  metadata->partition_id, 
FilterStats::FILES_KEY, filter_ctxs, file, state)) {
line too long (93 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@1231
PS22, Line 1231:                                                                
  "Read $0 of data across network that was expected to be local. Block locality 
"
line too long (144 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@1232
PS22, Line 1232:                                                                
  "metadata for table '$1.$2' may be stale. This only affects query performance 
"
line too long (144 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@1233
PS22, Line 1233:                                                                
  "and not result correctness. One of the common causes for this warning is 
HDFS "
line too long (145 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@1234
PS22, Line 1234:                                                                
  "rebalancer moving some of the file's blocks. If the issue persists, consider 
"
line too long (144 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@1235
PS22, Line 1235:                                                                
  "running \"INVALIDATE METADATA `$1`.`$2`\".",
line too long (110 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@1236
PS22, Line 1236:                                                                
  PrettyPrinter::Print(unexpected_remote_bytes_->value(), TUnit::BYTES),
line too long (135 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/be/src/exec/hdfs-scan-node-base.cc@1237
PS22, Line 1237:                                                                
  hdfs_table_->database(), hdfs_table_->name())));
line too long (113 > 90)


http://gerrit.cloudera.org:8080/#/c/17771/22/bin/bootstrap_toolchain.py
File bin/bootstrap_toolchain.py:

http://gerrit.cloudera.org:8080/#/c/17771/22/bin/bootstrap_toolchain.py@489
PS22, Line 489: "
flake8: E501 line too long (98 > 90 characters)


http://gerrit.cloudera.org:8080/#/c/17771/22/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
File fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java:

http://gerrit.cloudera.org:8080/#/c/17771/22/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java@475
PS22, Line 475:    
line has trailing whitespace


http://gerrit.cloudera.org:8080/#/c/17771/22/tests/query_test/test_tpch_queries.py
File tests/query_test/test_tpch_queries.py:

http://gerrit.cloudera.org:8080/#/c/17771/22/tests/query_test/test_tpch_queries.py@39
PS22, Line 39: s
flake8: E501 line too long (96 > 90 characters)



--
To view, visit http://gerrit.cloudera.org:8080/17771
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If79364a421d862d0d837f9be694911e388d4d629
Gerrit-Change-Number: 17771
Gerrit-PatchSet: 22
Gerrit-Owner: Anonymous Coward <shikha.asran...@gmail.com>
Gerrit-Reviewer: Aman Sinha <amsi...@cloudera.com>
Gerrit-Reviewer: Anonymous Coward <pranav.lo...@cloudera.com>
Gerrit-Reviewer: Anonymous Coward <shikha.asran...@gmail.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Quanlong Huang <huangquanl...@gmail.com>
Gerrit-Comment-Date: Thu, 10 Nov 2022 06:18:08 +0000
Gerrit-HasComments: Yes

Reply via email to