Alex Behm has posted comments on this change.

Change subject: IMPALA-4172/IMPALA-3653: Improvements to block metadata loading
......................................................................


Patch Set 12: Code-Review+1

(7 comments)

I'm pretty happy with the changes. Let's let Dimitris take a look.

http://gerrit.cloudera.org:8080/#/c/5148/12//COMMIT_MSG
Commit Message:

Line 15: We loop throuh each and every file in the table/partition directories
through


Line 17: This results in large no. of RPC calls to namenode, especially with
no. -> number

to the NameNode

especially for large tables


Line 35: 
mention the behavior change of REFRESH


http://gerrit.cloudera.org:8080/#/c/5148/12/fe/src/main/java/org/apache/impala/catalog/DiskIdMapper.java
File fe/src/main/java/org/apache/impala/catalog/DiskIdMapper.java:

Line 31:  * - To maintain consistent mapping across all the table instances so 
that the disk thread
... a consistent mapping ... so that the assignment of scan ranges to I/O 
threads is balanced and consistent for all scans on the same host.


http://gerrit.cloudera.org:8080/#/c/5148/12/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
File fe/src/main/java/org/apache/impala/catalog/HdfsTable.java:

Line 288:    * Queries the filesystem to load the file block metadata (e.g. DFS 
blocks) for the
Suggest rephrasing:

Drops and re-loads the block metadata for all partitions in 'partsByPath' whose 
location is under the given 'dirPath'. It involves the following steps:
...


Line 296:    *   and enumerate all its blocks and their corresponding hosts and 
disk IDs.
remove part about disk ids, I think that's the next point


Line 778:     LOG.debug("partsByPath size: " + partsByPath.size());
check log lvl


-- 
To view, visit http://gerrit.cloudera.org:8080/5148
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-MessageType: comment
Gerrit-Change-Id: Ie127658172e6e70dae441374530674a4ac9d5d26
Gerrit-PatchSet: 12
Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-Owner: Bharath Vissapragada <bhara...@cloudera.com>
Gerrit-Reviewer: Alex Behm <alex.b...@cloudera.com>
Gerrit-Reviewer: Bharath Vissapragada <bhara...@cloudera.com>
Gerrit-Reviewer: Mostafa Mokhtar <mmokh...@cloudera.com>
Gerrit-HasComments: Yes

Reply via email to