Paul Rogers has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11227 )

Change subject: IMPALA-7047. Refreshing partitions should not make an RPC per 
file
......................................................................


Patch Set 4:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/11227/4//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11227/4//COMMIT_MSG@12
PS4, Line 12: since the prior load, it was instead calling it on every file.
General question. In HDFS, we can tell if a file was added or removed. Once 
written, HDFS files are immutable, so we don't have to worry about any file 
"changing."

Still, HDFS could rebalance blocks. In the present code, we have no way to know 
that block assignments changed and will retain stale block info in the cache.

Do we know if HDFS migrates blocks in any of our supported configurations? 
Since stale blocks stay in the cache, do we have a way in the query to detect 
stale blocks and fetch new block info? Say, at scan time?



--
To view, visit http://gerrit.cloudera.org:8080/11227
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2051b96599206164aaa06ecbdf64374c46eda956
Gerrit-Change-Number: 11227
Gerrit-PatchSet: 4
Gerrit-Owner: Todd Lipcon <t...@apache.org>
Gerrit-Reviewer: Bharath Vissapragada <bhara...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Paul Rogers <par0...@yahoo.com>
Gerrit-Reviewer: Todd Lipcon <t...@apache.org>
Gerrit-Reviewer: Vuk Ercegovac <vercego...@cloudera.com>
Gerrit-Comment-Date: Tue, 27 Nov 2018 06:14:21 +0000
Gerrit-HasComments: Yes

Reply via email to