Sahil Takiar has posted comments on this change. ( http://gerrit.cloudera.org:8080/13357 )
Change subject: IMPALA-8490: [DOCS] Describe the S3 file handle caching feature ...................................................................... Patch Set 3: (2 comments) http://gerrit.cloudera.org:8080/#/c/13357/3/docs/topics/impala_scalability.xml File docs/topics/impala_scalability.xml: http://gerrit.cloudera.org:8080/#/c/13357/3/docs/topics/impala_scalability.xml@1022 PS3, Line 1022: One scalability aspect that affects heavily loaded clusters is the number of calls made I think we want to explicitly mention the HDFS NameNode here as NameNode scalability issues are a big issue for Impala. Would revise to something like: "Once scalability aspect that affects heavily loaded clusters is the load on the metadata layer from looking up the details as each file is opened. On HDFS that can lead to increased load on the NameNode, and on S3 this can lead to an excessive number of S3 metadata requests." http://gerrit.cloudera.org:8080/#/c/13357/3/docs/topics/impala_scalability.xml@1032 PS3, Line 1032: You can reduce the number of calls made to your files system by enabling the file handle "made to your files system" --> "made to your file system's metadata layer" -- To view, visit http://gerrit.cloudera.org:8080/13357 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: comment Gerrit-Change-Id: I304a0a033475f2289d8a620448d70b90447e4ee1 Gerrit-Change-Number: 13357 Gerrit-PatchSet: 3 Gerrit-Owner: Alex Rodoni <arod...@cloudera.com> Gerrit-Reviewer: Alex Rodoni <arod...@cloudera.com> Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com> Gerrit-Reviewer: Joe McDonnell <joemcdonn...@cloudera.com> Gerrit-Reviewer: Sahil Takiar <stak...@cloudera.com> Gerrit-Comment-Date: Fri, 17 May 2019 00:33:57 +0000 Gerrit-HasComments: Yes