I configured HDFS to cache file in HDFS's cache, like following:
hdfs cacheadmin -addPool hibench
hdfs cacheadmin -addDirective -path /HiBench/Kmeans/Input -pool hibench
But I didn't see much performance impacts, no matter how I configure
dfs.datanode.max.locked.memory
Is it possible that
Have you read this thread ?
http://search-hadoop.com/m/uOzYttXZcg1M6oKf2/HDFS+cache=RE+hadoop+hdfs+cache+question+do+client+processes+share+cache+
Cheers
On Mon, Jan 25, 2016 at 1:23 PM, Jia Zou wrote:
> I configured HDFS to cache file in HDFS's cache, like following:
Please see also:
http://hadoop.apache.org/docs/r2.7.1/hadoop-project-dist/hadoop-hdfs/CentralizedCacheManagement.html
According to Chris Nauroth, an hdfs committer, it's extremely difficult to
use the feature correctly.
The feature also brings operational complexity. Since off-heap memory is