When you write HDFS, you write N replicas. By default, the first
replica is written to the local datanode. Reading, the DFSClient will
try to read from the most local replica first.
Compactions read from multiple files and write out a single merged
file. This newly written files' blocks will al
Ryan, can you elaborate how compactions create data locality?
-Jack
On Oct 11, 2010, at 10:12 PM, Ryan Rawson wrote:
> We don't attempt to optimize region placement with hdfs locations yet. A
> reason why is because on a long lived cluster compactions create the
> locality you are looking for.
We don't attempt to optimize region placement with hdfs locations yet. A
reason why is because on a long lived cluster compactions create the
locality you are looking for. Furthermore, in the old master such an
optimization was really hard to do. The new master should make it easier to
write such 1
hi, all
I set hdfs replica=1 when running hbase. And DN and RS co-exists on each
slave node. So the data in the regions managed by RS will be stored on its
local data node, rite?
But when I restart hbase and hbase client does gets on RS, datanode will
read data from remote data nodes. Does that mea