Re: Question regarding data location in hdfs after hbase restarts

2010-10-12 Thread Stack
When you write HDFS, you write N replicas. By default, the first replica is written to the local datanode. Reading, the DFSClient will try to read from the most local replica first. Compactions read from multiple files and write out a single merged file. This newly written files' blocks will al

Re: Question regarding data location in hdfs after hbase restarts

2010-10-12 Thread Jack Levin
Ryan, can you elaborate how compactions create data locality? -Jack On Oct 11, 2010, at 10:12 PM, Ryan Rawson wrote: > We don't attempt to optimize region placement with hdfs locations yet. A > reason why is because on a long lived cluster compactions create the > locality you are looking for.

Re: Question regarding data location in hdfs after hbase restarts

2010-10-11 Thread Ryan Rawson
We don't attempt to optimize region placement with hdfs locations yet. A reason why is because on a long lived cluster compactions create the locality you are looking for. Furthermore, in the old master such an optimization was really hard to do. The new master should make it easier to write such 1

Question regarding data location in hdfs after hbase restarts

2010-10-11 Thread Tao Xie
hi, all I set hdfs replica=1 when running hbase. And DN and RS co-exists on each slave node. So the data in the regions managed by RS will be stored on its local data node, rite? But when I restart hbase and hbase client does gets on RS, datanode will read data from remote data nodes. Does that mea