Re: Controlling the block placement and the file placement in HDFS writes

2014-12-19 Thread Ted Yu
Interesting - HDFS-6133 would directly help HBase data locality use case. On Fri, Dec 19, 2014 at 2:20 PM, Yongjun Zhang wrote: > Hi, > > FYI, > > A relevant jira HDFS-6133 tries to tell Balancer not to move around the > blocks stored at the favored nodes that application selected. I reviewed >

Re: Controlling the block placement and the file placement in HDFS writes

2014-12-19 Thread Yongjun Zhang
Hi, FYI, A relevant jira HDFS-6133 tries to tell Balancer not to move around the blocks stored at the favored nodes that application selected. I reviewed the patch, and the latest on looks good to me. Hope some committers can pick it up and push it forward. Thanks. --Yongjun On Fri, Dec 19, 2

Re: Controlling the block placement and the file placement in HDFS writes

2014-12-19 Thread Ananth Gundabattula
Hello Zhe, Thanks a lot for the inputs. Storage policies is really what I was looking for one of the problems. @Nick: I agree that it would be a nice feature to have. Thanks for the info. Regards, Ananth On Fri, Dec 19, 2014 at 10:49 AM, Nick Dimiduk wrote: > HBase would enjoy a similar funct

Re: Controlling the block placement and the file placement in HDFS writes

2014-12-18 Thread Nick Dimiduk
HBase would enjoy a similar functionality. In our case, we'd like all replicas for all files in a given HDFS path to land on the same set of machines. That way, in the event of a failover, regions can be assigned to one of these other machines that has local access to all blocks for all region file

Re: Controlling the block placement and the file placement in HDFS writes

2014-12-18 Thread Zhe Zhang
> The second aspect is that our queries are time based and this time window > follows a familiar pattern of old data not being queried much. Hence we > would like to preserve the most recent data in the HDFS cache ( impala is > helping us manage this aspect via their command set ) but we would like

Controlling the block placement and the file placement in HDFS writes

2014-12-15 Thread Ananth Gundabattula
Hello All, I was wondering if the following issues can be solved by extending hdfs classes with custom implementations if possible. Here are my requirements : 1. Is there a way to control that all file blocks belonging to a particular hdfs directory & file go to the same physical datanode ( an