1804 looks promising for the longer term. But yes, I suspect a
rebalancer would be useful. I'll work on an offline one first.

Kevin

On Thu, Apr 4, 2013 at 10:45 AM, Harsh J <ha...@cloudera.com> wrote:
> If you're looking for the online solution, Aaron's just posted a
> working implementation of it at
> https://issues.apache.org/jira/browse/HDFS-1804.
>
> For the offline or asynchronous disk balancer discussed by
> https://issues.apache.org/jira/browse/HDFS-1312, if you want your tool
> to be part of the upstream project, I'd encourage first posting your
> design for vetting/comments followed by the implementation, so that
> all finer points get covered. The offline tool is the easiest to
> write, and can also exist in Python (outside of HDFS, hosted over some
> GitHub repo perhaps) as it doesn't really have to work with the DN or
> NN's protocol calls. Understanding the block data directory structure
> (ls -l one of your dfs.data.dirs/dfs.datanode.data.dirs and follow)
> should let you write one up easily.
>
> On Wed, Apr 3, 2013 at 6:36 PM, Kevin Lyda <ke...@ie.suberic.net> wrote:
>> I've been following https://issues.apache.org/jira/browse/HDFS-1312
>> and really need the balancing tool described therein. I'd be
>> interested in writing it, but am not sure where to start. I'm more
>> comfortable in Python, but I suspect it has a better chance of being
>> integrated if I do it in Java.
>>
>> Is hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop the
>> place to look for interfaces to manipulate the filesystem?
>>
>> Kevin
>>
>> --
>> Kevin Lyda
>> Galway, Ireland
>> US Citizen overseas? We can vote.
>> Register now: http://www.votefromabroad.org/
>
>
>
> --
> Harsh J



-- 
Kevin Lyda
Galway, Ireland
US Citizen overseas? We can vote.
Register now: http://www.votefromabroad.org/

Reply via email to