Re: Copy on write for HDFS

Konstantin Shvachko Mon, 09 Jul 2007 11:01:50 -0700

Copy on write was discussed in HADOOP-334 in the context of periodiccheckpointing of the name-space.Other than that I remember a discussion about file clone() operation,which makes a new inode, but uses thesame blocks as the original file, which are copied once they aremodified or appended.But this functionality would be possible only if we had at leastappends. Since hdfs does not supportmodifications the purpose of COW is going to be only to support groupingof blocks from different

(or is it just one?) files.
I think it is possible, but very non-possix.

And you can always create one-block files and group them in directoriesinstead.


--Konstantin

Benjamin Reed wrote:

I need to implement COW for HDFS for a project I'm working on. Ivaguely remember it being discussed before, but I can't find anythreads about it. I wanted to at least check for interest/previouswork before proceeding. Hard links would work for me as well, but theyare harder to implement. I was thinking of adding the following to theclient protocol:
public void cow(String src, String clientName, boolean overwrite,LocatedBlocks blocks) throws IOException;
The call would simply create a new file and populate its contents withthe blocks contained in the LocatedBlocks.
Apart from fast copies, it also allows fast truncations and extensionsof existing files.
(This is not a hard link because it is possible that the set of blocksmay not correspond to any other file.)
Has such a thing been discussed before?

thanx
ben

Re: Copy on write for HDFS

Reply via email to