Re: raid5 + HDFS

2015-10-04 Thread Jim Dowling
On 2015-10-05 00:00, Martin Tippmann wrote: 2015-10-03 16:50 GMT+02:00 Jim Dowling <mailto:jdowl...@sics.se>>: As you point out, hdfs does its own checksumming of blocks, which is needed as blocks are transferred over the network. So, yes it is double checksumming if

raid5 + HDFS

2015-10-03 Thread Jim Dowling
On Saturday, October 3, 2015, Martin Tippmann > wrote: 2015-10-03 0:07 GMT+02:00 Jim Dowling : > Hi Hi, I'm not a btrfs developer but we run HDFS on top of btrfs (mainly due to other use-cases that profit from checksumming data) > I am interested in combinin

raid5 + HDFS

2015-10-02 Thread Jim Dowling
Hi I am interested in combining BtrFS RAID-5 with erasure-coded replication for HDFS. We have an implementation of Reed-Solomon replication for our HDFS distribution called HopsFS (www.hops.io). Some of the nice features of HDFS that make it suitable are: * not many small files * not excessive