Boris Epstein sent a missive onĀ 2010-06-16:

> Hi all,
> 
> I am just trying to consider my options for storing a large mass of
> data (tens of terrabytes of files) and one idea is to build a
> clustered FS of some kind. Has anybody had any experience with that?
> Any recommendations?
> 
> Thanks in advance for any and all advice.

Take a look at hadoop http://hadoop.apache.org and specifically HDFS (hadoop
distributed file system) http://hadoop.apache.org/hdfs/ I've used it in
conjunction with nutch across 20 odd servers (circa 10TB). When I used it
the down side was a single metadata node, but this may have changed by now.
The data is stored redundantly across the nodes and doesn't seem to require
any special hardware (I ran it on dell 1425's).

HTH

Simon.




_______________________________________________
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos

Reply via email to