Les Mikesell wrote at about 10:06:40 -0500 on Tuesday, June 2, 2009:
 > Peter Walter wrote:
 > > 
 > >> Still, it would be awesome to combine the simplicity and pooling
 > >> structure of BackupPC with the flexibility of a database
 > >> architecture...
 > >>   
 > > I, for one, would be willing to contribute financially and with my very 
 > > limited skills if Craig, or others, were willing to undertake such an 
 > > effort. Perhaps Craig would care to comment.
 > 
 > The first thing needed would be to demonstrate that there would be an 
 > advantage to a database approach - like some benchmarks showing an 
 > improvement in throughput in the TB size range and measurements of the 
 > bandwidth needed for remote replication.

No one ever claimed that the primary advantages of a database
approach is throughput. The advantages are really more about
extensibility, flexibility, and transportability. If you don't value
any of the 7 or so advantages I listed before, then I guess a database
approach is not for you.

Also, while clearly, a database approach would in general have more
computational overhead (at least for backups), from my experience the
bottlenecks are network bandwidth and disk speed. In fact, some people
have implemented BackupPC to run native on a 500MHz ARM processor
without effective slowdown. (On the other hand, restore-like
operations would likely be faster since it would be simpler to walk
down the hierarchy of incremental backups) So, I don't think you would
find any significant slowdowns from a database approach. If anything a
database approach could allow significantly *faster* backups since the
file transfers could be split across multiple disks which is not
possible under BackupPC unless you use LVM.

 > Personally I think the way to make things better would be to have a 
 > filesystem that does block-level de-duplication internally. Then most of 
 > what backuppc does won't even be necessary.   There were some 
 > indications that this would be added to ZFS at some point, but I don't 
 > know how the Oracle acquisition will affect those plans.

Ideally, I don't think that the backup approach should depend on the
underlying filesystem architecture. Such a restriction limits the
transportability of the backup solution just as currently BackupPC
really only works on *nix systems with hard links. A database approach
allows one to get away from dependence on specific filesystem features.
That doesn't mean there isn't room for specialized filesystem
approaches but just that such a requirement limits the audience for
the backup solution since it will be a while before we all start
running ZFS-type filesystems and then we will have the issue of
requiring different optimizations and code for different filesystem
approaches.

------------------------------------------------------------------------------
OpenSolaris 2009.06 is a cutting edge operating system for enterprises 
looking to deploy the next generation of Solaris that includes the latest 
innovations from Sun and the OpenSource community. Download a copy and 
enjoy capabilities such as Networking, Storage and Virtualization. 
Go to: http://p.sf.net/sfu/opensolaris-get
_______________________________________________
BackupPC-users mailing list
BackupPC-users@lists.sourceforge.net
List:    https://lists.sourceforge.net/lists/listinfo/backuppc-users
Wiki:    http://backuppc.wiki.sourceforge.net
Project: http://backuppc.sourceforge.net/

Reply via email to