Re: [RLUG] Backup features

Sebastian Smith Mon, 25 Jul 2005 15:23:24 -0700

First off... thanks for all the input. I need to research a couple ofthings before I dive too deep into this project ;)


The rest of my response is inline:



On Sun, 24 Jul 2005, Brian Chrisman wrote:

James Washer wrote:
The big issues with backups IMHO is getting a consistent snapshot. If youstart copying a large data file, and it changes after you've copied thebeginning, you're screwed.

An exact snapshot will be one of the hardest qualities to achieve. It'snot my intension to modify the current rsync algorithm to add this feature(if it isn't already part of the system... I don't know all the details ofthe rsync algorithm), so I may have to develop a method for freezing asnapshot of a file. Anyone know of any good algorithms?

Other backup schemes I've used involved BCL's (Block changed logging), thatallow the admin to only backup up those blocks that have been updated sincethe last master. This is a GREAT space saver for large data farms with lowupdate rates. I tend to work on larger multi-terrabyte systems, sominimizing backup times is very important.

Rsync uses a method similar to BCL's insofar as it will only sync changedfiles -- based on modification time and size, or a file checksum.Clearly BCL's are considerably lower in filesystem abstraction "level"(and considerably faster), but I'd prefer to stick to a userland processoperating on top of the complete filesystem abstraction for portabilityreasons (and ease of programming).

It's not my intension to create a complete enterprise backup solution...at least not at this stage. I just need to backup my user's files,databases, etc when they go home at 5 every night. But, simultaneously,I'd like a backup system that is flexible enough to create hourlyincremental backups, backup to any media I want at any time I want, and isscriptable. So... prehaps it hits in the "middle ground" -- it's notntbackup or veritas, but somewhere inbetween.


Are you planning on your backup scheme being able to handle "hot" backups?

Yes... but functionality will be limited. I'm probably going to make useof the Ostrich Algorithm -- stick my head in the sand, and pretend theproblem doesn't exist -- when it comes to filesystem locks, databases, andother complex backup issues. At least for the prototype. Again, thisreally hits the portability aspect of the code IMHO. If I bloat it toomuch it becomes difficult to modify, and will only work with specificsystems.

I guess what I'm trying for is a "core" backup utility that can performadvanced backup operations -- like incremental backups -- very well, butthat can be easily supplimented for your specific installation viascripting, modification of code, or addon software.

This makes particular sense in open source software. With proprietary backupsolutions, I always leaned against block level incrementals such as providedby veritas, because without their software, there was pretty much no way torecover your data... This meant that things like license management or lamebugs could end up biting you in the butt at the very worst possible time.There have always been file/tar-based products out there, but like you said,not so efficient.

Great point Jim! This is by far my most hated feature of proprietarybackup software. It costs $1000 for the software, and I have to have itloaded to get to my files?! Ridiculous! This will not be a feature of mysoftware. By default all files will be stored as they appear in yourfilesystem. Of course, you'll have the options to compress, encrypt, anddo whatever else people do to files these days. Due to this storagemethod, for example, if /etc and /home are backed up to a USB hard diskyou could boot from a live disk and configure it to serve your files untilyour server is back online.


Thanks again for all of the ideas!  Keep them coming!

- Sebastian

_______________________________________________
RLUG mailing list
[email protected]
http://lists.rlug.org/mailman/listinfo/rlug

Re: [RLUG] Backup features

Reply via email to