Re: [rdiff-backup-users] Feature requests questions/discussion

2005-11-03 Thread Fran Firman
I think the hashing of the files is worth the overhead. At work we
currently keep 3 months of backups on our backup server ~ 1.4TBytes of
data.

I would like the hash values to be available for reporting against, so
that I can also do the equ of tripwire on the servers.

ie these files changed, these files were added and these files were
deleted.

At the moment I find a lot of false positives. ie a file is backed up
when it has not changed at all. When I was first reading about
rdiff-backup, and that it used rsync-lib, it seemed to be a side affect
of the rsync library. It had to do with the method that rsync figured
out what had changed.

F.


On Thu, 2005-10-27 at 13:44 +0200, Wiebe Cazemier wrote:
 Ben Escoto wrote:
  Another problem is that we would have to rewrite the mirror_metadata
  file each time.  If there was a bug in rdiff-backup or a disk
  corruption, and the main file was lost, we would lose all the metadata
  information.  I don't have any specific worry in mind here, it just
  makes me uneasy.
 
 This is a genuine concern. Remember I corrupted my external HD severly with 
 USB 
 errors? That sort of thing can happen.
 
 Just two days ago, I had a similair situation. My computer suddenly became 
 very 
 unstable, probably because I did rrmod -f on a module which I couldn't get 
 un-used. At first I suspected my HD, so I made a backup as soon as I could. 
 It 
 crashed as well. Later, when I diagnosed my disc and restarted my system, I 
 ran 
 a new backup. I was very glad the failed one got rolled back.
 
 I know you can save a temporary mirror_metadata as a working version, but if 
 rdiff-backup had crashed writing the temp version over the orignal one, you 
 are 
 bust. Or, power can fail suddenly. Anything can happen...
 
 Anyway, it's good to guard against the unexpected and unprobable.
 
 
 ___
 rdiff-backup-users mailing list at rdiff-backup-users@nongnu.org
 http://lists.nongnu.org/mailman/listinfo/rdiff-backup-users
 Wiki URL: http://rdiff-backup.solutionsfirst.com.au/index.php/RdiffBackupWiki
 
 


signature.asc
Description: This is a digitally signed message part
___
rdiff-backup-users mailing list at rdiff-backup-users@nongnu.org
http://lists.nongnu.org/mailman/listinfo/rdiff-backup-users
Wiki URL: http://rdiff-backup.solutionsfirst.com.au/index.php/RdiffBackupWiki

Re: [rdiff-backup-users] Feature requests questions/discussion

2005-11-03 Thread dean gaudet
On Fri, 4 Nov 2005, Fran Firman wrote:

 At the moment I find a lot of false positives. ie a file is backed up
 when it has not changed at all. When I was first reading about
 rdiff-backup, and that it used rsync-lib, it seemed to be a side affect
 of the rsync library. It had to do with the method that rsync figured
 out what had changed.

there was a point in the past, with some specific version of librsync i 
think, that i was having troubles with lots of increments for files which 
hadn't changed at all... an upgrade made the problem disappear and i 
didn't bother tracking it further.

-dean


___
rdiff-backup-users mailing list at rdiff-backup-users@nongnu.org
http://lists.nongnu.org/mailman/listinfo/rdiff-backup-users
Wiki URL: http://rdiff-backup.solutionsfirst.com.au/index.php/RdiffBackupWiki


Re: [rdiff-backup-users] Feature requests questions/discussion

2005-10-25 Thread Ben Escoto
 Wiebe Cazemier [EMAIL PROTECTED]
 wrote the following on Fri, 21 Oct 2005 17:48:47 +0200

 First, I would really like an option like --store-checksums so that
 rdiff-backup calculates md5 hashes when doing a backup, and that
 that checksum it used for integrity checks upon restoration. But at
 restoration time, the check should not be an option, it should
 always be done, if the file being restored has hash info. This to
 prevent user-mistake. I once severly corrupted a partition on an
 external HD because of USB2 transfer errors (which I haven't been
 able to solve BTW). A lot was damaged, but of every file that
 resided in compressed files I was told about the corruption,
 beceause of the hashing usually done in zips/gzips/etc. The
 rdiff-backup repository became useless, because it had no idea the
 files were damaged. Such an option would of course be annoying to
 most people, because it's quite slow, but most of my backups are
 done through cron, so it doesn't matter for me. I think as an
 option, this could be very valuable. It doesn't even have to be that
 slow, if it's cleverly integrated with the copy routine.

It's a good idea, and one that someone else has suggested before.  The
checksums would be stored in the mirror-metadata file.  I don't even
think it would be hard to implement.  And there could be a --verify
switch to go through the repository and make sure everything checksums
correctly.

 Summarized, --store-checksums would calculate checksum info for
 integrity checks, and --checksum-diffs would use checksums for
 change-detections, instead of mtime+size.

Another good suggestion I think, which has come up before.  You
mentioned ctime before, I was going to add in ctime checking but there
was some complication (I forget what) and it never got in.

Does anyone else think they would use Wiebe's --checksum-diff option?


-- 
Ben Escoto


pgpMNV7oncYxd.pgp
Description: PGP signature
___
rdiff-backup-users mailing list at rdiff-backup-users@nongnu.org
http://lists.nongnu.org/mailman/listinfo/rdiff-backup-users
Wiki URL: http://rdiff-backup.solutionsfirst.com.au/index.php/RdiffBackupWiki

Re: [rdiff-backup-users] Feature requests questions/discussion

2005-10-25 Thread Wiebe Cazemier

Ben Escoto wrote:

It's a good idea, and one that someone else has suggested before.  The
checksums would be stored in the mirror-metadata file.  I don't even
think it would be hard to implement.  


Do you think it's possible to combine it with the copy syscall/API-call 
rdiff-backup probably uses so that the data which is read by copying is 
checksummed at the same time? If it is, it wouldn't even take more time to 
backup a new file than it would without checksums.



And there could be a --verify
switch to go through the repository and make sure everything checksums
correctly.


Indeed, I even forgot this.


You
mentioned ctime before, I was going to add in ctime checking but there
was some complication (I forget what) and it never got in.


If --checksum-diffs would make it, I wouldn't need ctime checking. But I guess 
there is value for such an option (speed being a definite advantage), as a third 
change-detection method. But it won't be widely useable I guess. I wouldn't be 
surprised if NFS or SAMBA don't supply correct ctimes. And what about FAT32? 
Doing ls -lc does reveil different times than a normal ls -l, but does that FS, 
and the windows driver which controlles it, properly set ctimes?



___
rdiff-backup-users mailing list at rdiff-backup-users@nongnu.org
http://lists.nongnu.org/mailman/listinfo/rdiff-backup-users
Wiki URL: http://rdiff-backup.solutionsfirst.com.au/index.php/RdiffBackupWiki


Re: [rdiff-backup-users] Feature requests questions/discussion

2005-10-25 Thread dean gaudet
On Tue, 25 Oct 2005, Ben Escoto wrote:

 It's a good idea, and one that someone else has suggested before.  The
 checksums would be stored in the mirror-metadata file.  I don't even
 think it would be hard to implement.  And there could be a --verify
 switch to go through the repository and make sure everything checksums
 correctly.

that would be cool... i'd use that.


  Summarized, --store-checksums would calculate checksum info for
  integrity checks, and --checksum-diffs would use checksums for
  change-detections, instead of mtime+size.
 
 Another good suggestion I think, which has come up before.  You
 mentioned ctime before, I was going to add in ctime checking but there
 was some complication (I forget what) and it never got in.
 
 Does anyone else think they would use Wiebe's --checksum-diff option?

in many cases for me this would cost too much on the machine being backed 
up (lots of disk seeks and cpu time) ... but maybe i could use it 
periodically on weekends...

hey, did you know there's actually nanosecond resolution to [acm]time on 
linux 2.6?  (and on several BSDs i think)  i don't know if the interfaces 
show up in python -- but the C structure elements are 
st_atimensec/st_ctimensec/st_mtimensec, and the utimes(2) syscall can set 
nanosecond resolution timestamps.

-dean




___
rdiff-backup-users mailing list at rdiff-backup-users@nongnu.org
http://lists.nongnu.org/mailman/listinfo/rdiff-backup-users
Wiki URL: http://rdiff-backup.solutionsfirst.com.au/index.php/RdiffBackupWiki