Thanks for the explanation Vincent.  I had a similar experience when I did the 
full backup because my BACKUPDIR was in the fossy user directory.  My full 
backup (60GB) also took about 6 hours with a single rsync thread.

I will look at the instructions and add something specific about excluding 
BACKUPDIR using the include a&b file.

Mary

________________________________
From: fossology-boun...@fossology.org [mailto:fossology-boun...@fossology.org] 
On Behalf Of Ma, Dong (vinc...@gdcc-bj-most)
Sent: Thursday, January 28, 2010 8:34 PM
To: Furosh One
Cc: fossology@fossology.org
Subject: Re: [FOSSology] Backups take a very, very long time... Possible 
culprit?

Hi Gibran,

I think your point will be right, if BACKUPDIR is located in an area that is 
being backed up, there may be causing a loop.  Why you mounted BACKUPDIR 
directory share with other servers in the repository, I didn't suggest you do 
this. Anyway, if you want to exclude BACKUPDIR use include a&b files, just add 
like this:
+ /repository/
+ /db/
- /*
+ /repository/clear[1-3]/
- /repository/*
-/repository/clear1/backup/

The BACKUPDIR will not be backup.

Also I should share my experience about backup whole repository time, there 
will be a long time to cost, in my test my repository upload a Fedora11 source 
ISO, the gold files is about 4.2G, and unupacked files cost more than 55G 
storage, I backup this more than 60G repository use 2 rsync thread, it cost 
about 4 hours to finished. This test can give you for a reference.
If cross NFS, I think will be slower than this.

So we also provide a only backup gold files solution here: 
http://fossology.org/task:fossology_data_backup_and_restore_instruction_-_gold_files

Hope helps for you.

Thanks,
Vincent



From: Furosh One [mailto:fur...@gmail.com]
Sent: Friday, January 29, 2010 7:19 AM
To: Ma, Dong (vinc...@gdcc-bj-most)
Subject: Fwd: Backups take a very, very long time... Possible culprit?

Hello Ma Dong,

I know you gave me some good help last time with the include/exclude files in 
the backup-scripts for fossology.
Just wanted to forward this to you directly in case you might not see the 
mailing list ones for a while.

Any help is appreciated.

-Regards,
Gibran
---------- Forwarded message ----------
From: Furosh One <fur...@gmail.com<mailto:fur...@gmail.com>>
Date: Thu, Jan 28, 2010 at 1:30 PM
Subject: Backups take a very, very long time... Possible culprit?
To: fossology@fossology.org<mailto:fossology@fossology.org>


I've played with the backups of our multi-setup repositories and had a couple 
of questions.

Although the script seems to work that I downloaded from : 
http://fossology.svn.sourceforge.net/viewvc/fossology/trunk/fossology/utils/backup/

Our repos are fairly big in size and we are directly connected to a 
giga-switch, I don't think the bottleneck is the network here.
It is using network because all hosts are mounted via NFS and the 
BACKUP_LOCATION is on one of those mounted shares (the one with the most space 
avail).
But all 6 machines are on the same subnet, same switch, and shouldn't really 
need to go out to the router and then back again. I would think it would be 
quick as far as network.

However, its possible that the BACKUPDIR is located in an area that is also 
being backed up, which may be causing a "loop".

BACKUPDIR=/srv/fossology/repository/clear1/BACKUPS

There's a small debate here at the office whether this is a network issue or a 
backup issue, or simply just a lot of data and processing (compression) of the 
entire backup-script that is taking such a long time. I just caught the fact 
that include has +/repository/ (which may also include our BACKUPDIR).

To give you an idea of how long it was taking I started the script at 9 AM and 
by 3 PM it still hadn't finished. It was probably just 50% completed. We needed 
to do a scan so we had to kill the backup job.

Exisintg include a&b files:
===================
gmarq...@san-clear5:~/Backup_Scripts/fossy_BackupScripts$ cat 
backup_include_a.mod
+ /repository/
+ /db/
- /*
+ /repository/clear[1-3]/
- /repository/*
gmarq...@san-clear5:~/Backup_Scripts/fossy_BackupScripts$ cat 
backup_include_b.mod
+ /repository/
- /*
+ /repository/clear[4-6]/
- /repository/*
===================

Do I need to add my BACKUPDIR to "-" exclude? How would I write that to be able 
to still "include" the repo in clear1?
-/repository/clear1/BACKUPS/*     <-- Would this be the way to do it? And does 
the order in the file matter?

Thanks!

-Regards,
Gibran

_______________________________________________
fossology mailing list
fossology@fossology.org
http://fossology.org/mailman/listinfo/fossology

Reply via email to