Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer
On Sun, Apr 11, 2010 at 09:32:43AM -0500, Jon Schewe wrote: I got it to work again last night. Changing the firewall time outs didn't help. What fixed it was turning off Accurate backups. Ah, so possibly bacula spent long enough stuck doing an accurate query in the catalog that the firewall connection timed out. Are you using mysql and bacula-5.0.1? -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] Problem with display backuped files in View Browser in Bat
I use Bacula 5.0.0 on Ubuntu 9.10 and bacula-bat 5.0.0-2 on Ubuntu 9.10 and Windows XP computers. On both computers I have a problem with View Browser - it don't display any files or display part of files from some Jobs. If I pushed button Refresh in the View Bowser's form I had different from first set of files but no all again. I have this problem very often. In Restore View mode I can see all files without any problems! But in Restore View I can see and restore files only from LAST backup and I can't choose volume from which I want restore files. In View Browser I can see all versions of files and choose which I want to restore but View browser don't work correct always. How can I solve this problem? How can I restore files from choosen volume without BAT? Thanks! -- View this message in context: http://old.nabble.com/Problem-with-display-backuped-files-in-View-Browser-in-Bat-tp28214833p28214833.html Sent from the Bacula - Users mailing list archive at Nabble.com. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Max Run Time exceede with 0s run time!
Harald Schmalzbauer schrieb am 08.04.2010 20:18 (localtime): Am 08.04.2010 14:20, schrieb Matija Nalis: On Mon, Apr 05, 2010 at 12:46:25PM +0200, Harald Schmalzbauer wrote: Absurdly canceled job 47: Fatal error: Max run time exceeded. Job canceled. Scheduled time: 04-Apr-2010 21:01:03 Start time: 04-Apr-2010 21:39:41 End time: 04-Apr-2010 21:39:41 Elapsed time: 0 secs ... Here's my conf regarding max times: Max Start Delay = 14400 # 4h to wait after scheduled start Max Run Time = 1800 # Half an hour to run after beeing really started Incremental Max Run Time = 900 # 15 Minutes for incrementals after beeing started Max Run Sched Time = 36000 # 10 hours to wait to start job as planned Max Wait Time = 7200 # 2h to wait for resources after job really started Which version of bacula is that ? There were bugs not too far ago where 'Max Wait Time' wrongly acted like 'Max Run Time'; maybe it had similar problems with other related directives too. I'm running 5.0.1. First job took longer than 30 minutes, so it was canceled. Second job took 9 minutes, so start time of third job is 39minutes after scheduled. There's no other time limit which could fit, Max Wait Time is 2 hours. Hello, this weekend the same thing happened again. I intentionally set the Max Run Time to 30 mins, but it is not working. It hoses all other subsequent jobs but the one following. Again to visualize my timetable: at 21:00h: startjob1 - startjob2 - startjob3 - startjob4 - startjob5 -- runing waiting waiting waiting waiting -- 30 mins canceled running waiting waiting waiting -- -finisehdcanceled ok (8min) (0s runtime) waiting waiting -- - - -canceled (0s runtime) waiting -- - - -- canceled (0s runtime) Why do job 3-5 get cancelled with 0s runtime? How can I file a bug report? Another thing: The canceled (due to runtime longer than 30mins) job reports 0 Bytes written, but in fact it should have been writing for 30 minutes. Which seems to be tha case if I compare Volume Bytes: At last volume usage: 162,598,409,235 (162.5 GB) Canceled Job reports: Elapsed time: 30 mins 22 secs Priority: 10 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None VSS:no Encryption: no Accurate: no Volume name(s): MonthA Volume Session Id: 73 Volume Session Time:1270146159 Last Volume Bytes: 174,597,640,742 (174.5 GB) Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Running Termination:Backup Canceled So it has written 12GB. Are the reports only valid for correctly terminated jobs? I think even for canceled or other error termination it should report as many correct values as possible. Thanks, -Harry signature.asc Description: OpenPGP digital signature -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer
On Fri, Apr 09, 2010 at 07:30:19PM -0500, Jon Schewe wrote: I have heartbeat intervals set at the following: bacula-dir.conf: client { Heartbeat interval = 15 Seconds } storage { Heartbeat interval = 1 minutes } bacula-sd.conf storage { Heartbeat interval = 1 minute } bacula-fd.conf FileDaemon { Heartbeat Interval = 5 seconds } Strange. Are you running GNU/Linux system on all the machines (FD, SD, DIR) ? IIRC, it might not be supported on other systems, and/or it may need additional tuning on them. I've updated the docs at http://tinyurl.com/y8wapdu -- Matija Nalis Odjel racunalno-informacijskih sustava i servisa Hrvatska akademska i istrazivacka mreza - CARNet Josipa Marohnica 5, 1 Zagreb tel. +385 1 6661 616, fax. +385 1 6661 766 www.CARNet.hr -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer
On 04/12/2010 04:17 AM, Matija Nalis wrote: On Fri, Apr 09, 2010 at 07:30:19PM -0500, Jon Schewe wrote: I have heartbeat intervals set at the following: bacula-dir.conf: client { Heartbeat interval = 15 Seconds } storage { Heartbeat interval = 1 minutes } bacula-sd.conf storage { Heartbeat interval = 1 minute } bacula-fd.conf FileDaemon { Heartbeat Interval = 5 seconds } Strange. Are you running GNU/Linux system on all the machines (FD, SD, DIR) ? IIRC, it might not be supported on other systems, and/or it may need additional tuning on them. I'm running opensuse Linux for the director and storage daemon and Debian Linux for the file daemon. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] TR: Getting auto-label feature to work properly
One up on this question please. I cant get the auto-labeling function to work as expected Does anyone use the same functions : backup to disk, auto-label of volumes, and auto-recycling ? It seems to me quite a standard way of using Bacula. What am I doing wrong ? De : Dominique Jeannerod [mailto:dominique.jeanne...@interact-iv.com] Envoyé : lundi 22 mars 2010 18:12 À : 'bacula-users@lists.sourceforge.net' Objet : Getting auto-label feature to work properly Hello, Im currently working on Bacula 5.0.1 to upgrade our current production backup system. We are backuping on disk files, and thus implement the auto-label feature, using counters. On our first tests, everything was running as planned : volumes were created and labeled automatically, and then recycled. What happens now is that old volumes are recycled as they should be, but new volumes are not created when needed, and Im getting the famous message : Please use the label command to create a new Volume for: Storage: fr-201-sd-2 (/backups) Pool: PS_Default Media type: File I really dont understand why ? Our version : Bacula 3.0.3 on CentOS 5.4 Compilated from source with all pre-requisites: yum install httpd mysql-server mysql-devel php mtx gcc gcc-c++ readline-devel make openssl-devel cd /root/bacula-3.0.3 make distclean ./configure --sbindir=/opt/bacula/bin --sysconfdir=/opt/bacula/etc --with-pid-dir=/opt/bacula/working --with-subsys-dir=/opt/bacula/working --enable-smartalloc --with-mysql --with-working-dir=/opt/bacula/working --with-dump-email=xx...@x.fr --with-job-email=xx...@x.fr with-smtp-host=localhost make make install make install-autostart Director Configuration Director {# define myself Name = sv-ics-tst-bkp-fr-201-dir DIRport = 9101# where we listen for UA connections QueryFile = /opt/bacula/opt/etc/query.sql WorkingDirectory = /opt/bacula/working PidDirectory = /opt/bacula/working Maximum Concurrent Jobs = 60 Password = XX # Console password Messages = Daemon FD Connect Timeout = 5 minutes# Default : 30 minutes SD Connect Timeout = 5 minutes# Default : 30 minutes } Storage { Name = fr-201-sd-1 Address = 10.3.1.254 SDPort = 9103 Maximum Concurrent Jobs = 1 Password = XX Device = fr-201-sd-1 Media Type = File } Storage { Name = fr-201-sd-2 Address = 10.3.1.254 SDPort = 9103 Maximum Concurrent Jobs = 1 Password = XX Device = fr-201-sd-2 Media Type = File } JobDefs { Name = JS_Default Type = Backup Level = Incremental Client = sv-ics-tst-bkp-fr-201 FileSet = FS_Default Schedule = SS_Default Storage = fr-201-sd-1 Messages = Standard Pool = PS_Default Accurate = Yes Max Wait Time = 14400 # 4 hours Max Run Sched Time = 27000 } FileSet { Name = FS_Default Include { Options { signature = MD5 onefs=no # on ne reste pas sur un seul file-system fstype=ext2 # pour ext3 : fstype=ext2 aussi ... compression = GZIP } File = / } Exclude { File = /tmp File = /.journal File = /.fsck File = /opt } } Schedule { Name = SS_PRD_1 Run = Level=Full Pool=PS_Default thu at 23:45 Run = Level=Full Pool=PS_Default fri-wed at 23:45 } Pool { Name = PS_Default Pool Type = Backup Recycle = yes AutoPrune = yes VolumeRetention = 3 days Use Volume Once = yes Label Format = SYS-SD-FR-2-${C_SYS+} NextPool = PS_VFull } Pool { Name = PS_VFull Pool Type = Backup Recycle = yes AutoPrune = yes VolumeRetention = 3 days Use Volume Once = yes Label Format = SYS-VFULL-SD-FR-2-${C_SYSVFULL+} Storage = fr-201-sd-vfull } Counter { Name = C_SYS Catalog = MyCatalog } Counter { Name = C_SYSVFULL Catalog = MyCatalog } #CLIENTS : All clients have the same conf Client { Name = sv-ics-tst-bkp-fr-201 Address = 10.3.1.254 FDPort = 9102 Catalog = MyCatalog Maximum Concurrent Jobs = 2 Password = XX # password for FileDaemon File Retention = 3 months Job Retention = 3 months AutoPrune = yes # Prune expired Jobs/Files } # JOBS : All Jobs have the same conf except, using different storages (fr-201-sd-1, fr-201-sd-2, ) Job { Name = JS_sv-ics-tst-bkp-fr-201 Enabled = yes JobDefs = JS_Default Client = sv-ics-tst-bkp-fr-201 Write Bootstrap = /opt/bacula/bootstrap/sv-ics-tst-bkp-fr-201.sys.bsr Storage = fr-201-sd-1 Schedule = SS_PRD_1 } STORAGE CONFIGURATION (SD) Storage {
Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer
On Mon, Apr 12, 2010 at 05:41:51AM -0500, Jon Schewe wrote: Strange. Are you running GNU/Linux system on all the machines (FD, SD, DIR) ? IIRC, it might not be supported on other systems, and/or it may need additional tuning on them. I'm running opensuse Linux for the director and storage daemon and Debian Linux for the file daemon. that is strange... can you check what are your default SO_KEEPALIVE values with: grep '' /proc/sys/net/ipv4/tcp_keepalive_* and what bacula is using for running connections - start backup first, then check if keepalive is enabled (and with what timers) with: netstat -to -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer
On 4/12/10 7:21 AM, Matija Nalis wrote: On Mon, Apr 12, 2010 at 05:41:51AM -0500, Jon Schewe wrote: Strange. Are you running GNU/Linux system on all the machines (FD, SD, DIR) ? IIRC, it might not be supported on other systems, and/or it may need additional tuning on them. I'm running opensuse Linux for the director and storage daemon and Debian Linux for the file daemon. that is strange... can you check what are your default SO_KEEPALIVE values with: grep '' /proc/sys/net/ipv4/tcp_keepalive_* Server: /proc/sys/net/ipv4/tcp_keepalive_intvl:75 /proc/sys/net/ipv4/tcp_keepalive_probes:9 /proc/sys/net/ipv4/tcp_keepalive_time:7200 Client: /proc/sys/net/ipv4/tcp_keepalive_intvl:75 /proc/sys/net/ipv4/tcp_keepalive_probes:9 /proc/sys/net/ipv4/tcp_keepalive_time:7200 bacula 3.0.3 on both systems and what bacula is using for running connections - start backup first, then check if keepalive is enabled (and with what timers) with: netstat -to Client: tcp0 0 client:9102 server:54043 ESTABLISHED keepalive (7196.36/0/0) tcp0 0 client:43628 server:9103 ESTABLISHED keepalive (7197.26/0/0) Server (behind NAT): tcp0 0 192.168.42.2:9103 client:43628 ESTABLISHED keepalive (7199.10/0/0) tcp0 0 127.0.0.2:9103 127.0.0.2:33218 ESTABLISHED keepalive (7197.84/0/0) tcp0 0 127.0.0.2:36664 127.0.0.2:9101 TIME_WAIT timewait (56.31/0/0) tcp0 0 192.168.42.2:54043 client:9102 ESTABLISHED keepalive (7198.18/0/0) -- Jon Schewe | http://mtu.net/~jpschewe If you see an attachment named signature.asc, this is my digital signature. See http://www.gnupg.org for more information. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer
On Mon, Apr 12, 2010 at 07:59:53AM -0500, Jon Schewe wrote: /proc/sys/net/ipv4/tcp_keepalive_time:7200 netstat -to Client: tcp0 0 client:9102 server:54043 ESTABLISHED keepalive (7196.36/0/0) That's strange. It should've been the timeouts you specified in config files, not 7200 seconds (two hours) which is system default. It looks like bacula does not use TCP_KEEPIDLE setsockopt(2) on your system. You might want to report a bug on http://bugs.bacula.org/ IMHO, it should work there. Or if not, it should probably throw a warning if you try to use it and it is not supported or fails. Apart from fixing bacula, you can override system default, for example (on both server and client) do : echo 60 /proc/sys/net/ipv4/tcp_keepalive_time (or edit /etc/sysctl.d/* or /etc/sysctl.conf to retain value across reboots). Can you try what netstat -to says after you lower that limit and rerun backups ? If netstat -to then reports smaller timers (60 or less), than it should fix your problem, so you can try turning accurate back to yes. Does that help ? -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer
On 4/12/10 8:39 AM, Matija Nalis wrote: On Mon, Apr 12, 2010 at 07:59:53AM -0500, Jon Schewe wrote: /proc/sys/net/ipv4/tcp_keepalive_time:7200 netstat -to Client: tcp0 0 client:9102 server:54043 ESTABLISHED keepalive (7196.36/0/0) That's strange. It should've been the timeouts you specified in config files, not 7200 seconds (two hours) which is system default. It looks like bacula does not use TCP_KEEPIDLE setsockopt(2) on your system. You might want to report a bug on http://bugs.bacula.org/ IMHO, it should work there. Or if not, it should probably throw a warning if you try to use it and it is not supported or fails. Apart from fixing bacula, you can override system default, for example (on both server and client) do : echo 60 /proc/sys/net/ipv4/tcp_keepalive_time (or edit /etc/sysctl.d/* or /etc/sysctl.conf to retain value across reboots). Can you try what netstat -to says after you lower that limit and rerun backups ? Now I see the timer down where I expect it. Should I only need this on the client? If netstat -to then reports smaller timers (60 or less), than it should fix your problem, so you can try turning accurate back to yes. Does that help ? It's running, I'll know in a couple of hours. -- Jon Schewe | http://mtu.net/~jpschewe If you see an attachment named signature.asc, this is my digital signature. See http://www.gnupg.org for more information. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer
On 4/12/10 9:00 AM, Matija Nalis wrote: On Mon, Apr 12, 2010 at 08:45:36AM -0500, Jon Schewe wrote: On 4/12/10 8:39 AM, Matija Nalis wrote: echo 60 /proc/sys/net/ipv4/tcp_keepalive_time (or edit /etc/sysctl.d/* or /etc/sysctl.conf to retain value across reboots). Can you try what netstat -to says after you lower that limit and rerun backups ? Now I see the timer down where I expect it. Should I only need this on the client? If only that client is having timeout timeout problems, than yes (as I understand your Director and SD are on same server, so you should not have timeout issues there as no networking is involved). (SO_KEEPALIVE will work even with only one side of connection having it enabled). So I should only need the heartbeat on that client's setup as well, right? Getting rid of extra heart beats would be nice. If netstat -to then reports smaller timers (60 or less), than it should fix your problem, so you can try turning accurate back to yes. Does that help ? It's running, I'll know in a couple of hours. Good, let us know how it fares. It seems to be running, but I've run into a problem with bconsole. Once I started the job, if I run bconsole and then status dir, the console hangs. If I strace the bconsole process it's stuck in a select call. strace -p 18452 Process 18452 attached - interrupt to quit select(4, [3], NULL, NULL, {9, 461287}) = 0 (Timeout) read(3, 0x655d80, 5)= -1 EAGAIN (Resource temporarily unavailable) select(4, [3], NULL, NULL, {10, 0}) = 0 (Timeout) read(3, 0x655d80, 5)= -1 EAGAIN (Resource temporarily unavailable) select(4, [3], NULL, NULL, {10, 0} -- Jon Schewe | http://mtu.net/~jpschewe If you see an attachment named signature.asc, this is my digital signature. See http://www.gnupg.org for more information. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer
On Mon, Apr 12, 2010 at 09:23:51AM -0500, Jon Schewe wrote: On 4/12/10 9:00 AM, Matija Nalis wrote: (SO_KEEPALIVE will work even with only one side of connection having it enabled). So I should only need the heartbeat on that client's setup as well, right? Getting rid of extra heart beats would be nice. Yes, it should be enough. Note that there is no real need to get rid of extra heartbeats, they are not really expensive (so biggest gain is cleaner config files). Good, let us know how it fares. It seems to be running, but I've run into a problem with bconsole. Once I started the job, if I run bconsole and then status dir, the console hangs. If I strace the bconsole process it's stuck in a select call. strace -p 18452 Process 18452 attached - interrupt to quit select(4, [3], NULL, NULL, {9, 461287}) = 0 (Timeout) read(3, 0x655d80, 5)= -1 EAGAIN (Resource temporarily unavailable) That should not be related to SO_KEEPALIVE - it should be completly transparent to the applications if the network is working (and even when it is not working, it should differ only in always terminating the connection instead of sometimes terminating connection and sometimes hanging idefinitely). Anyway, it may be few issues with directory hanging. Most common is you are too eager. For example, is SQL server is busy, status dir will hang until it completes. It is especially problem with bigger databases and MySQL instead of PostgreSQL, see http://bugs.bacula.org/view.php?id=1472, where it can take even several hours! (note that while it talks about restore speed, it is also related to accurate backups which employ similar SQL queries) You can check for this with show processlist in MySQL (if you are running MySQL for database, of course) if that is the case (or simply wait). Or you might be unlucky enough to hit a real director bug in 5.0.1, see http://bugs.bacula.org/view.php?id=1528, but that is unlikely. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Using removable SATA disks as media
On 04/11/10 23:14, Joseph Spenner wrote: Phil: Thanks for the help! Getting closer. It's not breaking up the backup, but now it uses the same file/device every night. Maybe the Volume Use Duration is too short? Did you fix the retention period yet? If it's immediately reusing the first volume, it probably means your retention is too short. If you're trying to use these volumes in daily rotation, then the volume use duration should be set so that it expires just before the next day's backup (23 hours being a good number for that), and your retention period needs to be set such that the first volume becomes available again just after the last volume is used (for a ten-day rotation, nine days should be right). Make sure that after you update the Pool resource, you FIRST update the Pool from the resource, THEN update ALL of the Volumes from the Pool. -- Phil Stracchino, CDK#2 DoD#299792458 ICBM: 43.5607, -71.355 ala...@caerllewys.net ala...@metrocast.net p...@co.ordinate.org Renaissance Man, Unix ronin, Perl hacker, Free Stater It's not the years, it's the mileage. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Using removable SATA disks as media
--- On Mon, 4/12/10, Phil Stracchino ala...@metrocast.net wrote: Did you fix the retention period yet? If it's immediately reusing the first volume, it probably means your retention is too short. If you're trying to use these volumes in daily rotation, then the volume use duration should be set so that it expires just before the next day's backup (23 hours being a good number for that), and your retention period needs to be set such that the first volume becomes available again just after the last volume is used (for a ten-day rotation, nine days should be right). Make sure that after you update the Pool resource, you FIRST update the Pool from the resource, THEN update ALL of the Volumes from the Pool. Phil: Here are my retention settings: === Client { Name = backula-fd Address = backula File Retention = 1 days Job Retention = 6 months } Pool { Name = Default Pool Type = Backup Recycle = yes AutoPrune = yes Volume Retention = 1 days } Pool { Name = File Pool Type = Backup Recycle = yes AutoPrune = yes Volume Retention = 9 days Volume Use Duration = 23h Maximum Volume Bytes = 1400G Maximum Volumes = 10 } Client { Name = tycho-fd Address = tycho File Retention = 1 days Job Retention = 6 months } === Is the client specific File Retention causing my problem? I suspect since I'm making 'retention' definitions withing the Client {} it might be the root of my problem. I know all my clients will be backed up the same, so I'd like to have global settings inherited by all clients. I'm also a bit confused (obviously) with: File Retention Job Retention Volume Retention Volume Use Duration Thanks again! -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bacula don't read all my files
Hi everyone... That tip of John works. Thanks John.. :)... I made the full backup OK. But my job is making the copy of all the files and directories, and I have to exclude the copy of my virtual machines and the hidden folder .gvfs If I put the option Exclude = yes, the bacula server does not copy any file, but if I exclude that line, it makes the copy of the full directory... the manual is written that the option exclude = yes should be placed, otherwise the system does not ignore the files in the section exclude. So, what is the problem? I am sending the copy of the FileSet configuration FileSet { Name = User File Set Include { Options { # Exclude = yes ## here is the problem.. :( If I left that way, the backup make the copy of all files and not ignore the directory /home/user/.VirtualBox ## If I delete the comment, the system does not make the copy of any file or folder } File = /home/user/ } Exclude { File = /home/user/.VirtualBox/ File = /home/user/.gvfs } } Thanks John Drescher escreveu: On Thu, Apr 8, 2010 at 2:33 PM, Daniel dpi...@gmail.com wrote: John Drescher escreveu: On Thu, Apr 8, 2010 at 1:32 PM, Daniel dpi...@gmail.com wrote: Hi... Well, in the first place, English is not my first language, so sorry for some errors in my e-mail... :) I have a bacula server with ubuntu 9.10 working fine. I can connect with 4 others servers and make all the backups with no errors. But, in my personal machine, I install the last version off bacula (5.0.1) with the option client-only like I did with all others machines and make the config in the bacula server to make the backup of my /home/user (my personal machine has installed the ubuntu 9.10). When I try to make the backup, the bacula server return no errors, but don't copy any files from my computer. If I do the estimate of backup, it returns: Using Catalog Mycatalog Conecting client XX-fd in 192.168.1.67:9102 2000 OK estimate files=1 bytes=0 My /home/user has more than 10Gb of files, but bacula don't find any file or folder. Does anyone has a suggestion?? What does your backup set look like? Is /home on a different filesystem as / John I don't really understand what you mean with backup set look likes. I think you want to know how my director is config to access my computer, am I right? If yes, that is the configuration of the FileSet FileSet { Name = User File Set Include { Options { Exclude = yes } File = /home/user/ } Exclude { File = /home/user/.VirtualBox/ File = /home/user/.gvfs } } Get rid of the Options { Exclude = yes } part. John -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Using removable SATA disks as media
On 04/12/10 11:58, Joseph Spenner wrote: --- On Mon, 4/12/10, Phil Stracchino ala...@metrocast.net wrote: Did you fix the retention period yet? If it's immediately reusing the first volume, it probably means your retention is too short. If you're trying to use these volumes in daily rotation, then the volume use duration should be set so that it expires just before the next day's backup (23 hours being a good number for that), and your retention period needs to be set such that the first volume becomes available again just after the last volume is used (for a ten-day rotation, nine days should be right). Make sure that after you update the Pool resource, you FIRST update the Pool from the resource, THEN update ALL of the Volumes from the Pool. Phil: Here are my retention settings: === Client { Name = backula-fd Address = backula File Retention = 1 days Job Retention = 6 months } Pool { Name = Default Pool Type = Backup Recycle = yes AutoPrune = yes Volume Retention = 1 days } Pool { Name = File Pool Type = Backup Recycle = yes AutoPrune = yes Volume Retention = 9 days Volume Use Duration = 23h Maximum Volume Bytes = 1400G Maximum Volumes = 10 } Client { Name = tycho-fd Address = tycho File Retention = 1 days Job Retention = 6 months } === Is the client specific File Retention causing my problem? Yes, that's exactly the problem. I'm also a bit confused (obviously) with: File Retention Job Retention Volume Retention Volume Use Duration Basically, whichever is the SHORTEST of the retention settings will dictate when data begins getting pruned. In most cases, you probably want all three retention settings to be the same. Volume Use Duration is not a retention setting at all; it is the time window during which data may be written to the volume, starting from when it is first written after creation or recycling. When that window ends, the volume will be marked Used even if not full, and no new jobs not already running will be allowed to write to it. (I'm honestly not certain what happens to the running job if the use duration expires while a job is still writing to the volume; I've never tried it.) If all your clients will have the same settings, then I would remofe the File Retention setting from your clients altogether. Unless a specific client NEEDS its retention settings to be different from the Pool defaults, there's no reason to have retention settings in the client resource at all. -- Phil Stracchino, CDK#2 DoD#299792458 ICBM: 43.5607, -71.355 ala...@caerllewys.net ala...@metrocast.net p...@co.ordinate.org Renaissance Man, Unix ronin, Perl hacker, Free Stater It's not the years, it's the mileage. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] bacula won't backup files with apostrophes
Hi; I'm backing up files from a Windows share connecting with cifs (samba) using autofs. I'm getting an error with files with apostrophes that the files doesn't exist and it lists the file path with a ? instead of the apostrophe. Any ideas? -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bacula don't read all my files
But my job is making the copy of all the files and directories, and I have to exclude the copy of my virtual machines and the hidden folder .gvfs If I put the option Exclude = yes, the bacula server does not copy any file, but if I exclude that line, it makes the copy of the full directory... the manual is written that the option exclude = yes should be placed, otherwise the system does not ignore the files in the section exclude. So, what is the problem? I am sending the copy of the FileSet configuration FileSet { Name = User File Set Include { Options { # Exclude = yes ## here is the problem.. :( If I left that way, the backup make the copy of all files and not ignore the directory /home/user/.VirtualBox ## If I delete the comment, the system does not make the copy of any file or folder } File = /home/user/ } This means exclude all files in the File = part which in this case is /home/user not the Exclude lines below. Exclude { File = /home/user/.VirtualBox/ File = /home/user/.gvfs } } Look at the following example taken from the manual for an example of the usage of exclude=yes : http://bacula.org/5.0.x-manuals/en/main/main/Configuring_Director.html#SECTION00178 FileSet { Name = Full Set Include { Options { Compression=GZIP signature=SHA1 Sparse = yes } @/etc/backup.list } Include { Options { wildfile = *.o wildfile = *.exe Exclude = yes } File = /root/myfile File = /usr/lib/another_file } Exclude { File = /tmp } } John -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bacula don't read all my files
On Mon, Apr 12, 2010 at 2:07 PM, John Drescher dresche...@gmail.com wrote: But my job is making the copy of all the files and directories, and I have to exclude the copy of my virtual machines and the hidden folder .gvfs If I put the option Exclude = yes, the bacula server does not copy any file, but if I exclude that line, it makes the copy of the full directory... the manual is written that the option exclude = yes should be placed, otherwise the system does not ignore the files in the section exclude. So, what is the problem? I am sending the copy of the FileSet configuration FileSet { Name = User File Set Include { Options { # Exclude = yes ## here is the problem.. :( If I left that way, the backup make the copy of all files and not ignore the directory /home/user/.VirtualBox ## If I delete the comment, the system does not make the copy of any file or folder } File = /home/user/ } This means exclude all files in the File = part which in this case is /home/user not the Exclude lines below. Exclude { File = /home/user/.VirtualBox/ File = /home/user/.gvfs } } Look at the following example taken from the manual for an example of the usage of exclude=yes : http://bacula.org/5.0.x-manuals/en/main/main/Configuring_Director.html#SECTION00178 FileSet { Name = Full Set Include { Options { Compression=GZIP signature=SHA1 Sparse = yes } @/etc/backup.list } Include { Options { wildfile = *.o wildfile = *.exe Exclude = yes } File = /root/myfile File = /usr/lib/another_file } Exclude { File = /tmp } } Okay. I am wrong about this. See the explanation in the example but you do need something in the Options to exclude when you use Exclude=yes. John -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] bacula won't backup files with apostrophes
On 04/12/10 14:05, Mark Coolen wrote: Hi; I'm backing up files from a Windows share connecting with cifs (samba) using autofs. I'm getting an error with files with apostrophes that the files doesn't exist and it lists the file path with a ? instead of the apostrophe. Any ideas? Is there some sound reason why you're backing up this share indirectly via CIFS, rather than directly via a native Windows client? Indirect backup of this type is not recommended, and will heavily impact your backup throughput, as every byte backed up has to cross your network twice, to and from the client mounting the CIFS share, and therefore limiting your backup speed for that share to AT MOST half the client's available bandwidth. -- Phil Stracchino, CDK#2 DoD#299792458 ICBM: 43.5607, -71.355 ala...@caerllewys.net ala...@metrocast.net p...@co.ordinate.org Renaissance Man, Unix ronin, Perl hacker, Free Stater It's not the years, it's the mileage. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer
On 4/12/10 9:40 AM, Matija Nalis wrote: On Mon, Apr 12, 2010 at 09:23:51AM -0500, Jon Schewe wrote: On 4/12/10 9:00 AM, Matija Nalis wrote: Good, let us know how it fares. It seems to be running, but I've run into a problem with bconsole. Once I started the job, if I run bconsole and then status dir, the console hangs. If I strace the bconsole process it's stuck in a select call. strace -p 18452 Process 18452 attached - interrupt to quit select(4, [3], NULL, NULL, {9, 461287}) = 0 (Timeout) read(3, 0x655d80, 5)= -1 EAGAIN (Resource temporarily unavailable) That should not be related to SO_KEEPALIVE - it should be completly transparent to the applications if the network is working (and even when it is not working, it should differ only in always terminating the connection instead of sometimes terminating connection and sometimes hanging idefinitely). Anyway, it may be few issues with directory hanging. Most common is you are too eager. For example, is SQL server is busy, status dir will hang until it completes. It is especially problem with bigger databases and MySQL instead of PostgreSQL, see http://bugs.bacula.org/view.php?id=1472, where it can take even several hours! (note that while it talks about restore speed, it is also related to accurate backups which employ similar SQL queries) Must be what it is then. I've been thinking about switching to postgres, but haven't because the opensuse packages for bacula are only for mysql. This may motivate me more. The backup finished, so it seems that in version 3.0.3 bacula does NOT set the socket option SO_KEEPALIVE. -- Jon Schewe | http://mtu.net/~jpschewe If you see an attachment named signature.asc, this is my digital signature. See http://www.gnupg.org for more information. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] Problems getting restore to work
Hi, I am still tweaking a new installation of bacula 5.0.1 on Centos 5.4. The backups work fine but I am trying to get the restore to work and I keep getting the following errors: Volume hardware-0014 to file:block 7:2933114700. 10-Apr 12:51 swift-fd JobId 118: Error: restore.c:1133 Write error on /backup0/bacula-restores/home/hardware/pdf/altera/quartus/81_linux/81_nios2eds_linux.tar: No space left on device The volume is a disk drive and I am trying to restore it to a dedicated restore directory on a different disk. I have checked the config files with an old set that I was running under 1.38 and it look very similar( ie names of pools and disks were changed). thanks Here are my config files ( directory and storage): # # Standard Restore template, to be changed by Console program # Only one such job is needed for all Jobs/Clients/Storage ... # Job { Name = RestoreHardware Type = Restore Client=distress-fd FileSet=Swift Hardware Set Storage = File1 Pool = Restore Messages = Standard Where = /backup0/bacula-restores } # List of files to be backed up FileSet { Name = Swift Hardware Set Include { Options { signature = MD5 } # # Put your list of files here, preceded by 'File =', one per line #or include an external list with: # #File = file-name # # Note: / backs up everything on the root partition. #if you have other partitions such as /usr or /home #you will probably want to add them too. # File = /home/hardware } # # If you backup the root directory, the following two excluded # files can be useful # Exclude { #File = /var/run/bacula/working #File = /tmp #File = /proc #File = /tmp #File = /.journal #File = /.fsck } } Storage { Name = File0 # used for database and restores only # Do not use localhost here Address = distress# N.B. Use a fully qualified name here SDPort = 9103 Password = Device = FileStorage0 Media Type = File } # Definition of file storage device Storage { Name = File1 # used for home/hardware # Do not use localhost here Address = distress# N.B. Use a fully qualified name here SDPort = 9103 Password = Device = FileStorage1 Media Type = File } # Default tape pool definition Pool { Name = Restore Pool Type = Backup Recycle = yes # Bacula can automatically recycle Volumes AutoPrune = yes # Prune expired volumes Volume Retention = 30 days # one month } # File Pool definition Pool { Name = Pool0 Pool Type = Backup Recycle = yes # Bacula can automatically recycle Volumes AutoPrune = yes # Prune expired volumes Volume Retention = 7 days # one week Maximum Volume Bytes = 500G # Limit Volume size to something reasonable Maximum Volumes = 2 # Limit number of Volumes in Pool } # File Pool definition Pool { Name = Pool1 Pool Type = Backup Recycle = yes # Bacula can automatically recycle Volumes AutoPrune = yes # Prune expired volumes Volume Retention = 7 days # one week Maximum Volume Bytes = 500G # Limit Volume size to something reasonable Maximum Volumes = 2 # Limit number of Volumes in Pool Label Format = hardware- } ** bacula-sd Device { Name = Restore Media Type = File Archive Device = /backup0/bacula-restores LabelMedia = yes; # lets Bacula label unlabeled media Random Access = Yes; AutomaticMount = yes; # when device opened, read it RemovableMedia = no; AlwaysOpen = no; } Device { Name = FileStorage0 Media Type = File Archive Device = /backup0/DBB LabelMedia = yes; # lets Bacula label unlabeled media Random Access = Yes; AutomaticMount = yes; # when device opened, read it RemovableMedia = no; AlwaysOpen = no; } Device { Name = FileStorage1 Media Type = File Archive Device = /backup1 LabelMedia = yes; # lets Bacula label unlabeled media Random Access = Yes; AutomaticMount = yes; # when device opened, read it RemovableMedia = no; AlwaysOpen = no; } -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] bacula-dir dies on 3.0.3 version
The problem is certainly in version 3.0.3. In version 3.0.2 this problem does not happen. John, I'm sure the problem is not in the database. The job that I perform, the bacula-dir shows me running, but not running on the bacula-sd and not in the bacula-fd too. John Drescher escreveu: On Sat, Apr 10, 2010 at 11:55 AM, Gustavo Nunes Freire Ribeiro gust...@linconet.com.br wrote: Hello Mehma, I've done these 2 tests: I commented out messages section and still going the same problem. Guilt bstmp's really not because I am not using it. I had restarted the bacula-dir after running the first job and it runs the first job, when I run the second job it simply stop. When I ask the status of the bacula-dir, it shows that the job is running, but when I check in bacula-sd and bacula-fd does not there is no job running. Sorry my english =) How long have you waited? Was there activity on the database server? A job will end on the FD side but still be running on the director for sometimes several minutes. The reason is it may take a long time to despool the database entries. John -- Gustavo N. F. Ribeiro Analista de Redes LINCONET Soluções em Informática LTDA +55 84 9996-3582 -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] bacula-dir dies on 3.0.3 version
On Mon, Apr 12, 2010 at 5:36 PM, Gustavo Nunes Freire Ribeiro gust...@linconet.com.br wrote: The problem is certainly in version 3.0.3. In version 3.0.2 this problem does not happen. John, I'm sure the problem is not in the database. The job that I perform, the bacula-dir shows me running, but not running on the bacula-sd and not in the bacula-fd too. What I mean is it is normal for the client and sd to say the job is finished before the director says its finished. The reason is if you have attribute spooling on this happens after all data is backed up and after the client is done. This may also happen if you are running an accurate backup. John -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] bacula-dir dies on 3.0.3 version
Gustavo, You are probably over-thinking this. If the first job runs every time the services are re-started and you are not getting any messages via email - that's the problem. Mehma --- -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users