[Bacula-users] Bacula SD Broken Pipe after 16 minutes after ...
Hi ! I'm back from vacation. Smile thanks for your tips. Unfortunately the Heartbeat won't help. I have upgraded meanwhile to 7.0.4. Just came into office to see that the weekend backup failed again with Heartbeat set on the client/server on all daemons to 300. As suggested. - It just happens wenn MaxSpoolCache Size gets hist. - It despools for exactly 16:12 min and then breaks. (what kind of timeout would that be ?) Also did - Switched network card on bacula server - Removed on LTO drive (running single now) - Switched SAS Port (on Library) 05-Jul 15:42 srv-bacula-dir JobId 9856: Start Backup JobId 9856, Job=cli-bacula-data.2014-07-05_15.42.00_34 05-Jul 15:42 srv-bacula-dir JobId 9856: Using Device tapelib-drive0 to write. 05-Jul 15:42 srv-bacula-sd JobId 9856: Spooling data ... 05-Jul 23:10 srv-bacula-sd JobId 9856: User specified Device spool size reached: DevSpoolSize=800,000,016,969 MaxDevSpoolSize=800,000,000,000 05-Jul 23:10 srv-bacula-sd JobId 9856: Writing spooled data to Volume. Despooling 800,000,016,969 bytes ... 05-Jul 23:26 srv-client-fd JobId 9856: Error: bsock.c:428 Write error sending 65540 bytes to Storage daemon:srv-bacula:9103: ERR=Broken pipe 05-Jul 23:26 srv-client-fd JobId 9856: Fatal error: backup.c:1200 Network send error to SD. ERR=Broken pipe 05-Jul 23:26 srv-bacula-sd JobId 9856: Despooling elapsed time = 00:16:12, Transfer rate = 823.0 M Bytes/second 05-Jul 23:26 srv-bacula-dir JobId 9856: Error: Director's connection to SD for this Job was lost. 05-Jul 23:26 srv-bacula-dir JobId 9856: Error: Bacula srv-bacula-dir 7.0.4 (04Jun14): Again - I am desperate. No clue what else todo to get it running. - Why is this happening when it starts despooling from MaxSpoolCache size ? - What has the client to do with despooling ? (05-Jul 23:26) cause the data is in the cache on the server. - Why after 16:12min ? After restarting the job - in 95% of the retries the backup completes. Many thanks -- David +-- |This was sent by d...@espros.ch via Backup Central. |Forward SPAM to ab...@backupcentral.com. +-- -- Open source business process management suite built on Java and Eclipse Turn processes into business applications with Bonita BPM Community Edition Quickly connect people, data, and systems into organized workflows Winner of BOSSIE, CODIE, OW2 and Gartner awards http://p.sf.net/sfu/Bonitasoft ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] FW: Bacula: Backup OK of sphad01-fd Differential
I'm using Bacula to back up a shop that's about half linux, half windows. Three of the Windows machines are located in another network. A few times per week, I find a result like the one below for a back-up of one of these three. As it stands now, I have a total of 4 such messages for 3 machines over the last 7 days. Traffic is routed through two firewalls that regard each other as trusted (each network has its own firewall, each had a spare port in it, I just connected a cable between those spare ports and configured routing so everything passes, no restricting firewall rules for that connection). Because traffic is going through those firewalls, I had already configured keepalive packets (heartbeat) at 300 seconds. In my first tests, backups *did* fail because that was missing. Now they don't seem to fail anymore, but there's that socket terminated message every now and then that doesn't belong there. Director and SD are 5.2.5 (default version found in Ubuntu 12.04 LTS). All Windows clients are using the enterprise FD version 6.0.6. The windows clients that are exposing these symptoms are Server 2008 R2. A server 2003 that was previously located in the same network also exhibited the problem, but neither version in the local (to bacula-dir and bacula-fd) network ever does. Does anyone have an idea what might cause this? [The Pre and post backup jobs you see are just empty files on this machine, I have them configured everywhere and fill in the files where necessary]. 04-Jul 22:09 bacula-dir JobId 19317: Start Backup JobId 19317, Job=sphad01.2014-07-04_20.05.00_38 04-Jul 22:09 bacula-dir JobId 19317: Using Device FileStorage 04-Jul 22:09 sphad01-fd JobId 19317: shell command: run ClientBeforeJob C:/Program Files/Eurautomat/BaculaPreBackup.cmd Differential 04-Jul 22:09 sphad01-fd JobId 19317: Generate VSS snapshots. Driver=Win64 VSS, Drive(s)=CE 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Task Scheduler Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): VSS Metadata Store Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Performance Counters Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): System Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): ASR Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): FRS Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): WMI Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Shadow Copy Optimization Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Registry Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): COM+ REGDB Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Dhcp Jet Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): NTDS, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 bacula-sd-sd JobId 19317: Job write elapsed time = 00:15:26, Transfer rate = 10.46 M Bytes/second 04-Jul 22:24 sphad01-fd JobId 19317: Error: lib/bsock.c:350 Socket is terminated=1 on call to client:10.9.0.89:9102 04-Jul 22:24 sphad01-fd JobId 19317: shell command: run ClientAfterJob C:/Program Files/Eurautomat/BaculaPostBackup.cmd Differential 04-Jul 22:24 bacula-dir JobId 19317: Bacula bacula-dir 5.2.5 (26Jan12): Build OS: x86_64-pc-linux-gnu ubuntu 12.04 JobId: 19317 Job:sphad01.2014-07-04_20.05.00_38 Backup Level: Differential, since=2014-06-14 10:53:31 Client: sphad01-fd 6.0.6 (30Sep12) Microsoft Windows Server 2008 R2 Standard Edition Service Pack 1 (build 7601), 64-bit,Cross-compile,Win64 FileSet:sphad01-set 2014-04-29 09:34:07 Pool: File (From Job DiffPool override) Catalog:MyCatalog (From Client resource) Storage:File (From Pool resource) Scheduled time: 04-Jul-2014 20:05:00 Start time: 04-Jul-2014 22:09:07 End time: 04-Jul-2014 22:24:34 Elapsed time: 15 mins 27 secs Priority: 10 FD Files Written: 7,470 SD Files Written: 7,470 FD Bytes Written: 9,690,701,338 (9.690 GB) SD Bytes Written: 9,692,463,456 (9.692 GB) Rate: 10453.8 KB/s Software Compression: 35.0 % VSS:yes Encryption: no Accurate: no Volume name(s): FileStorage0003 Volume Session Id: 1001 Volume Session Time:1401794541 Last Volume Bytes: 53,009,601,334 (53.00 GB)
[Bacula-users] Socket terminated message after backup complete
Sorry for the subject line, forgot to replace it by something more descriptive :( -Original Message- From: Luc Van der Veken [mailto:luc...@wimionline.com] Sent: 07 July 2014 8:54 To: bacula-users@lists.sourceforge.net Subject: [Bacula-users] FW: Bacula: Backup OK of sphad01-fd Differential I'm using Bacula to back up a shop that's about half linux, half windows. Three of the Windows machines are located in another network. A few times per week, I find a result like the one below for a back-up of one of these three. As it stands now, I have a total of 4 such messages for 3 machines over the last 7 days. Traffic is routed through two firewalls that regard each other as trusted (each network has its own firewall, each had a spare port in it, I just connected a cable between those spare ports and configured routing so everything passes, no restricting firewall rules for that connection). Because traffic is going through those firewalls, I had already configured keepalive packets (heartbeat) at 300 seconds. In my first tests, backups *did* fail because that was missing. Now they don't seem to fail anymore, but there's that socket terminated message every now and then that doesn't belong there. Director and SD are 5.2.5 (default version found in Ubuntu 12.04 LTS). All Windows clients are using the enterprise FD version 6.0.6. The windows clients that are exposing these symptoms are Server 2008 R2. A server 2003 that was previously located in the same network also exhibited the problem, but neither version in the local (to bacula-dir and bacula-fd) network ever does. Does anyone have an idea what might cause this? [The Pre and post backup jobs you see are just empty files on this machine, I have them configured everywhere and fill in the files where necessary]. 04-Jul 22:09 bacula-dir JobId 19317: Start Backup JobId 19317, Job=sphad01.2014-07-04_20.05.00_38 04-Jul 22:09 bacula-dir JobId 19317: Using Device FileStorage 04-Jul 22:09 sphad01-fd JobId 19317: shell command: run ClientBeforeJob C:/Program Files/Eurautomat/BaculaPreBackup.cmd Differential 04-Jul 22:09 sphad01-fd JobId 19317: Generate VSS snapshots. Driver=Win64 VSS, Drive(s)=CE 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Task Scheduler Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): VSS Metadata Store Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Performance Counters Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): System Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): ASR Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): FRS Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): WMI Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Shadow Copy Optimization Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Registry Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): COM+ REGDB Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Dhcp Jet Writer, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): NTDS, State: 0x1 (VSS_WS_STABLE) 04-Jul 22:24 bacula-sd-sd JobId 19317: Job write elapsed time = 00:15:26, Transfer rate = 10.46 M Bytes/second 04-Jul 22:24 sphad01-fd JobId 19317: Error: lib/bsock.c:350 Socket is terminated=1 on call to client:10.9.0.89:9102 04-Jul 22:24 sphad01-fd JobId 19317: shell command: run ClientAfterJob C:/Program Files/Eurautomat/BaculaPostBackup.cmd Differential 04-Jul 22:24 bacula-dir JobId 19317: Bacula bacula-dir 5.2.5 (26Jan12): Build OS: x86_64-pc-linux-gnu ubuntu 12.04 JobId: 19317 Job:sphad01.2014-07-04_20.05.00_38 Backup Level: Differential, since=2014-06-14 10:53:31 Client: sphad01-fd 6.0.6 (30Sep12) Microsoft Windows Server 2008 R2 Standard Edition Service Pack 1 (build 7601), 64-bit,Cross-compile,Win64 FileSet:sphad01-set 2014-04-29 09:34:07 Pool: File (From Job DiffPool override) Catalog:MyCatalog (From Client resource) Storage:File (From Pool resource) Scheduled time: 04-Jul-2014 20:05:00 Start time: 04-Jul-2014 22:09:07 End time: 04-Jul-2014 22:24:34 Elapsed time: 15 mins 27 secs Priority: 10 FD Files Written: 7,470 SD Files Written: 7,470 FD Bytes Written: 9,690,701,338 (9.690 GB) SD Bytes Written: 9,692,463,456 (9.692 GB) Rate:
Re: [Bacula-users] Socket terminated message after backup complete
Because traffic is going through those firewalls, I had already configured keepalive packets (heartbeat) at 300 seconds. In my first tests, backups *did* fail because that was missing. Now they don't seem to fail anymore, but there's that socket terminated message every now and then that doesn't belong there. Hi, This seems like the problem that you're having. http://bugs.bacula.org/view.php?id=1925 I believe this was fixed in community client version 5.2.12 and I can verify that we no longer see these warning/error messages on clients that have been upgraded to = 5.2.12. We still see it on Windows machines that are running 5.2.10. I don't know which version of the Enterprise client has this fix in it. The messages themselves are mainly harmless so you can ignore them if you want to. --tom -- Open source business process management suite built on Java and Eclipse Turn processes into business applications with Bonita BPM Community Edition Quickly connect people, data, and systems into organized workflows Winner of BOSSIE, CODIE, OW2 and Gartner awards http://p.sf.net/sfu/Bonitasoft ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users