[Bacula-users] Bacula SD Broken Pipe after 16 minutes after ...

2014-07-07 Thread dave
Hi !

I'm back from vacation. Smile

thanks for your tips. Unfortunately the Heartbeat won't help. I have upgraded 
meanwhile to 7.0.4. Just came into office to see that the weekend backup failed 
again with Heartbeat set on the client/server on all daemons to 300. As 
suggested.

- It just happens wenn MaxSpoolCache Size gets hist.
- It despools for exactly 16:12 min and then breaks. (what kind of timeout 
would that be ?)

Also did 
- Switched network card on bacula server
- Removed on LTO drive (running single now)
- Switched SAS Port (on Library)

05-Jul 15:42 srv-bacula-dir JobId 9856: Start Backup JobId 9856, 
Job=cli-bacula-data.2014-07-05_15.42.00_34
05-Jul 15:42 srv-bacula-dir JobId 9856: Using Device tapelib-drive0 to write.
05-Jul 15:42 srv-bacula-sd JobId 9856: Spooling data ...
05-Jul 23:10 srv-bacula-sd JobId 9856: User specified Device spool size 
reached: DevSpoolSize=800,000,016,969 MaxDevSpoolSize=800,000,000,000
05-Jul 23:10 srv-bacula-sd JobId 9856: Writing spooled data to Volume. 
Despooling 800,000,016,969 bytes ...
05-Jul 23:26 srv-client-fd JobId 9856: Error: bsock.c:428 Write error sending 
65540 bytes to Storage daemon:srv-bacula:9103: ERR=Broken pipe
05-Jul 23:26 srv-client-fd JobId 9856: Fatal error: backup.c:1200 Network send 
error to SD. ERR=Broken pipe
05-Jul 23:26 srv-bacula-sd JobId 9856: Despooling elapsed time = 00:16:12, 
Transfer rate = 823.0 M Bytes/second
05-Jul 23:26 srv-bacula-dir JobId 9856: Error: Director's connection to SD for 
this Job was lost.
05-Jul 23:26 srv-bacula-dir JobId 9856: Error: Bacula srv-bacula-dir 7.0.4 
(04Jun14):


Again - I am desperate. No clue what else todo to get it running.
- Why is this happening when it starts despooling from MaxSpoolCache size ?
- What has the client to do with despooling ? (05-Jul 23:26) cause the data is 
in the cache on the server.
- Why after 16:12min ?

After restarting the job - in 95% of the retries the backup completes.

Many thanks
-- David

+--
|This was sent by d...@espros.ch via Backup Central.
|Forward SPAM to ab...@backupcentral.com.
+--



--
Open source business process management suite built on Java and Eclipse
Turn processes into business applications with Bonita BPM Community Edition
Quickly connect people, data, and systems into organized workflows
Winner of BOSSIE, CODIE, OW2 and Gartner awards
http://p.sf.net/sfu/Bonitasoft
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


[Bacula-users] FW: Bacula: Backup OK of sphad01-fd Differential

2014-07-07 Thread Luc Van der Veken
I'm using Bacula to back up a shop that's about half linux, half windows.

Three of the Windows machines are located in another network.
A few times per week, I find a result like the one below for a back-up of one 
of these three.
As it stands now, I have a total of 4 such messages for 3 machines over the 
last 7 days.

Traffic is routed through two firewalls that regard each other as trusted (each 
network has its own firewall, each had a spare port in it, I just connected a 
cable between those spare ports and configured routing so everything passes, no 
restricting firewall rules for that connection).

Because traffic is going through those firewalls, I had already configured 
keepalive packets (heartbeat) at 300 seconds.
In my first tests, backups *did* fail because that was missing.  Now they don't 
seem to fail anymore, but there's that socket terminated message every now 
and then that doesn't belong there.

Director and SD are 5.2.5 (default version found in Ubuntu 12.04 LTS).
All Windows clients are using the enterprise FD version 6.0.6.
The windows clients that are exposing these symptoms are Server 2008 R2.  A 
server 2003 that was previously located in the same network also exhibited the 
problem, but neither version in the local (to bacula-dir and bacula-fd) network 
ever does.


Does anyone have an idea what might cause this?


[The Pre and post backup jobs you see are just empty files on this machine, I 
have them configured everywhere and fill in the files where necessary].


04-Jul 22:09 bacula-dir JobId 19317: Start Backup JobId 19317, 
Job=sphad01.2014-07-04_20.05.00_38
04-Jul 22:09 bacula-dir JobId 19317: Using Device FileStorage
04-Jul 22:09 sphad01-fd JobId 19317: shell command: run ClientBeforeJob 
C:/Program Files/Eurautomat/BaculaPreBackup.cmd Differential
04-Jul 22:09 sphad01-fd JobId 19317: Generate VSS snapshots. Driver=Win64 
VSS, Drive(s)=CE
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Task 
Scheduler Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): VSS Metadata 
Store Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Performance 
Counters Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): System 
Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): ASR Writer, 
State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): FRS Writer, 
State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): WMI Writer, 
State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Shadow Copy 
Optimization Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Registry 
Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): COM+ REGDB 
Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Dhcp Jet 
Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): NTDS, 
State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 bacula-sd-sd JobId 19317: Job write elapsed time = 00:15:26, 
Transfer rate = 10.46 M Bytes/second
04-Jul 22:24 sphad01-fd JobId 19317: Error: lib/bsock.c:350 Socket is 
terminated=1 on call to client:10.9.0.89:9102
04-Jul 22:24 sphad01-fd JobId 19317: shell command: run ClientAfterJob 
C:/Program Files/Eurautomat/BaculaPostBackup.cmd Differential
04-Jul 22:24 bacula-dir JobId 19317: Bacula bacula-dir 5.2.5 (26Jan12):
  Build OS:   x86_64-pc-linux-gnu ubuntu 12.04
  JobId:  19317
  Job:sphad01.2014-07-04_20.05.00_38
  Backup Level:   Differential, since=2014-06-14 10:53:31
  Client: sphad01-fd 6.0.6 (30Sep12) Microsoft Windows Server 
2008 R2 Standard Edition Service Pack 1 (build 7601), 64-bit,Cross-compile,Win64
  FileSet:sphad01-set 2014-04-29 09:34:07
  Pool:   File (From Job DiffPool override)
  Catalog:MyCatalog (From Client resource)
  Storage:File (From Pool resource)
  Scheduled time: 04-Jul-2014 20:05:00
  Start time: 04-Jul-2014 22:09:07
  End time:   04-Jul-2014 22:24:34
  Elapsed time:   15 mins 27 secs
  Priority:   10
  FD Files Written:   7,470
  SD Files Written:   7,470
  FD Bytes Written:   9,690,701,338 (9.690 GB)
  SD Bytes Written:   9,692,463,456 (9.692 GB)
  Rate:   10453.8 KB/s
  Software Compression:   35.0 %
  VSS:yes
  Encryption: no
  Accurate:   no
  Volume name(s): FileStorage0003
  Volume Session Id:  1001
  Volume Session Time:1401794541
  Last Volume Bytes:  53,009,601,334 (53.00 GB)
  

[Bacula-users] Socket terminated message after backup complete

2014-07-07 Thread Luc Van der Veken
Sorry for the subject line, forgot to replace it by something more descriptive 
:(


-Original Message-
From: Luc Van der Veken [mailto:luc...@wimionline.com] 
Sent: 07 July 2014 8:54
To: bacula-users@lists.sourceforge.net
Subject: [Bacula-users] FW: Bacula: Backup OK of sphad01-fd Differential

I'm using Bacula to back up a shop that's about half linux, half windows.

Three of the Windows machines are located in another network.
A few times per week, I find a result like the one below for a back-up of one 
of these three.
As it stands now, I have a total of 4 such messages for 3 machines over the 
last 7 days.

Traffic is routed through two firewalls that regard each other as trusted (each 
network has its own firewall, each had a spare port in it, I just connected a 
cable between those spare ports and configured routing so everything passes, no 
restricting firewall rules for that connection).

Because traffic is going through those firewalls, I had already configured 
keepalive packets (heartbeat) at 300 seconds.
In my first tests, backups *did* fail because that was missing.  Now they don't 
seem to fail anymore, but there's that socket terminated message every now 
and then that doesn't belong there.

Director and SD are 5.2.5 (default version found in Ubuntu 12.04 LTS).
All Windows clients are using the enterprise FD version 6.0.6.
The windows clients that are exposing these symptoms are Server 2008 R2.  A 
server 2003 that was previously located in the same network also exhibited the 
problem, but neither version in the local (to bacula-dir and bacula-fd) network 
ever does.


Does anyone have an idea what might cause this?


[The Pre and post backup jobs you see are just empty files on this machine, I 
have them configured everywhere and fill in the files where necessary].


04-Jul 22:09 bacula-dir JobId 19317: Start Backup JobId 19317, 
Job=sphad01.2014-07-04_20.05.00_38
04-Jul 22:09 bacula-dir JobId 19317: Using Device FileStorage
04-Jul 22:09 sphad01-fd JobId 19317: shell command: run ClientBeforeJob 
C:/Program Files/Eurautomat/BaculaPreBackup.cmd Differential
04-Jul 22:09 sphad01-fd JobId 19317: Generate VSS snapshots. Driver=Win64 
VSS, Drive(s)=CE
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Task 
Scheduler Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): VSS Metadata 
Store Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Performance 
Counters Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): System 
Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): ASR Writer, 
State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): FRS Writer, 
State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): WMI Writer, 
State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Shadow Copy 
Optimization Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Registry 
Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): COM+ REGDB 
Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): Dhcp Jet 
Writer, State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 sphad01-fd JobId 19317: VSS Writer (BackupComplete): NTDS, 
State: 0x1 (VSS_WS_STABLE)
04-Jul 22:24 bacula-sd-sd JobId 19317: Job write elapsed time = 00:15:26, 
Transfer rate = 10.46 M Bytes/second
04-Jul 22:24 sphad01-fd JobId 19317: Error: lib/bsock.c:350 Socket is 
terminated=1 on call to client:10.9.0.89:9102
04-Jul 22:24 sphad01-fd JobId 19317: shell command: run ClientAfterJob 
C:/Program Files/Eurautomat/BaculaPostBackup.cmd Differential
04-Jul 22:24 bacula-dir JobId 19317: Bacula bacula-dir 5.2.5 (26Jan12):
  Build OS:   x86_64-pc-linux-gnu ubuntu 12.04
  JobId:  19317
  Job:sphad01.2014-07-04_20.05.00_38
  Backup Level:   Differential, since=2014-06-14 10:53:31
  Client: sphad01-fd 6.0.6 (30Sep12) Microsoft Windows Server 
2008 R2 Standard Edition Service Pack 1 (build 7601), 64-bit,Cross-compile,Win64
  FileSet:sphad01-set 2014-04-29 09:34:07
  Pool:   File (From Job DiffPool override)
  Catalog:MyCatalog (From Client resource)
  Storage:File (From Pool resource)
  Scheduled time: 04-Jul-2014 20:05:00
  Start time: 04-Jul-2014 22:09:07
  End time:   04-Jul-2014 22:24:34
  Elapsed time:   15 mins 27 secs
  Priority:   10
  FD Files Written:   7,470
  SD Files Written:   7,470
  FD Bytes Written:   9,690,701,338 (9.690 GB)
  SD Bytes Written:   9,692,463,456 (9.692 GB)
  Rate:

Re: [Bacula-users] Socket terminated message after backup complete

2014-07-07 Thread Thomas Lohman

 Because traffic is going through those firewalls, I had already
 configured keepalive packets (heartbeat) at 300 seconds. In my first
 tests, backups *did* fail because that was missing.  Now they don't
 seem to fail anymore, but there's that socket terminated message
 every now and then that doesn't belong there.


Hi,

This seems like the problem that you're having.

http://bugs.bacula.org/view.php?id=1925

I believe this was fixed in community client version 5.2.12 and I can 
verify that we no longer see these warning/error messages on clients 
that have been upgraded to = 5.2.12.  We still see it on Windows 
machines that are running 5.2.10.  I don't know which version of the 
Enterprise client has this fix in it.

The messages themselves are mainly harmless so you can ignore them if 
you want to.


--tom

--
Open source business process management suite built on Java and Eclipse
Turn processes into business applications with Bonita BPM Community Edition
Quickly connect people, data, and systems into organized workflows
Winner of BOSSIE, CODIE, OW2 and Gartner awards
http://p.sf.net/sfu/Bonitasoft
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users