Hi,

10.07.2007 09:43,, Daniel J. Priem wrote::
> Hi,
> i have a job wich runs forever in the view of the DIR.
> 
> if i make the ammount of data smaller (reducing fileset to /root as
> example ) then backup works.
> 
> So. any hints?
> 
> Regards
> Daniel
> 
> 
> 
> *status dir
> baculadir01 Version: 1.38.11 (28 June 2006) i486-pc-linux-gnu debian 4.0
> Daemon started 10-Jul-07 08:53, 0 Jobs run since started.
> 
> Scheduled Jobs:
> Level          Type     Pri  Scheduled          Name
> Volume
> ===================================================================================
> Differential   Backup    10  10-Jul-07 10:00
> dmz-bs1-hmailweb1-taeglich disy0018
> ====
> 
> Running Jobs:
> Console connected at 10-Jul-07 09:07
>  JobId Level   Name                       Status
> ======================================================================
>     37 Full    dmz-bs1-hmailweb1-taeglich.2007-07-10_09.00.00 is running
> ====

DIR thinks this job is running.

> No Terminated Jobs.
> ====
> *status stor
> Automatically selected Storage: stor01
> Connecting to Storage daemon stor01 at
> baculastor01.sts.ffm.tc.cust.disy.net:9103
> 
> baculastor01 Version: 1.38.11 (28 June 2006) x86_64-unknown-linux-gnu
> suse 10
> Daemon started 10-Jul-07 07:30, 3 Jobs run since started.
> 
> Running Jobs:
> No Jobs running.
> ====
> 
> Jobs waiting to reserve a drive:
> ====
> 
> Terminated Jobs:
>  JobId  Level   Files          Bytes Status   Finished        Name
> ======================================================================
>     22  Full          0              0 Other    09-Jul-07 19:52
>  dmz-bs1-hmailweb1-taeglich
>     24  Full    122,972  3,188,798,869 OK       09-Jul-07 21:42
>  dmz-bs1-hmailweb1-taeglich
>     25  Full    137,592 10,987,144,277 Error    10-Jul-07 06:34
>  dmz-bs1-hmailweb1-taeglich
>     26  Full    180,232 18,950,880,155 OK       10-Jul-07 06:55
>  dmz-bs1-hmailweb1-taeglich
>     31  Full     17,490     18,990,882 Error    10-Jul-07 07:11
>  dmz-bs1-hmailweb1-taeglich
>     32  Full      4,795    794,824,620 OK       10-Jul-07 07:12
>  dmz-bs1-hmailweb1-taeglich
>     33  Full     11,644        938,011 Cancel   10-Jul-07 07:27
>  dmz-bs1-hmailweb1-taeglich
>     34  Full    180,233 18,952,941,935 OK       10-Jul-07 07:49
>  dmz-bs1-hmailweb1-taeglich
>     36  Full    180,233 18,954,682,347 OK       10-Jul-07 08:38
>  dmz-bs1-hmailweb1-taeglich
>     37  Full    180,233 18,956,285,343 OK       10-Jul-07 09:16
>  dmz-bs1-hmailweb1-taeglich

SD thinks this job already finished.

> ====
> 
> Device status:
> Device "FileStorage" (/raid/bacula) is not open or does not exist.
> ====
> 
> In Use Volume status:
> ====
> 
> *status client=dmz-bs1-hmailweb1
> Connecting to Client dmz-bs1-hmailweb1 at
> dmz-bs1-hmailweb1.sts.ffm.tc.cust.disy.net:9102
> 
> dmz-bs1-hmailweb1 Version: 1.38.11 (28 June 2006)
> x86_64-unknown-linux-gnu suse 9
> Daemon started 09-Jul-07 19:15, 10 Jobs run since started.
> 
> Terminated Jobs:
>  JobId  Level     Files         Bytes  Status   Finished        Name
> ======================================================================
>     23  Full     31,273    417,142,763 Cancel   09-Jul-07 19:24
>  dmz-bs1-hmailweb1-taeglich
>     24  Full    122,972  3,173,801,755 OK       09-Jul-07 21:42
>  dmz-bs1-hmailweb1-taeglich
>     25  Full    137,592 10,970,433,480 Error    10-Jul-07 06:33
>  dmz-bs1-hmailweb1-taeglich
>     26  Full    180,232 18,926,850,782 OK       10-Jul-07 06:55
>  dmz-bs1-hmailweb1-taeglich
>     31  Full     17,492     17,511,532 Error    10-Jul-07 07:10
>  dmz-bs1-hmailweb1-taeglich
>     32  Full      4,795    794,216,527 OK       10-Jul-07 07:12
>  dmz-bs1-hmailweb1-taeglich
>     33  Full     12,899              0 Error    10-Jul-07 07:27
>  dmz-bs1-hmailweb1-taeglich
>     34  Full    180,233 18,928,912,435 OK       10-Jul-07 07:49
>  dmz-bs1-hmailweb1-taeglich
>     36  Full    180,233 18,930,652,847 OK       10-Jul-07 08:37
>  dmz-bs1-hmailweb1-taeglich
>     37  Full    180,233 18,932,255,840 OK       10-Jul-07 09:15
>  dmz-bs1-hmailweb1-taeglich

FD also thinks the job is finished.

> ====
> Running Jobs:
> Director connected at: 10-Jul-07 09:24
> No Jobs running.
> ====
> *

Either you are having connectivity problems between DIR and SD and/or 
FD (quite possible, guessing from the host name with dmz in it - 
firewalls can cause this sort of problem) or there's something wrong 
with the DIR.

I'd suggest setting the heartbeat interval for the FD first, and 
observing network traffic between FD and DIR during backup. I suppose 
the firewall in between terminates that session while the 
FD-SD-connection is up. I wonder why the problem is not detected by 
the DIR. After two hours, it should register a failure, if not earlier.

Arno

-- 
Arno Lehmann
IT-Service Lehmann
www.its-lehmann.de

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to