Dan Langille wrote:
> On 24 Apr 2006 at 12:05, Martin Horcicka wrote:
>
>> I run a lot of jobs in parallel (with spooling) and a few of them (different
>> ones every day) usually fail with this message from the client daemons:
>>
>> Fatal error: Bad response from stored to open command
>
> Is everthing on the same bacula version? Are the clients all 1.38.8?
> Are the storage daemons all on 1.38.8?
The director and the storage daemon are both 1.38.8 (it's the same server).
The clients are on versions from 1.38.2 to 1.38.8.
>> It seems to happen after elapsing of the job's MaxRunTime or MaxWaitTime and
>> the jobs don't have any destination volume assigned by the storage daemon -
>> which is strange. When I run the job manually immediately after the failure,
>> it works well.
>>
>> Does anyone know what the message really means and where should I look for a
>> cause of the problem?
Right now I'm running another test backup - from 120 jobs run in parallel 118
jobs finished OK and 2 jobs are in a strange state that will likely result in
the problem described above:
>From "status dir":
Director Version: 1.38.8 (14 April 2006) i386-portbld-freebsd5.4 freebsd
5.4-RELEASE-p14
...
Running Jobs:
JobId Level Name Status
======================================================================
2389 Differe b2--backup.2006-04-24_13.08.59 is running
2404 Differe b4--backup.2006-04-24_13.09.14 is running
====
>From "status storage":
Storage Version: 1.38.8 (14 April 2006) i386-portbld-freebsd5.4 freebsd
5.4-RELEASE-p14
...
Running Jobs:
Writing: Differential Backup job b2--backup JobId=2389 Volume=""
pool="Daily" device=""Tape-Library-1-Drive-0" (/dev/nsa0)"
Files=0 Bytes=0 Bytes/sec=0
FDReadSeqNo=4 in_msg=4 out_msg=3 fd=130
Writing: Differential Backup job b4--backup JobId=2404 Volume=""
pool="Daily" device=""Tape-Library-1-Drive-0" (/dev/nsa0)"
Files=0 Bytes=0 Bytes/sec=0
FDReadSeqNo=4 in_msg=4 out_msg=3 fd=161
====
Notice the strange Volume="" above.
>From "status client" (machine b2):
Client Version: 1.38.6 (28 March 2006) i386-portbld-freebsd4.11 freebsd
4.11-RELEASE-p11
...
Running Jobs:
JobId 2389 Job b2--backup.2006-04-24_13.08.59 is running.
Backup Job started: 24-dub-06 13:09
Files=0 Bytes=0 Bytes/sec=0
Files Examined=0
SDReadSeqNo=4 fd=7
Director connected at: 24-dub-06 15:50
====
>From "status client" (machine b4):
Client Version: 1.38.6 (28 March 2006) i386-portbld-freebsd4.11 freebsd
4.11-RELEASE-p11
...
Running Jobs:
JobId 2404 Job b4--backup.2006-04-24_13.09.14 is running.
Backup Job started: 24-dub-06 13:09
Files=0 Bytes=0 Bytes/sec=0
Files Examined=0
SDReadSeqNo=4 fd=7
Director connected at: 24-dub-06 15:52
====
Unfortunately, I don't know how to find out what the system is doing right now
in more detail but it seems it does not do anything.
Martin
-------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Bacula-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/bacula-users