Kern Sibbald wrote:

On Tuesday 13 December 2005 17:54, Richard W. Knight wrote:
Kern Sibbald wrote:
On Tuesday 13 December 2005 13:43, Rick Knight wrote:
Kern Sibbald wrote:
Hello,

On Monday 12 December 2005 19:35, Richard W. Knight wrote:
Hi all,

A couple of weeks ago I upgraded my Bacula installation from 1.34 to
1.38.1. I made a few other changes at the same time and everything has
been working well since. Yesterday I decided to upgrade to 1.38.3. I
built from source using the same configuration options that I used to
build 1.38.1 with the addition of "--with-python". The build went  OK,
no errors. I stopped 1.38.1 and started 1.38.3. Everything seemed to be
OK. I ran a couple of small test backups and there were no errors so I
assumed that the upgrade went fine. Last night the scheduled backup ran
and after the first job, instead of continuing on to the next job, I
got this message ...

12-Dec 07:29 MyJob-SMB-sd: Job MyJob.2005-12-12_01.05.01 waiting to
reserve a device.

This morning, when I first saw this message, I just did a mount from
bconsole and the job continued, I have bacula configured for 6 jobs per
media and the tape wasn't full so the job should have just started as
it hallways has. Now it's time to backup up the catalog, to file, and
I'm getting the same message.

I have an HP DDS2 drive, no changer, running on Slackware 10. I'm using
the same conf files the worked fine on 1.38.1. Can anyone tell me why
I'm now getting this error message?
You are not by any chance running on a 2.4 kernel with /lib/tls?

Could you send me your bacula-dir.conf and bacula-sd.conf along with the
job report that shows the jobs blocking?

Thanks,
RickKnight

Thanks Kern,

I am running a 2.4.26 kernel, but I don't know about /lib/tls. I don't
have a /lib/tls on my system. How can I tell?
Try:

ls -l /lib/tls

If it exists, then that could explain why it *appears* that Bacula is not
seeing some pthread broadcasts that would allow it to continue. This is a
bit of a long shot, but at this point, I need to consider all
possibilities ...

In the mean time, I'll take a careful look at your config.  Perhaps I have
missed something important that I can add to my test cases.  All my tests
here succeeded perfectly ...

By the way, getting the message that a job is waiting to reserve a drive
is not in itself bad. This happens in my test case. However, at some
point when the drive is available, the job should continue as it does in
my test case.

Also, the .conf files are attached along with the log.

Thanks again,
RickKnight

Hello Rick,

Kern,

I just got another job log email. Something I noticed is a clock
difference between the Director and File daemons. The two lines below
are from this mornings log (also attached). Could that be causing a
problem?

No, that cannot be causing the problem. For some reason the signals that are sent to wake up the waiting thread are not properly delivered, or there is something else going on.

Thanks for your files and the log.
Your welcome Kern, I hope they help.

I would recommend that you put double quotes arount the "HP SureStore DAT-8" name *everywhere* it appears in your conf file.
Will do.

Can you tell me a few more things?

1. Did you finally have to use the "mount command to free up the Knight-Linux_SMB job? -- if so, I'm impressed because you had to be up at 4am to do it -- if not, where is the problem?
I had to use the mount command to get bacula to start the job. Knight-Linux-SMB runs first, then Knight-Linux runs, followed by Catalog. After Knight-Linux-SMB finished, instead of just continuing on with Knight-Linux, bacula just sits with the "waiting to reserve a device" message. If I just mount the device, the job does continue. But, I just realized something else. I have a "run after" job that rewinds the tape and then ejects it. It didn't seem to be a problem with 1.38.1, but it may be causing the pause now? I'm at work now, so I can't check it but I will when I get home tonight. And yes, I am up at about 4:00 most mornings. I have a long commute to work so I get up, early, spend about 15 minutes in my home office with coffee and email. If a tape needs attention, I do that then too.

2. Can you show me the log output of the job that Knight-Linux_SMB was waiting for?
I'll attach a log that shows both jobs, before and after. Oops, I can't access the log files from work, so I'll send them tonight.

Thanks again Kern,
Rick Knight

I was a bit surprised to see the waiting to reserver a device message printed a number of times, but after looking at the code, it was perfectly OK.
12-Dec 19:29 knight-linux-SMB-sd: Job Knight-Linux.2005-12-12_01.05.01
waiting to reserve a device. 13-Dec 04:33 knight-linux-fd: DIR and FD
clocks differ by 676 seconds, FD automatically adjusting.


Thanks again,
Rick Knight




-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to