[Bacula-users] Director Crash

2015-11-11 Thread Adam Pielak
Hello,
if we add double clients config (IP oraz name) on Bacula 7.2 and we make
reload or just bacula-dir restart then director crash.
I don't know how look like on 7.0.x, but on 5.x works fine.
Before You make some changes, 1st check config :
$ sudo -u bacula /sbin/bacula-dir -t

Regards,
Adam



smime.p7s
Description: Kryptograficzna sygnatura S/MIME
--
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] director crash when an backup starts

2012-10-24 Thread ZIT-BB Büttner , Frank
Yes, version 5.2.12 do not crash.
The website of the project seems to be some obsolete.
Under news only 5.2.11 is presented and under current files
5.2.6. 

But now all is fine again.:))

Thanks


-Ursprüngliche Nachricht-
Von: Simone Caronni [mailto:negativ...@gmail.com] 
Gesendet: Mittwoch, 24. Oktober 2012 09:51
An: ZIT-BB Büttner, Frank
Cc: bacula-users@lists.sourceforge.net
Betreff: Re: [Bacula-users] director crash when an backup starts

There has been a fix applied in 5.2.12 related to a Director crash,
could you please update?

If you want a binary package you can have a look here:

http://repos.fedorapeople.org/repos/slaanesh/bacula/README.txt

Regards,
--Simone
--
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_sfd2d_oct
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] director crash when an backup starts

2012-10-24 Thread Simone Caronni
There has been a fix applied in 5.2.12 related to a Director crash,
could you please update?

If you want a binary package you can have a look here:

http://repos.fedorapeople.org/repos/slaanesh/bacula/README.txt

Regards,
--Simone


On 24 October 2012 09:14, ZIT-BB Büttner, Frank
 wrote:
> Hello,
> I use bacula 5.2.11 on an SL 6 system(RHEL 6 clone).
> As far I start an backup job, the director crash.
> dbcheck have not find any errors.
> Here the debug output:
>
> system-name: next_vol.c:71-1227 find_next_vol_for_append: JobId=1227 
> PoolId=1, MediaType=File
> system-name: sql_find.c:462-1227 Rtn numrows=1
> system-name: next_vol.c:183-1227 VolJobs=24 FirstWritten=1347350543
> system-name: next_vol.c:271-1227 Vol=vBand-0001 expired=0
> system-name: next_vol.c:198-1227 return ok=1 find_next_vol
> system-name: catreq.c:146-1227 find_media ok=1 idx=1 vol=vBand-0001
> system-name: fd_cmds.c:109-1227 Opened connection with File daemon
> system-name: authenticate.c:196-1227 Sent: Hello Director system-name calling
> system-name: cram-md5.c:150-1227 sending resp to challenge:
> system-name: cram-md5.c:79-1227 send: auth cram-md5 4@system-name> ssl=0
> system-name: cram-md5.c:98-1227 Authenticate OK
> system-name: sql_get.c:1271-1227 
> db_accurate_get_jobids=1106,1117,1127,1137,1147,1157,1167,1210
> system-name: backup.c:226-1227 Checksum will be sent to FD
> system-name: mysql.c:177-1227 mysql_init done
> system-name: mysql.c:202-1227 mysql_real_connect done
> system-name: mysql.c:204-1227 db_user=bacula db_name=bacula db_password=
> system-name: next_vol.c:271-1227 Vol=vBand-0001 expired=0
> system-name: next_vol.c:271-1227 Vol=vBand-0001 expired=0
> system-name: next_vol.c:271-1227 Vol=vBand-0001 expired=0
> system-name: next_vol.c:271-1227 Vol=vBand-0001 expired=0
> system-name: catreq.c:285-1227 Update StorageId old=1 new=1
> system-name: next_vol.c:271-1227 Vol=vBand-0001 expired=0
> system-name: sql_create.c:802-1227 db_create_file_record changes=7
> system-name: ua_prune.c:499-1227 select sql=INSERT INTO DelCandidates SELECT 
> JobId,PurgedFiles,FileSetId,JobFiles,JobStatus FROM Job  JOIN Client USING 
> (ClientId)  JOIN Pool ON (Job.PoolId = Pool.PoolId)  WHERE Type IN ('B', 'C', 
> 'M', 'V',  'D', 'R', 'c', 'm', 'g')   AND JobTDate < 1335510589  AND 
> Client.Name = 'Datenbank_Produktion'  AND Pool.Name = 'Default'
> system-name: ua_prune.c:306-1227 select sql=SELECT COUNT(1) FROM Job  JOIN 
> Client USING (ClientId)  JOIN Pool ON (Job.PoolId = Pool.PoolId)  WHERE 
> PurgedFiles=0  AND JobTDate < 1348470589  AND Client.Name = 
> 'Datenbank_Produktion'  AND Pool.Name = 'Default'
> system-name: job.c:362-1227  End Job stat=T ==
> Bacula interrupted by signal 11: Segmentation violation
> Kaboom! bacula-dir, system-name got signal 11 - Segmentation violation. 
> Attempting traceback.
> Kaboom! exepath=/usr/sbin
> Calling: /usr/sbin/btraceback /usr/sbin/bacula-dir 1761 /var/spool/bacula
> It looks like the traceback worked ...
> Dumping: /var/spool/bacula/system-name.1761.bactrace
>
> here the contains of system-name.1761.bactrace:
> Attempt to dump locks
> threadid=0x7f22121fc700 max=1 current=-1
> threadid=0x7f2212bfd700 max=1 current=-1
> threadid=0x7f22135fe700 max=1 current=-1
> threadid=0x7f2213fff700 max=2 current=-1
> threadid=0x7f221922a700 max=0 current=-1
> threadid=0x7f2219c2b700 max=0 current=-1
> threadid=0x7f22246877e0 max=1 current=-1
> Attempt to dump current JCRs. njcrs=3
> threadid=(nil) JobId=0 JobStatus=R jcr=0x155cf98 
> name=*JobMonitor*.2012-10-24_09.07.31_01
> threadid=(nil) killable=0 JobId=0 JobStatus=R jcr=0x155cf98 
> name=*JobMonitor*.2012-10-24_09.07.31_01
> use_count=1
> JobType=I JobLevel=
> sched_time=24-Okt-2012 09:07 start_time=24-Okt-2012 09:07
> end_time=01-Jan-1970 01:00 wait_time=01-Jan-1970 01:00
> db=(nil) db_batch=(nil) batch_started=0
> threadid=(nil) JobId=0 JobStatus=R jcr=0x7f221078 
> name=-Console-.2012-10-24_09.07.55_02
> threadid=(nil) killable=0 JobId=0 JobStatus=R jcr=0x7f221078 
> name=-Console-.2012-10-24_09.07.55_02
> use_count=1
> JobType=U JobLevel=
> sched_time=24-Okt-2012 09:07 start_time=24-Okt-2012 09:07
> end_time=01-Jan-1970 01:00 wait_time=01-Jan-1970 01:00
> db=0x7f224578 db_batch=(nil) batch_started=0
> B_DB=0x7f224578 db_name=bacula db_user=bacula connected=true
> cmd="SELECT VolumeName,MAX(VolIndex) FROM JobMedia,Media WHERE 
> JobMedia.JobId=1227 AND JobMedia.MediaId=Media.MediaId GROUP BY VolumeName 
> ORDER BY 2 ASC" changes=9
> RWLOCK=0x7f224580 w_active=0 w_wait=0
> threadid=(nil) JobId=0 JobStatus=R jcr=0x7f2204001078 
> name=-Console-.2012-10-24_09.09.40_03
> threadid=(nil) killable=0 JobId=0 JobStatus=R jcr=0x7f2204001078 
> name=-Console-.2012-10-24_09.09.40_03
> use_count=1
> JobType=U JobLevel=
> sched_time=24-Okt-2012 09:09 start_time=24-Okt-2012 09:09
> end_time=0

[Bacula-users] director crash when an backup starts

2012-10-24 Thread ZIT-BB Büttner , Frank
Hello,
I use bacula 5.2.11 on an SL 6 system(RHEL 6 clone).
As far I start an backup job, the director crash.
dbcheck have not find any errors.
Here the debug output:

system-name: next_vol.c:71-1227 find_next_vol_for_append: JobId=1227 PoolId=1, 
MediaType=File
system-name: sql_find.c:462-1227 Rtn numrows=1
system-name: next_vol.c:183-1227 VolJobs=24 FirstWritten=1347350543
system-name: next_vol.c:271-1227 Vol=vBand-0001 expired=0
system-name: next_vol.c:198-1227 return ok=1 find_next_vol
system-name: catreq.c:146-1227 find_media ok=1 idx=1 vol=vBand-0001
system-name: fd_cmds.c:109-1227 Opened connection with File daemon
system-name: authenticate.c:196-1227 Sent: Hello Director system-name calling
system-name: cram-md5.c:150-1227 sending resp to challenge: 
system-name: cram-md5.c:79-1227 send: auth cram-md5 4@system-name> ssl=0
system-name: cram-md5.c:98-1227 Authenticate OK 
system-name: sql_get.c:1271-1227 
db_accurate_get_jobids=1106,1117,1127,1137,1147,1157,1167,1210
system-name: backup.c:226-1227 Checksum will be sent to FD
system-name: mysql.c:177-1227 mysql_init done
system-name: mysql.c:202-1227 mysql_real_connect done
system-name: mysql.c:204-1227 db_user=bacula db_name=bacula db_password=
system-name: next_vol.c:271-1227 Vol=vBand-0001 expired=0
system-name: next_vol.c:271-1227 Vol=vBand-0001 expired=0
system-name: next_vol.c:271-1227 Vol=vBand-0001 expired=0
system-name: next_vol.c:271-1227 Vol=vBand-0001 expired=0
system-name: catreq.c:285-1227 Update StorageId old=1 new=1
system-name: next_vol.c:271-1227 Vol=vBand-0001 expired=0
system-name: sql_create.c:802-1227 db_create_file_record changes=7
system-name: ua_prune.c:499-1227 select sql=INSERT INTO DelCandidates SELECT 
JobId,PurgedFiles,FileSetId,JobFiles,JobStatus FROM Job  JOIN Client USING 
(ClientId)  JOIN Pool ON (Job.PoolId = Pool.PoolId)  WHERE Type IN ('B', 'C', 
'M', 'V',  'D', 'R', 'c', 'm', 'g')   AND JobTDate < 1335510589  AND 
Client.Name = 'Datenbank_Produktion'  AND Pool.Name = 'Default'
system-name: ua_prune.c:306-1227 select sql=SELECT COUNT(1) FROM Job  JOIN 
Client USING (ClientId)  JOIN Pool ON (Job.PoolId = Pool.PoolId)  WHERE 
PurgedFiles=0  AND JobTDate < 1348470589  AND Client.Name = 
'Datenbank_Produktion'  AND Pool.Name = 'Default'
system-name: job.c:362-1227  End Job stat=T ==
Bacula interrupted by signal 11: Segmentation violation
Kaboom! bacula-dir, system-name got signal 11 - Segmentation violation. 
Attempting traceback.
Kaboom! exepath=/usr/sbin
Calling: /usr/sbin/btraceback /usr/sbin/bacula-dir 1761 /var/spool/bacula
It looks like the traceback worked ...
Dumping: /var/spool/bacula/system-name.1761.bactrace

here the contains of system-name.1761.bactrace:
Attempt to dump locks
threadid=0x7f22121fc700 max=1 current=-1
threadid=0x7f2212bfd700 max=1 current=-1
threadid=0x7f22135fe700 max=1 current=-1
threadid=0x7f2213fff700 max=2 current=-1
threadid=0x7f221922a700 max=0 current=-1
threadid=0x7f2219c2b700 max=0 current=-1
threadid=0x7f22246877e0 max=1 current=-1
Attempt to dump current JCRs. njcrs=3
threadid=(nil) JobId=0 JobStatus=R jcr=0x155cf98 
name=*JobMonitor*.2012-10-24_09.07.31_01
threadid=(nil) killable=0 JobId=0 JobStatus=R jcr=0x155cf98 
name=*JobMonitor*.2012-10-24_09.07.31_01
use_count=1
JobType=I JobLevel=
sched_time=24-Okt-2012 09:07 start_time=24-Okt-2012 09:07
end_time=01-Jan-1970 01:00 wait_time=01-Jan-1970 01:00
db=(nil) db_batch=(nil) batch_started=0
threadid=(nil) JobId=0 JobStatus=R jcr=0x7f221078 
name=-Console-.2012-10-24_09.07.55_02
threadid=(nil) killable=0 JobId=0 JobStatus=R jcr=0x7f221078 
name=-Console-.2012-10-24_09.07.55_02
use_count=1
JobType=U JobLevel=
sched_time=24-Okt-2012 09:07 start_time=24-Okt-2012 09:07
end_time=01-Jan-1970 01:00 wait_time=01-Jan-1970 01:00
db=0x7f224578 db_batch=(nil) batch_started=0
B_DB=0x7f224578 db_name=bacula db_user=bacula connected=true
cmd="SELECT VolumeName,MAX(VolIndex) FROM JobMedia,Media WHERE 
JobMedia.JobId=1227 AND JobMedia.MediaId=Media.MediaId GROUP BY VolumeName 
ORDER BY 2 ASC" changes=9
RWLOCK=0x7f224580 w_active=0 w_wait=0
threadid=(nil) JobId=0 JobStatus=R jcr=0x7f2204001078 
name=-Console-.2012-10-24_09.09.40_03
threadid=(nil) killable=0 JobId=0 JobStatus=R jcr=0x7f2204001078 
name=-Console-.2012-10-24_09.09.40_03
use_count=1
JobType=U JobLevel=
sched_time=24-Okt-2012 09:09 start_time=24-Okt-2012 09:09
end_time=01-Jan-1970 01:00 wait_time=01-Jan-1970 01:00
db=0x7f224578 db_batch=(nil) batch_started=0
B_DB=0x7f224578 db_name=bacula db_user=bacula connected=true
cmd="SELECT VolumeName,MAX(VolIndex) FROM JobMedia,Media WHERE 
JobMedia.JobId=1227 AND JobMedia.MediaId=Media.MediaId GROUP BY VolumeName 
ORDER BY 2 ASC" changes=9
RWLOCK=0x7f224580 w_active=0 w_wait=0
Attempt to dump plugins. Hook count=0


Re: [Bacula-users] Director crash- again with traceback

2011-01-14 Thread jerry lowry
No one has any ideas on what would have caused this.  Based on the trace 
dump it looks like there is a problem with the scheduler.  Any pointers 
as to what I can look at?


thanks,

---
Jerold Lowry
IT Manager / Software Engineer
Engineering Design Team (EDT), Inc. a HEICO company
1400 NW Compton Drive, Suite 315
Beaverton, Oregon 97006 (U.S.A.)
Phone: 503-690-1234 / 800-435-4320
Fax: 503-690-1243
Web: _www.edt.com _



On 1/11/2011 9:12 AM, jerry lowry wrote:

I really hate when I do that!!!

[?1034h[Thread debugging using libthread_db enabled]
[New Thread 0x7f8362bfd710 (LWP 9002)]
[New Thread 0x7f8363fff710 (LWP 3111)]
[New Thread 0x7f8368c49710 (LWP 3110)]
0x003377a0e91d in nanosleep () from /lib64/libpthread.so.0
$1 = '\000'
$2 = 0x1fe2068 "bacula-dir"
$3 = 0x1fe20a8 "/usr/bacula/bin/bacula-dir"
$4 = 0x7f834c004328 "MySQL"
$5 = 0x7f836eadbd9e "5.0.1 (24 February 2010)"
$6 = 0x7f836eadbdb7 "x86_64-unknown-linux-gnu"
$7 = 0x7f836eadbdd0 "redhat"
$8 = 0x7f836eadba7c ""
$9 = "distress", '\000'
#0  0x003377a0e91d in nanosleep () from /lib64/libpthread.so.0
#1  0x7f836eaae6f7 in bmicrosleep (sec=60, usec=0) at bsys.c:61
#2  0x0042e1d5 in wait_for_next_job (
 one_shot_job_to_run=) at scheduler.c:131
#3  0x0040d93d in main (argc=,
 argv=) at dird.c:338

Thread 4 (Thread 0x7f8368c49710 (LWP 3110)):
#0  0x0033772d7393 in select () from /lib64/libc.so.6
#1  0x7f836eab0ad4 in bnet_thread_server (addrs=,
 max_clients=, client_wq=,
 handle_client_request=) at bnet_server.c:161
#2  0x004468fc in connect_thread (arg=0x1fe3ee8) at ua_server.c:82
#3  0x003377a06a3a in start_thread () from /lib64/libpthread.so.0
#4  0x0033772de62d in clone () from /lib64/libc.so.6
#5  0x in ?? ()

Thread 3 (Thread 0x7f8363fff710 (LWP 3111)):
#0  0x003377a0b3b9 inpthread_cond_timedwait@@GLIBC_2.3.2  ()
from /lib64/libpthread.so.0
#1  0x7f836ead402c in watchdog_thread (arg=)
 at watchdog.c:308
#2  0x003377a06a3a in start_thread () from /lib64/libpthread.so.0
#3  0x0033772de62d in clone () from /lib64/libc.so.6
#4  0x in ?? ()

Thread 2 (Thread 0x7f8362bfd710 (LWP 9002)):
#0  0x003377a0ec8d in waitpid () from /lib64/libpthread.so.0
#1  0x7f836eacb7ad in signal_handler (sig=11) at signal.c:229
#2
#3  0x003377a0c280 in pthread_kill () from /lib64/libpthread.so.0
#4  0x00420eba in cancel_storage_daemon_job (jcr=0x7f834c01c2f8)
 at job.c:515
#5  0x00410b50 in wait_for_job_termination (jcr=0x7f834c01c2f8,
 timeout=) at backup.c:538
#6  0x004116f0 in do_backup (jcr=0x7f834c01c2f8) at backup.c:456
#7  0x00421fd4 in job_thread (arg=0x7f834c01c2f8) at job.c:314
#8  0x00423624 in jobq_server (arg=0x673b40) at jobq.c:450
#9  0x003377a06a3a in start_thread () from /lib64/libpthread.so.0
#10 0x0033772de62d in clone () from /lib64/libc.so.6
#11 0x in ?? ()

Thread 1 (Thread 0x7f836ea7b7e0 (LWP 3106)):
#0  0x003377a0e91d in nanosleep () from /lib64/libpthread.so.0
#1  0x7f836eaae6f7 in bmicrosleep (sec=60, usec=0) at bsys.c:61
#2  0x0042e1d5 in wait_for_next_job (
 one_shot_job_to_run=) at scheduler.c:131
#3  0x0040d93d in main (argc=,
 argv=) at dird.c:338
#0  0x003377a0e91d in nanosleep () from /lib64/libpthread.so.0
No symbol table info available.
#1  0x7f836eaae6f7 in bmicrosleep (sec=60, usec=0) at bsys.c:61
61 stat = nanosleep(&timeout, NULL);
timeout = {tv_sec = 60, tv_nsec = 0}
tv = {tv_sec = 90194313216, tv_usec = 140202474247679}
tz = {tz_minuteswest = 372, tz_dsttime = 0}
stat =
#2  0x0042e1d5 in wait_for_next_job (
 one_shot_job_to_run=) at scheduler.c:131
131   bmicrosleep(next_check_secs, 0); /* recheck once per minute */
jcr =
job =
run =
now =
prev =
first = false
next_job =
#3  0x0040d93d in main (argc=,
 argv=) at dird.c:338
338while ( (jcr = wait_for_next_job(runjob)) ) {
jcr =
test_config = false
ch =
no_signals = false
uid = 0x0
gid = 0x0
mode =
#0  0x in ?? ()
No symbol table info available.
#0  0x in ?? ()
No symbol table info available.
#0  0x in ?? ()
No symbol table info available.
#0  0x in ?? ()
No symbol table info available.


 Original Message 
Subject:Director crash
Date:   Tue, 11 Jan 2011 09:11:17 -0800
From:   jerry lowry 
To: bacula-users@lists.sourceforge.net



Hi list,

I came in this morning and found that my director had died last night 
after doing two of the backups.  The traceback follows at the end.

This is the scenario:

I noticed yesterday that the only two jobs that were scheduled to 
be performed last night were a monthly backup and the catalog backup.  
Given that I did not have the time to research why the other 5 backups 
w

[Bacula-users] Director crash- again with traceback

2011-01-11 Thread jerry lowry

I really hate when I do that!!!

[?1034h[Thread debugging using libthread_db enabled]
[New Thread 0x7f8362bfd710 (LWP 9002)]
[New Thread 0x7f8363fff710 (LWP 3111)]
[New Thread 0x7f8368c49710 (LWP 3110)]
0x003377a0e91d in nanosleep () from /lib64/libpthread.so.0
$1 = '\000'
$2 = 0x1fe2068 "bacula-dir"
$3 = 0x1fe20a8 "/usr/bacula/bin/bacula-dir"
$4 = 0x7f834c004328 "MySQL"
$5 = 0x7f836eadbd9e "5.0.1 (24 February 2010)"
$6 = 0x7f836eadbdb7 "x86_64-unknown-linux-gnu"
$7 = 0x7f836eadbdd0 "redhat"
$8 = 0x7f836eadba7c ""
$9 = "distress", '\000'
#0  0x003377a0e91d in nanosleep () from /lib64/libpthread.so.0
#1  0x7f836eaae6f7 in bmicrosleep (sec=60, usec=0) at bsys.c:61
#2  0x0042e1d5 in wait_for_next_job (
one_shot_job_to_run=) at scheduler.c:131
#3  0x0040d93d in main (argc=,
argv=) at dird.c:338

Thread 4 (Thread 0x7f8368c49710 (LWP 3110)):
#0  0x0033772d7393 in select () from /lib64/libc.so.6
#1  0x7f836eab0ad4 in bnet_thread_server (addrs=,
max_clients=, client_wq=,
handle_client_request=) at bnet_server.c:161
#2  0x004468fc in connect_thread (arg=0x1fe3ee8) at ua_server.c:82
#3  0x003377a06a3a in start_thread () from /lib64/libpthread.so.0
#4  0x0033772de62d in clone () from /lib64/libc.so.6
#5  0x in ?? ()

Thread 3 (Thread 0x7f8363fff710 (LWP 3111)):
#0  0x003377a0b3b9 inpthread_cond_timedwait@@GLIBC_2.3.2  ()
   from /lib64/libpthread.so.0
#1  0x7f836ead402c in watchdog_thread (arg=)
at watchdog.c:308
#2  0x003377a06a3a in start_thread () from /lib64/libpthread.so.0
#3  0x0033772de62d in clone () from /lib64/libc.so.6
#4  0x in ?? ()

Thread 2 (Thread 0x7f8362bfd710 (LWP 9002)):
#0  0x003377a0ec8d in waitpid () from /lib64/libpthread.so.0
#1  0x7f836eacb7ad in signal_handler (sig=11) at signal.c:229
#2
#3  0x003377a0c280 in pthread_kill () from /lib64/libpthread.so.0
#4  0x00420eba in cancel_storage_daemon_job (jcr=0x7f834c01c2f8)
at job.c:515
#5  0x00410b50 in wait_for_job_termination (jcr=0x7f834c01c2f8,
timeout=) at backup.c:538
#6  0x004116f0 in do_backup (jcr=0x7f834c01c2f8) at backup.c:456
#7  0x00421fd4 in job_thread (arg=0x7f834c01c2f8) at job.c:314
#8  0x00423624 in jobq_server (arg=0x673b40) at jobq.c:450
#9  0x003377a06a3a in start_thread () from /lib64/libpthread.so.0
#10 0x0033772de62d in clone () from /lib64/libc.so.6
#11 0x in ?? ()

Thread 1 (Thread 0x7f836ea7b7e0 (LWP 3106)):
#0  0x003377a0e91d in nanosleep () from /lib64/libpthread.so.0
#1  0x7f836eaae6f7 in bmicrosleep (sec=60, usec=0) at bsys.c:61
#2  0x0042e1d5 in wait_for_next_job (
one_shot_job_to_run=) at scheduler.c:131
#3  0x0040d93d in main (argc=,
argv=) at dird.c:338
#0  0x003377a0e91d in nanosleep () from /lib64/libpthread.so.0
No symbol table info available.
#1  0x7f836eaae6f7 in bmicrosleep (sec=60, usec=0) at bsys.c:61
61 stat = nanosleep(&timeout, NULL);
timeout = {tv_sec = 60, tv_nsec = 0}
tv = {tv_sec = 90194313216, tv_usec = 140202474247679}
tz = {tz_minuteswest = 372, tz_dsttime = 0}
stat =
#2  0x0042e1d5 in wait_for_next_job (
one_shot_job_to_run=) at scheduler.c:131
131   bmicrosleep(next_check_secs, 0); /* recheck once per minute */
jcr =
job =
run =
now =
prev =
first = false
next_job =
#3  0x0040d93d in main (argc=,
argv=) at dird.c:338
338while ( (jcr = wait_for_next_job(runjob)) ) {
jcr =
test_config = false
ch =
no_signals = false
uid = 0x0
gid = 0x0
mode =
#0  0x in ?? ()
No symbol table info available.
#0  0x in ?? ()
No symbol table info available.
#0  0x in ?? ()
No symbol table info available.
#0  0x in ?? ()
No symbol table info available.



 Original Message 
Subject:Director crash
Date:   Tue, 11 Jan 2011 09:11:17 -0800
From:   jerry lowry 
To: bacula-users@lists.sourceforge.net



Hi list,

I came in this morning and found that my director had died last night 
after doing two of the backups.  The traceback follows at the end.

This is the scenario:

I noticed yesterday that the only two jobs that were scheduled to 
be performed last night were a monthly backup and the catalog backup.  
Given that I did not have the time to research why the other 5 backups 
were not scheduled I started BAT and selected the jobs to run at the 
appropriate times they normally run each night ( supposed to anyway ).  
So, when I looked at the director status I saw the two that were 
scheduled and 5 jobs that were waiting for the selected time to run.


The two jobs that were scheduled ran without any errors.  The director 
crashed when running the first job that I selected to run from BAT.  
From BAT
I selected the JOBS tab and then selected the job which I wanted to 
run.  I modified only the "when" ( or start time ) by highlighting the 

[Bacula-users] Director crash

2011-01-11 Thread jerry lowry

Hi list,

I came in this morning and found that my director had died last night 
after doing two of the backups.  The traceback follows at the end.

This is the scenario:

I noticed yesterday that the only two jobs that were scheduled to 
be performed last night were a monthly backup and the catalog backup.  
Given that I did not have the time to research why the other 5 backups 
were not scheduled I started BAT and selected the jobs to run at the 
appropriate times they normally run each night ( supposed to anyway ).  
So, when I looked at the director status I saw the two that were 
scheduled and 5 jobs that were waiting for the selected time to run.


The two jobs that were scheduled ran without any errors.  The director 
crashed when running the first job that I selected to run from BAT.  
From BAT
I selected the JOBS tab and then selected the job which I wanted to 
run.  I modified only the "when" ( or start time ) by highlighting the 
hour and minute
and inserting the time I wanted the job to run.  Did this for each of 
the jobs that did not get scheduled.


Made sure they were all showing up in the DIRECTOR tab and went on home.

Restarted bacula this morning and all the jobs were scheduled as normal.

Any clues or ideas as to the problem would be great.

OS:  Fedora 12 ( 2.6.32.11-99.fc12)
MySQL: 5.1.45 ( source distribution )
Bacula: 5.0.1

--

---
Jerold Lowry
IT Manager / Software Engineer
Engineering Design Team (EDT), Inc. a HEICO company
1400 NW Compton Drive, Suite 315
Beaverton, Oregon 97006 (U.S.A.)
Phone: 503-690-1234 / 800-435-4320
Fax: 503-690-1243
Web: _www.edt.com _


--
Protect Your Site and Customers from Malware Attacks
Learn about various malware tactics and how to avoid them. Understand 
malware threats, the impact they can have on your business, and how you 
can protect your company and customers by using code signing.
http://p.sf.net/sfu/oracle-sfdevnl___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Director crash on reload command

2010-08-17 Thread John Drescher
> Ok, I've missed that option. Thank's John.
>

I missed it about 4 years ago and the list helped me with that. Ever
since I always test the config first.

John

--
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev 
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Director crash on reload command

2010-08-17 Thread John Drescher
> The director crash on reload command if syntax is broken. Do you know a
> workarround. ? This is really annoying as all running job failed for a so
> little syntax error ... can't be that way for a professional tool like
> Bacula.
>

Yes. Always test the configuration before you reload it.

bacula-dir -t /etc/bacula/bacula-dir.conf

John

--
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev 
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


[Bacula-users] Director crash on reload command

2010-08-17 Thread Pierre Lavoisier
Hi,

The director crash on reload command if syntax is broken. Do you know a
workarround. ? This is really annoying as all running job failed for a so
little syntax error ... can't be that way for a professional tool like
Bacula.


Best,
P.Lavoisier.
--
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev ___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] director crash(?) and empty report email

2006-05-24 Thread Kern Sibbald

> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA1
>
> I don't believe so, but I could be wrong. I don't think there's
> presently any trapping for config problems. In my experience, it will
> die and you can get the error by attempting to restart it.

In 1.38.x you should test the changed conf with "bacula-dir -t -c
bacula-dir.conf" prior to reloading it.

In 1.39.x hopefully *all* the reload problems are resolved and handled
gracefully -- more user testing will confirm this ...

>
> Bill Moran wrote:
>
>> On Wed, 24 May 2006 11:23:07 +0200 "Robert Wirth"
>> <[EMAIL PROTECTED]> wrote:
>>
>>> Hi!
>>>
>>> I'm using bacula 1.38.7. Today, when I changed the director's
>>> configuration a bit --just changed a MaxWaitTime entry in a
>>> JobDefs resource--, the director daemon terminated after the
>>> reload and sent the email attached here.
>>>
>>> The email has a subject which I don't understand, and an empty
>>> body. Thus, I can't figure out what was going wrong.
>>>
>>> Can anybody give me a hint? Around the same time, there was a
>>> backup job running that saved the catalog database. I wonder if
>>> the reload itself, or the reload while backup, or what else was
>>> the cause for the crash.
>>
>>
>> Don't know for sure if this is your problem, but that version of
>> Bacula would crash if you did a reload and the permissions on its
>> config files didn't allow the bacula user to read the config.
>> Check the permissions on the config files.
>>
>> I think this is fixed in newer versions? Anyone know?
>>
> -BEGIN PGP SIGNATURE-
> Version: GnuPG v1.4.1 (GNU/Linux)
> Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org
>
> iD8DBQFEdQermb+gadEcsb4RAlnGAKDBLmbI3NL/zP3kyB1BphuLDLw0GQCeNLDH
> +tQ8z8Qr2B9hivx18W42j54=
> =fAdN
> -END PGP SIGNATURE-
>
>
>
> ---
> All the advantages of Linux Managed Hosting--Without the Cost and Risk!
> Fully trained technicians. The highest number of Red Hat certifications in
> the hosting industry. Fanatical Support. Click to learn more
> http://sel.as-us.falkag.net/sel?cmd=lnk&kid=107521&bid=248729&dat=121642
> ___
> Bacula-users mailing list
> Bacula-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/bacula-users
>


Best regards, Kern


---
All the advantages of Linux Managed Hosting--Without the Cost and Risk!
Fully trained technicians. The highest number of Red Hat certifications in
the hosting industry. Fanatical Support. Click to learn more
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=107521&bid=248729&dat=121642
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] director crash(?) and empty report email

2006-05-24 Thread Ryan Novosielski
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

I don't believe so, but I could be wrong. I don't think there's
presently any trapping for config problems. In my experience, it will
die and you can get the error by attempting to restart it.

Bill Moran wrote:

> On Wed, 24 May 2006 11:23:07 +0200 "Robert Wirth"
> <[EMAIL PROTECTED]> wrote:
>
>> Hi!
>>
>> I'm using bacula 1.38.7. Today, when I changed the director's
>> configuration a bit --just changed a MaxWaitTime entry in a
>> JobDefs resource--, the director daemon terminated after the
>> reload and sent the email attached here.
>>
>> The email has a subject which I don't understand, and an empty
>> body. Thus, I can't figure out what was going wrong.
>>
>> Can anybody give me a hint? Around the same time, there was a
>> backup job running that saved the catalog database. I wonder if
>> the reload itself, or the reload while backup, or what else was
>> the cause for the crash.
>
>
> Don't know for sure if this is your problem, but that version of
> Bacula would crash if you did a reload and the permissions on its
> config files didn't allow the bacula user to read the config.
> Check the permissions on the config files.
>
> I think this is fixed in newer versions? Anyone know?
>
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.1 (GNU/Linux)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iD8DBQFEdQermb+gadEcsb4RAlnGAKDBLmbI3NL/zP3kyB1BphuLDLw0GQCeNLDH
+tQ8z8Qr2B9hivx18W42j54=
=fAdN
-END PGP SIGNATURE-



---
All the advantages of Linux Managed Hosting--Without the Cost and Risk!
Fully trained technicians. The highest number of Red Hat certifications in
the hosting industry. Fanatical Support. Click to learn more
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=107521&bid=248729&dat=121642
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] director crash(?) and empty report email

2006-05-24 Thread Bill Moran
On Wed, 24 May 2006 11:23:07 +0200
"Robert Wirth" <[EMAIL PROTECTED]> wrote:

> Hi!
> 
> I'm using bacula 1.38.7.  Today, when I changed the director's 
> configuration a bit --just changed a MaxWaitTime entry in a JobDefs 
> resource--, the director daemon terminated after the reload and sent
> the email attached here. 
> 
> The email has a subject which I don't understand, and an empty body.
> Thus, I can't figure out what was going wrong.
> 
> Can anybody give me a hint?  Around the same time, there was a backup job
> running that saved the catalog database.  I wonder if the reload itself, 
> or the reload while backup, or what else was the cause for the crash.

Don't know for sure if this is your problem, but that version of Bacula
would crash if you did a reload and the permissions on its config files
didn't allow the bacula user to read the config.  Check the permissions
on the config files.

I think this is fixed in newer versions?  Anyone know?

-- 
Bill Moran
Collaborative Fusion Inc.


---
All the advantages of Linux Managed Hosting--Without the Cost and Risk!
Fully trained technicians. The highest number of Red Hat certifications in
the hosting industry. Fanatical Support. Click to learn more
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=107521&bid=248729&dat=121642
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


[Bacula-users] director crash(?) and empty report email

2006-05-24 Thread Robert Wirth
Hi!

I'm using bacula 1.38.7.  Today, when I changed the director's 
configuration a bit --just changed a MaxWaitTime entry in a JobDefs 
resource--, the director daemon terminated after the reload and sent
the email attached here. 

The email has a subject which I don't understand, and an empty body.
Thus, I can't figure out what was going wrong.

Can anybody give me a hint?  Around the same time, there was a backup job
running that saved the catalog database.  I wonder if the reload itself, 
or the reload while backup, or what else was the cause for the crash.


Regards,

Robert

--- Begin Message ---
--- End Message ---

++ German Research Center for Artificial Intelligence ++

Robert Wirth, Stuhlsatzenhausweg 3, D-66123 Saarbruecken
@office: +49-681-302-5078/5572 ++ @fax: +49-681-302-5341 
mailto:[EMAIL PROTECTED] ++ http://www.dfki.de/~wirth



[Bacula-users] Director crash again...sigh

2006-04-14 Thread Joshua Kugler
Well, my solution (compiling with -O0) seemed to work for a while.  I came in 
this morning to no bacula reports, and I find a director hung.  Tracebacks 
for -dir and -sd are below.  This is 1.36.3.  I don't *really* want to 
upgrade at the moment, but I can if need be.

Netstat shows:

tcp0  0 *:9101  *:* LISTEN  
7883/bacula-dir 
tcp0  0 *:9102  *:* LISTEN  
5688/bacula-fd  
tcp0  0 *:9103  *:* LISTEN  
5703/bacula-sd  
tcp0  0 herodotus.cde.uaf:48028 locus.dist-ed.uaf.:9102 
ESTABLISHED 7883/bacula-dir 
tcp   31  0 herodotus.cde.uaf.:9101 herodotus.cde.uaf:55934 CLOSE_WAIT  
7883/bacula-dir 
tcp 1248  0 herodotus.cde.uaf:60839 herodotus.cde.uaf.:9103 
ESTABLISHED 7883/bacula-dir 
tcp0  0 herodotus.cde.uaf:60841 herodotus.cde.uaf.:9103 
ESTABLISHED 7883/bacula-dir 
tcp0  0 herodotus.cde.uaf:60847 herodotus.cde.uaf.:9103 
ESTABLISHED 7883/bacula-dir 
tcp0  0 herodotus.cde.uaf.:9103 herodotus.cde.uaf:60839 
ESTABLISHED 5703/bacula-sd  
tcp0  0 herodotus.cde.uaf.:9103 herodotus.cde.uaf:60841 
ESTABLISHED 5703/bacula-sd  
tcp0  0 herodotus.cde.uaf.:9103 herodotus.cde.uaf:60847 
ESTABLISHED 5703/bacula-sd  
tcp   31  0 herodotus.cde.uaf.:9101 herodotus.cde.uaf:54717 CLOSE_WAIT  
7883/bacula-dir 
tcp0  0 herodotus.cde.uaf:37018 warbucks.cde.uaf.e:9102 
ESTABLISHED 7883/bacula-dir 
tcp0  0 herodotus.cde.uaf:58673 fmpserver.cde.uaf.:9102 
ESTABLISHED 7883/bacula-dir 
tcp83962  0 herodotus.cde.uaf.:9103 warbucks.cde.uaf.:39126 
ESTABLISHED 5703/bacula-sd  
tcp0  0 herodotus.cde.uaf.:9103 fmpserver.cde.uaf:48286 
ESTABLISHED 5703/bacula-sd

There are jobs in progress, but no activity.  Load is at 0, and I can read the 
devices to which it is trying to write (/backupvault, a disk).  There is only 
on bacula-dir instance showing up on on ps, so the other instances may have 
crashed.  I cannot connect via bconsole.  Here is the traceback it gave me 
when I ran btraceback.

Using host libthread_db library "/lib/tls/libthread_db.so.1".
[Thread debugging using libthread_db enabled]
[New Thread -1213032256 (LWP 7883)]
[New Thread -1318069328 (LWP 8664)]
[Thread debugging using libthread_db enabled]
[New Thread -1213032256 (LWP 7883)]
[New Thread -1318069328 (LWP 8664)]
[Thread debugging using libthread_db enabled]
[New Thread -1213032256 (LWP 7883)]
[New Thread -1318069328 (LWP 8664)]
[New Thread -1309676624 (LWP 8632)]
[New Thread -1301283920 (LWP 703)]
[New Thread -1272034384 (LWP 696)]
[New Thread -1263641680 (LWP 694)]
[New Thread -1255248976 (LWP 693)]
[New Thread -1246671952 (LWP 690)]
[New Thread -1288827984 (LWP 689)]
[New Thread -1238258768 (LWP 687)]
[New Thread -1280435280 (LWP 676)]
[New Thread -1221469264 (LWP 7888)]
[New Thread -1213076560 (LWP 7887)]
0xe410 in ?? ()
$1 = "herodotus-dir", '\0' 
$2 = 0x80cb020 "bacula-dir"
$3 = 0x80cb048 "/usr/local/bacula/sbin/"
$4 = "MySQL"
$5 = 0x80b8280 "1.36.3 (22 April 2005)"
$6 = 0x80b8297 "i686-redhat-linux-gnu"
$7 = 0x80b82ad "mandrake"
$8 = 0x80b82b6 "for"
#0  0xe410 in ?? ()
#1  0xb728 in ?? ()
#2  0x0002 in ?? ()
#3  0xb7ec50fe in __lll_mutex_lock_wait () from /lib/tls/libpthread.so.0

Thread 13 (Thread -1213076560 (LWP 7887)):
#0  0xe410 in ?? ()
#1  0xb7b1ea88 in ?? ()
#2  0x in ?? ()
#3  0xb7b1deb0 in ?? ()
#4  0xb7d56991 in select () from /lib/tls/libc.so.6
#5  0x080898cc in bnet_thread_server (addrs=0x80cb808, max_clients=10, 
client_wq=0x80c9f00, handle_client_request=0x8076442 
) at bnet_server.c:154
#6  0x08076335 in connect_thread (arg=0x80cb808) at ua_server.c:79
#7  0xb7ec0b3c in start_thread () from /lib/tls/libpthread.so.0
#8  0xb7d5d93a in clone () from /lib/tls/libc.so.6

Thread 12 (Thread -1221469264 (LWP 7888)):
#0  0xe410 in ?? ()
#1  0xb731da08 in ?? ()
#2  0x0002 in ?? ()
#3  0xb7ec50fe in __lll_mutex_lock_wait () from /lib/tls/libpthread.so.0

Thread 11 (Thread -1280435280 (LWP 676)):
#0  0xe410 in ?? ()
#1  0xb3ae1128 in ?? ()
#2  0x0006 in ?? ()
#3  0x080a14db in wd_lock () at watchdog.c:305
#4  0x080a109a in register_watchdog (wd=0x80f6b50) at watchdog.c:180
#5  0x080a234c in start_bsock_timer (bsock=0x80f6ba8, wait=600) at 
btimers.c:166
#6  0x0804ca28 in authenticate_storage_daemon (jcr=0x8102198, store=0x80cc968) 
at authenticate.c:68
#7  0x0805e02a in connect_to_storage_daemon (jcr=0x8102198, retry_interval=10, 
max_retry_time=1800, verbose=1) at msgchan.c:89
#8  0x0804d9cb in do_backup (jcr=0x8102198) at backup.c:145
#9  0x08058dc5 in job_thread (arg=0x8102198) at job.c:215
#10 0x0805be4c in jobq_server (arg=0x80c9d80) at jobq.c:444
#11 0xb7ec0b3c in start_thread () from /lib/tls/libpthread.so.0
#12 0xb7d5d93a in clone () from /lib/tls/libc.

[Bacula-users] director crash - bug no 375

2005-12-15 Thread Steen . L . Meyer

FYI

I had some crashes - and then it is good to be able to find it in the bug
system, so you know how to avoid it again

In my case it was also everytime after a change in dir.conf, but I did not
change schedules, but added client, job and fileset.

Is it maybe a coincidence that the crash happened at the exact time when
the catalog job was scheduled?

13-Dec 23:00 adm-backup-sd: New volume "Full-0002" mounted on device
"FileStorage" (/home/bckp/data/ibsen) at 13-Dec-2005 23:00.
13-Dec 23:10 adm-backup-dir: Fatal Error because: Bacula interrupted by
signal 11: Segmentation violation

Cheers

Steen

-- 
Steen L Meyer - IT Manager -  Ibsen Photonics A/S
Ryttermarken 15 - 21, DK-3520 Farum, Denmark
Tel.: (+45) 44 34 70 00 - Fax.: (+45) 44 34 70 01
[EMAIL PROTECTED] - http://www.ibsenphotonics.com



---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users