The traceback suggests some problem in the is_cleaning_tape function.

It might be useful to do

gdb /opt/bacula/bin/bacula-dir
disass is_cleaning_tape

__Martin


>>>>> On Sun, 20 May 2018 17:15:40 +0200, Rasmus Linden said:
> 
> Hey All !!
> 
> I have a strange issue, I hope some of you can help me with. I am currently 
> running bacula version: 9.0.7, recently migrated from mysql to postgres.
> Everything is working as expected, except when i try to table a tape and 
> stick it in our scratch pool, bacula-dir crashes with a bang !
> I have set the debug level to 10, and pasted the traceback.
> The last thing happening before the crash, according to the stack is a sql 
> query :
> 
> SELECT 
> MediaId,VolumeName,VolJobs,VolFiles,VolBlocks,VolBytes,VolABytes,VolHoleBytes,VolHoles,VolMounts,VolErrors,VolWrites,MaxVolBytes,VolCapacityBytes,MediaType,VolStatus,PoolId,VolRetention,VolUseDuration,MaxVolJ
> obs,MaxVolFiles,Recycle,Slot,FirstWritten,LastWritten,InChanger,EndFile,EndBlock,VolType,VolParts,VolCloudParts,LastPartBytes,LabelType,LabelDate,StorageId,Enabled,LocationId,RecycleCount,InitialWrite,ScratchPoolId,RecyclePoolId
> ,VolReadTime,VolWriteTime,ActionOnPurge,CacheRetention FROM Media WHERE 
> VolumeName=‘B00110L6'
> 
> I have tried running the query by hand, and it does not fail *screenshot from 
> pgadmin attached.
> 
> If i label the volume and stick it into a none scratch pool, it completes 
> with no error.
> 
> Any suggestion on how to fix is mostly appreciated !
> 
> All the best
> 
> Ras
> 
> 
> 
> BCONSOLE FAILING IN SCRATCH POOL:
> [root@technicos ~]# bconsole 
> Connecting to Director technicos:9101
> 1000 OK: 103 director.xxx.xxxVersion: 9.0.7 (18 April 2018)
> Enter a period to cancel a command.
> *label barcodes slots=3
> Automatically selected Catalog: MyCatalog
> Using Catalog "MyCatalog"
> The defined Storage resources are:
>      1: Autochanger
>      2: LTO6-01
> Select Storage resource (1-2): 1
> Connecting to Storage daemon Autochanger at technicos.xxx:9103 ...
> Connecting to Storage daemon Autochanger at technicos.xxx:9103 ...
> 3306 Issuing autochanger "slots" command.
> Device "Autochanger" has 48 slots.
> Connecting to Storage daemon Autochanger at technicos.xxx:9103 ...
> 3306 Issuing autochanger "list" command.
> The following Volumes will be labeled:
> Slot  Volume
> ==============
>    3  B00110L6
> Do you want to label these Volumes? (yes|no): yes
> Defined Pools:
>      1: Archive
>      2: Default
>      3: File
>      4: ProjectsArchive
>      5: Scratch
> Select the Pool (1-5): 5
> [root@technicos ~]# 
> 
> 
> CONSOLE SUCCESS WITH ANOTHER POOL:
> *label barcodes slots=3
> Automatically selected Catalog: MyCatalog
> Using Catalog "MyCatalog"
> The defined Storage resources are:
>      1: Autochanger
>      2: LTO6-01
> Select Storage resource (1-2): 1
> Connecting to Storage daemon Autochanger at technicos.xxx:9103 ...
> Connecting to Storage daemon Autochanger at technicos.xxx:9103 ...
> 3306 Issuing autochanger "slots" command.
> Device "Autochanger" has 48 slots.
> Connecting to Storage daemon Autochanger at technicos.xxx:9103 ...
> 3306 Issuing autochanger "list" command.
> The following Volumes will be labeled:
> Slot  Volume
> ==============
>    3  B00110L6
> Do you want to label these Volumes? (yes|no): yes
> Defined Pools:
>      1: Archive
>      2: Default
>      3: File
>      4: ProjectsArchive
>      5: Scratch
> Select the Pool (1-5): 4
> Connecting to Storage daemon Autochanger at technicos.xxx:9103 ...
> Sending label command for Volume "B00110L6" Slot 3 ...
> 3307 Issuing autochanger "unload Volume *Unknown*, Slot 2, Drive 0" command.
> 3304 Issuing autochanger "load Volume B00110L6, Slot 3, Drive 0" command.
> 3305 Autochanger "load Volume B00110L6, Slot 3, Drive 0", status is OK.
> 3000 OK label. VolBytes=64512 VolABytes=0 VolType=0 Volume="B00110L6" 
> Device="LTO6-01" (/dev/nst0)
> Catalog record for Volume "B00110L6", Slot 3  successfully created.
> *
> 
> 
> 
> TRACEBACK
>  more  /opt/bacula/working/bacula.16939.traceback
> [New LWP 23316]
> [New LWP 22394]
> [New LWP 16942]
> [New LWP 16941]
> [Thread debugging using libthread_db enabled]
> Using host libthread_db library "/lib64/libthread_db.so.1".
> 0x00007fb138e9ef3d in nanosleep () from /usr/lib64/libpthread.so.0
> $1 = "20-May-2018 16:50:22\000\000\000\000\000\000\000\000\000"
> $2 = '\000' <repeats 127 times>
> $3 = 0xae1048 "bacula-dir"
> $4 = 0xae1088 "/opt/bacula/bin/bacula-dir"
> $5 = 0x0
> $6 = "PostgreSQL", '\000' <repeats 39 times>
> $7 = 0x7fb1390feeee "9.0.7 (18 April 2018)"
> $8 = 0x7fb1390feecc "x86_64-pc-linux-gnu"
> $9 = 0x7fb1390feec5 "redhat"
> $10 = 0x7fb1390feee7 "(Core)"
> $11 = "technicos", '\000' <repeats 40 times>
> $12 = 0x7fb1390feee0 "redhat (Core)"
> Environment variable "TestName" not defined.
> #0  0x00007fb138e9ef3d in nanosleep () from /usr/lib64/libpthread.so.0
> #1  0x00007fb1390c0688 in bmicrosleep (sec=sec@entry=60, usec=usec@entry=0) 
> at bsys.c:178
> #2  0x0000000000437c77 in wait_for_next_job (one_shot_job_to_run=<optimized 
> out>) at scheduler.c:121
> #3  0x000000000040dde7 in main (argc=<optimized out>, argv=<optimized out>) 
> at dird.c:382
> 
> Thread 5 (Thread 0x7fb12e16e700 (LWP 16941)):
> #0  0x00007fb138194c73 in select () from /usr/lib64/libc.so.6
> #1  0x00007fb1390c3d08 in bnet_thread_server (addrs=addrs@entry=0xae5d98, 
> max_clients=<optimized out>, client_wq=client_wq@entry=0x6988c0 <ua_workq>, 
> handle_client_request=handle_client_request@entry=0x45a800 <handle_UA_client_r
> equest(void*)>) at bnet_server.c:166
> #2  0x000000000045a43c in connect_thread (arg=0xae5d98) at ua_server.c:72
> #3  0x00007fb138e97e25 in start_thread () from /usr/lib64/libpthread.so.0
> #4  0x00007fb13819dbad in clone () from /usr/lib64/libc.so.6
> 
> Thread 4 (Thread 0x7fb12d96d700 (LWP 16942)):
> #0  0x00007fb138e9bd42 in pthread_cond_timedwait@@GLIBC_2.3.2 () from 
> /usr/lib64/libpthread.so.0
> #1  0x00007fb1390f31b5 in watchdog_thread (arg=<optimized out>) at 
> watchdog.c:299
> #2  0x00007fb138e97e25 in start_thread () from /usr/lib64/libpthread.so.0
> #3  0x00007fb13819dbad in clone () from /usr/lib64/libc.so.6
> 
> Thread 3 (Thread 0x7fb12d16c700 (LWP 22394)):
> #0  0x00007fb138e9e7fd in read () from /usr/lib64/libpthread.so.0
> #1  0x00007fb1390c297e in read_nbytes (bsock=bsock@entry=0x7fb1280008e8, 
> ptr=ptr@entry=0x7fb12d16bcdc "", nbytes=nbytes@entry=4) at bnet.c:74
> #2  0x00007fb1390c6464 in BSOCK::recv (this=this@entry=0x7fb1280008e8) at 
> bsock.c:753
> #3  0x000000000045a92a in handle_UA_client_request (arg=0x7fb1280008e8) at 
> ua_server.c:131
> #4  0x00007fb1390f3a83 in workq_server (arg=0x6988c0 <ua_workq>) at 
> workq.c:372
> #5  0x00007fb138e97e25 in start_thread () from /usr/lib64/libpthread.so.0
> #6  0x00007fb13819dbad in clone () from /usr/lib64/libc.so.6
> 
> Thread 2 (Thread 0x7fb127565700 (LWP 23316)):
> #0  0x00007fb138e9f279 in waitpid () from /usr/lib64/libpthread.so.0
> #1  0x00007fb1390ea74a in signal_handler (sig=11) at signal.c:233
> #2  <signal handler called>
> #3  0x00007fb13820dff1 in __strlen_sse2_pminub () from /usr/lib64/libc.so.6
> #4  0x0000000000444dae in is_cleaning_tape (pr=0x7fb127563e80, 
> mr=0x7fb127564680, ua=0x7fb11c001f48) at ua_label.c:1005
> #5  label_from_barcodes (drive=0, ua=0x7fb11c001f48) at ua_label.c:583
> #6  do_label (ua=0x7fb11c001f48, relabel=<optimized out>, cmd=<optimized 
> out>) at ua_label.c:361
> #7  0x000000000043a145 in do_a_command (ua=0x7fb11c001f48) at ua_cmds.c:253
> #8  0x000000000045a8c3 in handle_UA_client_request (arg=0x7fb128003108) at 
> ua_server.c:138
> #9  0x00007fb1390f3a83 in workq_server (arg=0x6988c0 <ua_workq>) at 
> workq.c:372
> #10 0x00007fb138e97e25 in start_thread () from /usr/lib64/libpthread.so.0
> #11 0x00007fb13819dbad in clone () from /usr/lib64/libc.so.6
> 
> Thread 1 (Thread 0x7fb139f8c880 (LWP 16939)):
> #0  0x00007fb138e9ef3d in nanosleep () from /usr/lib64/libpthread.so.0
> #1  0x00007fb1390c0688 in bmicrosleep (sec=sec@entry=60, usec=usec@entry=0) 
> at bsys.c:178
> #2  0x0000000000437c77 in wait_for_next_job (one_shot_job_to_run=<optimized 
> out>) at scheduler.c:121
> #3  0x000000000040dde7 in main (argc=<optimized out>, argv=<optimized out>) 
> at dird.c:382
> #0  0x00007fb138e9ef3d in nanosleep () from /usr/lib64/libpthread.so.0
> No symbol table info available.
> #1  0x00007fb1390c0688 in bmicrosleep (sec=sec@entry=60, usec=usec@entry=0) 
> at bsys.c:178
> 178      stat = nanosleep(&timeout, NULL);
> timeout = {tv_sec = 60, tv_nsec = 0}
> tv = {tv_sec = 210453397560, tv_usec = 140399143188541}
> tz = {tz_minuteswest = 24, tz_dsttime = 0}
> stat = <optimized out>
> #2  0x0000000000437c77 in wait_for_next_job (one_shot_job_to_run=<optimized 
> out>) at scheduler.c:121
> 121         bmicrosleep(next_check_secs, 0); /* recheck once per minute */
> jcr = <optimized out>
> job = <optimized out>
> run = <optimized out>
> now = <optimized out>
> prev = <optimized out>
> first = false
> next_job = <optimized out>
> #3  0x000000000040dde7 in main (argc=<optimized out>, argv=<optimized out>) 
> at dird.c:382
> 382      while ( (jcr = wait_for_next_job(runjob)) ) {
> jcr = <optimized out>
> ch = <optimized out>
> no_signals = false
> uid = 0x0
> gid = 0x0
> mode = <optimized out>
> #0  0x0000000000000000 in ?? ()
> No symbol table info available.
> #0  0x0000000000000000 in ?? ()
> No symbol table info available.
> #0  0x0000000000000000 in ?? ()
> No symbol table info available.
> #0  0x0000000000000000 in ?? ()
> No symbol table info available.
> Attempt to dump current JCRs. njcrs=3
> threadid=0x7fb139f8c880 JobId=0 JobStatus=R jcr=0xaeb758 
> name=*JobMonitor*.2018-05-20_16.46.56_01
>       use_count=1 killable=0
>       JobType=I JobLevel= 
>       sched_time=20-May-2018 16:46 start_time=20-May-2018 16:46
>       end_time=01-Jan-1970 01:00 wait_time=01-Jan-1970 01:00
>       db=(nil) db_batch=(nil) batch_started=0
>       wstore=0xae8368 rstore=(nil) wjcr=(nil) client=0xae7918 
> reschedule_count=0 SD_msg_chan_started=0
> threadid=0x7fb12d16c700 JobId=0 JobStatus=R jcr=0x7fb1200008e8 
> name=-Console-.2018-05-20_16.49.43_03
>       use_count=1 killable=0
>       JobType=U JobLevel= 
>       sched_time=20-May-2018 16:49 start_time=20-May-2018 16:49
>       end_time=01-Jan-1970 01:00 wait_time=01-Jan-1970 01:00
>       db=0x7fb120002618 db_batch=(nil) batch_started=0
>       wstore=0xae8368 rstore=(nil) wjcr=(nil) client=0xae7918 
> reschedule_count=0 SD_msg_chan_started=0
> BDB=0x7fb120002618 db_name=bacula db_user=bacula connected=true
>       cmd="sql_get.c:1213 Media record for Volume name "B00112L6" not found.
> " changes=0
>       RWLOCK=0x7fb120002630 w_active=0 w_wait=0
> threadid=0x7fb127565700 JobId=0 JobStatus=R jcr=0x7fb11c0008e8 
> name=-Console-.2018-05-20_16.50.10_04
>       use_count=1 killable=0
>       JobType=U JobLevel= 
>       sched_time=20-May-2018 16:50 start_time=20-May-2018 16:50
>       end_time=01-Jan-1970 01:00 wait_time=01-Jan-1970 01:00
>       db=0x7fb120002e58 db_batch=(nil) batch_started=0
>       wstore=0xae8368 rstore=(nil) wjcr=(nil) client=0xae7918 
> reschedule_count=0 SD_msg_chan_started=0
> BDB=0x7fb120002e58 db_name=bacula db_user=bacula connected=true
>       cmd="SELECT 
> MediaId,VolumeName,VolJobs,VolFiles,VolBlocks,VolBytes,VolABytes,VolHoleBytes,VolHoles,VolMounts,VolErrors,VolWrites,MaxVolBytes,VolCapacityBytes,MediaType,VolStatus,PoolId,VolRetention,VolUseDuration,MaxVolJ
> obs,MaxVolFiles,Recycle,Slot,FirstWritten,LastWritten,InChanger,EndFile,EndBlock,VolType,VolParts,VolCloudParts,LastPartBytes,LabelType,LabelDate,StorageId,Enabled,LocationId,RecycleCount,InitialWrite,ScratchPoolId,RecyclePoolId
> ,VolReadTime,VolWriteTime,ActionOnPurge,CacheRetention FROM Media WHERE 
> VolumeName='B00110L6'" changes=0
>       RWLOCK=0x7fb120002e70 w_active=0 w_wait=0
> List plugins. Hook count=0
> 
> 
> 
> 
> 

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to