Re: [Samba] Signal 11 in smbd 3.0.2rc2 on printer operation!

2004-02-10 Thread Jerry Haltom
Follow up on this. I have SAMBA_3_0 from CVS, checked out about 4 days
ago (last time we discussed this). I have been able to cause hte
problem, or another similar problem, im not sure, to be reproduced.

What I was doing was adding a new printer, by editing smb.conf, adding
the new printer entry, and then SIGHUPping all smbd processes. After
wards, the new printer began working, but after printing the first page,
all printers immediatly ceased working. The following messages in no
particular order are now appearing in my logs:

tdb(/opt/samba/var/locks/printing/pdfwriter.tdb): tdb_lock failed on
list 4128 ltype=0 (Bad file descriptor)

and

[2004/02/10 13:27:44, 0] tdb/tdbutil.c:tdb_log(724)
  tdb(/opt/samba/var/locks/printing/hpljet8100n.tdb): tdb_oob len
842018636 beyond eof at 40960

Perhaps the action of adding a new printer and SIGHUPping smbd is
causing these problems? I will once again attempt to fix this once
people leave the office by clearing all the .tdb files and restarting
samba.

=)

On Thu, 2004-02-05 at 11:29, Jerry Haltom wrote:
 Nope. In fact, I do suspect bad ram. I don't know why it would be
 consistant though. I would think that bad ram would effect more than
 just samba's printing. And you'd think reacreating the process
 (different location in memory) would get around it. The problem is
 pretty consistant, it's happening for every computer in the office now.
 
 On Thu, 2004-02-05 at 10:08, Gerald (Jerry) Carter wrote:
  -BEGIN PGP SIGNED MESSAGE-
  Hash: SHA1
  
  Jerry Haltom wrote:
  |
  | Could a corrupt .tdb of any sort be causing this? It
  | is touching a variable named pdb (maybe printer db?),
  | but I'm not sure what that is.
  
  It could I guess, but I agree with Jeremy.  That's a
  strange place to crash.  How many machines did you
  see this on?  If only one, have you ruled out bad RAM ?
  
  On Wed, 2004-02-04 at 19:08, Jeremy Allison wrote:
  
  | #7  0x0820b54b in print_queue_status (snum=1, ppqueue=0xb4d0,
  |status=0xb4d4) at printing/printing.c:2283
  | keystr = STATUS/hpljet8100n\0\0\001\0\0\0 öÿ¿ [EMAIL PROTECTED]\bò\006
  [EMAIL 
  PROTECTED]:´\0\0\0\0\0\0\0\0Döÿ¿,ôÿ¿Q0\026\b\004\0\0\0Döÿ¿\024\0\0\0\0\0\0\0\001\0\0\0\0\0\0\0\004\0\0\0\0\0\0\0\002\0\0\08U=\bLôÿ¿\0361\026\b\220\026=\bDöÿ¿xôÿ¿\004\0\0\0\0\0\0\0\001\0\0\0|ôÿ¿ð\214!\b\001\0\0\0Döÿ¿xôÿ¿,L\023\b\0\0\0\0\224ýÿ¿\214ôÿ¿Kð\017\b...
  | data = {dptr = 0x0, dsize = 264}
  | key = {dptr = 0xb3ac STATUS/hpljet8100n, dsize = 18}
  | printername = 0x82f6c40 hpljet8100n
  | pdb = (struct tdb_print_db *) 0x83d4ac8
  | count = 0
  |
  | This is a strange crash location. It appears to be in the call here :
  |
  |if (!get_stored_queue_info(pdb, snum, count, ppqueue)) {
  |release_print_db(pdb);
  |return 0;
  |
  | I don't immediately see any bad pointers etc. in this call...
  
  
  
  
  cheers, jerry
  -BEGIN PGP SIGNATURE-
  Version: GnuPG v1.2.1 (GNU/Linux)
  Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
  
  iD8DBQFAImqUIR7qMdg1EfYRAikcAJ43erXruQhRuLE7arSDsDUROwnVuQCgyLwU
  8T0AGaPERFSb0WPYVczJEKE=
  =G3Fk
  -END PGP SIGNATURE-
-- 
Jerry Haltom [EMAIL PROTECTED]
Feedback Plus, Inc.


signature.asc
Description: This is a digitally signed message part
-- 
To unsubscribe from this list go to the following URL and read the
instructions:  http://lists.samba.org/mailman/listinfo/samba

Re: [Samba] Signal 11 in smbd 3.0.2rc2 on printer operation!

2004-02-05 Thread Gerald (Jerry) Carter
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Jerry Haltom wrote:
|
| Could a corrupt .tdb of any sort be causing this? It
| is touching a variable named pdb (maybe printer db?),
| but I'm not sure what that is.
It could I guess, but I agree with Jeremy.  That's a
strange place to crash.  How many machines did you
see this on?  If only one, have you ruled out bad RAM ?
On Wed, 2004-02-04 at 19:08, Jeremy Allison wrote:

| #7  0x0820b54b in print_queue_status (snum=1, ppqueue=0xb4d0,
|status=0xb4d4) at printing/printing.c:2283
|   keystr = STATUS/hpljet8100n\0\0\001\0\0\0 öÿ¿ [EMAIL PROTECTED]\bò\006
[EMAIL 
PROTECTED]:´\0\0\0\0\0\0\0\0Döÿ¿,ôÿ¿Q0\026\b\004\0\0\0Döÿ¿\024\0\0\0\0\0\0\0\001\0\0\0\0\0\0\0\004\0\0\0\0\0\0\0\002\0\0\08U=\bLôÿ¿\0361\026\b\220\026=\bDöÿ¿xôÿ¿\004\0\0\0\0\0\0\0\001\0\0\0|ôÿ¿ð\214!\b\001\0\0\0Döÿ¿xôÿ¿,L\023\b\0\0\0\0\224ýÿ¿\214ôÿ¿Kð\017\b...
|   data = {dptr = 0x0, dsize = 264}
|   key = {dptr = 0xb3ac STATUS/hpljet8100n, dsize = 18}
|   printername = 0x82f6c40 hpljet8100n
|   pdb = (struct tdb_print_db *) 0x83d4ac8
|   count = 0
|
| This is a strange crash location. It appears to be in the call here :
|
|if (!get_stored_queue_info(pdb, snum, count, ppqueue)) {
|release_print_db(pdb);
|return 0;
|
| I don't immediately see any bad pointers etc. in this call...


cheers, jerry
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.2.1 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iD8DBQFAImqUIR7qMdg1EfYRAikcAJ43erXruQhRuLE7arSDsDUROwnVuQCgyLwU
8T0AGaPERFSb0WPYVczJEKE=
=G3Fk
-END PGP SIGNATURE-
--
To unsubscribe from this list go to the following URL and read the
instructions:  http://lists.samba.org/mailman/listinfo/samba


Re: [Samba] Signal 11 in smbd 3.0.2rc2 on printer operation!

2004-02-05 Thread Jerry Haltom
Nope. In fact, I do suspect bad ram. I don't know why it would be
consistant though. I would think that bad ram would effect more than
just samba's printing. And you'd think reacreating the process
(different location in memory) would get around it. The problem is
pretty consistant, it's happening for every computer in the office now.

On Thu, 2004-02-05 at 10:08, Gerald (Jerry) Carter wrote:
 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1
 
 Jerry Haltom wrote:
 |
 | Could a corrupt .tdb of any sort be causing this? It
 | is touching a variable named pdb (maybe printer db?),
 | but I'm not sure what that is.
 
 It could I guess, but I agree with Jeremy.  That's a
 strange place to crash.  How many machines did you
 see this on?  If only one, have you ruled out bad RAM ?
 
 On Wed, 2004-02-04 at 19:08, Jeremy Allison wrote:
 
 | #7  0x0820b54b in print_queue_status (snum=1, ppqueue=0xb4d0,
 |status=0xb4d4) at printing/printing.c:2283
 |   keystr = STATUS/hpljet8100n\0\0\001\0\0\0 öÿ¿ [EMAIL PROTECTED]\bò\006
 [EMAIL 
 PROTECTED]:´\0\0\0\0\0\0\0\0Döÿ¿,ôÿ¿Q0\026\b\004\0\0\0Döÿ¿\024\0\0\0\0\0\0\0\001\0\0\0\0\0\0\0\004\0\0\0\0\0\0\0\002\0\0\08U=\bLôÿ¿\0361\026\b\220\026=\bDöÿ¿xôÿ¿\004\0\0\0\0\0\0\0\001\0\0\0|ôÿ¿ð\214!\b\001\0\0\0Döÿ¿xôÿ¿,L\023\b\0\0\0\0\224ýÿ¿\214ôÿ¿Kð\017\b...
 |   data = {dptr = 0x0, dsize = 264}
 |   key = {dptr = 0xb3ac STATUS/hpljet8100n, dsize = 18}
 |   printername = 0x82f6c40 hpljet8100n
 |   pdb = (struct tdb_print_db *) 0x83d4ac8
 |   count = 0
 |
 | This is a strange crash location. It appears to be in the call here :
 |
 |if (!get_stored_queue_info(pdb, snum, count, ppqueue)) {
 |release_print_db(pdb);
 |return 0;
 |
 | I don't immediately see any bad pointers etc. in this call...
 
 
 
 
 cheers, jerry
 -BEGIN PGP SIGNATURE-
 Version: GnuPG v1.2.1 (GNU/Linux)
 Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
 
 iD8DBQFAImqUIR7qMdg1EfYRAikcAJ43erXruQhRuLE7arSDsDUROwnVuQCgyLwU
 8T0AGaPERFSb0WPYVczJEKE=
 =G3Fk
 -END PGP SIGNATURE-
-- 
Jerry Haltom [EMAIL PROTECTED]
Feedback Plus, Inc.


signature.asc
Description: This is a digitally signed message part
-- 
To unsubscribe from this list go to the following URL and read the
instructions:  http://lists.samba.org/mailman/listinfo/samba

Re: [Samba] Signal 11 in smbd 3.0.2rc2 on printer operation!

2004-02-05 Thread Jeremy Allison
On Thu, Feb 05, 2004 at 11:29:16AM -0600, Jerry Haltom wrote:
 Nope. In fact, I do suspect bad ram. I don't know why it would be
 consistant though. I would think that bad ram would effect more than
 just samba's printing. And you'd think reacreating the process
 (different location in memory) would get around it. The problem is
 pretty consistant, it's happening for every computer in the office now.

A corrupt tdb wouldn't cause a failure in the place you sent in.

I'd use memcheck to look for bad ram.

Jeremy.
-- 
To unsubscribe from this list go to the following URL and read the
instructions:  http://lists.samba.org/mailman/listinfo/samba


Re: [Samba] Signal 11 in smbd 3.0.2rc2 on printer operation!

2004-02-04 Thread Jeremy Allison
On Wed, Feb 04, 2004 at 05:35:11PM -0600, Jerry Haltom wrote:
 Our computers in our office ceased being able to print to a shared cups
 printer today. Could not find the cause. Seems the smbd processes are
 dying. Below is the stack trace produced by gdb bt full.
 
 
 
 The Samba 'panic action' script, /usr/share/samba/panic-action,
 was called for pid 5839 (/usr/opt/samba-3.0.2rc2/sbin/smbd).
 
 0x40150a59 in wait4 () from /lib/libc.so.6
 #0  0x40150a59 in wait4 () from /lib/libc.so.6
 No symbol table info available.
 #1  0x401c7e48 in __check_rhosts_file () from /lib/libc.so.6
 No symbol table info available.
 #2  0x400f57c5 in system () from /lib/libc.so.6
 No symbol table info available.
 #3  0x081e7625 in smb_panic (why=0x82af4fe internal error) at lib/util.c:1391
   cmd = 0x83ec268 /usr/share/samba/panic-action 5839
   result = 1074632376
   backtrace_stack = {0x0, 0xbfffeee0, 0xbfffefd4, 0x81d174f, 0x82af3e2, 
   0x82e43e0, 0x0, 0xbfffef08, 0x400c9319, 0xb628, 0xbfffefd4, 0x81d1757, 
   0xb51c, 0x4e4ec24c, 0xb55c, 0x8299000, 0x0, 0x400fa582, 0x82f6c4b, 
   0x83d4ae3, 0xbfffef3c, 0x81dd474, 0x83d4ad8, 0x82f6c40, 0x0, 0x0, 0x0, 0x0, 
   0xb36c, 0x821a08b, 0x83d4ad8, 0x82f6c40, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 
   0x0, 0x0, 0xb598, 0x81d4704, 0x1c, 0x84038d4, 0x68, 0x0, 0x83d4c40, 0x0, 
   0xbfffefcc, 0x81eafe1, 0x400c9319, 0x401c7e48, 0xb0fc, 0x401c7e48, 
   0xb628, 0x0, 0xbd94, 0x4011e1d6, 0x401c7e48, 0x401c56a0, 0x0, 0x0, 
   0x8200b20, 0x400}
   backtrace_size = 138426576
   backtrace_strings = (char **) 0x1c
 #4  0x081d1951 in fault_report (sig=11) at lib/fault.c:41
   counter = 1
 #5  0x081d19ba in sig_fault (sig=11) at lib/fault.c:61
 No locals.
 #6  0x400d96b8 in sigaction () from /lib/libc.so.6
 No symbol table info available.
 #7  0x0820b54b in print_queue_status (snum=1, ppqueue=0xb4d0, 
 status=0xb4d4) at printing/printing.c:2283
   keystr = STATUS/hpljet8100n\0\0\001\0\0\0 öÿ¿ [EMAIL PROTECTED]\bò\006 
 [EMAIL 
 PROTECTED]:´\0\0\0\0\0\0\0\0Döÿ¿,ôÿ¿Q0\026\b\004\0\0\0Döÿ¿\024\0\0\0\0\0\0\0\001\0\0\0\0\0\0\0\004\0\0\0\0\0\0\0\002\0\0\08U=\bLôÿ¿\0361\026\b\220\026=\bDöÿ¿xôÿ¿\004\0\0\0\0\0\0\0\001\0\0\0|ôÿ¿ð\214!\b\001\0\0\0Döÿ¿xôÿ¿,L\023\b\0\0\0\0\224ýÿ¿\214ôÿ¿Kð\017\b...
   data = {dptr = 0x0, dsize = 264}
   key = {dptr = 0xb3ac STATUS/hpljet8100n, dsize = 18}
   printername = 0x82f6c40 hpljet8100n
   pdb = (struct tdb_print_db *) 0x83d4ac8
   count = 0

This is a strange crash location. It appears to be in the call here :

if (!get_stored_queue_info(pdb, snum, count, ppqueue)) {
release_print_db(pdb);
return 0;

I don't immediately see any bad pointers etc. in this call...

Jeremy.
-- 
To unsubscribe from this list go to the following URL and read the
instructions:  http://lists.samba.org/mailman/listinfo/samba


Re: [Samba] Signal 11 in smbd 3.0.2rc2 on printer operation!

2004-02-04 Thread Jerry Haltom
I don't know enough about what is going on to know if what I'm saying
makes any sense. :) But I'm working on it.

Could a corrupt .tdb of any sort be causing this? It is touching a
variable named pdb (maybe printer db?), but I'm not sure what that is.

Since I had the problem I have deleted all of my tbds and recreated
necessary data. However, I did it as part of a few troubleshooting
steps, and didn't check 100% to make sure my changes fixed it.

It appears to be fixed on the two systems I can access from home to test
from, however, our office has 25 more systems, which I can't get from
home to test with. I will follow up on this if the problem is resolved.

If somebody could confirm a corrupted tdb could be responsible for this,
I won't pursue it any further unless it resurfaces. Thanks!

On Wed, 2004-02-04 at 19:08, Jeremy Allison wrote:
 On Wed, Feb 04, 2004 at 05:35:11PM -0600, Jerry Haltom wrote:
  Our computers in our office ceased being able to print to a shared cups
  printer today. Could not find the cause. Seems the smbd processes are
  dying. Below is the stack trace produced by gdb bt full.
  
  
  
  The Samba 'panic action' script, /usr/share/samba/panic-action,
  was called for pid 5839 (/usr/opt/samba-3.0.2rc2/sbin/smbd).
  
  0x40150a59 in wait4 () from /lib/libc.so.6
  #0  0x40150a59 in wait4 () from /lib/libc.so.6
  No symbol table info available.
  #1  0x401c7e48 in __check_rhosts_file () from /lib/libc.so.6
  No symbol table info available.
  #2  0x400f57c5 in system () from /lib/libc.so.6
  No symbol table info available.
  #3  0x081e7625 in smb_panic (why=0x82af4fe internal error) at lib/util.c:1391
  cmd = 0x83ec268 /usr/share/samba/panic-action 5839
  result = 1074632376
  backtrace_stack = {0x0, 0xbfffeee0, 0xbfffefd4, 0x81d174f, 0x82af3e2, 
0x82e43e0, 0x0, 0xbfffef08, 0x400c9319, 0xb628, 0xbfffefd4, 0x81d1757, 
0xb51c, 0x4e4ec24c, 0xb55c, 0x8299000, 0x0, 0x400fa582, 0x82f6c4b, 
0x83d4ae3, 0xbfffef3c, 0x81dd474, 0x83d4ad8, 0x82f6c40, 0x0, 0x0, 0x0, 0x0, 
0xb36c, 0x821a08b, 0x83d4ad8, 0x82f6c40, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 
0x0, 0x0, 0xb598, 0x81d4704, 0x1c, 0x84038d4, 0x68, 0x0, 0x83d4c40, 0x0, 
0xbfffefcc, 0x81eafe1, 0x400c9319, 0x401c7e48, 0xb0fc, 0x401c7e48, 
0xb628, 0x0, 0xbd94, 0x4011e1d6, 0x401c7e48, 0x401c56a0, 0x0, 0x0, 
0x8200b20, 0x400}
  backtrace_size = 138426576
  backtrace_strings = (char **) 0x1c
  #4  0x081d1951 in fault_report (sig=11) at lib/fault.c:41
  counter = 1
  #5  0x081d19ba in sig_fault (sig=11) at lib/fault.c:61
  No locals.
  #6  0x400d96b8 in sigaction () from /lib/libc.so.6
  No symbol table info available.
  #7  0x0820b54b in print_queue_status (snum=1, ppqueue=0xb4d0, 
  status=0xb4d4) at printing/printing.c:2283
  keystr = STATUS/hpljet8100n\0\0\001\0\0\0 öÿ¿ [EMAIL PROTECTED]\bò\006 
  [EMAIL 
  PROTECTED]:´\0\0\0\0\0\0\0\0Döÿ¿,ôÿ¿Q0\026\b\004\0\0\0Döÿ¿\024\0\0\0\0\0\0\0\001\0\0\0\0\0\0\0\004\0\0\0\0\0\0\0\002\0\0\08U=\bLôÿ¿\0361\026\b\220\026=\bDöÿ¿xôÿ¿\004\0\0\0\0\0\0\0\001\0\0\0|ôÿ¿ð\214!\b\001\0\0\0Döÿ¿xôÿ¿,L\023\b\0\0\0\0\224ýÿ¿\214ôÿ¿Kð\017\b...
  data = {dptr = 0x0, dsize = 264}
  key = {dptr = 0xb3ac STATUS/hpljet8100n, dsize = 18}
  printername = 0x82f6c40 hpljet8100n
  pdb = (struct tdb_print_db *) 0x83d4ac8
  count = 0
 
 This is a strange crash location. It appears to be in the call here :
 
 if (!get_stored_queue_info(pdb, snum, count, ppqueue)) {
 release_print_db(pdb);
 return 0;
 
 I don't immediately see any bad pointers etc. in this call...
 
 Jeremy.

-- 
To unsubscribe from this list go to the following URL and read the
instructions:  http://lists.samba.org/mailman/listinfo/samba