HI All.

We've been having some problems the past couple days with one of our Samba Servers. We currently have it running two instances. One instance is using local auth (historical) the other instances is using our newish domain for authentication. We're slowly migrating users over to the domain so having both methods available was the solution we came up with.

Yesterday we had a crash and couldn't remotely login to the server. Our data partition is VERY near capacity, but the system and var partitions are fine. After a reboot some strange issues with the smb, nmb and winbind processes.. couldn't stop/start them properly. kept complaining they were already running. The domain users could access their drives, but not print... and the non domain users couldn't access their drives, but could print. hah.

After another reboot and a fsck, we started up the local auth instance of samba and seemed ok. and started up the domain auth instance and things seems to be working. Over the course of the night about 1900 smb processes had built up and the load avg on the server was through the roof. One user alone had 1100 of those processes.

In the logs there was a samba panic and internal error (more details and backtrace to follow)

We're unfortunately running an older version of samba, on an older distro of SLES... but everything was working fine for almost the past year.

SLES 10.1
kernel 2.6.16.46-0.12-smp
samba-3.0.36-7.1

smb.conf
[global]
#       client code page = 437
       interfaces = [ip removed].10/24
       bind interfaces only = true
       workgroup = WORKGROUP
       netbios name = smbserver
       netbios aliases = CBNsmbserver
       server string = smbserver
       encrypt passwords = Yes
       map to guest = Bad User
       passwd program = /usr/bin/passwd
       log file = /var/log/samba/log.smbd
       lock directory = /var/lib/samba
       time server = Yes
       kernel oplocks = No
       oplock break wait time = 20
       deadtime = 10
       keepalive = 30
       admin users = george
       os level = 34
       local master = Yes
       preferred master = Yes
# domain master for WORKGROUP
       domain master = Yes
# The only WINS server
       wins support = Yes
# Local master browsers on the different subnets plus locale1 domain master
       remote browse sync = [ips removed]
       remote announce = [ip removed]/WORKGROUP [ip removed]/PHONE
       unix extensions = Yes
       log level = 2
       max log size = 1048576
       syslog = 0
       printing = CUPS
       printcap name = CUPS
       use client driver = Yes
socket options = SO_KEEPALIVE IPTOS_LOWDELAY TCP_NODELAY
#       veto files = /*.eml/*.nws/riched20.dll/*.{*}-
       browseable = Yes
       writeable = Yes
       create mask = 0600
       directory mask = 0700
       locking = Yes

smb.ad.conf  [for domain auth instance]
[global]
       realm = dc.blah
       security = ads
       winbind separator = ^
       winbind nested groups = yes
       winbind enum groups = yes
       winbind enum users = yes
       password server = domaincont.dc.blah
#       client code page = 437
       interfaces = [ip removed].11/24
       bind interfaces only = true
       workgroup = dc
       netbios name = smbserverAD
       netbios aliases = smbserverAD
       server string = smbserverAD
       encrypt passwords = Yes
       map to guest = Bad User
       passwd program = /usr/bin/passwd
       log file = /var/log/samba/log.ad.smbd
       lock directory = /var/lib/samba.ad
       time server = Yes
       kernel oplocks = No
       oplock break wait time = 20
       deadtime = 10
       keepalive = 30
       admin users =  %D^george
       os level = 34
       local master = Yes
       preferred master = No
# domain master for WORKGROUP
       domain master = Yes
# The only WINS server
       wins support = Yes
# Local master browsers on the different subnets plus locale1 domain master
       remote browse sync = [ips removed]
       remote announce = [ip removed]/WORKGROUP [ip removed]/PHONE
unix extensions = Yes log level = 2
       max log size = 1048576
       syslog = 0
       printing = CUPS
       printcap name = CUPS
       use client driver = Yes
       socket options = SO_KEEPALIVE IPTOS_LOWDELAY TCP_NODELAY
#       veto files = /*.eml/*.nws/riched20.dll/*.{*}-
       browseable = Yes
       writeable = Yes
       create mask = 0600
       directory mask = 0700
       locking = Yes


Here's a snippit of the logs from the panic.

Apr 15 08:00:11 shalimar smbd[28735]: [2011/04/15 08:00:11, 0] printing/printing_db.c:get_print_db_byname(40) Apr 15 08:00:11 shalimar smbd[28735]: PANIC: assert failed at printing/printing_db.c(40) Apr 15 08:00:11 shalimar smbd[28735]: [2011/04/15 08:00:11, 0] lib/substitute.c:alloc_sub_basic(463) Apr 15 08:00:11 shalimar smbd[28735]: alloc_sub_basic: NULL source string! This should not happen Apr 15 08:00:11 shalimar smbd[28735]: [2011/04/15 08:00:11, 0] lib/fault.c:fault_report(41) Apr 15 08:00:11 shalimar smbd[28735]: =============================================================== Apr 15 08:00:11 shalimar smbd[28735]: [2011/04/15 08:00:11, 0] lib/fault.c:fault_report(42) Apr 15 08:00:11 shalimar smbd[28735]: INTERNAL ERROR: Signal 11 in pid 28735 (3.0.36-7.1-2365-SUSE-CODE10) Apr 15 08:00:11 shalimar smbd[28735]: Please read the Trouble-Shooting section of the Samba3-HOWTO Apr 15 08:00:11 shalimar smbd[28735]: [2011/04/15 08:00:11, 0] lib/fault.c:fault_report(44)
Apr 15 08:00:11 shalimar smbd[28735]:
Apr 15 08:00:11 shalimar smbd[28735]: From: http://www.samba.org/samba/docs/Samba3-HOWTO.pdf Apr 15 08:00:11 shalimar smbd[28735]: [2011/04/15 08:00:11, 0] lib/fault.c:fault_report(45) Apr 15 08:00:11 shalimar smbd[28735]: =============================================================== Apr 15 08:00:11 shalimar smbd[28735]: [2011/04/15 08:00:11, 0] lib/util.c:smb_panic(1633)
Apr 15 08:00:11 shalimar smbd[28735]:   PANIC (pid 28735): internal error
Apr 15 08:00:19 shalimar smbd[28735]:   BACKTRACE: 17 stack frames:
Apr 15 08:00:19 shalimar smbd[28735]: #0 /usr/sbin/smbd(log_stack_trace+0x1c) [0x555555773f8c] Apr 15 08:00:19 shalimar smbd[28735]: #1 /usr/sbin/smbd(smb_panic+0x41) [0x555555774081]
Apr 15 08:00:19 shalimar smbd[28735]:    #2 /usr/sbin/smbd [0x555555761ea2]
Apr 15 08:00:20 shalimar smbd[28735]: #3 /lib64/libc.so.6 [0x2b2fdfbadc10] Apr 15 08:00:20 shalimar smbd[28735]: #4 /usr/sbin/smbd(Get_Pwnam_alloc+0x2b) [0x55555576693b] Apr 15 08:00:20 shalimar smbd[28735]: #5 /usr/sbin/smbd(Get_Pwnam+0xb) [0x555555766c9b] Apr 15 08:00:20 shalimar smbd[28735]: #6 /usr/sbin/smbd(get_user_home_dir+0x9) [0x555555766cd9]
Apr 15 08:00:20 shalimar smbd[28735]:    #7 /usr/sbin/smbd [0x555555779fb4]
Apr 15 08:00:20 shalimar smbd[28735]: #8 /usr/sbin/smbd(standard_sub_advanced+0x26) [0x55555577a866]
Apr 15 08:00:20 shalimar smbd[28735]:    #9 /usr/sbin/smbd [0x555555794d0e]
Apr 15 08:00:20 shalimar smbd[28735]: #10 /usr/sbin/smbd(print_queue_status+0x38a) [0x55555579547a] Apr 15 08:00:20 shalimar smbd[28735]: #11 /usr/sbin/smbd(update_monitored_printq_cache+0x43) [0x5555556a1293]
Apr 15 08:00:20 shalimar smbd[28735]:    #12 /usr/sbin/smbd [0x555555628a6b]
Apr 15 08:00:20 shalimar smbd[28735]: #13 /usr/sbin/smbd(smbd_process+0x430) [0x555555629640] Apr 15 08:00:20 shalimar smbd[28735]: #14 /usr/sbin/smbd(main+0x1203) [0x555555828903] Apr 15 08:00:20 shalimar smbd[28735]: #15 /lib64/libc.so.6(__libc_start_main+0xf4) [0x2b2fdfb9b154]
Apr 15 08:00:20 shalimar smbd[28735]:    #16 /usr/sbin/smbd [0x5555555bc779]
Apr 15 08:00:20 shalimar smbd[28735]: [2011/04/15 08:00:20, 0] lib/fault.c:dump_core(181) Apr 15 08:00:20 shalimar smbd[28735]: dumping core in /var/log/samba/cores/smbd
Apr 15 08:00:20 shalimar smbd[28735]:



--
Matt Ingram
Intermediate Unix Administrator, IS
Canadian Bank Note Company, Limited
\m/

--
To unsubscribe from this list go to the following URL and read the
instructions:  https://lists.samba.org/mailman/options/samba

Reply via email to