Re: [Samba] Overloaded samba server. Is it a bug? (but not a samba bug)
Well. Finally we resolve it. The problem was with the QLA driver, we applied a kernel patch (kernel-bigsmp-2.6.5-7.234.i586.rpm) provided by SuSE support and it is working fine. The patch will be provided soon in next SLES9 Support Pack 3. Anyway, thanks all of you for your help! Martín On Nov 04, 2005 01:36 PM, Jeremy Allison <[EMAIL PROTECTED]> wrote: > On Fri, Nov 04, 2005 at 10:51:52AM -0300, Martin wrote: > > > > How could we find it out? How could we get enough debugging level to > > reach > > this information? > > > > When the smbd proccess stopped in D state the strace does not show > > any line... > > Attach to it with gdb and type "bt". > > Jeremy. > -- To unsubscribe from this list go to the following URL and read the instructions: https://lists.samba.org/mailman/listinfo/samba
Re: [Samba] Overloaded samba server. Is it a bug?
On Nov 03, 2005 03:18 PM, Jeremy Allison <[EMAIL PROTECTED]> wrote: > On Thu, Nov 03, 2005 at 05:16:49AM -0300, Martin wrote: > > Roger, > > > > On Thursday 03 November 2005 03:22, Roger Eisenecher wrote: > > > Hi all > > > > > > Martin schrieb: > > > > 1TB with reiserfs in LVM > > > > > > We have a similar installation: Kernel 2.6.5-7.201-smp (the > > > official > > > kernel of SuSE 9.1 Professional) and we are using openldap and > > > reiserfs > > > too. Additonally we are using quota on the filesystem. Our server > > > hangs > > > often in this situation with a load of 350!!! The interesting part > > > is > > > that the cpu's are 92% idle. If we deactivate the quota subsystem > > > the > > > server will work for a longer time, but it could also happen that > > > the > > > load reaches 350... Only a reboot will solve this problem... > > This is exacltly our same sympthom. > > We have already disable the quota without success. Still got the > > problem. > > > > > > > Martin: Which kernel are you using? Do you use quota on your > > > filesystem? > > > > > > > This is a SLES9 running > > kernel-bigsmp-2.6.5-7.201.i586 > > > > We had also had problems with later version > > kernel-bigsmp-2.6.5-7.193.i586 > > > > Note: We decided to run 32bits kernel on the EM64T Intel platform. > > Can you reproduce this problem on a different filesystem than > Reiser ? I'm trying to narrow down the problem here. Nop. It's quite difficult with 1200 users using it. > > Jeremy. > -- To unsubscribe from this list go to the following URL and read the instructions: https://lists.samba.org/mailman/listinfo/samba
Re: [Samba] Overloaded samba server. Is it a bug?
On Wednesday 02 November 2005 19:50, Jeremy Allison wrote: > On Wed, Nov 02, 2005 at 06:53:36PM -0300, Martin wrote: > > #> strace -f -p > > > > RDONLY|O_NONBLOCK|O_LARGEFILE|O_DIRECTORY) = 18 [ ... ] > > 2005/INVERSION", {st_mode=S_IFDIR|0770, st_size=128, ...}) = 0 > > stat64("Estructura_Central/marketing/Medios/Victor/insitucional > > 2005/INVERSION/cao 2.xls", 0xbfffcec0) = -1 ENOENT (No such file or > > directory) > > stat64("Estructura_Central/marketing/Medios/Victor/insitucional > > 2005/INVERSION/cao 2.xls", 0xbfffcec0) = -1 ENOENT (No such file or > > directory) > > What filesystem is this ? 1TB with reiserfs in LVM -- Mrtn -- To unsubscribe from this list go to the following URL and read the instructions: https://lists.samba.org/mailman/listinfo/samba
Re: [Samba] Group membership limitation
On Fri Oct 28 14:44:02 GMT 2005 Bruno Guerreiro wrote: > I've made this question over and over, but still no answer till now. > So here goes again, maybe I have better luck this time. > Is there any limitation to the number of groups a samba user may > belong? > I've found out that if the user belongs to more than 60 to 70 groups, > group-based share access stop working. > From another post in this ml, i've found out that kernel 2.4.xx had a > 32 > group membership limitation, but i'm using 2.6.xx which has a 65536 > groups > limit. > Is there any place in samba where I shoulb be looking? > Any info/pointers would be much appreciated. Have you check with getent command if your platform response correctly? try "getent group " It should returns a members list like a line from /etc/group. If it does not work, check your entry in nsswitch.conf and replace "passwd compat" by "passwd ldap" (do the same for the group and maybe for shadow) Another thing you could try is use the recently "ldapsam:trusted = yes" option... take care of the considerations to make it work! Saludos, Martín -- To unsubscribe from this list go to the following URL and read the instructions: https://lists.samba.org/mailman/listinfo/samba
Re: [Samba] Overloaded samba server. Is it a bug?
On Fri Oct 28 14:44:02 GMT 2005 Bruno Guerreiro wrote: > I've made this question over and over, but still no answer till now. > So here goes again, maybe I have better luck this time. > Is there any limitation to the number of groups a samba user may > belong? > I've found out that if the user belongs to more than 60 to 70 groups, > group-based share access stop working. > From another post in this ml, i've found out that kernel 2.4.xx had a > 32 > group membership limitation, but i'm using 2.6.xx which has a 65536 > groups > limit. > Is there any place in samba where I shoulb be looking? > Any info/pointers would be much appreciated. Have you check with getent command if your platform response correctly? try "getent group " It should returns a members list like a line from /etc/group. If it does not work, check your entry in nsswitch.conf and replace "passwd compat" by "passwd ldap" (do the same for the group and maybe for shadow) Another thing you could try is use the recently "ldapsam:trusted = yes" option... take care of the considerations to make it work! Saludos, Martín -- To unsubscribe from this list go to the following URL and read the instructions: https://lists.samba.org/mailman/listinfo/samba
RE: [Samba] Overloaded samba server. Is it a bug?
On Oct 28, 2005 02:11 PM, MJBarber wrote: > I am running Suse 9.2 Pro in a corporate environment with 3.0.14a and > it works great.Just my 0.02... Well, when Samba is running before the load begins to rise, it's around 0.50 (with aproximately 1000 users logued in and 500 in high activity) > If you truly think this is a samba problem try a different version to > either replicate the issue or to have it point to a different piece of > the > puzzle. What is your complete config? We are using the Samba 3.0.20b because we need a new feature included in this version. (SeTakeOwnerShipPrivilege) We haven't been able to use root user as administrator of extended file system ACLs because the ldapsam:trusted is preventing us from using it. (NT_STATUS_UNSUCCESSFUL) > You said the load went sky high in a matter of seconds...do you see > which > process is running wild (smbd, nmbd, winbindd...). We've done an strace to the partent process of all smbds (it follows all the forks) and we didn't see nothing relevant. Here is our smb.conf, and winbindd is not being used. srvsmb02:~ # cat /etc/samba/smb.conf [global] workgroup = DOMAIN passdb backend = ldapsam:ldap://10.10.6.130 netbios name = SRVSMBFS netbios aliases = SRVSMBPS ldap admin dn = cn=admin,o=domain ldap suffix = ou=ar,o=domain ldap group suffix = ou=grupos_openldap ldap machine suffix = ou=maquinas ldap timeout = 2 idmap backend = ldap:ldap://10.10.6.130 idmap uid = 1-4 idmap gid = 1-4 unix charset = ISO8859-15 add machine script = /usr/local/sbin/smbldap-useradd -w %u domain logons = yes domain master = yes local master = yes show add printer wizard = no bind interfaces only = yes interfaces = 10.10.6.75/24 username level = 15 username map = /etc/samba/smbusers ldapsam:trusted = yes preferred master = yes ldap ssl = no wins support = yes printing = cups printcap name = cups printcap cache time = 750 cups options = raw map to guest = Bad User logon path = logon home = \\%L\%U\.9xprofile logon drive = H: os level = 255 log level = 3 socket options = IPTOS_LOWDELAY TCP_NODELAY cups server = 10.10.6.78 veto files = /*.eml/*.nws/riched20.dll/*.{*}/aquota.user/aquota.group/.msprofile/lost+found/ hide files = /aquota.user/aquota.group/.msprofile/ enable privileges = yes acl group control = yes logon script = ARRANQUE.BAT inherit owner = yes inherit acls = yes disable spoolss = yes log file = /var/log/samba/machines/log.%m [homes] comment = Home Directories valid users = %S browseable = No read only = No [profiles] comment = Network Profiles Service path = %H read only = No store dos attributes = Yes create mask = 0600 directory mask = 0700 browseable = no [printers] comment = All Printers path = /var/tmp printable = Yes create mask = 0600 browseable = No [netlogon] comment = netlogon service path = /var/lib/samba/netlogon browseable = no guest ok = . Continue ---8<---8< Thanks for your interest, Martín -- To unsubscribe from this list go to the following URL and read the instructions: https://lists.samba.org/mailman/listinfo/samba
RE: [Samba] Overloaded samba server. Is it a bug?
> I am running Suse 9.2 Pro in a corporate environment with 3.0.14a and > it > works great. > Just my 0.02... Well, when Samba is running before the load begins to rise, it's around 0.50 (with aproximately 1000 users logued in and 500 in high activity) > If you truly think this is a samba problem try a different version to > either replicate the issue or to have it point to a different piece of > the > puzzle. What is your complete config? We are using the Samba 3.0.20b because we need a new feature included in this version. (SeTakeOwnerShipPrivilege) We haven't been able to use root user as administrator of extended file system ACLs because the ldapsam:trusted is preventing us from using it. (NT_STATUS_UNSUCCESSFUL) > You said the load went sky high in a matter of seconds...do you see > which > process is running wild (smbd, nmbd, winbindd...). We've done an strace to the partent process of all smbds (it follows all the forks) and we didn't see nothing relevant. Here is our smb.conf, and winbindd is not being used. srvsmb02:~ # cat /etc/samba/smb.conf [global] workgroup = DOMAIN passdb backend = ldapsam:ldap://10.10.6.130 netbios name = SRVSMBFS netbios aliases = SRVSMBPS ldap admin dn = cn=admin,o=domain ldap suffix = ou=ar,o=domain ldap group suffix = ou=grupos_openldap ldap machine suffix = ou=maquinas ldap timeout = 2 idmap backend = ldap:ldap://10.10.6.130 idmap uid = 1-4 idmap gid = 1-4 unix charset = ISO8859-15 add machine script = /usr/local/sbin/smbldap-useradd -w %u domain logons = yes domain master = yes local master = yes show add printer wizard = no bind interfaces only = yes interfaces = 10.10.6.75/24 username level = 15 username map = /etc/samba/smbusers ldapsam:trusted = yes preferred master = yes ldap ssl = no wins support = yes printing = cups printcap name = cups printcap cache time = 750 cups options = raw map to guest = Bad User logon path = logon home = \\%L\%U\.9xprofile logon drive = H: os level = 255 log level = 3 socket options = IPTOS_LOWDELAY TCP_NODELAY cups server = 10.10.6.78 veto files = /*.eml/*.nws/riched20.dll/*.{*}/aquota.user/aquota.group/.msprofile/lost+found/ hide files = /aquota.user/aquota.group/.msprofile/ enable privileges = yes acl group control = yes logon script = ARRANQUE.BAT inherit owner = yes inherit acls = yes disable spoolss = yes log file = /var/log/samba/machines/log.%m [homes] comment = Home Directories valid users = %S browseable = No read only = No [profiles] comment = Network Profiles Service path = %H read only = No store dos attributes = Yes create mask = 0600 directory mask = 0700 browseable = no [printers] comment = All Printers path = /var/tmp printable = Yes create mask = 0600 browseable = No [netlogon] comment = netlogon service path = /var/lib/samba/netlogon browseable = no guest ok = . Continue ---8<---8< Thanks for your interest, Martín > > -Original Message- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On Behalf > Of > [EMAIL PROTECTED] > Sent: Friday, October 28, 2005 12:48 PM > To: [EMAIL PROTECTED] > Cc: samba@lists.samba.org > Subject: Re: [Samba] Overloaded samba server. Is it a bug? > > First of all, why run SuSe when CentOS is free, runs faster and is > more up > to date? I have basically the same setup you have except our system is > a > quad xeon system and CentOS runs flawlessly 24/7. We used to > experiment > with SuSe but it is not good for a corporate environment. > Just a heads up as I have been doing this for 17 years and CentOS is > the > cream of the crop for the money. > > Martin Scandroli wrote: > > >Experts, > > > >We've just migrated from samba 2.2.8a to samba 3.0.20b in a very > >large > >corporate environment. Everything was really fine in our lab, but we > >began experiment serious load problems on the productive servers the > >morning after the procedure took place. I'll try (briefly) to > >describe > >the characteristics of the scenario: > > > >Resources: > > > >Old Environment: > > > >Hardware: > >Dell PowerEdge 2650 > >Intel Xeon Processor > >2 GB Ram >
[Samba] Overloaded samba server. Is it a bug?
Experts, We've just migrated from samba 2.2.8a to samba 3.0.20b in a very large corporate environment. Everything was really fine in our lab, but we began experiment serious load problems on the productive servers the morning after the procedure took place. I'll try (briefly) to describe the characteristics of the scenario: Resources: Old Environment: Hardware: Dell PowerEdge 2650 Intel Xeon Processor 2 GB Ram Raid 5 (via perc raid controller) on 10k scsi disks Software: SuSE Linux Enterprise Server 8 Samba 2.2.8a Servers cups printing service openldap2 as backend (with replicas all over the country, about 3000 objects in the tree) HeartBeat as high availability Service Everything was charming here!! New Environment Hardware: Dell PowerEdge 2850 Servers 2 Intel Xeon 3.2 GHz (HT i think... i see 4 of them) Processors 4 GB Ram Raid 5 (via Perc raid controller) on 15k scsi disks Software SuSE Linux Enterprise Server 9 Samba 3.0.20b Servers cups printing service Novell eDirectory 8.7.3.4 as backend (Very distributed too, about 4000 objects in the tree) HeartBeat as high availability Service drbd to keep samba configuracion replicated among the cluster nodes. Problems we're having (or had, just as a usefull comment): eDirectory turned out to be much slower than openldap2 when responding to nss_ldap queries (i mean about 7 or 8 times slower) so queries asking for members of large groups (i.e: groups with about 1500 users and above) were usually terminated with an RPC timeout Everything started to work when we added the ldapsam:trusted=yes parameter. It dramatically reduced the response times and affected queries began to work. The implementation of this feature produced some other problems (we've found workarrounds but i'll comment them just to provide some feedback). 1) The samba server used to die seconds after it was started. Something about the nobody user and it's primary group prevented it from working in a proper manner. We solved this inconvinient by adding de user nobody and it's corresponding primary group to the backend. 2) Root user was no longer recognized, (we still trying to figure out why, the user's been added to the tree, but nothing changed) so we used the new role based administration provided by samba 3 as a workarround (SeMachinAccount...), and no more troubles about it. 3)THIS ISSUE IS KILLING US!!! Something happens in a determined moment of the day (rush hour). Everything is running smoothly (0.3 - 0.4 of load average) when the load start to grow indefinitely!!. It raises from 0.3 to 50 in a matter of seconds!, and it keeps growing till the server dies. We couldn't find the reason of this, but it happens in a two hors interval. Before and after this interval, there are no errors of any kind. I'll paste some log errors (just the ones i saw). I don't think they're the cause of our problems, buy you're the experts. Any clue? do you need me to gather some kind of information? any DoS bug reported for this samba version? Any help will be highly appreciated Regards, Martin -- from /var/log/messages Oct 25 04:34:15 srvsmb01 smbd[2961]: [2005/10/25 04:34:15, 0] lib/util_sock.c:send_smb(762) Oct 25 04:34:15 srvsmb01 smbd[2961]: Error writing 4 bytes to client. -1. (Connection reset by peer) Oct 25 04:40:36 srvsmb01 smbd[2983]: [2005/10/25 04:40:36, 0] lib/util_sock.c:get_peer_addr(1222) Oct 25 04:40:36 srvsmb01 smbd[2983]: getpeername failed. Error was Transport endpoint is not connected Oct 25 04:40:36 srvsmb01 smbd[2983]: [2005/10/25 04:40:36, 0] lib/util_sock.c:write_data(554) Oct 25 04:40:36 srvsmb01 smbd[2983]: write_data: write failure in writing to client 167.252.104.98. Error Connection reset by peer (this happens very often) From /var/log/samba/log.nmbd tdb(unnamed): tdb_open_ex: /var/lib/samba/unexpected.tdb (2059,2959) is already open in this process [2005/10/26 04:17:01, 2] tdb/tdbutil.c:tdb_log(767) tdb(unnamed): tdb_open_ex: /var/lib/samba/unexpected.tdb (2059,2959) is already open in this process [2005/10/26 04:17:01, 2] tdb/tdbutil.c:tdb_log(767) tdb(unnamed): tdb_open_ex: /var/lib/samba/unexpected.tdb (2059,2959) is already open in this process [2005/10/26 04:17:01, 2] tdb/tdbutil.c:tdb_log(767) tdb(unnamed): tdb_open_ex: /var/lib/samba/unexpected.tdb (2059,2959) is already open in this process [2005/10/26 04:17:01, 2] tdb/tdbutil.c:tdb_log(767) tdb(unnamed): tdb_open_ex: /var/lib/samba/unexpected.tdb (2059,2959) is already open in this process [2005/10/26 04:17:01, 2] tdb/tdbutil.c:tdb_log(767) tdb(unnamed): tdb_open_ex: /var/lib