Re: [Samba] Overloaded samba server. Is it a bug? (but not a samba bug)

2005-11-10 Thread Martin Scandroli
Well. Finally we resolve it.
The problem was with the QLA driver, we applied a kernel patch
(kernel-bigsmp-2.6.5-7.234.i586.rpm) provided by SuSE support and it is
working fine.
The patch will be provided soon in next SLES9 Support Pack 3. 

Anyway, thanks all of you for your help!
Martín

On Nov 04, 2005 01:36 PM, Jeremy Allison <[EMAIL PROTECTED]> wrote:

> On Fri, Nov 04, 2005 at 10:51:52AM -0300, Martin wrote:
> > 
> > How could we find it out? How could we get enough debugging level to
> > reach
> > this information?
> > 
> > When the smbd proccess stopped in D state the strace does not show
> > any line...
> 
> Attach to it with gdb and type "bt".
> 
> Jeremy.
> 

--
To unsubscribe from this list go to the following URL and read the
instructions:  https://lists.samba.org/mailman/listinfo/samba


Re: [Samba] Overloaded samba server. Is it a bug?

2005-11-03 Thread Martin Scandroli
On Nov 03, 2005 03:18 PM, Jeremy Allison <[EMAIL PROTECTED]> wrote:

> On Thu, Nov 03, 2005 at 05:16:49AM -0300, Martin wrote:
> > Roger,
> > 
> > On Thursday 03 November 2005 03:22, Roger Eisenecher wrote:
> > > Hi all
> > >
> > > Martin schrieb:
> > > > 1TB with reiserfs in LVM
> > >
> > > We have a similar installation: Kernel 2.6.5-7.201-smp (the
> > > official
> > > kernel of SuSE 9.1 Professional) and we are using openldap and
> > > reiserfs
> > > too. Additonally we are using quota on the filesystem. Our server
> > > hangs
> > > often in this situation with a load of 350!!! The interesting part
> > > is
> > > that the cpu's are 92% idle. If we deactivate the quota subsystem
> > > the
> > > server will work for a longer time, but it could also happen that
> > > the
> > > load reaches 350... Only a reboot will solve this problem...
> > This is exacltly our same sympthom.
> > We have already disable the quota without success. Still got the
> > problem.
> > 
> > 
> > > Martin: Which kernel are you using? Do you use quota on your
> > > filesystem?
> > >
> > 
> > This is a SLES9 running
> > kernel-bigsmp-2.6.5-7.201.i586
> > 
> > We had also had problems with later version
> > kernel-bigsmp-2.6.5-7.193.i586
> > 
> > Note: We decided to run 32bits kernel on the EM64T Intel platform.
> 
> Can you reproduce this problem on a different filesystem than
> Reiser ? I'm trying to narrow down the problem here.
Nop. It's quite difficult with 1200 users using it.

> 
> Jeremy.
> 

-- 
To unsubscribe from this list go to the following URL and read the
instructions:  https://lists.samba.org/mailman/listinfo/samba


Re: [Samba] Overloaded samba server. Is it a bug?

2005-11-02 Thread Martin Scandroli
On Wednesday 02 November 2005 19:50, Jeremy Allison wrote:
> On Wed, Nov 02, 2005 at 06:53:36PM -0300, Martin wrote:
> > #> strace -f -p 
> >
> > RDONLY|O_NONBLOCK|O_LARGEFILE|O_DIRECTORY) = 18

[ ... ]

> > 2005/INVERSION", {st_mode=S_IFDIR|0770, st_size=128, ...}) = 0
> > stat64("Estructura_Central/marketing/Medios/Victor/insitucional
> > 2005/INVERSION/cao 2.xls", 0xbfffcec0) = -1 ENOENT (No such file or
> > directory)
> > stat64("Estructura_Central/marketing/Medios/Victor/insitucional
> > 2005/INVERSION/cao 2.xls", 0xbfffcec0) = -1 ENOENT (No such file or
> > directory)
>
> What filesystem is this ?
1TB with reiserfs in LVM

--
Mrtn

-- 
To unsubscribe from this list go to the following URL and read the
instructions:  https://lists.samba.org/mailman/listinfo/samba


Re: [Samba] Group membership limitation

2005-10-28 Thread Martin Scandroli

On Fri Oct 28 14:44:02 GMT 2005 Bruno Guerreiro wrote:

> I've made this question over and over, but still no answer till now.
> So here goes again, maybe I have better luck this time.
> Is there any limitation to the number of groups a samba user may
> belong?
> I've found out that if the user belongs to more than 60 to 70 groups,
> group-based share access stop working. 
> From another post in this ml, i've found out that kernel 2.4.xx had a
> 32
> group membership limitation, but i'm using 2.6.xx which has a 65536
> groups
> limit.
> Is there any place in samba where I shoulb be looking?
> Any info/pointers would be much appreciated.

Have you check with getent command if your platform response correctly?
try "getent group "
It should returns a members list like a line from /etc/group.

If it does not work, check your entry in nsswitch.conf and replace
"passwd compat" by "passwd ldap" (do the same for the group and maybe
for shadow)

Another thing you could try is use the recently "ldapsam:trusted = yes"
option... take care of the considerations to make it work!


Saludos, 
Martín

--
To unsubscribe from this list go to the following URL and read the
instructions:  https://lists.samba.org/mailman/listinfo/samba


Re: [Samba] Overloaded samba server. Is it a bug?

2005-10-28 Thread Martin Scandroli
On Fri Oct 28 14:44:02 GMT 2005 Bruno Guerreiro wrote:

> I've made this question over and over, but still no answer till now.
> So here goes again, maybe I have better luck this time.
> Is there any limitation to the number of groups a samba user may
> belong?
> I've found out that if the user belongs to more than 60 to 70 groups,
> group-based share access stop working. 
> From another post in this ml, i've found out that kernel 2.4.xx had a
> 32
> group membership limitation, but i'm using 2.6.xx which has a 65536
> groups
> limit.
> Is there any place in samba where I shoulb be looking?
> Any info/pointers would be much appreciated.

Have you check with getent command if your platform response correctly?
try "getent group "
It should returns a members list like a line from /etc/group.

If it does not work, check your entry in nsswitch.conf and replace
"passwd compat" by "passwd ldap" (do the same for the group and maybe
for shadow)

Another thing you could try is use the recently "ldapsam:trusted = yes"
option... take care of the considerations to make it work!


Saludos, 
Martín

--
To unsubscribe from this list go to the following URL and read the
instructions:  https://lists.samba.org/mailman/listinfo/samba


RE: [Samba] Overloaded samba server. Is it a bug?

2005-10-28 Thread Martin Scandroli

On Oct 28, 2005 02:11 PM, MJBarber wrote:

> I am running Suse 9.2 Pro in a corporate environment with 3.0.14a and
> it works great.Just my 0.02...

Well, when Samba is running before the load begins to rise, it's around
0.50 (with aproximately 1000 users logued in and 500 in high activity)

> If you truly think this is a samba problem try a different version to
> either replicate the issue or to have it point to a different piece of
> the
> puzzle. What is your complete config?

We are using the Samba 3.0.20b because we need a new feature included in
this version. (SeTakeOwnerShipPrivilege) We haven't been able to use
root user as administrator of extended file system ACLs because the
ldapsam:trusted is preventing us from using it.
(NT_STATUS_UNSUCCESSFUL)

> You said the load went sky high in a matter of seconds...do you see
> which
> process is running wild (smbd, nmbd, winbindd...).

We've done an strace to the partent process of all smbds (it follows all
the forks) and we didn't see nothing relevant.

Here is our smb.conf, and winbindd is not being used.

srvsmb02:~ # cat /etc/samba/smb.conf
[global]
workgroup = DOMAIN
passdb backend = ldapsam:ldap://10.10.6.130
netbios name = SRVSMBFS
netbios aliases = SRVSMBPS
ldap admin dn = cn=admin,o=domain
ldap suffix = ou=ar,o=domain
ldap group suffix = ou=grupos_openldap
ldap machine suffix = ou=maquinas
ldap timeout = 2
idmap backend = ldap:ldap://10.10.6.130
idmap uid = 1-4
idmap gid = 1-4
unix charset = ISO8859-15
add machine script = /usr/local/sbin/smbldap-useradd -w %u
domain logons = yes
domain master = yes
local master = yes
show add printer wizard = no
bind interfaces only = yes
interfaces = 10.10.6.75/24
username level = 15
username map = /etc/samba/smbusers
ldapsam:trusted = yes
preferred master = yes
ldap ssl = no
wins support = yes
printing = cups
printcap name = cups
printcap cache time = 750
cups options = raw
map to guest = Bad User
logon path =
logon home = \\%L\%U\.9xprofile
logon drive = H:
os level = 255
log level = 3
socket options = IPTOS_LOWDELAY TCP_NODELAY
cups server = 10.10.6.78
veto files =

/*.eml/*.nws/riched20.dll/*.{*}/aquota.user/aquota.group/.msprofile/lost+found/
hide files = /aquota.user/aquota.group/.msprofile/
enable privileges = yes
acl group control = yes
logon script = ARRANQUE.BAT
inherit owner = yes
inherit acls = yes
disable spoolss = yes
log file = /var/log/samba/machines/log.%m
[homes]
comment = Home Directories
valid users = %S
browseable = No
read only = No
[profiles]
comment = Network Profiles Service
path = %H
read only = No
store dos attributes = Yes
create mask = 0600
directory mask = 0700
browseable = no
[printers]
comment = All Printers
path = /var/tmp
printable = Yes
create mask = 0600
browseable = No
[netlogon]
comment = netlogon service
path = /var/lib/samba/netlogon
browseable = no
guest ok = . Continue
---8<---8<


Thanks for your interest,
Martín



--
To unsubscribe from this list go to the following URL and read the
instructions:  https://lists.samba.org/mailman/listinfo/samba


RE: [Samba] Overloaded samba server. Is it a bug?

2005-10-28 Thread Martin Scandroli


> I am running Suse 9.2 Pro in a corporate environment with 3.0.14a and
> it
> works great.

> Just my 0.02...  
Well, when Samba is running before the load begins to rise, it's around
0.50 (with aproximately 1000 users logued in and 500 in high activity)

> If you truly think this is a samba problem try a different version to
> either replicate the issue or to have it point to a different piece of
> the
> puzzle.  What is your complete config? 
We are using the Samba 3.0.20b because we need a new feature included in
this version. (SeTakeOwnerShipPrivilege) We haven't been able to use
root user as administrator of extended file system ACLs because the
ldapsam:trusted is preventing us from using it.
(NT_STATUS_UNSUCCESSFUL)

> You said the load went sky high in a matter of seconds...do you see
> which
> process is running wild (smbd, nmbd, winbindd...).
We've done an strace to the partent process of all smbds (it follows all
the forks) and we didn't see nothing relevant.

Here is our smb.conf, and winbindd is not being used.

srvsmb02:~ # cat /etc/samba/smb.conf
[global]
workgroup = DOMAIN
passdb backend = ldapsam:ldap://10.10.6.130
netbios name = SRVSMBFS
netbios aliases = SRVSMBPS
ldap admin dn = cn=admin,o=domain
ldap suffix = ou=ar,o=domain
ldap group suffix = ou=grupos_openldap
ldap machine suffix = ou=maquinas
ldap timeout = 2
idmap backend = ldap:ldap://10.10.6.130
idmap uid = 1-4
idmap gid = 1-4
unix charset = ISO8859-15
add machine script = /usr/local/sbin/smbldap-useradd -w %u
domain logons = yes
domain master = yes
local master = yes
show add printer wizard = no
bind interfaces only = yes
interfaces = 10.10.6.75/24
username level = 15
username map = /etc/samba/smbusers
ldapsam:trusted = yes
preferred master = yes
ldap ssl = no
wins support = yes
printing = cups
printcap name = cups
printcap cache time = 750
cups options = raw
map to guest = Bad User
logon path =
logon home = \\%L\%U\.9xprofile
logon drive = H:
os level = 255
log level = 3
socket options = IPTOS_LOWDELAY TCP_NODELAY
cups server = 10.10.6.78
veto files =

/*.eml/*.nws/riched20.dll/*.{*}/aquota.user/aquota.group/.msprofile/lost+found/
hide files = /aquota.user/aquota.group/.msprofile/
enable privileges = yes
acl group control = yes
logon script = ARRANQUE.BAT
inherit owner = yes
inherit acls = yes
disable spoolss = yes
log file = /var/log/samba/machines/log.%m
[homes]
comment = Home Directories
valid users = %S
browseable = No
read only = No
[profiles]
comment = Network Profiles Service
path = %H
read only = No
store dos attributes = Yes
create mask = 0600
directory mask = 0700
browseable = no
[printers]
comment = All Printers
path = /var/tmp
printable = Yes
create mask = 0600
browseable = No
[netlogon]
comment = netlogon service
path = /var/lib/samba/netlogon
browseable = no
guest ok = . Continue
---8<---8<


Thanks for your interest,
Martín





> 
> -Original Message-
> From: [EMAIL PROTECTED]
> [mailto:[EMAIL PROTECTED] On Behalf
> Of
> [EMAIL PROTECTED]
> Sent: Friday, October 28, 2005 12:48 PM
> To: [EMAIL PROTECTED]
> Cc: samba@lists.samba.org
> Subject: Re: [Samba] Overloaded samba server. Is it a bug?
> 
> First of all, why run SuSe when CentOS is free, runs faster and is
> more up
> to date? I have basically the same setup you have except our system is
> a
> quad xeon system and CentOS runs flawlessly 24/7. We used to
> experiment
> with SuSe but it is not good for a corporate environment.  
> Just a heads up as I have been doing this for 17 years and CentOS is
> the
> cream of the crop for the money.
> 
> Martin Scandroli wrote:
> 
> >Experts,
> >
> >We've just migrated from samba 2.2.8a to samba 3.0.20b in a very
> >large
> >corporate environment. Everything was really fine in our lab, but we 
> >began experiment serious load problems on the productive servers the 
> >morning after the procedure took place. I'll try (briefly) to
> >describe
> >the characteristics of the scenario:
> >
> >Resources:
> >
> >Old Environment:
> >
> >Hardware:
> >Dell PowerEdge 2650
> >Intel Xeon Processor
> >2 GB Ram
> 

[Samba] Overloaded samba server. Is it a bug?

2005-10-26 Thread Martin Scandroli

Experts,

We've just migrated from samba 2.2.8a to samba 3.0.20b in a very large
corporate environment. Everything was really fine in our lab, but we
began
experiment serious load problems on the productive servers the morning
after
the procedure took place. I'll try (briefly) to describe the
characteristics
of the scenario:

Resources:

Old Environment:

Hardware:
Dell PowerEdge 2650
Intel Xeon Processor
2 GB Ram
Raid 5 (via perc raid controller) on 10k scsi disks
Software:
SuSE Linux Enterprise Server 8
Samba 2.2.8a Servers
cups printing service
openldap2 as backend (with replicas all over the country,
about 3000 objects in the tree)
HeartBeat as high availability Service

Everything was charming here!!


New Environment

Hardware:
Dell PowerEdge 2850 Servers
2 Intel Xeon 3.2 GHz (HT i think... i see 4 of them)
Processors
4 GB Ram
Raid 5 (via Perc raid controller) on 15k scsi disks

Software
SuSE Linux Enterprise Server 9
Samba 3.0.20b Servers
cups printing service
Novell eDirectory 8.7.3.4 as backend (Very distributed too,
about 4000 objects in the tree)
HeartBeat as high availability Service
drbd to keep samba configuracion replicated among the cluster
nodes.

Problems we're having (or had, just as a usefull comment):

eDirectory turned out to be much slower than openldap2 when responding
to nss_ldap queries (i mean about 7 or 8 times slower) so
queries
asking for members of large groups (i.e: groups with about 1500 users
and
above) were usually terminated with an RPC timeout

Everything started to work when we added the ldapsam:trusted=yes
parameter. It dramatically reduced the response times and affected
queries
began to work.
The implementation of this feature produced some other problems (we've
found workarrounds but i'll comment them just to provide some feedback).

1) The samba server used to die seconds after it was started. 
Something about the nobody user and it's primary group prevented it from
working in a proper manner. We solved this inconvinient by adding de
user
nobody and it's corresponding primary group to the backend.
2) Root user was no longer recognized, (we still trying to figure out
why, the user's been added to the tree, but nothing changed) so we used
the
new role based administration provided by samba 3 as a workarround 
(SeMachinAccount...), and no more troubles about it.



3)THIS ISSUE IS KILLING US!!!

Something happens in a determined moment of the day (rush hour).
Everything is running smoothly (0.3 - 0.4 of load average) when the load
start to grow indefinitely!!. It raises from 0.3 to 50 in a matter
of
seconds!, and it keeps growing till the server dies. We couldn't find
the
reason of this, but it happens in a two hors interval. Before and after
this
interval, there are no errors of any kind.

I'll paste some log errors (just the ones i saw). I don't think 
they're the cause of our problems, buy you're the experts.

Any clue? do you need me to gather some kind of information? any DoS
bug reported for this samba version?

Any help will be highly appreciated

Regards, 
Martin

--

from /var/log/messages

Oct 25 04:34:15 srvsmb01 smbd[2961]: [2005/10/25 04:34:15, 0] 
lib/util_sock.c:send_smb(762)
Oct 25 04:34:15 srvsmb01 smbd[2961]:   Error writing 4 bytes to 
client. -1. (Connection reset by peer)
Oct 25 04:40:36 srvsmb01 smbd[2983]: [2005/10/25 04:40:36, 0] 
lib/util_sock.c:get_peer_addr(1222)
Oct 25 04:40:36 srvsmb01 smbd[2983]: getpeername failed. Error was
Transport endpoint is not connected
Oct 25 04:40:36 srvsmb01 smbd[2983]: [2005/10/25 04:40:36, 0] 
lib/util_sock.c:write_data(554)
Oct 25 04:40:36 srvsmb01 smbd[2983]: write_data: write failure in
writing to client 167.252.104.98. Error Connection reset
by peer

(this happens very often)

From /var/log/samba/log.nmbd

tdb(unnamed): tdb_open_ex: /var/lib/samba/unexpected.tdb (2059,2959)
is already open in this process
[2005/10/26 04:17:01, 2] tdb/tdbutil.c:tdb_log(767)
tdb(unnamed): tdb_open_ex: /var/lib/samba/unexpected.tdb (2059,2959)
is already open in this process
[2005/10/26 04:17:01, 2] tdb/tdbutil.c:tdb_log(767)
tdb(unnamed): tdb_open_ex: /var/lib/samba/unexpected.tdb (2059,2959)
is already open in this process
[2005/10/26 04:17:01, 2] tdb/tdbutil.c:tdb_log(767)
tdb(unnamed): tdb_open_ex: /var/lib/samba/unexpected.tdb (2059,2959)
is already open in this process
[2005/10/26 04:17:01, 2] tdb/tdbutil.c:tdb_log(767)
tdb(unnamed): tdb_open_ex: /var/lib/samba/unexpected.tdb (2059,2959)
is already open in this process
[2005/10/26 04:17:01, 2] tdb/tdbutil.c:tdb_log(767)
tdb(unnamed): tdb_open_ex: /var/lib