RES: Bind 9.6.1 stops after few hours.
Hi Rob, I could not reproduce this behavior on my Linux lab machine using queryperf. As Jinmei pointed this seens to be a Solaris thing. I still could not reproduce the problem I faced on my own production DNS... Thanks for your post! Laurence -Mensagem original- De: Rob Payne [mailto:rnspa...@the-paynes.com] Enviada em: terça-feira, 7 de julho de 2009 12:56 Para: JINMEI Tatuya / 神明達哉 Cc: Laurence Stendard; bind-users@lists.isc.org Assunto: Re: Bind 9.6.1 stops after few hours. On Sun, Jul 05, 2009 at 02:42:34PM -0700, JINMEI Tatuya / 神明達哉 wrote: > At Fri, 3 Jul 2009 17:31:57 -0300, > "Laurence Stendard" wrote: > > After an upgrade to 9.6.1 we noticed the Bind daemon stops after few > > hours. > What do you mean by "stop"? Did the daemon crash, simply not respond > to queries, or something else? I don't know if this is the same as what Laurence is seeing. Testing 9.6.1 on Solaris 10/sparc, with a local build (THREADS, no MEMFILL, openssl 0.9.8k) the server stops responding to queries made from the network (LAN), until a local query comes in (dig @localhost ...). > >From which version did you upgrade your named? > How often does that happen? To reproduce this: queryperf -> 9.6.1 acting as a mostly recursive server (haven't tested with a server configured as authoritative-only). The server stops responding during a 30 second queryperf run. Running a script on the name server with 'dig @localhost...' once per second wakes the system up and the server continues answering queries made via the network. > Does the problem change if you disable threads and/or epoll > (via --disable-threads / --disable-epoll)? Have not tested either of these changes, yet. I have not tested the packages from sunfreeware, yet. -rob ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users
Re: Bind 9.6.1 stops after few hours.
On 07.07.2009 / 11:55:34 -0400, Rob Payne wrote: > > > What do you mean by "stop"? Did the daemon crash, simply not respond > > to queries, or something else? > > I don't know if this is the same as what Laurence is seeing. Testing > 9.6.1 on Solaris 10/sparc, with a local build (THREADS, no MEMFILL, > openssl 0.9.8k) the server stops responding to queries made from the > network (LAN), until a local query comes in (dig @localhost ...). We're using 9.6.0-P1 in solaris 10 x86 zone, acting as both recursive and authoritative server (a bit loaded, like 1k concurrent recursive queries during daytime hours seen with 'rndc status') and don't seeing any problems with it. Bind was configured as './configure --with-openssl=no' since we don't use DNSSEC. ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users
Re: Bind 9.6.1 stops after few hours.
At Tue, 7 Jul 2009 11:55:34 -0400, Rob Payne wrote: > > > After an upgrade to 9.6.1 we noticed the Bind daemon stops after few > > > hours. > > > What do you mean by "stop"? Did the daemon crash, simply not respond > > to queries, or something else? > > I don't know if this is the same as what Laurence is seeing. Testing > 9.6.1 on Solaris 10/sparc, with a local build (THREADS, no MEMFILL, > openssl 0.9.8k) the server stops responding to queries made from the > network (LAN), until a local query comes in (dig @localhost ...). You may want to try this: 2489. [port] solaris: Workaround Solaris's kernel bug about /dev/poll: http://bugs.opensolaris.org/view_bug.do?bug_id=6724237 Define ISC_SOCKET_USE_POLLWATCH at build time to enable this workaround. [RT #18870] Note, however, that this is workaround after all and may still cause problematic behavior. The essential fix is to apply Sun's patch to the kernel bug (I hear it exists, but don't know how widely it's available). --- JINMEI, Tatuya Internet Systems Consortium, Inc. ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users
Re: Bind 9.6.1 stops after few hours.
On Sun, Jul 05, 2009 at 02:42:34PM -0700, JINMEI Tatuya / 神明達哉 wrote: > At Fri, 3 Jul 2009 17:31:57 -0300, > "Laurence Stendard" wrote: > > After an upgrade to 9.6.1 we noticed the Bind daemon stops after few > > hours. > What do you mean by "stop"? Did the daemon crash, simply not respond > to queries, or something else? I don't know if this is the same as what Laurence is seeing. Testing 9.6.1 on Solaris 10/sparc, with a local build (THREADS, no MEMFILL, openssl 0.9.8k) the server stops responding to queries made from the network (LAN), until a local query comes in (dig @localhost ...). > >From which version did you upgrade your named? > How often does that happen? To reproduce this: queryperf -> 9.6.1 acting as a mostly recursive server (haven't tested with a server configured as authoritative-only). The server stops responding during a 30 second queryperf run. Running a script on the name server with 'dig @localhost...' once per second wakes the system up and the server continues answering queries made via the network. > Does the problem change if you disable threads and/or epoll > (via --disable-threads / --disable-epoll)? Have not tested either of these changes, yet. I have not tested the packages from sunfreeware, yet. -rob pgpGgqNrLCdb8.pgp Description: PGP signature ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users
Re: Bind 9.6.1 stops after few hours.
At Fri, 3 Jul 2009 17:31:57 -0300, "Laurence Stendard" wrote: > After an upgrade to 9.6.1 we noticed the Bind daemon stops after few > hours. What do you mean by "stop"? Did the daemon crash, simply not respond to queries, or something else? >From which version did you upgrade your named? How often does that happen? Does the problem change if you disable threads and/or epoll (via --disable-threads / --disable-epoll)? --- JINMEI, Tatuya Internet Systems Consortium, Inc. ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users
Re: Bind 9.6.1 stops after few hours.
I'm using 9.6.1 from sunfreeware.com on a x86 IBM dual core (Solaris 10) and never had a problem with it. Good luck! Julian - Original Message - From: Laurence Stendard To: bind-users@lists.isc.org Sent: Friday, July 03, 2009 2:31 PM Subject: Bind 9.6.1 stops after few hours. Dear All, After an upgrade to 9.6.1 we noticed the Bind daemon stops after few hours. Our servers are RHEL 4.6 32 bits. Bind daemon was compiled com threads and IPv6 (./configure --enable-threads --enable-ipv6) and runs under chroot jail. The only problem I found on logs was: 24-Jun-2009 16:06:35.843 general: error: socket.c:4922: unexpected error: 24-Jun-2009 16:06:35.843 general: error: 22/Invalid argument 24-Jun-2009 16:06:36.412 general: error: socket.c:4922: unexpected error: 24-Jun-2009 16:06:36.412 general: error: 22/Invalid argument 24-Jun-2009 16:06:38.089 general: error: socket.c:4922: unexpected error: But, since they happen only when the daemon starts I think they are unrelated to this issue... Is anybody else experiencing something like this problem? Many thanks, Laurence -- ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users
Bind 9.6.1 stops after few hours.
Dear All, After an upgrade to 9.6.1 we noticed the Bind daemon stops after few hours. Our servers are RHEL 4.6 32 bits. Bind daemon was compiled com threads and IPv6 (./configure --enable-threads --enable-ipv6) and runs under chroot jail. The only problem I found on logs was: 24-Jun-2009 16:06:35.843 general: error: socket.c:4922: unexpected error: 24-Jun-2009 16:06:35.843 general: error: 22/Invalid argument 24-Jun-2009 16:06:36.412 general: error: socket.c:4922: unexpected error: 24-Jun-2009 16:06:36.412 general: error: 22/Invalid argument 24-Jun-2009 16:06:38.089 general: error: socket.c:4922: unexpected error: But, since they happen only when the daemon starts I think they are unrelated to this issue... Is anybody else experiencing something like this problem? Many thanks, Laurence ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users