On 07/06/11 13:51, I wrote:
I now have this situation on one Solaris 10 slave; the problem
probably also exists on the other Sol 10 slave and the two
Ubuntu hardy slaves:

The _tcp zone on the master MS DNS Server:

     1238 600 86400 3600

The _tcp zone on the BIND 9.7.3-P1 Solaris 10 server disk:

     1239       ; serial
     900        ; refresh (15 minutes)
     600        ; retry (10 minutes)
     86400      ; expire (1 day)
     3600       ; minimum (1 hour)

The _udp zone on the master MS DNS Server:

     842 900 600 86400 3600

The _udp zone on the BIND 9.7.3-P1 Solaris 10 server disk:
     843        ; serial
     900        ; refresh (15 minutes)
     600        ; retry (10 minutes)
     86400      ; expire (1 day)
     3600       ; minimum (1 hour)

Note that the zone serial number for both zones on the master is
one LESS than the serial number on the slave.  The last messages
in /var/adm/messages are

     _tcp:
     Jun  4 07:46:57 serial number (1238) received from master ... <
ours (1239)
     Jun  4 07:47:35 zone ... expired
     Jun  4 07:47:35 zone ... transfer started
     Jun  4 07:47:35 zone ... transferred serial 1238
     Jun  4 07:47:35 zone ... Transfer completed: ...

     _udp:
     Jun  4 07:39:22 serial number (842) received from master ... <
ours (843)
     Jun  4 07:42:22 zone ... expired
     Jun  4 07:42:22 zone ... transfer started
     Jun  4 07:42:22 zone ... transferred serial 842
     Jun  4 07:42:22 zone ... Transfer completed

There was a zone serial number mismatch, each zone expired three days
ago, and new zones were transferred from the master.  But the zone
files on disk still have the higher serial numbers.  There are no .jnl
files on the disk.  A "dig" on the server for the zone serial numbers
shows the lower numbers, so BIND has those correct serial numbers.  I
assume that if I stopped BIND (rndc stop) and restarted it, then I
would again see the serial number mismatches.  I can try this during
the day, as this server is not heavily used.  Is there any debugging I
need to run?  Thanks.


I ran a test this morning on one of the Solaris 10 slave servers.
A query to the server showed serial numbers:

     _tcp   1238
     _udp    842

Both of these match the zone on the MS Windows DNS Server.
I checked the zone files on the slave server:

     _tcp   1239
     _udp    843

Both of these are increased by one from what BIND returns in
response to a query.

The two zones have NO .jnl files.

I did

     ./rndc stop
     <<Wait for the "exiting" message.>>
     /etc/init.d/named.anl start;tail -f /var/adm/messages

Once BIND started, the serial numbers were INCREASED, as I
expected they would be, given the lack of .jnl files.

And a few minutes later BIND complained about the serial
number on the master being less than that on the slave
for both zones.  I consider this a bug in BIND 9.
What further diagnostics do I need to get?

I have another Solaris 10 slave on which, I assume, I can
duplicate this.  And from past experience, in one day, after
the zone has expired and been refreshed, I will be in the same
state on this slave.
-
----------------------------------------------------------------------
Barry S. Finkel
Computing and Information Systems Division
Argonne National Laboratory          Phone:    +1 (630) 252-7277
9700 South Cass Avenue               Facsimile:+1 (630) 252-4601
Building 240, Room 5.B.8             Internet: bsfin...@anl.gov
Argonne, IL   60439-4828             IBMMAIL:  I1004994
_______________________________________________
bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users

Reply via email to