[OpenAFS] VL_RegisterAddrs rpc failed (code=5376, err=22)
>> [EMAIL PROTECTED] root]# udebug bongo 7003 >> Host's addresses are: 192.168.7.68 192.168.4.68 192.168.1.68 192.168.8.68 >> 192.168.9.68 >> Host's 192.168.7.68 time is Wed Sep 26 10:51:59 2007 >> Local time is Wed Sep 26 10:52:02 2007 (time differential 3 secs) >> Last yes vote for 68.7.168.192 was 0 secs ago (sync site); > >Looks like an identity crisis. I bet you have a local hostname that binds to >68.7.168.192 in dns or /etc/hosts and in CellServDB you're calling the host >192.168.7.68 or somesuch? > see also: http://www.mail-archive.com/openafs-info@openafs.org/msg24691.html i'm absolutely sure that my name-services are fine and consistent: [EMAIL PROTECTED] root]# cat /etc/openafs/CellServDB freakout.de #Cell name 192.168.7.68#bongo [EMAIL PROTECTED] root]# cat /etc/hosts # bongo:/etc/hosts 127.0.0.1 localhost loopback 192.168.7.68bongo.freakout.de bongo [EMAIL PROTECTED] root]# nslookup bongo Server: 127.0.0.1 Address:127.0.0.1#53 Name: bongo.freakout.de Address: 192.168.7.68 [EMAIL PROTECTED] root]# nslookup 192.168.7.68 Server: 127.0.0.1 Address:127.0.0.1#53 68.7.168.192.in-addr.arpa name = bongo.freakout.de. ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] VL_RegisterAddrs rpc failed (code=5376, err=22)
>> >> cannot get around this problem - see error messages from FileLog: >> >> >> >> Tue Sep 25 16:15:21 2007 VL_RegisterAddrs rpc failed; will retry >> periodically (code=5376, err=22) >> > >> >use "translate_et 5376" : >> > >> >~: translate_et 5376 >> >5376 (u).0 = no quorum elected >> >~: >> > >> >So your database servers haven't head a quorum when the fileserver >> >started. Registration of a fileserver requires a write operation on the >> >vlserver which is only possible on the sync site. Without a quorum the >> >sync site can't be elcted... >> > >> >Do a "udebug 7003" to all your database servers >> >to find out what happens! >> > >> >> [EMAIL PROTECTED] root]# udebug bongo 7003 >> Host's addresses are: 192.168.7.68 192.168.4.68 192.168.1.68 192.168.8.68 >> 192.168.9.68 >> Host's 192.168.7.68 time is Wed Sep 26 10:51:59 2007 >> Local time is Wed Sep 26 10:52:02 2007 (time differential 3 secs) >> Last yes vote for 68.7.168.192 was 0 secs ago (sync site); > >Looks like an identity crisis. I bet you have a local hostname that binds to >68.7.168.192 in dns or /etc/hosts and in CellServDB you're calling the host >192.168.7.68 or somesuch? > i'm absolutely sure that my name-services are fine and consistent: [EMAIL PROTECTED] root]# cat /etc/openafs/CellServDB freakout.de #Cell name 192.168.7.68#bongo [EMAIL PROTECTED] root]# cat /etc/hosts # bongo:/etc/hosts 127.0.0.1 localhost loopback 192.168.7.68bongo.freakout.de bongo [EMAIL PROTECTED] root]# nslookup bongo Server: 127.0.0.1 Address:127.0.0.1#53 Name: bongo.freakout.de Address: 192.168.7.68 [EMAIL PROTECTED] root]# nslookup 192.168.7.68 Server: 127.0.0.1 Address:127.0.0.1#53 68.7.168.192.in-addr.arpa name = bongo.freakout.de. ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] VL_RegisterAddrs rpc failed (code=5376, err=22)
On 9/26/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote: > > >> cannot get around this problem - see error messages from FileLog: > >> > >> Tue Sep 25 16:15:21 2007 VL_RegisterAddrs rpc failed; will retry > periodically (code=5376, err=22) > > > >use "translate_et 5376" : > > > >~: translate_et 5376 > >5376 (u).0 = no quorum elected > >~: > > > >So your database servers haven't head a quorum when the fileserver > >started. Registration of a fileserver requires a write operation on the > >vlserver which is only possible on the sync site. Without a quorum the > >sync site can't be elcted... > > > >Do a "udebug 7003" to all your database servers > >to find out what happens! > > > > [EMAIL PROTECTED] root]# udebug bongo 7003 > Host's addresses are: 192.168.7.68 192.168.4.68 192.168.1.68 192.168.8.68 > 192.168.9.68 > Host's 192.168.7.68 time is Wed Sep 26 10:51:59 2007 > Local time is Wed Sep 26 10:52:02 2007 (time differential 3 secs) > Last yes vote for 68.7.168.192 was 0 secs ago (sync site); Looks like an identity crisis. I bet you have a local hostname that binds to 68.7.168.192 in dns or /etc/hosts and in CellServDB you're calling the host 192.168.7.68 or somesuch?
[OpenAFS] VL_RegisterAddrs rpc failed (code=5376, err=22)
>> cannot get around this problem - see error messages from FileLog: >> >> Tue Sep 25 16:15:21 2007 VL_RegisterAddrs rpc failed; will retry >> periodically (code=5376, err=22) > >use "translate_et 5376" : > >~: translate_et 5376 >5376 (u).0 = no quorum elected >~: > >So your database servers haven't head a quorum when the fileserver >started. Registration of a fileserver requires a write operation on the >vlserver which is only possible on the sync site. Without a quorum the >sync site can't be elcted... > >Do a "udebug 7003" to all your database servers >to find out what happens! > [EMAIL PROTECTED] root]# udebug bongo 7003 Host's addresses are: 192.168.7.68 192.168.4.68 192.168.1.68 192.168.8.68 192.168.9.68 Host's 192.168.7.68 time is Wed Sep 26 10:51:59 2007 Local time is Wed Sep 26 10:52:02 2007 (time differential 3 secs) Last yes vote for 68.7.168.192 was 0 secs ago (sync site); Last vote started 0 secs ago (at Wed Sep 26 10:52:02 2007) Local db version is 1190723057.5 I am sync site forever (1 server) Recovery state 1f Sync site's db version is 1190723057.5 0 locked pages, 0 of them for write [EMAIL PROTECTED] root]# ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] VL_RegisterAddrs rpc failed (code=5376, err=22)
[EMAIL PROTECTED] wrote: cannot get around this problem - see error messages from FileLog: Tue Sep 25 16:15:21 2007 File server starting Tue Sep 25 16:15:21 2007 afs_krb_get_lrealm failed, using freakout.de. Tue Sep 25 16:15:21 2007 VL_RegisterAddrs rpc failed; will retry periodically (code=5376, err=22) Tue Sep 25 16:15:21 2007 Set thread id 14 for FSYNC_sync Tue Sep 25 16:15:21 2007 VInitVolumePackage: beginning single-threaded fileserver startup Tue Sep 25 16:15:21 2007 VInitVolumePackage: using 1 thread to attach volumes on 1 partition(s) Tue Sep 25 16:15:21 2007 Partition /vicepa: attaching volumes Tue Sep 25 16:15:22 2007 Partition /vicepa: attached 4 volumes; 0 volumes not attached Tue Sep 25 16:15:22 2007 Set thread id 15 for 'FiveMinuteCheckLWP' Tue Sep 25 16:15:22 2007 Set thread id 16 for 'HostCheckLWP' Tue Sep 25 16:15:22 2007 Set thread id 17 for 'FsyncCheckLWP' Tue Sep 25 16:15:22 2007 Getting FileServer name... Tue Sep 25 16:15:22 2007 FileServer host name is 'bongo' Tue Sep 25 16:15:22 2007 Getting FileServer address... Tue Sep 25 16:15:22 2007 FileServer bongo has address 192.168.7.68 (0x4407a8c0 or 0xc0a80744 in host byte order) Tue Sep 25 16:15:22 2007 File Server started Tue Sep 25 16:15:22 2007 i tried to follow the tips from different articles using NetInfo and NetRestrict - none worked. i tried to check "code=5376, err=22" from the source-code but i cannot get enough information to find the source of the problem. use "translate_et 5376" : ~: translate_et 5376 5376 (u).0 = no quorum elected ~: So your database servers haven't head a quorum when the fileserver started. Registration of a fileserver requires a write operation on the vlserver which is only possible on the sync site. Without a quorum the sync site can't be elcted... Do a "udebug 7003" to all your database servers to find out what happens! Hartmut Reuter Please help i'm lost on this one. ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info -- - Hartmut Reuter e-mail [EMAIL PROTECTED] phone +49-89-3299-1328 RZG (Rechenzentrum Garching) fax +49-89-3299-1301 Computing Center of the Max-Planck-Gesellschaft (MPG) and the Institut fuer Plasmaphysik (IPP) - ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] VL_RegisterAddrs rpc failed (code=5376, err=22)
cannot get around this problem - see error messages from FileLog: Tue Sep 25 16:15:21 2007 File server starting Tue Sep 25 16:15:21 2007 afs_krb_get_lrealm failed, using freakout.de. Tue Sep 25 16:15:21 2007 VL_RegisterAddrs rpc failed; will retry periodically (code=5376, err=22) Tue Sep 25 16:15:21 2007 Set thread id 14 for FSYNC_sync Tue Sep 25 16:15:21 2007 VInitVolumePackage: beginning single-threaded fileserver startup Tue Sep 25 16:15:21 2007 VInitVolumePackage: using 1 thread to attach volumes on 1 partition(s) Tue Sep 25 16:15:21 2007 Partition /vicepa: attaching volumes Tue Sep 25 16:15:22 2007 Partition /vicepa: attached 4 volumes; 0 volumes not attached Tue Sep 25 16:15:22 2007 Set thread id 15 for 'FiveMinuteCheckLWP' Tue Sep 25 16:15:22 2007 Set thread id 16 for 'HostCheckLWP' Tue Sep 25 16:15:22 2007 Set thread id 17 for 'FsyncCheckLWP' Tue Sep 25 16:15:22 2007 Getting FileServer name... Tue Sep 25 16:15:22 2007 FileServer host name is 'bongo' Tue Sep 25 16:15:22 2007 Getting FileServer address... Tue Sep 25 16:15:22 2007 FileServer bongo has address 192.168.7.68 (0x4407a8c0 or 0xc0a80744 in host byte order) Tue Sep 25 16:15:22 2007 File Server started Tue Sep 25 16:15:22 2007 i tried to follow the tips from different articles using NetInfo and NetRestrict - none worked. i tried to check "code=5376, err=22" from the source-code but i cannot get enough information to find the source of the problem. Please help i'm lost on this one. ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info