Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-29 Thread Stephan Wonczak

  Hi Kanou!

On Sun, 27 Jul 2008, kanou wrote:

snip

Don't be so unpatient, your reluctance to give proper information

I'm sorry and I thought I was giving you all the information I found.


First of all, 2 database-servers are a really bad idea.

Allright. I will change that.


After that, go to your boss and ask for a third database-server.

I did and I will install a third one.


  While more redundancy (i.e. a third database server) is always a good 
idea, it is not strictly necessary, much less 'a bad idea to run with two 
database servers'.
  Christof probably was thinking about the 'split brain' problem, which 
does not come into play with the AFS architecture; we are proof against 
that one. I made a posting about this a while ago; it should be in the 
archives.


Dipl. Chem. Dr. Stephan Wonczak

Zentrum fuer Angewandte Informatik (ZAIK)
Regionales Rechenzentrum der Universitaet zu Koeln (RRZK)
Universitaet zu Koeln, Robert-Koch-Strasse 10, 50931 Koeln
Tel: ++49/(0)221/478-5577, Fax: ++49/(0)221/478-5590

___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-29 Thread Brandon S. Allbery KF8NH


On 2008 Jul 29, at 3:44, Stephan Wonczak wrote:

 While more redundancy (i.e. a third database server) is always a  
good idea, it is not strictly necessary, much less 'a bad idea to  
run with two database servers'.
 Christof probably was thinking about the 'split brain' problem,  
which does not come into play with the AFS architecture; we are  
proof against that one. I made a posting about this a while ago; it  
should be in the archives.



Actually he's thinking about a screw condition that used to happen  
with voting for a sync site if you have 2 database servers and the  
lower-IP-numbered one goes missing.  I *think* it has been fixed now.


--
brandon s. allbery [solaris,freebsd,perl,pugs,haskell] [EMAIL PROTECTED]
system administrator [openafs,heimdal,too many hats] [EMAIL PROTECTED]
electrical and computer engineering, carnegie mellon universityKF8NH


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-29 Thread Harald Barth

While more redundancy (i.e. a third database server) is always a
good idea, it is not strictly necessary, much less 'a bad idea to
run with two database servers'.

Agree. The host with the lowest IP gets an extra half vote, so there is never a 
draw.

But back to your problem. I would tackle it like that: Set up a new DB
host. Wait for it to replicate. Check that replica is OK. (1) Shoot
original down. Erase database on original. Take first one up again and
let it replicate back.

Harald.

(1) If the error replicates we have another problem, but let's first try this, 
ok?
___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-29 Thread Stephan Wonczak

  Hi Brandon!

On Tue, 29 Jul 2008, Brandon S. Allbery KF8NH wrote:



On 2008 Jul 29, at 3:44, Stephan Wonczak wrote:

While more redundancy (i.e. a third database server) is always a good idea, 
it is not strictly necessary, much less 'a bad idea to run with two 
database servers'.
Christof probably was thinking about the 'split brain' problem, which does 
not come into play with the AFS architecture; we are proof against that 
one. I made a posting about this a while ago; it should be in the archives.



Actually he's thinking about a screw condition that used to happen with 
voting for a sync site if you have 2 database servers and the 
lower-IP-numbered one goes missing.  I *think* it has been fixed now.


  Not really a bug. If the lowest numbered DB server goes AWOL the 
remaining server has no chance to become master. I cannot distinguish 
(without human help) wether there is a simple communication problem or if 
the lower server really is down. So it stays in RO-Mode, as it should.
  As an admin you then have the choice to go back to a single-server 
scenario, or bring the other one back online. I do agree however that this 
is not too nice, and you are probably better off with three DB servers.


Dipl. Chem. Dr. Stephan Wonczak

Zentrum fuer Angewandte Informatik (ZAIK)
Regionales Rechenzentrum der Universitaet zu Koeln (RRZK)
Universitaet zu Koeln, Robert-Koch-Strasse 10, 50931 Koeln
Tel: ++49/(0)221/478-5577, Fax: ++49/(0)221/478-5590

___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-28 Thread kanou

Hello,
I'm sorry for being unpatient, but there were a thousand people  
waiting for their files and I was really nervous about the systems'  
crash.


So I used our backups from the last weekend and now the machines are  
running again. Now we are able to create new users and other things  
but there's still the problem with the database.


Could someone please tell me how to rebuild/repair the database  
without loosing all data in it?	


Thanks for your help!
Cheers
kanou


Don't be so unpatient, your reluctance to give proper information

I'm sorry and I thought I was giving you all the information I found.


First of all, 2 database-servers are a really bad idea.

Allright. I will change that.


After that, go to your boss and ask for a third database-server.

I did and I will install a third one.
___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-28 Thread kanou

Hello,
I'm sorry for being unpatient, but there were a thousand people  
waiting for their files and I was really nervous about the systems'  
crash.


So I used our backups from the last weekend and now the machines are  
running again. Now we are able to create new users and other things  
but there's still the problem with the database:


Could someone please tell me how to rebuild/repair the database  
without loosing all data in it?	


Thanks for your help!
Cheers
kanou


Don't be so unpatient, your reluctance to give proper information

I'm sorry and I thought I was giving you all the information I found.


First of all, 2 database-servers are a really bad idea.

Allright. I will change that.


After that, go to your boss and ask for a third database-server.

I did and I will install a third one.
___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-23 Thread kanou

Hello,
well, there is a file called db_verify.c in the folder /usr/src/ 
modules/openafs/ptserver but I don' know how to build it.

Greetings
kanou

Am 22.07.2008 um 22:29 schrieb Derrick Brashear:

There's a db verify tool in src/ptserver; Are you able to build and  
run it?


On Tue, Jul 22, 2008 at 4:15 PM, kanou [EMAIL PROTECTED] wrote:
Hello,
i realy need some help. If I can't get this mess fixed I will be in  
serious trouble.


Our AFS is running for more than 5 years, but I just joined the  
company a few weeks ago - so I'm pretty unexperienced with AFS.


It started last week. There were messages like this:
pts: database needs rebuilding.

So I stopped the server with something like
bos stop -server myserver -instance ptserver and
did something perhaps not pretty good: Made backups of /var/lib/ 
openafs/db and

deleted prdb.DB0 and prdb.DBSYS1.

Restart:
bos create -server myserver -instance ptserver -type simple -cmd / 
usr/lib/openafs/ptserver


That did it. The systems broken.
Starting afsd prints:
afsd: Can't mount AFS on /afs(22)

The logs print:

== /var/log/openafs/FileLog ==
Tue Jul 22 21:43:39 2008 Couldn't get CPS for AnyUser, will try  
again in 30 seconds; code=267275.


and this:

== /var/log/openafs/PtLog ==
ptserver: Unknown code pt 11 (267275) Can't rebuild database because  
it is not empty


So playing back the prdb.DB0 and prdb.DBSYS1 and restarting the  
server didn't help.

Please help me.

Sincerely
kanou
___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info





Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-23 Thread Steven Jenkins
On Wed, Jul 23, 2008 at 10:51 AM, kanou [EMAIL PROTECTED] wrote:
 Hello,
 well, there is a file called db_verify.c in the folder
 /usr/src/modules/openafs/ptserver but I don' know how to build it.

If I recall correctly, db_verify gets renamed to 'prdb_check' during
the install, so you should check for the existence of that file.

If you can't find it, you'll need to build it from source code: the
directions on the AFSLore wiki are a good place to start:

http://www.dementia.org/twiki/bin/view/AFSLore/HowToBuildOpenAFSFromSource

If you have problems building openafs-stable-1_4_x, you could get
openafs-stable-1_4_7 instead, as that is the latest official release.

Once you have built the tree, src/ptserver/db_verify should get built,
so you can simply copy it out of the source tree for your use.  If it
doesn't get built automatically for you, you can cd into src/ptserver
and do a 'make db_verify' manuall.

Also, feel free to ask for help here  or on the irc channel.

Steven Jenkins
End Point Corporation
http://www.endpoint.com/
___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-23 Thread kanou

Thanks for your answer.
Well I found the file prdb_check. It doesnt print any errors. Only  
thing I can find is with

./prdb_check -database /var/lib/openafs/db/prdb.DB0 -uheader -verbose
this line:
Ubik header size is 0 (should be 64)

So there are no errors! I can start the server and everything runs  
fine but the machine wont mount /afs!

kanou

Am 23.07.2008 um 17:26 schrieb Steven Jenkins:


On Wed, Jul 23, 2008 at 10:51 AM, kanou [EMAIL PROTECTED] wrote:

Hello,
well, there is a file called db_verify.c in the folder
/usr/src/modules/openafs/ptserver but I don' know how to build it.


If I recall correctly, db_verify gets renamed to 'prdb_check' during
the install, so you should check for the existence of that file.

If you can't find it, you'll need to build it from source code: the
directions on the AFSLore wiki are a good place to start:

http://www.dementia.org/twiki/bin/view/AFSLore/HowToBuildOpenAFSFromSource

If you have problems building openafs-stable-1_4_x, you could get
openafs-stable-1_4_7 instead, as that is the latest official release.

Once you have built the tree, src/ptserver/db_verify should get built,
so you can simply copy it out of the source tree for your use.  If it
doesn't get built automatically for you, you can cd into src/ptserver
and do a 'make db_verify' manuall.

Also, feel free to ask for help here  or on the irc channel.

Steven Jenkins
End Point Corporation
http://www.endpoint.com/


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-23 Thread kanou

My logs on the second machine tell me:

== /var/log/openafs/FileLog.old ==
Wed Jul 23 19:03:37 2008 File server starting
Wed Jul 23 19:03:37 2008 afs_krb_get_lrealm failed, myserver2.
Wed Jul 23 19:03:37 2008 VL_RegisterAddrs rpc failed; will retry  
periodically (code=5376, err=4)
Wed Jul 23 19:03:37 2008 Couldn't get CPS for AnyUser, will try again  
in 30 seconds; code=267275.


== /var/log/openafs/SalvageLog ==
07/23/2008 19:08:27 SALVAGING OF PARTITION /vicepa COMPLETED

and aklog gives me:
aklog: Couldn't get hrf.uni-koeln.de AFS tickets:
aklog: Cannot contact any KDC for requested realm while getting AFS  
tickets


damn! i did not do anything on that second one!



Just to make sure you're working on the correct file:
As I understand you first deleted the file /var/lib/openafs/db/ 
prdb.DB0.

This file was then probably recreated when you restarted the ptserver.
Run this command on the backupfile you made first (or better on a  
copy of the backup file).


T/Christof

From: [EMAIL PROTECTED] [openafs-info- 
[EMAIL PROTECTED] On Behalf Of kanou [EMAIL PROTECTED]

Sent: Wednesday, July 23, 2008 6:46 PM
To: openafs-info@openafs.org
Subject: Re: [OpenAFS] Serious trouble, mounting /afs, ptserver,  
database rebuilding


Thanks for your answer.
Well I found the file prdb_check. It doesnt print any errors. Only
thing I can find is with
./prdb_check -database /var/lib/openafs/db/prdb.DB0 -uheader -verbose
this line:
Ubik header size is 0 (should be 64)

So there are no errors! I can start the server and everything runs
fine but the machine wont mount /afs!
kanou

Am 23.07.2008 um 17:26 schrieb Steven Jenkins:


On Wed, Jul 23, 2008 at 10:51 AM, kanou [EMAIL PROTECTED] wrote:

Hello,
well, there is a file called db_verify.c in the folder
/usr/src/modules/openafs/ptserver but I don' know how to build it.


If I recall correctly, db_verify gets renamed to 'prdb_check' during
the install, so you should check for the existence of that file.

If you can't find it, you'll need to build it from source code: the
directions on the AFSLore wiki are a good place to start:

http://www.dementia.org/twiki/bin/view/AFSLore/HowToBuildOpenAFSFromSource

If you have problems building openafs-stable-1_4_x, you could get
openafs-stable-1_4_7 instead, as that is the latest official release.

Once you have built the tree, src/ptserver/db_verify should get  
built,

so you can simply copy it out of the source tree for your use.  If it
doesn't get built automatically for you, you can cd into src/ptserver
and do a 'make db_verify' manuall.

Also, feel free to ask for help here  or on the irc channel.

Steven Jenkins
End Point Corporation
http://www.endpoint.com/


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-23 Thread Derrick Brashear
On Wed, Jul 23, 2008 at 1:10 PM, kanou [EMAIL PROTECTED] wrote:

 I did the check on the copy. So well, my first afs-Server just mounted /afs
 but now the second one is in trouble with the same errors and i even cant
 get any kerberos tickets on that machine.


the same errors. mount failed (error 22) I assume?

Can you reboot it? Once mounting fails you need to start with a fresh load
of the kernel module, and it's not going to unload.

I assume the root.afs volume is not on the same server, or, if it is, you're
doing something clever like delaying client start until the server is
running? If not... fix that?


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-23 Thread Hartmut Reuter

kanou wrote:

My logs on the second machine tell me:

== /var/log/openafs/FileLog.old ==
Wed Jul 23 19:03:37 2008 File server starting
Wed Jul 23 19:03:37 2008 afs_krb_get_lrealm failed, myserver2.
Wed Jul 23 19:03:37 2008 VL_RegisterAddrs rpc failed; will retry  
periodically (code=5376, err=4)



code 5376 means no quorum elected. Are you sure your database servers 
are all running?


Try udebug server 7002 for the ptserver
and udebug server 7003 for the vldb

Wed Jul 23 19:03:37 2008 Couldn't get CPS for AnyUser, will try again  
in 30 seconds; code=267275.


== /var/log/openafs/SalvageLog ==
07/23/2008 19:08:27 SALVAGING OF PARTITION /vicepa COMPLETED

and aklog gives me:
aklog: Couldn't get hrf.uni-koeln.de AFS tickets:
aklog: Cannot contact any KDC for requested realm while getting AFS  
tickets


damn! i did not do anything on that second one!



Just to make sure you're working on the correct file:
As I understand you first deleted the file /var/lib/openafs/db/ prdb.DB0.
This file was then probably recreated when you restarted the ptserver.
Run this command on the backupfile you made first (or better on a  
copy of the backup file).


T/Christof

From: [EMAIL PROTECTED] [openafs-info- [EMAIL PROTECTED] 
On Behalf Of kanou [EMAIL PROTECTED]

Sent: Wednesday, July 23, 2008 6:46 PM
To: openafs-info@openafs.org
Subject: Re: [OpenAFS] Serious trouble, mounting /afs, ptserver,  
database rebuilding


Thanks for your answer.
Well I found the file prdb_check. It doesnt print any errors. Only
thing I can find is with
./prdb_check -database /var/lib/openafs/db/prdb.DB0 -uheader -verbose
this line:
Ubik header size is 0 (should be 64)

So there are no errors! I can start the server and everything runs
fine but the machine wont mount /afs!
kanou

Am 23.07.2008 um 17:26 schrieb Steven Jenkins:


On Wed, Jul 23, 2008 at 10:51 AM, kanou [EMAIL PROTECTED] wrote:


Hello,
well, there is a file called db_verify.c in the folder
/usr/src/modules/openafs/ptserver but I don' know how to build it.



If I recall correctly, db_verify gets renamed to 'prdb_check' during
the install, so you should check for the existence of that file.

If you can't find it, you'll need to build it from source code: the
directions on the AFSLore wiki are a good place to start:

http://www.dementia.org/twiki/bin/view/AFSLore/HowToBuildOpenAFSFromSource 



If you have problems building openafs-stable-1_4_x, you could get
openafs-stable-1_4_7 instead, as that is the latest official release.

Once you have built the tree, src/ptserver/db_verify should get  built,
so you can simply copy it out of the source tree for your use.  If it
doesn't get built automatically for you, you can cd into src/ptserver
and do a 'make db_verify' manuall.

Also, feel free to ask for help here  or on the irc channel.

Steven Jenkins
End Point Corporation
http://www.endpoint.com/



___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info



___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info



--
-
Hartmut Reuter  e-mail  [EMAIL PROTECTED]
phone+49-89-3299-1328
fax  +49-89-3299-1301
RZG (Rechenzentrum Garching)webhttp://www.rzg.mpg.de/~hwr
Computing Center of the Max-Planck-Gesellschaft (MPG) and the
Institut fuer Plasmaphysik (IPP)
-
___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-23 Thread kanou

And this:

== /var/log/openafs/PtLog ==
ptserver: Unknown code pt 11 (267275) Can't rebuild database because  
it is not empty



Am 23.07.2008 um 19:10 schrieb kanou:

I did the check on the copy. So well, my first afs-Server just  
mounted /afs but now the second one is in trouble with the same  
errors and i even cant get any kerberos tickets on that machine.



Am 23.07.2008 um 18:52 schrieb Christof Hanke:


Just to make sure you're working on the correct file:
As I understand you first deleted the file /var/lib/openafs/db/ 
prdb.DB0.
This file was then probably recreated when you restarted the  
ptserver.
Run this command on the backupfile you made first (or better on a  
copy of the backup file).


T/Christof

From: [EMAIL PROTECTED] [openafs-info- 
[EMAIL PROTECTED] On Behalf Of kanou [EMAIL PROTECTED]

Sent: Wednesday, July 23, 2008 6:46 PM
To: openafs-info@openafs.org
Subject: Re: [OpenAFS] Serious trouble, mounting /afs, ptserver,  
database rebuilding


Thanks for your answer.
Well I found the file prdb_check. It doesnt print any errors. Only
thing I can find is with
./prdb_check -database /var/lib/openafs/db/prdb.DB0 -uheader -verbose
this line:
Ubik header size is 0 (should be 64)

So there are no errors! I can start the server and everything runs
fine but the machine wont mount /afs!
kanou

Am 23.07.2008 um 17:26 schrieb Steven Jenkins:


On Wed, Jul 23, 2008 at 10:51 AM, kanou [EMAIL PROTECTED] wrote:

Hello,
well, there is a file called db_verify.c in the folder
/usr/src/modules/openafs/ptserver but I don' know how to build it.


If I recall correctly, db_verify gets renamed to 'prdb_check' during
the install, so you should check for the existence of that file.

If you can't find it, you'll need to build it from source code: the
directions on the AFSLore wiki are a good place to start:

http://www.dementia.org/twiki/bin/view/AFSLore/HowToBuildOpenAFSFromSource

If you have problems building openafs-stable-1_4_x, you could get
openafs-stable-1_4_7 instead, as that is the latest official  
release.


Once you have built the tree, src/ptserver/db_verify should get  
built,
so you can simply copy it out of the source tree for your use.  If  
it
doesn't get built automatically for you, you can cd into src/ 
ptserver

and do a 'make db_verify' manuall.

Also, feel free to ask for help here  or on the irc channel.

Steven Jenkins
End Point Corporation
http://www.endpoint.com/


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-23 Thread Steven Jenkins
On Wed, Jul 23, 2008 at 1:30 PM, kanou [EMAIL PROTECTED] wrote:
 And this:

 == /var/log/openafs/PtLog ==
 ptserver: Unknown code pt 11 (267275) Can't rebuild database because it is
 not empty


I suggest dealing with first things first: until your ptserver is up
and running correctly, your fileservers won't be up and running
correctly, and you won't be able to access files.

Could you clarify whether or not providing the -entries option to
prdb_check resulted in output for your database?  If that produces
good output, you can

1- rebuild the protection database
2- restart your ptservers
3- restart your fileservers
4- finally, you can see what problems you encounter accessing files

Steven Jenkins
End Point Corporation
http;//www.endpoint.com/
___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-23 Thread kanou
To: openafs-info@openafs.org
Subject: Re: [OpenAFS] Serious trouble, mounting /afs, ptserver,   
database rebuilding


Thanks for your answer.
Well I found the file prdb_check. It doesnt print any errors. Only
thing I can find is with
./prdb_check -database /var/lib/openafs/db/prdb.DB0 -uheader - 
verbose

this line:
Ubik header size is 0 (should be 64)

So there are no errors! I can start the server and everything runs
fine but the machine wont mount /afs!
kanou

Am 23.07.2008 um 17:26 schrieb Steven Jenkins:


On Wed, Jul 23, 2008 at 10:51 AM, kanou [EMAIL PROTECTED] wrote:


Hello,
well, there is a file called db_verify.c in the folder
/usr/src/modules/openafs/ptserver but I don' know how to build it.



If I recall correctly, db_verify gets renamed to 'prdb_check'  
during

the install, so you should check for the existence of that file.

If you can't find it, you'll need to build it from source code: the
directions on the AFSLore wiki are a good place to start:

http://www.dementia.org/twiki/bin/view/AFSLore/HowToBuildOpenAFSFromSource

If you have problems building openafs-stable-1_4_x, you could get
openafs-stable-1_4_7 instead, as that is the latest official  
release.


Once you have built the tree, src/ptserver/db_verify should get   
built,
so you can simply copy it out of the source tree for your use.   
If it
doesn't get built automatically for you, you can cd into src/ 
ptserver

and do a 'make db_verify' manuall.

Also, feel free to ask for help here  or on the irc channel.

Steven Jenkins
End Point Corporation
http://www.endpoint.com/



___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info

___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info



--
-
Hartmut Reuter  e-mail  [EMAIL PROTECTED]
phone+49-89-3299-1328
fax  +49-89-3299-1301
RZG (Rechenzentrum Garching)webhttp://www.rzg.mpg.de/~hwr
Computing Center of the Max-Planck-Gesellschaft (MPG) and the
Institut fuer Plasmaphysik (IPP)
-


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-23 Thread kanou
 server 7002 for the ptserver
and udebug server 7003 for the vldb

Wed Jul 23 19:03:37 2008 Couldn't get CPS for AnyUser, will try  
again  in 30 seconds; code=267275.

== /var/log/openafs/SalvageLog ==
07/23/2008 19:08:27 SALVAGING OF PARTITION /vicepa COMPLETED
and aklog gives me:
aklog: Couldn't get hrf.uni-koeln.de AFS tickets:
aklog: Cannot contact any KDC for requested realm while getting  
AFS  tickets

damn! i did not do anything on that second one!

Just to make sure you're working on the correct file:
As I understand you first deleted the file /var/lib/openafs/db/  
prdb.DB0.
This file was then probably recreated when you restarted the  
ptserver.
Run this command on the backupfile you made first (or better on  
a  copy of the backup file).


T/Christof

From: [EMAIL PROTECTED] [openafs-info- [EMAIL PROTECTED] 
] On Behalf Of kanou [EMAIL PROTECTED]

Sent: Wednesday, July 23, 2008 6:46 PM
To: openafs-info@openafs.org
Subject: Re: [OpenAFS] Serious trouble, mounting /afs, ptserver,   
database rebuilding


Thanks for your answer.
Well I found the file prdb_check. It doesnt print any errors. Only
thing I can find is with
./prdb_check -database /var/lib/openafs/db/prdb.DB0 -uheader - 
verbose

this line:
Ubik header size is 0 (should be 64)

So there are no errors! I can start the server and everything runs
fine but the machine wont mount /afs!
kanou

Am 23.07.2008 um 17:26 schrieb Steven Jenkins:


On Wed, Jul 23, 2008 at 10:51 AM, kanou [EMAIL PROTECTED] wrote:


Hello,
well, there is a file called db_verify.c in the folder
/usr/src/modules/openafs/ptserver but I don' know how to build  
it.



If I recall correctly, db_verify gets renamed to 'prdb_check'  
during

the install, so you should check for the existence of that file.

If you can't find it, you'll need to build it from source code:  
the

directions on the AFSLore wiki are a good place to start:

http://www.dementia.org/twiki/bin/view/AFSLore/HowToBuildOpenAFSFromSource

If you have problems building openafs-stable-1_4_x, you could get
openafs-stable-1_4_7 instead, as that is the latest official  
release.


Once you have built the tree, src/ptserver/db_verify should get   
built,
so you can simply copy it out of the source tree for your use.   
If it
doesn't get built automatically for you, you can cd into src/ 
ptserver

and do a 'make db_verify' manuall.

Also, feel free to ask for help here  or on the irc channel.

Steven Jenkins
End Point Corporation
http://www.endpoint.com/



___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info

___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info



--
-
Hartmut Reuter  e-mail  [EMAIL PROTECTED]
phone+49-89-3299-1328
fax  +49-89-3299-1301
RZG (Rechenzentrum Garching)webhttp://www.rzg.mpg.de/~hwr
Computing Center of the Max-Planck-Gesellschaft (MPG) and the
Institut fuer Plasmaphysik (IPP)
-


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


Re: [OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-23 Thread kanou
:37 2008 afs_krb_get_lrealm failed, myserver2.
Wed Jul 23 19:03:37 2008 VL_RegisterAddrs rpc failed; will retry   
periodically (code=5376, err=4)



code 5376 means no quorum elected. Are you sure your database  
servers are all running?


Try udebug server 7002 for the ptserver
and udebug server 7003 for the vldb

Wed Jul 23 19:03:37 2008 Couldn't get CPS for AnyUser, will try  
again  in 30 seconds; code=267275.

== /var/log/openafs/SalvageLog ==
07/23/2008 19:08:27 SALVAGING OF PARTITION /vicepa COMPLETED
and aklog gives me:
aklog: Couldn't get hrf.uni-koeln.de AFS tickets:
aklog: Cannot contact any KDC for requested realm while getting  
AFS  tickets

damn! i did not do anything on that second one!

Just to make sure you're working on the correct file:
As I understand you first deleted the file /var/lib/openafs/db/  
prdb.DB0.
This file was then probably recreated when you restarted the  
ptserver.
Run this command on the backupfile you made first (or better on  
a  copy of the backup file).


T/Christof

From: [EMAIL PROTECTED] [openafs-info- [EMAIL PROTECTED] 
] On Behalf Of kanou [EMAIL PROTECTED]

Sent: Wednesday, July 23, 2008 6:46 PM
To: openafs-info@openafs.org
Subject: Re: [OpenAFS] Serious trouble, mounting /afs,  
ptserver,  database rebuilding


Thanks for your answer.
Well I found the file prdb_check. It doesnt print any errors. Only
thing I can find is with
./prdb_check -database /var/lib/openafs/db/prdb.DB0 -uheader - 
verbose

this line:
Ubik header size is 0 (should be 64)

So there are no errors! I can start the server and everything runs
fine but the machine wont mount /afs!
kanou

Am 23.07.2008 um 17:26 schrieb Steven Jenkins:


On Wed, Jul 23, 2008 at 10:51 AM, kanou [EMAIL PROTECTED] wrote:


Hello,
well, there is a file called db_verify.c in the folder
/usr/src/modules/openafs/ptserver but I don' know how to build  
it.



If I recall correctly, db_verify gets renamed to 'prdb_check'  
during

the install, so you should check for the existence of that file.

If you can't find it, you'll need to build it from source code:  
the

directions on the AFSLore wiki are a good place to start:

http://www.dementia.org/twiki/bin/view/AFSLore/HowToBuildOpenAFSFromSource

If you have problems building openafs-stable-1_4_x, you could get
openafs-stable-1_4_7 instead, as that is the latest official  
release.


Once you have built the tree, src/ptserver/db_verify should  
get  built,
so you can simply copy it out of the source tree for your use.   
If it
doesn't get built automatically for you, you can cd into src/ 
ptserver

and do a 'make db_verify' manuall.

Also, feel free to ask for help here  or on the irc channel.

Steven Jenkins
End Point Corporation
http://www.endpoint.com/



___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info

___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info



--
-
Hartmut Reuter  e-mail  [EMAIL PROTECTED]
phone+49-89-3299-1328
fax  +49-89-3299-1301
RZG (Rechenzentrum Garching)webhttp://www.rzg.mpg.de/~hwr
Computing Center of the Max-Planck-Gesellschaft (MPG) and the
Institut fuer Plasmaphysik (IPP)
-


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


[OpenAFS] Serious trouble, mounting /afs, ptserver, database rebuilding

2008-07-22 Thread kanou

Hello,
i realy need some help. If I can't get this mess fixed I will be in  
serious trouble.


Our AFS is running for more than 5 years, but I just joined the  
company a few weeks ago - so I'm pretty unexperienced with AFS.


It started last week. There were messages like this:
pts: database needs rebuilding.

So I stopped the server with something like
bos stop -server myserver -instance ptserver and
did something perhaps not pretty good: Made backups of /var/lib/ 
openafs/db and

deleted prdb.DB0 and prdb.DBSYS1.

Restart:
bos create -server myserver -instance ptserver -type simple -cmd /usr/ 
lib/openafs/ptserver


That did it. The systems broken.
Starting afsd prints:
afsd: Can't mount AFS on /afs(22)

The logs print:

== /var/log/openafs/FileLog ==
Tue Jul 22 21:43:39 2008 Couldn't get CPS for AnyUser, will try again  
in 30 seconds; code=267275.


and this:

== /var/log/openafs/PtLog ==
ptserver: Unknown code pt 11 (267275) Can't rebuild database because  
it is not empty


So playing back the prdb.DB0 and prdb.DBSYS1 and restarting the server  
didn't help.

Please help me.

Sincerely
kanou
___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info