Re: [OpenAFS] Tokens seem to be automatically discarded
On Friday 11 April 2008, Derrick Brashear wrote: > "gcpags"; you can disable it several ways, including by compiling > src/venus/gcpags.c into gcpags and running it. > Thanks! Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Tokens seem to be automatically discarded
Hello, I encountered an major problem for me at least after migration from a 1.4.0 client version to 1.4.6. Tokens bound by uid ( there is no pag) seem to disappear under still unknown conditions. These tokens are generated and renewed by one single process and are used for different processes which are spawned independently from time to time. No users login with these uids. Is there some sort of garbage collection of "unused" tokens? Any help is appreciated Gunther -- ____ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] AIX 5.3 ML03: afs_dynamic_auth
Hello all, the version of afs_dynamic_auth supplied by rs_aix53.tar.gz leads to a serious problem. Commands which convert uid to names segfault, eg "id" or "ls -l". All works well with the IBM version of the module installed Is there already a solution or any chance to get this fixed? Thanks for any help in advance -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Integrated Login on AIX 5.3
On Wednesday 22 February 2006 16:45, Jay Compton wrote: > Do you make any modifications to your systems after a blank install other > than installing the latest tarball (1.4.0) from OpenAFS and the changes to > /usr/lib/security/methods.cfg and /etc/security/user? I've made all these > changes to no avail so something must be different on my installation even > though I tried a completely new AIX install. No matter what I do AIX > doesn't seem to recognize the users from a login prompt, it's as if they > don't exist. > Hello Jay, I did not make other modifications than those I told. I am running AIX 5.3 ML03 with the 32 bit kernel. What error message do you get during a login attempt? -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Fileserver very busy
On Tuesday 21 February 2006 15:00, Jeffrey Altman wrote: > Hans-Gunther Borrmann wrote: > > On Saturday 18 February 2006 16:02, Jeffrey Altman wrote: > >> What versions are the clients and servers? > > > > Client: OpenAFS 1.3.71 built 2004-09-20 > > Server: OpenAFS 1.3.77 built 2005-01-18 > > 1.3.77 is a development build that was released in December 2004. > > 1.3.71 is a development build that was released in August 2004. > > Since 1.3.77 there have been 554 non-Windows commits and 255 Windows > commits. There have been numerous bugs fixed since then. You might > want to consider using something a bit newer. > > > > time 363.764627, pid 35349: Returning code 2 from 11 > time 363.764628, pid 35349: strategy done vp 0x38b251f0 code 0xd left > 0x1000 time 363.764925, pid 34833: Analyze RPC op 2 conn 0x39327a00 code > 0xd user 0x4114 > > This sequence indicates a FetchStatus operation being denied due to an > EACCESS error. In other words, your client is attempting to read some > that it doesn't have permission to access. > > Jeffrey Altman Thanks for this information. Surely I should upgrade. But the servers are IBM pSeries and OpenAFS is not so well supported under AIX. So I am always glad to have running servers/clients and do not upgrade before there is a real need to do so. I do not want to blame anyone. I realize that AIX is not widly spread and not important and man power is scarce. My answer is delayed because I was out of the office for 10 days. Thanks for your support Gunther Borrmann -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Integrated Login on AIX 5.3
On Tuesday 21 February 2006 15:36, Jay Compton wrote: > Hey Hans, > > Thanks for the reply! So, the main question I have is, aside from the uid > bug you located, do integrated logins work for you in ML03? That's my main > problem, I could even help chase the bug down if I could get logins working > on my system. I will try using your settings below and see if things work > for me. Are you running the latest version of OpenAFS as well? > Yes, it works besides the bug. People can log in and have a token. I have installed the rs_aix53.tar.gz of the latest release from openafs.org. In fact all my users are authenticated by AFS. -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Integrated Login on AIX 5.3
On Friday 17 February 2006 15:44, Jay Compton wrote: > Hello, > > I am currently running the latest stable build of OpenAFS (1.4.0) on AIX > 5.3, maintenance level 3. The afsd daemon starts up and runs fine, but I > can't get integrated login working. I have setup the /etc/security files as > the documentation suggests to no avail thus far. I was wondering if anyone > had successfully setup logins using AIX's default authentication grammar or > through PAM, which I am trying to look into as well. > > Any help would be greatly appreciated, thanks! > It should work with the following configuration, TRANSARC paths assumed: /usr/lib/security/methods.cfg: AFS: program = /usr/vice/etc/afs_dynamic_auth options = authonly For a user to be authenticated by AFS define in the /etc/security/users file SYSTEM=AFS registry=files in the user stanza. This worked for me with AIX 5.3 ML01. Under AIX5.3 ML03 commands wich convert uid to name segfault. I have opened a bugreport on this. Example: "ls somedir" works, "ls -l somedir" segfaults. -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Fileserver very busy
On Saturday 18 February 2006 16:02, Jeffrey Altman wrote: > What versions are the clients and servers? Client: OpenAFS 1.3.71 built 2004-09-20 Server: OpenAFS 1.3.77 built 2005-01-18 > > Is the client mobile? no > Or behind a NAT? no > Or perhaps using a transient VPN? no. Client and server are double headed and connected with one interface to the same switch. The other interfaces of both do not connect to the same switch but are in the same subnet. I use this networking on all my machines. -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Gentoo amd64: OpenAFS 1.4.1-rc6
On Friday 10 February 2006 18:51, Stefaan wrote: > No compilation problem at all on my 2.6.15-gentoo-r2 on amd64. I think > vanilla-sources-2.6.12.5 should work as well, but > vanilla-sources-2.6.11.12 seems not to provide the symbol you're > missing. > Guess you're stuck somewhere in between those last two, before the > symbol was introduced. > > Stefaan > ___ > OpenAFS-info mailing list > OpenAFS-info@openafs.org > https://lists.openafs.org/mailman/listinfo/openafs-info I have upgraded the kernel to: Linux sv4 2.6.15-gentoo-r5 #1 SMP Wed Feb 15 16:19:18 MET 2006 x86_64 AMD Opteron(tm) Processor 246 AuthenticAMD GNU/Linux But I still cannot compile openafs-1.4.1-rc6: In file included from /root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.15-gentoo- r5-MP/osi_module.c:42: include/linux/seq_file.h:43: warning: `printk' is an unrecognized format function type /root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.15-gentoo-r5-MP/osi_module.c: In function `afs_ioctl': /root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.15-gentoo-r5-MP/osi_module.c:294 : error: `TIF_32BIT' undeclared (first use in this function) /root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.15-gentoo-r5-MP/osi_module.c:294 : error: (Each undeclared identifier is reported only once /root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.15-gentoo-r5-MP/osi_module.c:294 : error: for each function it appears in.) make[6]: *** [/root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.15-gentoo-r5-MP/os i_module.o] Error 1 Any ideas Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Fileserver very busy
Hello, I observe that one of my fileservers is very busy over long periods of time, everal days or even a week. tcpdump on the fileservcer only shows the following traffic; 16:09:12.981900338 server.afs3-fileserver > client.afs3-callback: udp 66 16:09:12.982000367 server.afs3-fileserver > client.afs3-callback: udp 32 16:09:12.982190070 client.afs3-callback > server.afs3-fileserver: udp 44 16:09:12.982315764 server.afs3-fileserver > client.afs3-callback: udp 32 16:09:12.982498415 client.afs3-callback > server.afs3-fileserver: udp 44 16:09:12.982622973 server.afs3-fileserver > client.afs3-callback: udp 32 16:09:12.982808396 client.afs3-callback > server.afs3-fileserver: udp 44 16:09:12.982934593 server.afs3-fileserver > client.afs3-callback: udp 32 16:09:12.983123949 client.afs3-callback > server.afs3-fileserver: udp 44 16:09:12.983247845 server.afs3-fileserver > client.afs3-callback: udp 32 16:09:12.983437670 client.afs3-callback > server.afs3-fileserver: udp 44 fstrace on the client shows only the following: time 363.764627, pid 35349: Returning code 2 from 11 time 363.764628, pid 35349: strategy done vp 0x38b251f0 code 0xd left 0x1000 time 363.764925, pid 34833: Analyze RPC op 2 conn 0x39327a00 code 0xd user 0x4114 time 363.764927, pid 34833: Returning code 2 from 11 time 363.764928, pid 34833: strategy done vp 0x38b251f0 code 0xd left 0x1000 time 363.765223, pid 35607: Analyze RPC op 2 conn 0x39327a00 code 0xd user 0x4114 time 363.765224, pid 35607: Returning code 2 from 11 time 363.765225, pid 35607: strategy done vp 0x38b251f0 code 0xd left 0x1000 time 363.765524, pid 34575: Analyze RPC op 2 conn 0x39327a00 code 0xd user 0x4114 time 363.765526, pid 34575: Returning code 2 from 11 time 363.765527, pid 34575: strategy done vp 0x38b251f0 code 0xd left 0x1000 time 363.765823, pid 34317: Analyze RPC op 2 conn 0x39327a00 code 0xd user 0x4114 time 363.765825, pid 34317: Returning code 2 from 11 time 363.765826, pid 34317: strategy done vp 0x38b251f0 code 0xd left 0x1000 time 363.766124, pid 35091: Analyze RPC op 2 conn 0x39327a00 code 0xd user 0x4114 time 363.766126, pid 35091: Returning code 2 from 11 time 363.766127, pid 35091: strategy done vp 0x38b251f0 code 0xd left 0x1000 time 363.766425, pid 35349: Analyze RPC op 2 conn 0x39327a00 code 0xd user 0x4114 time 363.766427, pid 35349: Returning code 2 from 11 time 363.766428, pid 35349: strategy done vp 0x38b251f0 code 0xd left 0x1000 time 363.766723, pid 34833: Analyze RPC op 2 conn 0x39327a00 code 0xd user 0x4114 time 363.766725, pid 34833: Returning code 2 from 11 time 363.766726, pid 34833: strategy done vp 0x38b251f0 code 0xd left 0x1000 time 363.767028, pid 35607: Analyze RPC op 2 conn 0x39327a00 code 0xd user 0x4114 What is going on? Thanks for any hints. -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Gentoo amd64: OpenAFS 1.4.1-rc6
Hello, I just tried to compile 1.4.1-rc6 under Gentoo. The compilation stops with the following error(s): In file included from /root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.12-MP/osi_module.c:42: include/linux/seq_file.h:43: warning: `printk' is an unrecognized format function type /root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.12-MP/osi_module.c: In function `afs_ioctl': /root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.12-MP/osi_module.c:294: error: `TIF_32BIT' undeclared (first use in this function) /root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.12-MP/osi_module.c:294: error: (Each undeclared identifier is reported only once /root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.12-MP/osi_module.c:294: error: for each function it appears in.) make[6]: *** [/root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.12-MP/osi_module.o] Error 1 make[5]: *** [_module_/root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.12-MP] Error 2 make[5]: Leaving directory `/usr/src/linux-2.6.12' make[4]: *** [libafs.ko] Error 2 make[4]: Leaving directory `/root/openafs/openafs-1.4.1-rc6/src/libafs/MODLOAD-2.6.12-MP' make[3]: *** [linux_compdirs] Error 2 make[3]: Leaving directory `/root/openafs/openafs-1.4.1-rc6/src/libafs' make[2]: *** [libafs] Error 2 make[2]: Leaving directory `/root/openafs/openafs-1.4.1-rc6' make[1]: *** [build] Error 2 make[1]: Leaving directory `/root/openafs/openafs-1.4.1-rc6' make: *** [all] Error 2 System Information: vanilla kernel Portage 2.0.54 (default-linux/amd64/2005.0, gcc-3.4.4, glibc-2.3.5-r2, 2.6.12 x86_64) = System uname: 2.6.12 x86_64 AMD Opteron(tm) Processor 246 Gentoo Base System version 1.6.14 ccache version 2.3 [disabled] dev-lang/python: 2.2.3, 2.3.4-r1, 2.4.2 sys-apps/sandbox:1.2.12 sys-devel/autoconf: 2.13, 2.59-r6 sys-devel/automake: 1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r1 sys-devel/binutils: 2.16.1 sys-devel/libtool: 1.5.22 virtual/os-headers: 2.6.11-r2 ACCEPT_KEYWORDS="amd64" AUTOCLEAN="yes" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-march=athlon64 -O2 -pipe" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc /usr/kde/2/share/config /usr/kde/3/share/config /usr/lib/X11/xkb /usr/share/config /var/qmail/control" CONFIG_PROTECT_MASK="/etc/gconf /etc/terminfo /etc/env.d" CXXFLAGS="-march=athlon64 -O2 -pipe" DISTDIR="/usr/portage/distfiles" FEATURES="autoconfig distlocks sandbox sfperms strict" GENTOO_MIRRORS="http://pandemonium.tiscali.de/pub/gentoo/"; PKGDIR="/usr/portage/packages" PORTAGE_TMPDIR="/var/tmp" PORTDIR="/usr/portage" PORTDIR_OVERLAY="/usr/local/portage" SYNC="rsync://rsync.gentoo.org/gentoo-portage" USE="amd64 X acl alsa avi berkdb bitmap-fonts bzip2 cdb crypt cups curl eds emboss encode expat f77 foomaticdb fortran gd gdbm gif gnome gpm gstreamer gtk gtk2 imagemagick imlib ipv6 java jpeg kde ldap lzw lzw-tiff mhash motif mp3 mpeg mysql ncurses nls opengl pam pcre pdflib perl php png python qt quicktime readline sdl slang spell ssl tcpd tiff truetype truetype-fonts type1-fonts udev usb userlocales xml2 xpm xv zlib userland_GNU kernel_linux elibc_glibc" Unset: ASFLAGS, CTARGET, LANG, LC_ALL, LDFLAGS, LINGUAS, MAKEOPTS Any help is appreciated -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Gentoo amd64: Each process attempting to access a certain directory is blocked
Hello all, what I did: cd ~ # my home in AFS tar -cvf tars/tar.tar .backup # .backup mountpoint of backup volume After some time the tar hangs. Therafter each process attempting to acces the directory ~/tar hangs. It is impossible to terminate the processes in any way. In the meantime the directory is normally accessible from other AFS-clients. Only a reboot resolves the problem. This makes in fact one of our webservers unusable, because the same happens with directories php-scripts access. In a short time several hundred scripts are hanging around and the whole server stops service until it is rebooted. I am running OpenAFS 1.3.85 since several months very successfully on the same hardware under Portage 2.0.53 (default-linux/amd64/2004.3, gcc-3.4.3, glibc-2.3.4.20041102-r1, 2. 6.12 x86_64) System uname: 2.6.12 x86_64 AMD Opteron(tm) Processor 246 = Affected System Information OpenAFS Version: * net-fs/openafs Available versions: *1.2.10-r1 !1.2.13-r2 1.4.0 1.4.0-r1 1.4.0-r2 Installed: 1.4.0-r2 Homepage:http://www.openafs.org/ Description: The OpenAFS distributed file system System Information: vanilla kernel Portage 2.0.54 (default-linux/amd64/2005.0, gcc-3.4.4, glibc-2.3.5-r2, 2.6.12 x86_64) = System uname: 2.6.12 x86_64 AMD Opteron(tm) Processor 246 Gentoo Base System version 1.6.14 ccache version 2.3 [disabled] dev-lang/python: 2.2.3, 2.3.4-r1, 2.4.2 sys-apps/sandbox:1.2.12 sys-devel/autoconf: 2.13, 2.59-r6 sys-devel/automake: 1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r1 sys-devel/binutils: 2.16.1 sys-devel/libtool: 1.5.22 virtual/os-headers: 2.6.11-r2 ACCEPT_KEYWORDS="amd64" AUTOCLEAN="yes" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-march=athlon64 -O2 -pipe" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc /usr/kde/2/share/config /usr/kde/3/share/config /usr/lib/X11/xkb /usr/share/config /var/qmail/control" CONFIG_PROTECT_MASK="/etc/gconf /etc/terminfo /etc/env.d" CXXFLAGS="-march=athlon64 -O2 -pipe" DISTDIR="/usr/portage/distfiles" FEATURES="autoconfig distlocks sandbox sfperms strict" GENTOO_MIRRORS="http://pandemonium.tiscali.de/pub/gentoo/"; PKGDIR="/usr/portage/packages" PORTAGE_TMPDIR="/var/tmp" PORTDIR="/usr/portage" PORTDIR_OVERLAY="/usr/local/portage" SYNC="rsync://rsync.gentoo.org/gentoo-portage" USE="amd64 X acl alsa avi berkdb bitmap-fonts bzip2 cdb crypt cups curl eds emboss encode expat f77 foomaticdb fortran gd gdbm gif gnome gpm gstreamer gtk gtk2 imagemagick imlib ipv6 java jpeg kde ldap lzw lzw-tiff mhash motif mp3 mpeg mysql ncurses nls opengl pam pcre pdflib perl php png python qt quicktime readline sdl slang spell ssl tcpd tiff truetype truetype-fonts type1-fonts udev usb userlocales xml2 xpm xv zlib userland_GNU kernel_linux elibc_glibc" Unset: ASFLAGS, CTARGET, LANG, LC_ALL, LDFLAGS, LINGUAS, MAKEOPTS -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] AIX 5.3 binary version of OpenAFS
Hello all, is the server of the AIX 5.3 binary Version rs_aix53.tar.gz large file enabled? If not, why not (problems?)? I ask because I had serious problems with one of the 1.3.x releases, when I compiled the server large file enabled. Is someone running the 1.4.0 server under AIX 5.x large file enabled? Thanks in advance for any answers -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] 1.4.0 rc7 under AIX 5.3
Hello all, to give some feedback: the client itself works very well. No problem to compile and run. The AIX loadable authentication module afs_dynamic_auth is not usable. If a user logs in and is authenticated by this module all commands he submits segfault. I did not yet test the server. -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Still Kernel Bug message and client hang
On Wednesday 13 July 2005 16:49, Derrick J Brashear wrote: > If 1.3.85 proves as stable as we believe and hope it will, it will form > the basis of the 1.4.0-rc1 in the next week anyway. Just for feed back. I am runing now 1.3.85 for nearly 6 days on the university webserver without any problems. Thanks to all who develope OpenAFS. -- ____ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Cross Domain Auth & OpenAFS use
On Friday 15 July 2005 10:11, Lars Schimmer wrote: > And another small problem: the root.afs is mounted rl, is there a easy way > to make a new entry under /afs instead of removing all root.cell.readonly > and make the changes, or not? mount root.afs under your AFS home directory, make the changes and then vos rel. Gunther -- ____ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Still Kernel Bug message and client hang
On Wednesday 13 July 2005 17:43, chas williams - CONTRACTOR wrote: > that's a stupid debugging message i left in the code. sorry. it > can safely be ignored. and removed from the source. > Where is the message? I cannot let afs run because my /var fills up. Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Still Kernel Bug message and client hang
On Wednesday 13 July 2005 16:20, Derrick J Brashear wrote: > So dsid you mention openafs version and i just didn't see it? if the > version was less than 1.3.85, well, upgrade. > I upgraded to 1.3.85. Now /var/log/messages very quickly fills up with the following message: check_bad_parent(www): bad parent vcp->mvid->Fid.Volume != pvc->fid.Fid.Volume Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Still Kernel Bug message and client hang
On Wednesday 13 July 2005 16:52, chas williams - CONTRACTOR wrote: > is this 1.3.84? > Yes. I'm just installing 1.3.85 as Derrick advised. -- ____ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Still Kernel Bug message and client hang
On Wednesday 13 July 2005 16:20, Derrick J Brashear wrote: > So dsid you mention openafs version and i just didn't see it? Yes. It is 1.3.84 > if the version was less than 1.3.85, well, upgrade. > I' ll do so as quick as possible. Thanks! Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Skipping Already locked vcp
Hello, what does the following error message mean: AFS_VMA_CLOSE(30246): Skipping Already locked vcp=810075b85e48 vmap=810075b85e50 It is repeated six times in /var/log/messages. I see it on the client which crashes from time to time. See my previous posting. Gunther -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Still Kernel Bug message and client hang
Hello, I still see the following Kernel Bug message in /var/log/messages. Thereafter any process trying to access a file, local or not, hangs The machine has to be reset. Kernel BUG at "fs/inode.c":1094 invalid operand: [1] SMP CPU 1 ide_cd cdrom Pid: 5316, comm: php.exe Tainted: P 2.6.12 RIP: 0010:[] {iput+30} RSP: 0018:810062db3dc8 EFLAGS: 00010246 RAX: 880c6540 RBX: 8100f778d800 RCX: RDX: 81007951ae90 RSI: 810062db3d58 RDI: 8100f778d800 RBP: 8100f778d800 R08: R09: 810062db3d58 R10: 000b R11: 8809df50 R12: 8100f309ecd0 R13: 880c5a00 R14: 7fc17e30 R15: 7fc1b210 FS: 008f4ae0(0063) GS:804627c0() knlGS:08208760 CS: 0010 DS: ES: CR0: 8005003b CR2: 0051fa10 CR3: dbe9e000 CR4: 06e0 Process php.exe (pid: 5316, threadinfo 810062db2000, task 81007951ae90) Stack: 880c5a00 880660a1 8100f7eb4400 8100f7eb4400 8809e0d5 810062db3e68 810062db3ef8 0296 Call Trace:{:libafs:afs_PutVCache+161} {:libafs:afs_linux_getattr+389} {vfs_lstat+61} {sys_newlstat+31} {system_call+126} Code: 0f 0b 40 1e 32 80 ff ff ff ff 46 04 66 66 90 66 66 90 48 85 RIP {iput+30} RSP Hardware: AMD Dual Opteron Operating system: Gentoo Linux Kernel: vanilla kernel 2.12.6 OpenAFS 1.3.84 cacheinfo: /afs:/usr/vice/cache:1651000 real cache size: 2064208K afsd parameters: "-fakestat -stat 1 -dcache 4000 -daemons 5 -volumes 256 -files 5" This problem is urgent for me because I have to migrate our central webservice to this machine. All our web pages are in AFS. So any help is appreciated. Thanks in advance Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] OpenAFS Client crash
On Monday 11 July 2005 11:06, Hans-Gunther Borrmann wrote: > Hello, > > I experienced the following client crash: > > Jul 11 10:12:49 --- [cut here ] - [please bite here ] > - > Kernel BUG at inode:1109 > nvalid operand: [1] SMP > CPU 0 > Modules linked in: libafs ipt_multiport ipt_state ip_conntrack ipt_LOG > iptable_filter ip_tables tg3 ide_cd cdrom > Pid: 27008, comm: php.exe Tainted: P 2.6.12-rc2 > RIP: 0010:[] {iput+30} > RSP: 0018:810073777dc8 EFLAGS: 00010246 > RAX: 880c3fa0 RBX: 810002e7ec00 RCX: > RDX: 8100f96a4a00 RSI: 810073777d58 RDI: 810002e7ec00 > RBP: 810002e7ec00 R08: R09: 810073777d58 > R10: 8100f903e400 R11: 8809bef0 R12: 81007f7eb090 > R13: 880c3460 R14: 7f8a9620 R15: 7f8adcf6 > FS: 008f4ae0(0063) GS:80460d00() > knlGS:082069a0 CS: 0010 DS: ES: CR0: 8005003b > CR2: 00452b80 CR3: 72e4d000 CR4: 06e0 > Process php.exe (pid: 27008, threadinfo 810073776000, task > 8100f96a4a00) > Stack: 880c3460 880640b1 8100f903e000 8100f903e000 > 810002e7ec00 8809c075 810073777e68 810073777ef8 > 810073777e68 0296 > Call Trace:{:libafs:afs_PutVCache+161} > {:libafs:afs_linux_getattr+389} > {vfs_lstat+61} {sys_newlstat+31} > {system_call+126} > > Code: 0f 0b cf 30 32 80 ff ff ff ff 55 04 66 66 90 66 66 90 48 85 > RIP {iput+30} RSP > > The machine stayed running, but each program accessing any local file hung. > > OpenAFS: 1.3.81 > Operating System: Gentoo Linux with 64 bit SMP vanilla kernel 2.6.12-rc2 > Hardware: Dual AMD Opteron. > > Any help or hints are appreciated. The crash happend during the first > attempt to migrate the university webservice to this machine. The > webservice completely depends on AFS. > > Gunther Upgrade to kernel 2.6.12 and OpenAFS 1.3.84 solved the problem. -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] OpenAFS Client crash
Hello, I experienced the following client crash: Jul 11 10:12:49 --- [cut here ] - [please bite here ] - Kernel BUG at inode:1109 nvalid operand: [1] SMP CPU 0 Modules linked in: libafs ipt_multiport ipt_state ip_conntrack ipt_LOG iptable_filter ip_tables tg3 ide_cd cdrom Pid: 27008, comm: php.exe Tainted: P 2.6.12-rc2 RIP: 0010:[] {iput+30} RSP: 0018:810073777dc8 EFLAGS: 00010246 RAX: 880c3fa0 RBX: 810002e7ec00 RCX: RDX: 8100f96a4a00 RSI: 810073777d58 RDI: 810002e7ec00 RBP: 810002e7ec00 R08: R09: 810073777d58 R10: 8100f903e400 R11: 8809bef0 R12: 81007f7eb090 R13: 880c3460 R14: 7f8a9620 R15: 7f8adcf6 FS: 008f4ae0(0063) GS:80460d00() knlGS:082069a0 CS: 0010 DS: ES: CR0: 8005003b CR2: 00452b80 CR3: 72e4d000 CR4: 06e0 Process php.exe (pid: 27008, threadinfo 810073776000, task 8100f96a4a00) Stack: 880c3460 880640b1 8100f903e000 8100f903e000 810002e7ec00 8809c075 810073777e68 810073777ef8 810073777e68 0296 Call Trace:{:libafs:afs_PutVCache+161} {:libafs:afs_linux_getattr+389} {vfs_lstat+61} {sys_newlstat+31} {system_call+126} Code: 0f 0b cf 30 32 80 ff ff ff ff 55 04 66 66 90 66 66 90 48 85 RIP {iput+30} RSP The machine stayed running, but each program accessing any local file hung. OpenAFS: 1.3.81 Operating System: Gentoo Linux with 64 bit SMP vanilla kernel 2.6.12-rc2 Hardware: Dual AMD Opteron. Any help or hints are appreciated. The crash happend during the first attempt to migrate the university webservice to this machine. The webservice completely depends on AFS. Gunther -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] another problem with vos release
On Wednesday 15 June 2005 16:48, Ken Hornstein wrote: > >RWrite: 536871953 > >number of sites -> 2 > > server leda partition /vicepa RW Site -- New release -- Old > > release server atlas.cg.cs.tu-bs.de partition /vicepb RO Site -- Old > > release -- Old release > > I couldn't help noticing that you don't have a RO volume on the same > machine as the RW copy. > > This once happened to us here on one volume (don't ask; someone here > INSISTED that it didn't cause any problems). I noticed "weird" > behavior during vos releases. I can't quantify "weird" anymore than > that; it was strange and it was a while ago, and I pestered people > until it had a RO on the same machine as the RW copy, and the weirdness > went away. I could easily believe that not having a RO copy on the > same machine as the RW copy would be a corner case that isn't tested > that much, and maybe it would be worth putting one there just to see if > it solves some of your problems. > What I noticed in this situation was , that "vos release" of a "large" volume took an incredible long time. "large" in this case meant ~100000 Files. Having an ro on the same server reduced the time to a normal value. -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Unusable empty partition
Hello, I have one partition on a server, which is empty: Total number of volumes on server localhost partition /vicepcy: 0 Total volumes onLine 0 ; Total volumes offLine 0 ; Total busy 0 But I cannot move any volumes to this partition: [EMAIL PROTECTED]:~> vos move www.natscan.kirsche7 servera by serverb cy -verbose Starting transaction on source volume 537085072 ... done Cloning source volume 537085072 ... done Ending the transaction on the source volume 537085072 ... done Starting transaction on the cloned volume 537091168 ... done Creating the destination volume 537085072 ... done Dumping from clone 537091168 on source to volume 537085072 on destination ...Failed to move data for the volume 537085072 VOLSER: Problems encountered in doing the dump ! vos move: operation interrupted, cleanup in progress... clear transaction contexts access VLDB move incomplete - attempt cleanup of target partition - no guarantee cleanup complete - user verify desired result The VolserLog shows: Wed Jun 1 10:18:03 2005 VAttachVolume: Failed to open /vicepcy/V0537085072.vl (errno 2) Wed Jun 1 10:18:03 2005 1 Volser: CreateVolume: volume 537085072 (www.natscan.kirsche7) created unable to allocate inode: File exists Wed Jun 1 10:18:03 2005 1 Volser: ReadVnodes: Restore aborted Wed Jun 1 10:18:03 2005 1 Volser: Delete: volume 537085072 deleted and "df -k" gives: [EMAIL PROTECTED]:logs]# df -k /vicepcy Filesystem1024-blocks Free %UsedIused %Iused Mounted on /dev/vicepcy262144000 202566836 23% 2191 1% /vicepcy which means that about 60 GB are occupied. What to do? The server is a namei server. My idea is therefore to simply remove all files and directories except ./lost+found ./Lock ./Lock/vicepcy ./AFSIDat ./AFSIDat/README Will this be safe? -- ____ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] 1.3.82 Server under AIX
Hello, I tested the 1.3.82 Server under AIX 5.1 ML07, single processor, 32-bit kernel. The salvager still coredumps if compiled with "--enable-largefile-fileserver". It seems to work if compiled without. Is thre any hope to get large file support under AIX? All my Servers run under AIX and I need "large files" in the near future. Compilation information: CC=cc ./configure --enable-namei-fileserver \ --enable-largefile-fileserver \ --enable-fast-restart \ --enable-bitmap-later \ --enable-tivoli-tsm \ --enable-transarc-paths \ --disable-pam -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] 1.3.81 Server under AIX 5.1
Hello, I tested the 1.3.81 Server under AIX 5.1 ML07, single processor, 32-bit kernel. The salvager coredumps: [EMAIL PROTECTED]:root]# bos salvage localhost -all bos: shutting down fs. Starting salvage. bos: salvage completed bos: restarting fs. [EMAIL PROTECTED]:root]# bos status localhost Instance fs, currently running normally. Auxiliary status is: file server running. Instance ptserver, currently running normally. Instance vlserver, currently running normally. Instance kaserver, currently running normally. [EMAIL PROTECTED]:root]# vos listvol localhost Total number of volumes on server localhost partition /vicepa: 3 root.afs 536870962 RW 2 K On-line root.cell 536870965 RW 3 K On-line root.cell.readonly536870966 RO 3 K On-line Total volumes onLine 3 ; Total volumes offLine 0 ; Total busy 0 Total number of volumes on server localhost partition /vicepb: 1 usr.hgb 536870968 RW 564128 K On-line Total volumes onLine 1 ; Total volumes offLine 0 ; Total busy 0 [EMAIL PROTECTED]:root]# bos getlog localhost SalvageLog Fetching log file 'SalvageLog'... @(#) OpenAFS 1.3.81 built 2005-04-14 04/20/2005 10:42:12 STARTING AFS SALVAGER 2.4 (/usr/afs/bin/salvager -f) 04/20/2005 10:42:12 Starting salvage of file system partition /vicepa 04/20/2005 10:42:12 Starting salvage of file system partition /vicepb 04/20/2005 10:42:12 SALVAGING FILE SYSTEM PARTITION /vicepa (device=vicepa) 04/20/2005 10:42:12 ***Forced salvage of all volumes on this partition*** 04/20/2005 10:42:12 3 nVolumesInInodeFile 84 04/20/2005 10:42:12 SALVAGING VOLUME 536870962. 04/20/2005 10:42:12 root.afs (536870962) not updated (created 04/20/2005 10:16) 04/20/2005 10:42:12 totalInodes 5 04/20/2005 10:42:12 "Salvage volume group" core dumped! 04/20/2005 10:42:12 CHECKING CLONED VOLUME 536870966. 04/20/2005 10:42:12 root.cell.readonly (536870966) updated 04/20/2005 10:18 04/20/2005 10:42:12 "Salvage volume group" core dumped! 04/20/2005 10:42:12 SALVAGING OF PARTITION /vicepa COMPLETED 04/20/2005 10:42:12 SALVAGING FILE SYSTEM PARTITION /vicepb (device=vicepb) 04/20/2005 10:42:12 ***Forced salvage of all volumes on this partition*** 04/20/2005 10:42:12 1 nVolumesInInodeFile 28 04/20/2005 10:42:12 SALVAGING VOLUME 536870968. 04/20/2005 10:42:12 usr.hgb (536870968) updated 04/20/2005 10:38 04/20/2005 10:42:12 Vnode 60: version < inode version; fixed (old status) 04/20/2005 10:42:12 "Salvage volume group" core dumped! 04/20/2005 10:42:12 SALVAGING OF PARTITION /vicepb COMPLETED Compilation information: CC=cc ./configure --enable-namei-fileserver \ --enable-largefile-fileserver \ --enable-fast-restart \ --enable-bitmap-later \ --enable-tivoli-tsm \ --enable-transarc-paths \ --disable-pam -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] 1.3.81 under AIX 5.1
Hello, I further tested the 1.3.81 client under AIX 5.1, single processor, 32-Bit kernel. If root.afs of the workstations cell is not available the workstation crashes. In this special case root.afs of the cell in question was not yet created. afsd was started without -dynroot. Using -dynroot, the client works well. So this is a minor problem. Gunther -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] 1.3.81 under AIX 5.1
Hello, I tested the 1.3.81 client under AIX 5.1,single processor, 32-Bit kernel. Here my results: It is the first OpenAFS version which survives the first klog under AIX 5.1. Thanks to all who contributed. I did some stress tests. The client seems to work well. "make dest" terminates with an error message: /u/b/borrmann/sw/openafs/1.3.81/aix/openafs-1.3.81/src/pinstall/pinstall ./ doc/LICENSE /u/b/borrmann/sw/openafs/1.3.81/aix/openafs-1.3.81/rs_aix51/dest/ LICENSE Can't open source file ``./doc/LICENSE'': No such file or directory make: 1254-004 The error code from the last command is 1. make: 1254-005 Ignored error code 1 from last command. afs_dynamic_auth and afs_dynamic_kerbauth are completely missing. The 1.3.80 versions seem to work. Gunther Borrmann -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Loosing tokens...
On Monday 04 April 2005 16:57, Frank Burkhardt wrote: > Hi, > > On Mon, Apr 04, 2005 at 10:44:37AM -0400, Jim Rees wrote: > > Sure, I've restarted ssh. Hmm. Is unpagsh replacable by pagsh? I've no > > unpagsh on my system here... > > > > Pagsh is better than unpagsh for this purpose. You don't want sshd to > > ever have tokens. > > It's not. Two ssh-sessions (even of different users) would be in the same > PAG. unpagsh prevents that. > > Frank what about: at now < Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] orphaned root.afs problem during 1.3.80 client upgrade
On Wednesday 30 March 2005 19:44, ted creedon wrote: > Yes, root.cell.readonly did disappear on both machines. > Looks like it happened when a vos release on doris_disk.vol id 536870944 > was done from hiawatha . Seems to have created bogus.536870944 id > 536870944 on nanook. The byte count on 536870944 was different on both > machines at the time... > > What's the best way to repair? fs mkmounts on both machines? > bos salvage nanook. Then look at the salvage log. The description of the handling of readonly volumes is described in the following: The bos salvage command salvages (restores internal consistency to) one or more volumes on the file server machine named by the -server argument. When processing one or more partitions, the command restores consistency to corrupted read/write volumes where possible. For read-only or backup volumes, it inspects only the volume header: If the volume header is corrupted, the Salvager removes the volume completely and records the removal in its log file, /usr/afs/logs/SalvageLog. Issue the vos release or vos backup command to create the read-only or backup volume again. If the volume header is intact, the Salvager skips the volume (does not check for corruption in the contents). However, if the File Server notices corruption as it initializes, it sometimes refuses to attach the volume or bring it online. In this case, it is simplest to remove the volume by issuing the vos remove or vos zap command. Then issue the vos release or vos backup command to create it again. Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] orphaned root.afs problem during 1.3.80 client upgrade
On Wednesday 30 March 2005 04:24, ted creedon wrote: > vos syncvldb is OK on 10.1.1.180 (hiawatha) but a local client doesn't see > the RO cell "bigcell". "Connection timed out" error. I assume that "bigcell" is the mount point for root.cell in your root.afs. If a client wants to access /afs/bigcell the volume root.cell.readonly is needed becaus AFS folllows the read-only path as long as posible. But this volume is offline. So you get "Connection timed out". You should try to get root.cell.readonly online again. -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Testing 1.3.80 under AIX 5.1: klog crashes the client.
Hello, as all versions before klog crashes the client almost immediately. During my test it was already the second klog. I did not access any files or directories besides /afs. -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] can't force deletion of off-line volume
On Saturday 19 March 2005 01:25, Wes Chow wrote: > I'm having problems with a volume that isn't in the VLDB, but shows up > off line with "vos listvol": > > hippo:~# vos listvol hippo /vicepb | grep 20040102 > datafiles.20040102536937070 RW5688181 K Off-line > > hippo:~# vos online hippo /vicepb datafiles.20040102 > VLDB: no such entry > > hippo:~# vos listvldb datafiles.20040102 > VLDB: no such entry > > hippo:~# vos remove hippo /vicepb datafiles.20040102 > Can't find volume name 'datafiles.20040102' in VLDB > VLDB: no such entry > > hippo:~# vos zap hippo /vicepb datafiles.20040102 > VLDB: no such entry > > > > If my understanding is correct, the volume still exists on the server > but isn't in the VLDB. The "vos zap" command is supposed to delete > volumes off a server without consulting the VLDB. However, it seems > as if the zap command is trying to access the VLDB? > > I've also run both "vos syncvldb" and "vos syncserv", but I haven't > been able to get that volume back into a useable state. > > Any help would be appreciated. > > Thanks, > Wes From the manual: The -force flag removes a volume even if it cannot be "attached" (brought online), which can happen either because the volume is extremely damaged or because the Salvager functioned abnormally. Without this flag, this command cannot remove volumes that are not attachable. See also the Cautions section. -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Problems with OpenAFS Fileserver.../ Kerberos5 Problem
On Thursday 17 March 2005 21:03, Lars Schimmer wrote: > Jeffrey Altman schrieb: > | Lars Schimmer wrote: > |> If there are two entrys in the keyfile, one from old kaserver and one > |> from Kerberos5 server, and the krb5 hat kvno of 1, the kaserver a kvno > |> of 0, is it possible for all clients (linux and windows) to get tokens > |> via kaserv? The new kerberos server isn´t in their (clients) CellServDB > |> yet. > | > | Windows clients use MIT KFW for Kerberos 5 support. The locations of > | KDCs are determined either from the krb5.ini file or DNS SRV records. > | CellServDB is not used for token acquisition when Kerberos 5 support > | is being used. > > Thx for fast answer, but I meant the other way round. > If the KDC is up and running and the old kaserver are still up running, > and the windows clients has only the "old" kaserver in their CellServDB > and the have no kerberos on their system, can the windows clients still > logon AFS and get tokens via kaserv? > I mean, with now 2 entrys in the keyfile, can the servers select the > right one out for Windows AFS clients without kerberos? > > As far as I remember from my tests the answer is yes. -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Unable to move volumes
On Friday 11 March 2005 15:52, Hartmut Reuter wrote: > > If you look into > /vicepcp/AFSIDat/S=/SNo+U/special > > is there anything? /vicepcp/AFSIDat/S=/SNo+U/special does not exist: # find /vicepcp/AFSIDat/S=/SNo+U/ -print /vicepcp/AFSIDat/S=/SNo+U/ /vicepcp/AFSIDat/S=/SNo+U/+ /vicepcp/AFSIDat/S=/SNo+U/+/+ /vicepcp/AFSIDat/S=/SNo+U/+/+/=2 /vicepcp/AFSIDat/S=/SNo+U/+/+/06 /vicepcp/AFSIDat/S=/SNo+U/+/+/2A /vicepcp/AFSIDat/S=/SNo+U/+/+/4E /vicepcp/AFSIDat/S=/SNo+U/+/+/6U /vicepcp/AFSIDat/S=/SNo+U/+/+/8M /vicepcp/AFSIDat/S=/SNo+U/+/+/AQ /vicepcp/AFSIDat/S=/SNo+U/+/+/CY > If so remove the subtree > > /vicepcp/AFSIDat/S=/SNo+U > > and - if existent - /vicepcp/V0537085534.vl does not exist. > and try again. The message "unable to allocate inode: File exists" looks > like there is some volume special file from an earlier try around. > > Hartmut Thanks Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Unable to move volumes
On Wednesday 09 March 2005 23:11, Horst Birthelmer wrote: > ... and you're sure there are no files left from some other operations > like your testing of the savager on your system?? > because the code in IH_CREATE... is pretty straight forward. > (I just assume it's a namei fileserver) > > I moved a few GiBs onto my AIX fileserver and back the last days and > never had any problems. > > Horst I did not test the salvager on this system. I used my test cell I moved about 900 volumes from the TRANSARC source server to the same OpenAFS server partition without any problems. The fileserver and volserver were 1.3.77 compiled with large file support. Serious troubles arose when I started to move volumes wich contained websites to another partition. Volumes got dammaged, volserver frequently core dumped. Our whole webservice - we host about 200 websites,all webpages in AFS - was down for half a day. I then exchanged the fileserver and volserver against a 1.3.65 version compiled without large file support, which solved my problems. I had this version already running for several month on a small fileserver. Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Unable to move volumes
On Wednesday 09 March 2005 17:21, Derrick J Brashear wrote: > On Wed, 9 Mar 2005, Hans-Gunther Borrmann wrote: > > On the destination the VolserLog contains: > > Wed Mar 9 16:52:11 2005 VAttachVolume: Failed to open > > /vicepcp/V0537085440.vl (errno 2) > > Wed Mar 9 16:52:11 2005 1 Volser: CreateVolume: volume 537085440 > > (usr.md0) created > > unable to allocate inode: File exists > > Namei or inode? I'm unsure why the file would exist, but if it's namei, I > suppose you could use a syscall tracer and see what's getting EEXIST, I'd > be curious to hear what you have that's in the way. > ___ > OpenAFS-info mailing list > OpenAFS-info@openafs.org > https://lists.openafs.org/mailman/listinfo/openafs-info Sorry. I forgot the VolserLog: Fri Mar 11 14:38:57 2005 VAttachVolume: Failed to open /vicepcp/V0537085534.vl (errno 2) Fri Mar 11 14:38:57 2005 1 Volser: CreateVolume: volume 537085534 (usr.sperling) created unable to allocate inode: File exists Fri Mar 11 14:38:57 2005 1 Volser: ReadVnodes: Restore aborted Fri Mar 11 14:38:57 2005 1 Volser: Delete: volume 537085534 deleted Gunther -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Unable to move volumes
On Wednesday 09 March 2005 17:21, Derrick J Brashear wrote: > On Wed, 9 Mar 2005, Hans-Gunther Borrmann wrote: > > On the destination the VolserLog contains: > > Wed Mar 9 16:52:11 2005 VAttachVolume: Failed to open > > /vicepcp/V0537085440.vl (errno 2) > > Wed Mar 9 16:52:11 2005 1 Volser: CreateVolume: volume 537085440 > > (usr.md0) created > > unable to allocate inode: File exists > > Namei or inode? Namei I'm unsure why the file would exist, but if it's namei, I > suppose you could use a syscall tracer and see what's getting EEXIST, I'd > be curious to hear what you have that's in the way. > ___ > OpenAFS-info mailing list > OpenAFS-info@openafs.org > https://lists.openafs.org/mailman/listinfo/openafs-info Hello I have traced volserver on the destination of the failing move. Unfortunately I could not use truss (# truss -p 15496 truss: 0915-023 Cannot control process #15496.). So I had to trace the kernel. You find the trace report in /afs/uni-freiburg.de/usr/b/borrmann/public/volsertrace I traced FILE ACTIVITY (open,close,read,write). Hope you'll find a hint what is wrong with this partition. Gunther -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Unable to move volumes
Hello I tried to move several volumes from a TRANSARC fileserver to an OpenAFS fileserver and got the following error message for two volumes (shown only for one): Starting transaction on source volume 537085440 ... done Cloning source volume 537085440 ... done Ending the transaction on the source volume 537085440 ... done Starting transaction on the cloned volume 537089023 ... done Creating the destination volume 537085440 ... done Dumping from clone 537089023 on source to volume 537085440 on destination ...Failed to move data for the volume 537085440 VOLSER: Problems encountered in doing the dump ! vos move: operation interrupted, cleanup in progress... clear transaction contexts access VLDB move incomplete - attempt cleanup of target partition - no guarantee cleanup complete - user verify desired result On the destination the VolserLog contains: Wed Mar 9 16:52:11 2005 VAttachVolume: Failed to open /vicepcp/V0537085440.vl (errno 2) Wed Mar 9 16:52:11 2005 1 Volser: CreateVolume: volume 537085440 (usr.md0) created unable to allocate inode: File exists Wed Mar 9 16:52:11 2005 1 Volser: ReadVnodes: Restore aborted Wed Mar 9 16:52:11 2005 1 Volser: Delete: volume 537085440 deleted What to do? Operating System: AIX 5.1 fileserver: OpenAFS 1.3.65 built 2004-08-09 volserver: OpenAFS 1.3.65 built 2004-08-09 all other: OpenAFS 1.3.77 built 2005-01-18 -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Server Test of 1.3.79 under AIX 5.1
Hello, I tested the server of openafs-snap-2005-03-07 under AIX 5.1: call of configure: CC=cc ./configure --enable-namei-fileserver \ --enable-largefile-fileserver \ --enable-fast-restart \ --enable-bitmap-later \ --enable-tivoli-tsm \ --enable-transarc-paths \ --disable-pam _ Results: The salvager core dumps: Starting salvage. Tue Mar 8 17:38:21 2005: Assertion failed! file vol-salvage.c, line 3317. bos: salvage completed If I replace the salvager by one compiled without "--enable-largefile-fileserver" salvage runs but may erase the contents of volumes! Note there are only files <2GB on the test server! If I run a large file enabled server (all files < 2GB!) and replace the binaries later by a version compiled without "--enable-largefile-fileserver" salvage also may erase the contents of volumes! So there is no save way back! _ The server compiled without large file support seems to work but needs further testing. -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] releasing volumes automatically
On Thursday 10 February 2005 10:55, Marco Spatz wrote: > Hi, > > I've set up an OpenAFS und a 4 machines cluster, and have made replicas > of the user's home volumes. I know, that generally, this is a bad idea, > but I want to use this as backup the user can access without my help > (I've mounted the readonly volumes to another mountpoint). > And now I want the AFS system to release this volumes at night, but I > don't know how to this. Thought about writing a cronjob, but I don't > know how do gain access as admin to get the rights to execute 'vos > release'. Is there any possibility to tell OpenAFS to release certain (or > all) changed volumes at a certain time? Would be a great help. > > Thanks for your help, > > Marco You can run a cron job on your AFS fileservers which takes a list of the volumes to release and uses "vos release -localauth". Thats what I do. Gunther -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] 1.3.77 under AIX 5.1: unusable
On Wednesday 09 February 2005 18:17, Jeffrey Hutzelman wrote: > On Wednesday, February 09, 2005 03:41:31 PM +0100 Hans-Gunther Borrmann > > > I tried to apply your 2 diffs but failed. Many many hunks fail because > > the context is not found in my sources due to missing or adittional > > blanks. Two examples: > > Try running patch with -l, which will make it ignore differences in > whitespace between the patch and your existing sources. This does not help. My call to patch was: patch --verbose -R -p3 --ignore-whitespace --input ~/afs/hzr-diff1 I should have mentioned it. Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] 1.3.77 under AIX 5.1: unusable
On Wednesday 09 February 2005 15:45, Jim Rees wrote: > Looks to me like you need the "-l" option to patch. My call to patch ( I should have added it): patch --verbose -R -p3 --ignore-whitespace --input ~/afs/hzr-diff1 2>&1|tee / tmp/log1 The system I am working on is Suse Linux 9.1 Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] 1.3.77 under AIX 5.1: unusable
On Monday 07 February 2005 15:34, Derrick J Brashear wrote: > [stuff about the server deleted. one thing at a time] true. > Really, I'd like to try this, and Sven Oehme was even kind enough to send > a machine... which I need to get out of customs, which I'm sure is going > to be like getting teeth pulled. > > You can try applying /afs/andrew.cmu.edu/usr/shadow/hzr-diff1 and > hzr-diff2 in reverse, build, and see if it makes a difference. There is > little of substance there. If it works, it should be easy to figure out > why. If not, I don't know. Those are diffs of Hartmut's CVS tree against > OpenAFS 1.3.77-ish CVS head. > I tried to apply your 2 diffs but failed. Many many hunks fail because the context is not found in my sources due to missing or adittional blanks. Two examples: _ Output from patch: Hmm... Looks like a unified diff to me... The text leading up to this was: -- |diff -xCVS -urtw openafs/src/afs/AIX/osi_config.c /usr/tmp/openafs/src/afs/ AIX/osi_config.c |--- openafs/src/afs/AIX/osi_config.c Sat Nov 4 05:03:16 2000 |+++ /usr/tmp/openafs/src/afs/AIX/osi_config.c Thu Aug 19 01:42:24 2004 -- Patching file openafs/src/afs/AIX/osi_config.c using Plan A... Hunk #1 succeeded at 33. Hunk #2 succeeded at 53. Hunk #3 succeeded at 65. Hunk #4 succeeded at 85. Hunk #5 succeeded at 133. Hunk #6 succeeded at 161. Hunk #7 succeeded at 170. Hunk #8 FAILED at 257. Hunk #9 succeeded at 282 with fuzz 2. Hunk #10 FAILED at 290. Hunk #11 FAILED at 320. Hunk #12 FAILED at 329. 4 out of 12 hunks FAILED -- saving rejects to file openafs/src/afs/AIX/ osi_config.c.rej The 2 first failing hunks: @@ -284,24 +257,17 @@ { (void *) &vnodefops, "vnodefops" }, { (void *) &ifnet, "ifnet" }, { (void *) &jfs_icache_lock,"jfs_icache_lock" }, -#ifndef AFS_AIX51_ENV { (void *) &proc_tbl_lock, "proc_tbl_lock" }, -#endif { 0,0 }, }; . @@ -325,31 +290,27 @@ fpalloc(vp, flag, type, ops, fpp) struct vnode *vp; struct fileops *ops; - struct file **fpp; -{ +struct file **fpp; { _ The source lines corresponding to the context lines: {(void *)&vnodefops, "vnodefops"}, {(void *)&ifnet, "ifnet"}, {(void *)&jfs_icache_lock, "jfs_icache_lock"}, #ifndef AFS_AIX51_ENV {(void *)&proc_tbl_lock, "proc_tbl_lock"}, #endif {0, 0}, and: fpalloc(vp, flag, type, ops, fpp) struct vnode *vp; struct fileops *ops; struct file **fpp; ______ My sources are from openafs-snap-2005-01-17.tar.gz What to do? Gunther -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] 1.3.77 under AIX 5.1: unusable
On Monday 07 February 2005 11:36, Horst Birthelmer wrote: > On Feb 7, 2005, at 11:19 AM, Hans-Gunther Borrmann wrote: > I just didn't have the time to do any debugging, sorry, but I talked to > Derrick about that backtrace and we weren't sure what that could be. This client crash only happened once. What I normally observed was, that the client showed the fileservers of my cell suddenly as unavailable and I had to reboot the client to access AFS space again. > Just a little patience would help perhaps. I know about the problem but > just haven't had the time to look into it. No problem. I'll be patiened. > What bothers me more is the problem with the volserver. I don't > remember me doing anything to that code. ;-) > > Horst What bothers me most is the salvager crashing. I am in the middle of the migration from TRANSARC servers to OpenAFS servers. One OpenAFS server is already in production with about thousand volumes. At the moment I have stopped the migration. But I do not want to go back Thanks to all for your assistance! Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] 1.3.77 under AIX 5.1: unusable
On Monday 07 February 2005 07:29, Derrick J Brashear wrote: > > CRASH INFORMATION: > > CPU 0 CSA 2FF3B400 at time of crash, error code for LEDs: 3000 > > pvthread+004700 STACK: > > [05B53F2C]afs_RemoveVCB+C8 (69746572 [??]) > > [05B4A328]afs_GetDCache+001774 (33CF1720, , , 2FF3B188, > > Of course, nothing has changed in afs_RemoveVCB in some time. > But, looking at afs_vcache.c the aix 5.1 changes from Hartmut chowed up in > 1.3.71. I thought you said you needed to go back to 1.3.65 before things > worked. Am I mistaken ? Yes you are mistaken. I always resort to to the kernel extensions of Hartmut from his 15.03.04 built. No extensions of 1.3.x worked for me so far. 1.3.65 server worked for me except probably the salvager. I simply forgot to test it because the last three years I never had to salvage any volume. At the moment I am running an 1.3.77 server in production with fileserver and volserver taken from 1.3.65. The salvager from 1.3.77 does not work. It crashes. The salvager from 1.3.65 doesn't work either. It stops with the following error message: df -k /vicepcp Filesystem1024-blocks Free %UsedIused %Iused Mounted on /dev/vicepcp 52428800 33109272 37%98082 2% /vicepcp [EMAIL PROTECTED]:bin]# bos salvage localhost cp usr.hgb -localauth -showlog Starting salvage. bos: salvage completed SalvageLog: @(#) OpenAFS 1.3.65 built 2004-08-09 02/07/2005 11:16:31 STARTING AFS SALVAGER 2.4 (/usr/afs/bin/salvager /vicepcp 537087849) Unable to allocate enough space to read inode table; vicepcp not salvaged -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] 1.3.77 under AIX 5.1: unusable
On Tuesday 01 February 2005 16:05, Jim Rees wrote: > You said you're getting core dumps. Can you send us the stack traces? Hello Jim here is another stack trace. It is from the salvager: == bos salvage localhost cp usr.hgb -showlog -localauth Starting salvage. bos: salvage completed SalvageLog: @(#) OpenAFS 1.3.77 built 2005-02-02 02/02/2005 16:59:06 STARTING AFS SALVAGER 2.4 (/usr/afs/bin/salvager /vicepcp 537087849) 02/02/2005 16:59:06 2 nVolumesInInodeFile 56 02/02/2005 16:59:06 CHECKING CLONED VOLUME 537087980. 02/02/2005 16:59:06 usr.hgb.backup (537087980) updated 01/24/2005 14:15 02/02/2005 16:59:06 "Salvage volume group" core dumped! dbx -I src/vol rs_aix51/dest/root.server/usr/afs/bin/salvager ../dumps/core Type 'help' for help. reading symbolic information ...warning: no source compiled with -g [using memory image in ../dumps/core] IOT/Abort trap in raise at 0xd01e68a8 0xd01e68a8 (raise+0x4c) 80410014lwz r2,0x14(r1) (dbx) where raise(??) at 0xd01e68a8 abort() at 0xd01f5560 AssertionFailed(0x200012c4, 0xe01) at 0x10017f18 ClearROInUseBit(0x2002cab8) at 0x1000a2f4 SalvageVolumeHeaderFile(0x2002ba04, 0x2002d0f8, 0x0, 0x0, 0x2ff21fdc) at 0x10002000 DoSalvageVolumeGroup(0x2002b9e8, 0x2) at 0x10002970 SalvageFileSys1(0x20026498, 0x20034f69) at 0x10005cd8 SalvageFileSys(0x20026498, 0x20034f69) at 0x10005ed8 handleit(0x20022a18) at 0x10001a14 cmd_Dispatch(0x4, 0x20024198) at 0x1001c0b8 main(0x3, 0x2ff228b0) at 0x1000132c This salvager problem hits me most because no salvager version works on the OpenAFS server and I have about 150 GB of data on it including important web pages. One further information: the fileserver is a namei server and the vicepcp partition is an JFS2 filesystem. Thanks for any assistance Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] 1.3.77 under AIX 5.1: unusable
On Tuesday 01 February 2005 16:05, Jim Rees wrote: > You said you're getting core dumps. Can you send us the stack traces? This is stack trace from a client crash: =0 [EMAIL PROTECTED]:root]# kdb /tmp/vmcore.20 /unix The specified kernel file is a UP kernel /tmp/vmcore.20 mapped from @ 7000 to @ 744f7c7d Preserving 950098 bytes of symbol table First symbol __mulh Component Names: 1) dmp_minimal [5 entries] 2) proc [241 entries] 3) thrd [577 entries] 4) ldr [2 entries] 5) errlg [3 entries] 6) bos [7 entries] 7) ipc [7 entries] 8) vmm [20 entries] 9) rtastrc [8 entries] 10) sscsidd [2 entries] 11) scdisk [6 entries] 12) lvm [2 entries] 13) tty [4 entries] 14) smint0 [6 entries] 15) netstat [10 entries] 16) phxent_dd [5 entries] 17) bldd [5 entries] 18) kbddd[2 entries] 19) mousedd [2 entries] 20) jfs2 [1 entries] Component Dump Table has 915 entries START END 3500 01759EC8 _system_configuration+20 2FF3B400 2FF80A70 __ublock+00 2FF22FF4 2FF22FF8 environ+00 2FF22FF8 2FF22FFC errno+00 E000 F000 lkwseg+1000 PFT: id0007 raddr.0180 eaddr.0180 size.. align. valid..1 ros0 holes..0 io.0 seg1 wimg...2 PVT: id0008 raddr.00603000 eaddr. size.. align. valid..1 ros0 holes..0 io.0 seg1 wimg...2 Dump analysis on CHRP_UP_PCI POWER_PC POWER_630 machine with 1 cpu(s) (64-bit registers) Processing symbol table... ...done (0)> stat SYSTEM_CONFIGURATION: CHRP_UP_PCI POWER_PC POWER_630 machine with 1 cpu(s) (64-bit registers) SYSTEM STATUS: sysname... AIX nodename.. ibm1 release... 1 version... 5 machine... 00415FAC4C00 nid... 415FAC4C time of crash: Tue Jan 18 10:53:57 2005 age of system: 14 min., 41 sec. xmalloc debug: disabled CRASH INFORMATION: CPU 0 CSA 2FF3B400 at time of crash, error code for LEDs: 3000 pvthread+004700 STACK: [05B53F2C]afs_RemoveVCB+C8 (69746572 [??]) [05B4A328]afs_GetDCache+001774 (33CF1720, , , 2FF3B188, 2FF3B178, 2FF3B180, 0001) [05B8464C]BPrefetch+8C (05BD2D68) [05B84E98]afs_BackgroundDaemon+0002A8 () [05B2B390]afs_syscall_call+000238 (0002, , 2FF22FFC, D0B2, , 6000) [05B2AE94]syscall+A0 (001C, 0002, , 2FF22FFC, D0B2, , 6000) [3A50].sys_call+00 () = Gunther -- ____ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] 1.3.77 under AIX 5.1: unusable
Hello all, here is the result of my seriously testing of 1.3.77 under AIX 5.1: The server is not usable: ° VOLSERVER IS BROKEN "vos move" and "vos dump" from a 1.3.77 server do not work. The error message of volserver in the log is "Volser: DumpFile: no memory". Moving volumes from a TRANSARC Server to the OpenAFS Server some volumes got seriously damaged. If a process on a client tried to access such a volume the process hung completely. volserver crashed frequently. "vos zap -force" may loop forever, volserver producing one core dump after the other. fileserver and volserver both from 1.3.65 work. °SALVAGER IS BROKEN it crashes always, even during salvaging an intact volume: SalvageLog: @(#) OpenAFS 1.3.77 built 2005-01-18 02/01/2005 14:57:25 STARTING AFS SALVAGER 2.4 (/usr/afs/bin/salvager /vicepcp 537087849) 02/01/2005 14:57:25 2 nVolumesInInodeFile 56 02/01/2005 14:57:28 CHECKING CLONED VOLUME 537087980. 02/01/2005 14:57:28 usr.hgb.backup (537087980) updated 01/24/2005 14:15 02/01/2005 14:57:28 "Salvage volume group" core dumped! salvager from 1.3.65 doesn't work either. It does not crash but produces only an error message The client is not usable: the kernnel extensions crash the system or fileservers get suddenly unavailable and stay so. One has to reboot to get them back. This is definitely a problem of the kernel extensions. No version of them from openafs.org work for me. See also my previous postings. If there is any way to help to solve these problems (besides providing patches. Unfortunately I am not a programmer), I'll try to do it. Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Urgent: Unable to dump volumes
On Wednesday 26 January 2005 21:03, Horst Birthelmer wrote: > On Jan 26, 2005, at 6:35 PM, Hans-Gunther Borrmann wrote: > > Hello all, > > > > during the migration from TRANSARC fileservers to OpenAFS namei > > fileservers I > > ran into major problems. One is now: > > > > I cannot dump volumes which reside on the OpenAFS Fileserver > > > > [EMAIL PROTECTED]:www]# vos dump www.uniradio.var -file /tmp/dumptest > > Error in rx_EndCall > > VOLSER: Problems encountered in doing the dump ! > > Error in vos dump command. > > VOLSER: Problems encountered in doing the dump ! > > > > The VolserLog shows: > > Wed Jan 26 18:30:00 2005 1 Volser: DumpFile: no memory > > If it is what I think it is ... and I definitely have to go test it a > little more than that would be a bug related to one I fixed some time > ago. > > The filesize in the AIX stat is 64bit and if you give that value to a > malloc you'll get what the higher 32bits of that number are saying (we > have a big endian machine) and in 90% of the cases that's 0. ;-) > > Horst I just installed fileserver and volserver from 1.3.65 which was the version I used for my tests. These binaries work! -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Urgent: Unable to replicatecertain volumes
Hello all, during the migration from TRANSARC fileservers to OpenAFS namei fileservers I ran into major problems. One is now: it is not possible to replicate certain volumes, rw resides on OpenAFS fileserver, to TRANSARC-Fileservers (sv8 OpenAFS, sv7 TRANSARC). [EMAIL PROTECTED]:www]# vos addsite sv7 bf www.mathphys Added replication site sv7 /vicepbf for volume www.mathphys [EMAIL PROTECTED]:www]# vos exa www.mathphys www.mathphys 537084871 RW 17367 K On-line sv8.ruf.uni-freiburg.de /vicepcf RWrite 537084871 ROnly 537084872 Backup 0 MaxQuota 102400 K CreationWed Oct 23 13:02:16 2002 CopyWed Jan 26 10:53:34 2005 Backup Never Last Update Wed Jan 26 16:11:03 2005 1946 accesses in the past day (i.e., vnode references) RWrite: 537084871 ROnly: 537084872 number of sites -> 3 server sv8.ruf.uni-freiburg.de partition /vicepcf RW Site server sv8.ruf.uni-freiburg.de partition /vicepcf RO Site server sv7.ruf.uni-freiburg.de partition /vicepbf RO Site -- Not released [EMAIL PROTECTED]:www]# vos rel www.mathphys Release failed: VOLSER: Problems encountered in doing the dump ! The volume 537084871 could not be released to the following 1 sites: sv7.ruf.uni-freiburg.de /vicepbf VOLSER: release could not be completed Error in vos release command. VOLSER: release could not be completed [EMAIL PROTECTED]:www]# vos listvol sv7 bf|fgrep Off-line www.mathematik.readonly 537084000 RO 65917 K Off-line www.molbiotech.readonly 537087480 RO 135419 K Off-line www.organogenesis.readonly537087835 RO278 K Off-line www.orient.readonly 537084375 RO3862573 K Off-line www.physchem.readonly 537084131 RO743 K Off-line VolserLog on sv7: Wed Jan 26 19:18:21 2005 trans 4247 on volume 537083921 is older than 300 seconds Wed Jan 26 19:18:51 2005 trans 4247 on volume 537083921 is older than 330 seconds Wed Jan 26 19:19:21 2005 trans 4247 on volume 537083921 is older than 360 seconds Wed Jan 26 19:19:51 2005 trans 4247 on volume 537083921 is older than 390 seconds Wed Jan 26 19:20:21 2005 trans 4247 on volume 537083921 is older than 420 seconds Wed Jan 26 19:20:51 2005 trans 4247 on volume 537083921 is older than 450 seconds Wed Jan 26 19:21:21 2005 trans 4247 on volume 537083921 is older than 480 seconds Wed Jan 26 19:21:51 2005 trans 4247 on volume 537083921 is older than 510 seconds Wed Jan 26 19:22:21 2005 trans 4247 on volume 537083921 is older than 540 seconds Wed Jan 26 19:22:51 2005 trans 4247 on volume 537083921 is older than 570 seconds Wed Jan 26 19:22:56 2005 1 Volser: Delete: volume 537083921 deleted Wed Jan 26 19:23:45 2005 1 Volser: CreateVolume: volume 537084872 (www.mathphys.readonly) created Wed Jan 26 19:23:45 2005 1 Volser: WriteFile: Error reading dump file 1 size=2048 nbytes=2048 (0 of 2048); restore aborted Wed Jan 26 19:23:45 2005 1 Volser: ReadVnodes: IDEC inode 231439 VolserLog on sv8: Wed Jan 26 19:23:40 2005 1 Volser: Clone: Recloning volume 537084871 to volume 537084872 Wed Jan 26 19:23:45 2005 1 Volser: DumpFile: no memory vos exa: vos exa 537084871 www.mathphys 537084871 RW 17367 K On-line sv8.ruf.uni-freiburg.de /vicepcf RWrite 537084871 ROnly 537084872 Backup 0 MaxQuota 102400 K CreationWed Oct 23 13:02:16 2002 CopyWed Jan 26 10:53:34 2005 Backup Never Last Update Wed Jan 26 16:11:03 2005 1986 accesses in the past day (i.e., vnode references) RWrite: 537084871 ROnly: 537084872 RClone: 537084872 number of sites -> 3 server sv8.ruf.uni-freiburg.de partition /vicepcf RW Site -- New release server sv8.ruf.uni-freiburg.de partition /vicepcf RO Site -- New release server sv7.ruf.uni-freiburg.de partition /vicepbf RO Site -- Old release Any help is appreciated. Operating system AIX 5.1 OpenAFS: openafs-snap-2005-01-17 Gunther -- ____ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Urgent: Unable to dump volumes
Hello all, during the migration from TRANSARC fileservers to OpenAFS namei fileservers I ran into major problems. One is now: I cannot dump volumes which reside on the OpenAFS Fileserver [EMAIL PROTECTED]:www]# vos dump www.uniradio.var -file /tmp/dumptest Error in rx_EndCall VOLSER: Problems encountered in doing the dump ! Error in vos dump command. VOLSER: Problems encountered in doing the dump ! The VolserLog shows: Wed Jan 26 18:30:00 2005 1 Volser: DumpFile: no memory Any help is appreciated. Operating system AIX 5.1 OpenAFS: openafs-snap-2005-01-17 Gunther -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Unable to zap a volume
Hello all, during the migration from TRANSARC fileservers to OpenAFS namei fileservers I ran into major problems. One is now: I cannot replicat a volume: [EMAIL PROTECTED]:www]# vos exa www.uni.neurolab www.uni.neurolab 537069091 RW 8118 K On-line sv8.ruf.uni-freiburg.de /vicepcf RWrite 537069091 ROnly 537069092 Backup 0 MaxQuota 2 K CreationTue Oct 9 16:51:10 2001 CopyWed Jan 26 13:20:01 2005 Backup Never Last Update Wed Jul 2 12:26:55 2003 3457 accesses in the past day (i.e., vnode references) RWrite: 537069091 number of sites -> 1 server sv8.ruf.uni-freiburg.de partition /vicepcf RW Site [EMAIL PROTECTED]:www]# vos addsite sv8 cf www.uni.neurolab Added replication site sv8 /vicepcf for volume www.uni.neurolab [EMAIL PROTECTED]:www]# vos exa www.uni.neurolab www.uni.neurolab 537069091 RW 8118 K On-line sv8.ruf.uni-freiburg.de /vicepcf RWrite 537069091 ROnly 537069092 Backup 0 MaxQuota 2 K CreationTue Oct 9 16:51:10 2001 CopyWed Jan 26 13:20:01 2005 Backup Never Last Update Wed Jul 2 12:26:55 2003 3457 accesses in the past day (i.e., vnode references) RWrite: 537069091 number of sites -> 2 server sv8.ruf.uni-freiburg.de partition /vicepcf RW Site server sv8.ruf.uni-freiburg.de partition /vicepcf RO Site -- Not released [EMAIL PROTECTED]:www]# vos rel www.uni.neurolab Volume needs to be salvaged Error in vos release command. Volume needs to be salvaged The FileLog: Wed Jan 26 17:23:08 2005 VAttachVolume: Error reading smallVnode vol header / vicepcf//V0537069092.vl; error=101 Wed Jan 26 17:23:08 2005 VAttachVolume: Error attaching volume /vicepcf// V0537069092.vl; volume needs salvage; error=101 A salvage does not help. I did vos remsite ... which works. And then I tried "vos zap -force" of 537069092. This command loops forever and volser produces one core dump after the other. Any help is appreciated. Operating system AIX 5.1 OpenAFS: openafs-snap-2005-01-17 Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] OpenAFS-Client crashes AIX 5.1
On Tuesday 18 January 2005 11:45, Horst Birthelmer wrote: > > This is IMHO not the same problem. It doesn't look like a kernel > allocation problem to me. > BTW, can you start your client with -nosettime. I had some problems > with that in the past, and never got the time to look deeper into it. > Maybe it doesn't help but it doesn't hurt either. > > Horst Hello Horst, I used -nosettime. The problem is still there, but it is not reproducible. I saw it a second time. I tried the tar of my AFS home directory several times. It never succeeded. One crash. The other times the fileserver with the volume was suddenly unavailable and remained inaccessible from my test-client. Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Further Problem with 1.3.77 under AIX 5.1
On Thursday 13 January 2005 16:26, Derrick J Brashear wrote: > On Thu, 13 Jan 2005, Hans-Gunther Borrmann wrote: > > ° The fileservers are multihomed. The test-machine has no access to > > 10.1.2.x, the fileservers are only reachable by their 132.230.6.x > > adresses. > > I can give you a patch for this to try if you're willing. It seems that some of my mails get lost. So once more: I am willing to try your patch. By the way the problem still persists in the openafs-snap-2005-01-17.tar.gz Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] OpenAFS-Client crashes AIX 5.1
Hello all, What I did: I compiled openafs-snap-2005-01-17 under AIX 5.1with the following patch to rxkad.p.h, which defines the max ticket length as 344: *** rxkad.p.h 2005-01-18 09:52:10.0 +0100 --- ./rxkad.p.h.orig2004-12-07 17:49:11.0 +0100 *** *** 17,23 --- 17,31 #define MAXKTCTICKETLIFETIME (30*24*3600) #define MINKTCTICKETLEN 32 + #if defined(AFS_AIX52_ENV) + #ifdef __XCOFF64__ + #define MAXKTCTICKETLEN 12000 /* was 344 */ + #else #define MAXKTCTICKETLEN 344 + #endif + #else + #define MAXKTCTICKETLEN 12000 /* was 344 */ + #endif #define MAXKTCNAMELEN 64/* name & inst should be 256 */ #define MAXKTCREALMLEN 64/* should be 256 */ The call to configure is: CC=cc ./configure --with-afs-sysname=rs_aix51 \ --enable-namei-fileserver \ --enable-largefile-fileserver \ --enable-fast-restart \ --enable-bitmap-later \ --enable-tivoli-tsm \ --enable-transarc-paths \ --disable-pam After the installation of this version on my test machine I started the client, got a token and tard my AFS home directory to /dev/null. After some time the system crashed. Crash information follows: [EMAIL PROTECTED]:ras]# kdb /tmp/dump /unix The specified kernel file is a UP kernel /tmp/dump mapped from @ 7000 to @ 744f7c7d Preserving 950098 bytes of symbol table First symbol __mulh Component Names: 1) dmp_minimal [5 entries] 2) proc [241 entries] 3) thrd [577 entries] 4) ldr [2 entries] 5) errlg [3 entries] 6) bos [7 entries] 7) ipc [7 entries] 8) vmm [20 entries] 9) rtastrc [8 entries] 10) sscsidd [2 entries] 11) scdisk [6 entries] 12) lvm [2 entries] 13) tty [4 entries] 14) smint0 [6 entries] 15) netstat [10 entries] 16) phxent_dd [5 entries] 17) bldd [5 entries] 18) kbddd[2 entries] 19) mousedd [2 entries] 20) jfs2 [1 entries] Component Dump Table has 915 entries START END 3500 01759EC8 _system_configuration+20 2FF3B400 2FF80A70 __ublock+00 2FF22FF4 2FF22FF8 environ+00 2FF22FF8 2FF22FFC errno+00 E000 F000 lkwseg+1000 PFT: id0007 raddr.0180 eaddr.0180 size.. align. valid..1 ros0 holes..0 io.0 seg1 wimg...2 PVT: id0008 raddr.00603000 eaddr. size.. align. valid..1 ros0 holes..0 io.0 seg1 wimg...2 Dump analysis on CHRP_UP_PCI POWER_PC POWER_630 machine with 1 cpu(s) (64-bit registers) Processing symbol table... ...done (0)> stat SYSTEM_CONFIGURATION: CHRP_UP_PCI POWER_PC POWER_630 machine with 1 cpu(s) (64-bit registers) SYSTEM STATUS: sysname... AIX nodename.. ibm1 release... 1 version... 5 machine... 00415FAC4C00 nid... 415FAC4C time of crash: Tue Jan 18 10:53:57 2005 age of system: 14 min., 41 sec. xmalloc debug: disabled CRASH INFORMATION: CPU 0 CSA 2FF3B400 at time of crash, error code for LEDs: 3000 pvthread+004700 STACK: [05B53F2C]afs_RemoveVCB+C8 (69746572 [??]) [05B4A328]afs_GetDCache+001774 (33CF1720, , , 2FF3B188, 2FF3B178, 2FF3B180, 0001) [05B8464C]BPrefetch+8C (05BD2D68) [05B84E98]afs_BackgroundDaemon+0002A8 () [05B2B390]afs_syscall_call+000238 (0002, , 2FF22FFC, D0B2, , 6000) [05B2AE94]syscall+A0 (001C, 0002, , 2FF22FFC, D0B2, , 6000) [3A50].sys_call+00 () Gunther -- ____ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] crash on AIX 5.2
On Monday 17 January 2005 16:21, Horst Birthelmer wrote: > der Absturz ist in der xmalloc Routine. Genau die, die schon immer bei > grossen Tickets abstuerzt. > Am einfachsten also die Ticketsize ohne "ifdefs" und Co. fest auf 344 > Bytes setzen, OpenAFS neu uebersetzen und nochmal probieren. > Offenbar ist diese Sicherung ueber XCOFF64 nicht ausreichend, so dass > man im 32Bit Modus trotzdem an die falsche Stelle kommt und zu viel > Kernel Momory allokiert. > Ich hatte die auch in meiner Variante nicht drin, sondern die > Ticketsize fest auf 344 eingestellt. > > > Horst According to your advice I have patched rxkad.p.h. This seems to solve the problem that the system crashes nearly immediately after getting a token. Now after some time the system still crashes. I'll open a new thread on this issue. My change to rxkad.p.h follows. Gunther *** rxkad.p.h 2005-01-18 09:52:10.0 +0100 --- ./rxkad.p.h.orig2004-12-07 17:49:11.0 +0100 *** *** 17,23 --- 17,31 #define MAXKTCTICKETLIFETIME (30*24*3600) #define MINKTCTICKETLEN 32 + #if defined(AFS_AIX52_ENV) + #ifdef __XCOFF64__ + #define MAXKTCTICKETLEN 12000 /* was 344 */ + #else #define MAXKTCTICKETLEN 344 + #endif + #else + #define MAXKTCTICKETLEN 12000 /* was 344 */ + #endif #define MAXKTCNAMELEN 64/* name & inst should be 256 */ #define MAXKTCREALMLEN 64/* should be 256 */ -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
[OpenAFS] Further Problem with 1.3.77 under AIX 5.1
Hello, I have compiled openafs-snap-2005-01-10. Besides that getting a token and acessing AFS crashes the machine I found the following problem: Without any token I tar a large AFS area with campus wide available files to /dev/null. After some time I get the following error messages: a rs_aix51/gaussian-03/g03/l405.hlp 6 blocks. a rs_aix51/gaussian-03/g03/l502.exe 18195 blocks. tar: 0511-182 Read error on afs: Lost contact with file server 10.1.2.26 in cell uni-freiburg.de (multi-homed addre ss; other same-host interfaces maybe up) afs: Lost contact with file server 10.1.2.27 in cell uni-freiburg.de (multi-homed address; other same-host interfac es maybe up) afs: Lost contact with file server 132.230.6.235 in cell uni-freiburg.de (all multi-homed ip addresses down for the server) afs: Lost contact with file server 132.230.6.236 in cell uni-freiburg.de (all multi-homed ip addresses down for the server) afs: setting clock back 10 seconds (of 45, via 10.1.2.26 in cell uni-freiburg.de); clock is still fast. rs_aix51/gaussian-03/g03/l502.exe: A remote host did not respond within the timeout period. a rs_aix51/gaussian-03/g03/l502.hlp 68 blocks. tar: 0511-182 Read error on rs_aix51/gaussian-03/g03/l502.hlp: A remote host did not respond within the timeout per iod. a rs_aix51/gaussian-03/g03/l503.exe 2922 blocks. tar: 0511-182 Read error on rs_aix51/gaussian-03/g03/l503.exe: A remote host did not respond within the timeout per iod. a rs_aix51/gaussian-03/g03/l503.hlp 7 blocks. tar: 0511-182 Read error on rs_aix51/gaussian-03/g03/l503.hlp: A remote host did not respond within the timeout per iod. a rs_aix51/gaussian-03/g03/l504.exe 3307 blocks. tar: 0511-182 Read error on rs_aix51/gaussian-03/g03/l504.exe: A remote host did not respond within the timeout per iod. a rs_aix51/gaussian-03/g03/l506.exe 6171 blocks. tar: 0511-182 Read error on rs_aix51/gaussian-03/g03/l506.exe: A remote host did not respond within the timeout per iod. tar: rs_aix51/gaussian-03/g03/l506.hlp: A remote host did not respond within the timeout period. . . # the tar continues a little bit . a share/sw-tools-1.0/sbin/lnlibe 1 blocks. a share/sw-tools-1.0/sbin/lnman 1 blocks. a share/sw-tools-1.0/sbin/lnsbin 1 blocks. a share/sw-tools-1.0/sbin/mkman 1 blocks. a share/sw-tools-1.0/sbin/rmman 1 blocks. a share/sw-tools-1.0/Links 1 blocks. a share/sw-tools-1.0/Id 1 blocks. a share/sw-tools-1.0/README 1 blocks. a share/sw-tools-1.0/History 5 blocks. tar: share/xyz: A remote host did not respond within the timeout period. The tar the finishes. ° The fileservers are multihomed. The test-machine has no access to 10.1.2.x, the fileservers are only reachable by their 132.230.6.x adresses. ° After saying that "all multihomed ip-adresses are down" the test machine has no further access to AFS besides to some files which are stiil in the cache. fs checkservers says always "These servers unavailable due to network or server problems: sv6.ruf.uni-freiburg.de sv7.ruf.uni-freiburg.de". ° Stopping and starting AFS does not help. I have to reboot. ° During this state a machine connected to the same hub is able to tar the same area without any problems. So there is no problem with the network or the servers itselves. ° The problem is reproducible. ° The problem does not show up if I replace the kernel extensions by those of Hartmut Reuter contained in his 15.03.04 Version. Thanks in advance for any help. Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] crash on AIX 5.2
On Tuesday 11 January 2005 17:45, Hartmut Reuter wrote: > Jeffrey Altman wrote: > > Hartmut Reuter wrote: > >> I am in the process of tracking down all differences between my good > >> version and 1.3.77. > >> > >> I am now not very distant from 1.3.77, and at least one problem seems > >> to be the new code in afs_pioctl.c for get and set tokens along with > >> the huge ticket size introduced for compatibilty with active directory. > >> Keeping the old ticket size and the old code for tokens in afs_pioctl.c > >> results in a fairly stable client. At least I can get a token, make > >> clean in the openafs-tree and make dest without crashing the system. > >> This is certainly not enough testing for putting it into production, > >> but a hint where the problem may be hidden. > >> > >> Hartmut > > > > We know the problem is in the set/get token code on AIX. More then > > likely the stack is too small to support a 12000 byte object and it > > is getting blown away on AIX. The question is: > > > > * where is this object that is located on the stack? > > > > If you can find that, then you will have solved the bug. > > Does not look like stack overflow. The crash always happens in xmalloc1: > > (0)> f > pvthread+00A500 STACK: > [006021F0]xmalloc1+0007AC (0200, F1E00C22E000, > , F1E00C22E000, 0400, F1E03B964269, > 0002, 003E4338 [??]) > [00606B70]xmalloc+000208 (??, ??, ??) > [08E41978]afs_osi_Alloc+5C (??) > [08EBC6DC]afs_HandlePioctl+0003D4 (, 800C5608800C5608, > F0002FF3A400, , F0002FF3A438) > [08EC74F8]afs_syscall_pioctl+000294 (, 800C5608800C5608, > 2FF21FC0, ) > [08E46000]syscall+0001A0 (00140014, , > 800C5608800C5608, 2FF21FC02FF21FC0, , 2E6D70672E6D7067, > 00800080) > [08E45DB8]lpioctl+50 (, 800C5608800C5608, > 2FF21FC0, ) > [379C]sc_msr_2_point+000028 () > Not a valid dump data area @ 2FF21CF0 > (0)> > > So there probably storage on the kernel heap was overwritten. > > Hartmut > > > Jeffrey Altman -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] crash on AIX 5.2
On Tuesday 11 January 2005 15:24, Horst Birthelmer wrote: > On Jan 11, 2005, at 3:17 PM, Hans-Gunther Borrmann wrote: > > On Tuesday 11 January 2005 14:30, Horst Birthelmer wrote: > >> you shouldn't need the sysname if everything is OK... ;-) > > > > compiles without "sysname". > > > >> Now check if there is a passage in your rxkad.h that looks like: > >> > >> > >> #if defined(AFS_AIX52_ENV) > >> #ifdef __XCOFF64__ > >> #define MAXKTCTICKETLEN 12000 /* was 344 */ > >> #else > >> #define MAXKTCTICKETLEN 344 > >> #endif > >> #else > >> #define MAXKTCTICKETLEN 12000 /* was 344 */ > >> #endif > >> > >> it's pretty much at the beginning > > > > rxkad.h contains these lines. > > > >> BTW, this is a 32 bit Kernel on a AIX 5.2 machine?? Right?? > > > > It is a 32 bit kernel on an AIX 5.1 (!) machine. > > If it crashes you can try to change the lines above to AFS_AIX51_ENV > since that would be defined on a AIX 5.2 machine as well. > I don't know for sure since I never tested on AIX 5.1... > > Horst Still crashes :-(. Gunther -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] crash on AIX 5.2
On Tuesday 11 January 2005 14:30, Horst Birthelmer wrote: > > you shouldn't need the sysname if everything is OK... ;-) compiles without "sysname". > Now check if there is a passage in your rxkad.h that looks like: > > > #if defined(AFS_AIX52_ENV) > #ifdef __XCOFF64__ > #define MAXKTCTICKETLEN 12000 /* was 344 */ > #else > #define MAXKTCTICKETLEN 344 > #endif > #else > #define MAXKTCTICKETLEN 12000 /* was 344 */ > #endif > > it's pretty much at the beginning rxkad.h contains these lines. > BTW, this is a 32 bit Kernel on a AIX 5.2 machine?? Right?? It is a 32 bit kernel on an AIX 5.1 (!) machine. Gunther -- ____ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] crash on AIX 5.2
On Tuesday 11 January 2005 11:14, Horst Birthelmer wrote: > On Jan 11, 2005, at 11:09 AM, Hans-Gunther Borrmann wrote: > > > > I have compiled 1.3.77 under AIX 5.1 and see the same problem. In my > > case the > > machine crashes after getting a token. It seems to work before. > > > > I see this problem with all versions of OpenAFS I compiled. My > > workaround is > > always to use kernel extensions I got some times ago from Hartmut > > Reuter. > > They work. If this is an old problem, is any patch available somwhere? > > Yes. In CVS for some time now > > > Horst Hello, I've tried the 10.1.05 snapshot and the machine still crashes. What I did: I fetched openafs-snap-2005-01-10.tar.gz and stored the contents of the tar to directory . cd /../openafs-1.3.77; make clean rsync -av / /../openafs-1.3.77 # possible (??) or is this my fault call of configure: CC=cc ./configure --with-afs-sysname=rs_aix51 \ --enable-namei-fileserver \ --enable-largefile-fileserver \ --enable-tivoli-tsm \ --enable-transarc-paths \ --disable-pam make make dest Any idea? Gunther -- ____ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] crash on AIX 5.2
On Friday 07 January 2005 21:39, Horst Birthelmer wrote: > On Jan 5, 2005, at 9:04 AM, Dr. Elmar Abeln wrote: > > Hi all ! > > > > i tryed to install Open-AFS (1.3.74) on an AIX 5.2 > > The Compile part worked without an error. But after loading > > the kernel extensions (export.ext and afs.ext.32) and starting > > the AFS-Daemon afsd the machine crashes. > > The Machine crashes right away or after getting a token?? > Since you use the afs.ext.32 module it is possible that you run into > that old kernel allocation problem. > I have compiled 1.3.77 under AIX 5.1 and see the same problem. In my case the machine crashes after getting a token. It seems to work before. I see this problem with all versions of OpenAFS I compiled. My workaround is always to use kernel extensions I got some times ago from Hartmut Reuter. They work. If this is an old problem, is any patch available somwhere? Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Vos move / vos remsite errors..
On Thursday 18 November 2004 12:58, Lars Schimmer wrote: > 2. vos remsite doesn't work like expected. > On my debian 1.3.73 version (Debian sarge, 1.3.73 experimental source deb, > Kernel 2.4.27) I run the fileserver and a few RO copies of volumes. > Now I want to move the HD from this server to another. Vos move doesn't > work with RO copies, so vos remsite server a, vos addsite server b. > Here the workflow: > vos listvol tetris > Total number of volumes on server tetris partition /vicepb: 12 > CVS.readonly 536871206 RO 1761781 K On-line > ~ vos remsite tetris b CVS > Deleting the replication site for volume 536871205 ...Removed replication > site tetris /vicepb for volume CVS > [EMAIL PROTECTED]:~ ! vos listvol tetris > Total number of volumes on server tetris partition /vicepb: 12 > CVS.readonly 536871206 RO 1761781 K On-line > > H? > I deleted the replication site and right after that the replication of the > volume is still online? > But another: > [EMAIL PROTECTED]:~ ! vos remsite tetris b CVS > This site is not a replication site > Error in vos remsite command. > VOLSER: illegal operation > > Huu? > The RO copy is still there, but can't be removed? A salvage says everything > is OK and a syncvldb oder syncserv puts the RO copy back in use. > How can I delete the RO copy on this special Fileserver? From the manual: The vos remsite command removes the read-only replication site specified by the -machine and -partition arguments from the Volume Location Database (VLDB) entry for the indicated volume, which is read/write. This command is useful for removing read-only sites that were mistakenly created with the vos addsite command, before the vos release command actually releases them. If a read-only copy already exists at the site, it is not affected. However, if this read-only site was the last site housing any version of the volume, then the entire VLDB entry is removed, even if a copy of the read-only version still actually exists at the site. The VL Server does not correct the discrepancy until the vos syncserv and vos syncvldb commands are run. Cautions Do not use this command as the standard way to remove a read-only volume, because it can create a discrepancy between the VLDB and the volumes on file server machines. Use the vos remove command instead. -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list [EMAIL PROTECTED] https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] how to restore data from multiple incremental dumps
On Wednesday 17 November 2004 00:57, Tom Jones wrote: > Hi > > We have OpenAFS 1.2.111-rhel3.0.1 on Redhat Linux 3.0.1. Since there is no > direct TSM client available, we are using butc to dump data to local disk > backup data file and then archiving to TSM server using TSM B/A client. > > We are taking full dump on Saturday and differentail incremental dumps on > Mon,Tues,Wed,Thursday. > > /dev0sat expires in 17d > /dev0mon expires in 16d > /dev0tues expires in 15d > /dev0wed expires in 14d > /dev0thur expires in 13d > > How to restore data dumped on Wednesday? Accodrding to documents > > backup volrestore -server -partition >-volume -portoffset a b c d > > where port offset "a" points to fulldump backup data file > "b" points to incremental backup data file > (Monday) "c" points to subsequent incremental data file (tuesday) "d" > points to subsequent incremental data file (wednesday) > In which document did you find that information? The AFS administrator reference says: backup volrestore -server -partition -volume + [-extension ] [-date +] [-portoffset +] [-n] [-localauth] [-cell ] [-help] backup volr -s -pa -v + [-e ] [-d +] [-po +] [-n] [-l] [-c ] [-h] -date Specifies a date and optionally time; the restored volume includes data from dumps performed before the date only. Provide a value in the format mm/dd/ [hh:MM], where the required mm/dd/ portion indicates the month (mm), day (dd), and year (), and the optional hh:MM portion indicates the hour and minutes in 24-hour format (for example, the value 14:36 represents 2:36 p.m.). If omitted, the time defaults to 59 seconds after midnight (00:00:59 hours). Valid values for the year range from 1970 to 2037; higher values are not valid because the latest possible date in the standard UNIX representation is in February 2038. The command interpreter automatically reduces any later date to the maximum value. If this argument is omitted, the Backup System restores all possible dumps including the most recently created. Note: A plus sign follows this argument in the command's syntax statement because it accepts a multiword value which does not need to be enclosed in double quotes or other delimiters, not because it accepts multiple dates. Provide only one date (and optionally, time) definition. -portoffset Specifies one or more port offset numbers (up to a maximum of 128), each corresponding to a Tape Coordinator to use in the operation. If there is more than one value, the Backup System uses the first one when restoring the full dump of each volume, the second one when restoring the level 1 incremental dump of each volume, and so on. It uses the final value in the list when restoring dumps at the corresponding depth in the dump hierarchy and all dumps at lower levels. Provide this argument unless the default value of 0 (zero) is appropriate for all dumps. If 0 is just one of the values in the list, provide it explicitly in the appropriate order. -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list [EMAIL PROTECTED] https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] Inquiry: Backing up OpenAFS with IBM Tivoli Storage Manager (TSM)
On Thursday 07 October 2004 11:06, Christopher Odenbach wrote: > Hi, > > at the University of Paderborn we are using OpenAFS on linux servers and > Tivoli on a Solaris Box. We would like to backup our AFS space with tivoli, > but unfortunately up to now IBM has only released an IBM AFS client for IBM > AIX. > > I am actually in contact with IBM and have asked whether it would be > possible to release a client for OpenAFS (on linux or solaris). As this > does not seem to be too easy IBM wants to know if anybody else would be > interested in such a connector. > > So I simply ask the question: Would anyone else be interested in an OpenAFS > connector for Tivoli? Definitely YES. > How do other people manage their backups? Our servers run still under AIX and I use butc and TSM 5.2 -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list [EMAIL PROTECTED] https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] klog incompatibility
On Thursday 05 August 2004 17:32, Hartmut Reuter wrote: > As far as I remember it is the crypt() call which makes the trouble. > This is located in src/des/crypt.c > Probably it's sufficient to compile this one without -O. > You are right. It is sufficient to compile src/des/crypt.c without -O. Gunther -- ________ Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list [EMAIL PROTECTED] https://lists.openafs.org/mailman/listinfo/openafs-info
Re: [OpenAFS] afs.ext.32 and afs.ext.64 not generated
On Wednesday 16 June 2004 15:05, Jeffrey Hutzelman wrote: > On Tuesday, June 15, 2004 23:06:22 +0200 Horst Birthelmer > > <[EMAIL PROTECTED]> wrote: > >> Thank you. The kernel extensions are now generated. By the way > >> MakefileProto.AIX is generated by configure so the suggested change > >> has to be > >> made after each call to configurea t the moment. > > > > That's right! > > > > Somebody has to check that into cvs > > So make the change to MakefileProto.AIX.in, and submit a patch. > Here is may patch: *** MakefileProto.AIX.in2004-06-16 16:59:24.0 +0200 --- MakefileProto.AIX.in.orig 2004-03-17 08:51:57.0 +0100 *** *** 85,91 ln -fs /usr/include/sys sys ln -fs /usr/include/nfs nfs ln -fs /usr/include/jfs ufs ! for m in ${KMODS} ; do \ KDIR=MODLOAD-$$m ; \ mkdir -p $${KDIR} ; \ ln -fs ../Makefile $${KDIR}/Makefile ; \ --- 85,91 ln -fs /usr/include/sys sys ln -fs /usr/include/nfs nfs ln -fs /usr/include/jfs ufs ! for m in $${KMODS} ; do \ KDIR=MODLOAD-$$m ; \ mkdir -p $${KDIR} ; \ ln -fs ../Makefile $${KDIR}/Makefile ; \ *** *** 94,100 done ${COMPDIRS} ${INSTDIRS} ${DESTDIRS}: ! for m in ${KMODS} ; do \ KDIR=MODLOAD-$$m ; \ echo Building in directory: $${KDIR} ; \ if [ "$$m" = "32" ] ; then \ --- 94,100 done ${COMPDIRS} ${INSTDIRS} ${DESTDIRS}: ! for m in $${KMODS} ; do \ KDIR=MODLOAD-$$m ; \ echo Building in directory: $${KDIR} ; \ if [ "$$m" = "32" ] ; then \ -- Hans-Gunther Borrmann <[EMAIL PROTECTED]> Rechenzentrum der Universitaet Freiburg Hermann-Herder-Str. 10, D79104 FREIBURG Tel.: +49 761/203-4652 Fax: +49 761/203-4643 ___ OpenAFS-info mailing list [EMAIL PROTECTED] https://lists.openafs.org/mailman/listinfo/openafs-info