[OpenAFS] rxk: authentication expired

2014-03-14 Thread Timothy Balcer
and retry the release, however I am concerned about the error. How is it that an authentication can expire when the client is automatically reinitiated and aklog'd every 6 hours? -- Timothy Balcer / IT Services Telmate / San Francisco, CA Direct / (415) 300-4313 Customer Service / (800) 205-5510

[OpenAFS] Re: Ubik trouble

2014-01-12 Thread Timothy Balcer
12, 2014 at 4:17 PM, Timothy Balcer wrote: > Wow. Just as you send it outyou find the problem. Always the way :-) > > This turned out to be a subtle network problem. There was a change in the > cross colo VPN link by our provider, and while it didn't affect the > majori

[OpenAFS] Re: Ubik trouble

2014-01-12 Thread Timothy Balcer
oblem like this, try bypassing parts of your network to see if there is a network fault hiding amongst the trees. Best, On Sun, Jan 12, 2014 at 3:54 PM, Timothy Balcer wrote: > Hey folks. Odd problem. I have gone over many things, and I am stumped. I > am tempted to just destroy and rebui

[OpenAFS] Ubik trouble

2014-01-12 Thread Timothy Balcer
.7 Any help is appreciated! I have already upgraded and rebooted all three servers, lowest to highest, to no avail. I also attempted moving the data files out of the way and starting the affected vlserver, but the same symptoms remain. -- Timothy Balcer / IT Services Telmate / San Francisco,

[OpenAFS] Connection timed out and device doesn't exist finally solved

2013-12-24 Thread Timothy Balcer
ss did not release it, even though the IP was no longer active. So I moved one volume. That worked. But I didn't want to do that for the entire fileserver. So I entered -rxbind to the fileserver process and restarted it. Voila. Problem solved. -- Timothy Balcer / IT Services Telmate / Sa

Re: [OpenAFS] Re: Quickie: server prefs

2013-12-20 Thread Timothy Balcer
Okie.. that directory was, in fact, mounted RW in the RO path. Interesting! Blind sided me a bit there :) Let me ask, is this what happens if you use the -rw flag to fs mkmount? I just re read the man page and.. it does seem to be true. On Fri, Dec 20, 2013 at 9:10 PM, Timothy Balcer wrote

Re: [OpenAFS] Re: Quickie: server prefs

2013-12-20 Thread Timothy Balcer
ume access > will be to the rw volume. (unless a mount point is explicitly to the > volume.readonly.) > > > > > On 12/20/2013 7:11 PM, Timothy Balcer wrote: > > ˇTo be more clear.. the problematic server (one I was salvaging) had thr > > RW/RO pair, and was pref 5005.

[OpenAFS] Re: Quickie: server prefs

2013-12-20 Thread Timothy Balcer
000. It still went for the 35000 pref server. On Fri, Dec 20, 2013 at 3:10 PM, Timothy Balcer wrote: > I took down a file server for salvaging today, part of routine > maintenance. It had a RW/RO pair on it of volume 'photos', and there were > two RO copies on two other fileserv

[OpenAFS] Quickie: server prefs

2013-12-20 Thread Timothy Balcer
ocal RW/RO volume for the RO path.. doing an strace showed this was true. The problem will fix itself once the salvage is over, but this is disturbing... what can I do to avoid this sort of problem in future? Am I doing something wrong here? Better troubleshooting? Advice appreciated :) -- Tim

Re: [OpenAFS] Re: Client Cache Question

2013-07-09 Thread Timothy Balcer
rs to AFS. It breaks security, of course. But for performance purposes, and on a private VLAN, it serves my purpose well. On Fri, Jun 28, 2013 at 4:15 PM, Andrew Deason wrote: > On Fri, 28 Jun 2013 12:29:38 -0700 > Timothy Balcer wrote: > > > On another VM in the same local colo,

Re: [OpenAFS] Re: Client Cache Question

2013-06-28 Thread Timothy Balcer
In addition, plenty of memory free on both client and server, and no CPU spikes. On Fri, Jun 28, 2013 at 2:14 PM, Timothy Balcer wrote: > strace gave zero lines more after these after the cat totally completed. > So it blocked on close. > > > On Fri, Jun 28, 2013 at 1:12 PM,

Re: [OpenAFS] Re: Client Cache Question

2013-06-28 Thread Timothy Balcer
strace gave zero lines more after these after the cat totally completed. So it blocked on close. On Fri, Jun 28, 2013 at 1:12 PM, Timothy Balcer wrote: > Stracing that 'cat' yields this at the end: > read(3, "rojects_limit: 10\n authenticati"..., 32768) = 26624 >

Re: [OpenAFS] Re: Client Cache Question

2013-06-28 Thread Timothy Balcer
he transfer to complete to see what the final strace lines give. -- Timothy Balcer / IT Services Telmate / San Francisco, CA Direct / (415) 300-4313 Customer Service / (800) 205-5510

Re: [OpenAFS] Re: Client Cache Question

2013-06-28 Thread Timothy Balcer
o the client in question to see what changes. On Tue, Jun 25, 2013 at 2:03 PM, Timothy Balcer wrote: > Im aware of udp protocol problems.. I've tested network throughput and > responsivity for udp/tcp several different ways, and it's dandy. > > Ok I'll do a f

Re: [OpenAFS] Re: Client Cache Question

2013-06-25 Thread Timothy Balcer
, 25 Jun 2013 13:49:06 -0700 > Timothy Balcer wrote: > > > So, this means that there is no networkly or I/O reason on the > > destination or source that could be the culprit particularly. > > That's not necessarily true; even discarding the issues with Rx (the > transpor

Re: [OpenAFS] Re: Client Cache Question

2013-06-25 Thread Timothy Balcer
trace to all the afs background daemons? Best, On Tue, Jun 25, 2013 at 12:14 PM, Andrew Deason wrote: > On Mon, 24 Jun 2013 15:58:45 -0700 > Timothy Balcer wrote: > > >- Soon after client restart, rsync is very fast.. less than a second, > >compared to rs

Re: [OpenAFS] Re: Client Cache Question

2013-06-25 Thread Timothy Balcer
at 5:50 PM, Anne Salemme wrote: > what about the cache? how big is it, and is it on its own disk partition? > > anne > > > ---------- > *From:* Timothy Balcer > *To:* Andrew Deason > *Cc:* openafs-info@openafs.org > *Sent:* Monday, June 24, 20

Re: [OpenAFS] Re: Client Cache Question

2013-06-24 Thread Timothy Balcer
y. - Network speeds are good On Mon, Jun 24, 2013 at 1:39 PM, Andrew Deason wrote: > On Fri, 21 Jun 2013 16:26:22 -0700 > Timothy Balcer wrote: > > > This seems counter intuitive... the 100 or so files do not go over the > > 500,000 block cache size. They are fairly s

[OpenAFS] Client Cache Question

2013-06-21 Thread Timothy Balcer
with chunksize 13 and the same behavior occurs as before. This seems counter intuitive... the 100 or so files do not go over the 500,000 block cache size. They are fairly small (10's to 100's of kilobytes). Why would increasing cache size impact performance Negatively in such a case? -

Re: [OpenAFS] Re: Tuning the client and server

2013-01-28 Thread Timothy Balcer
The files are being not being read after they are written.. its a bulk write operation. Thanks or that tip on dcache.. I somehow didn't catch it was for a memory cache only... On Mon, Jan 28, 2013 at 2:05 PM, Andrew Deason wrote: > On Mon, 28 Jan 2013 12:58:17 -0800 > Timothy Ba

[OpenAFS] Tuning the client and server

2013-01-28 Thread Timothy Balcer
ded. I feel I am missing something.. although I do understand the basics here, and the relation of the various parameters to activity, I am guessing that I am missing something important re: tuning client vs. fileserver for bulk operations. Can anyone advise? Thanks muchly, -- Timothy Balcer / IT Services

[OpenAFS] Huh :-)

2012-12-05 Thread Timothy Balcer
Well now, this seems like a very nice possibility as an AFS file server cluster, in a box: SuperMicro * 2015TA-HTRF *Just thought I'd share the smile I had when I saw that it had 8 nodes per 2U ;) -- Timothy Balcer / IT Services Telmate / San Francisco, CA Direct / (415) 300-4313 Cus

Re: [OpenAFS] difficulty with afsio

2012-11-12 Thread Timothy Balcer
ix/HDRWQ91.html Thanks for the pointer! On Mon, Nov 12, 2012 at 5:38 AM, Jeffrey Altman wrote: > On 11/10/2012 2:45 PM, Timothy Balcer wrote: >> If I turn off dynroot, only the read only volume shows up in the tree, >> and the read write volume is not accessible. This coincides

Re: [OpenAFS] Cache partition choice still limited to ext2 on Linux?

2012-11-10 Thread Timothy Balcer
il: +49 (0)176 34473913 > GPG Public Key C2E467BB | Jabber: dirk.heinri...@altum.de -- Timothy Balcer / IT Services Telmate / San Francisco, CA Direct / (415) 300-4313 Customer Service / (800) 205-5510 ___ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info

[OpenAFS] difficulty with afsio

2012-11-10 Thread Timothy Balcer
ing odd, or non standard, with my cell configuration, and it is affecting my ability to use afsio to write/read files to the RW volumes. afsio isn't using the afs client and is going after things directly. Any advice? I am free to reconfigure my cell as I need to accommodate. -- Timothy Balcer /

[OpenAFS] Re: Advice on a use case

2012-11-09 Thread Timothy Balcer
it done, I can send it off for people to review and use :) If I could have some help with getting afsio working properly, I think that will be enough to get me going. Thanks also to everyone else as well.. I am learning more about the underpinnings of file systems working with AFS than I have in a very long time! -- Timothy Balcer / IT Services Telmate / San Francisco, CA Direct / (415) 300-4313 Customer Service / (800) 205-5510 -- Timothy Balcer / IT Services Telmate / San Francisco, CA Direct / (415) 300-4313 Customer Service / (800) 205-5510

Re: [OpenAFS] Re: Advice on a use case

2012-11-09 Thread Timothy Balcer
On Fri, Nov 9, 2012 at 9:45 AM, Andrew Deason wrote: > On Thu, 8 Nov 2012 22:48:56 -0800 > Timothy Balcer wrote: > > > Well, unless I am missing something seriously obvious, for example it > > took 1.5 hours to rsync a subdirectory to an AFS volume that had not a > &g

Re: [OpenAFS] Re: Advice on a use case

2012-11-08 Thread Timothy Balcer
On Tue, Nov 6, 2012 at 8:49 AM, Andrew Deason wrote: > On Tue, 6 Nov 2012 00:06:53 -0800 > Timothy Balcer wrote: > > > I have a need to think about replicating large volumes (multigigabyte) > > of large number (many terabytes of data total), to at least two other > &g

[OpenAFS] Advice on a use case

2012-11-06 Thread Timothy Balcer
talking about multiple gigabytes per release, I can imagine it being extremely difficult. Is there a schema that i can use with OpenAFS that will help alleviate this problem? Or perhaps another approach I am missing that may solve it better? Best, --Timothy Balcer PS: FYI the reason I chose AFS

[OpenAFS] [workaround] NFS export oddness

2012-11-01 Thread Timothy Balcer
I found a workaround for now.. I hope it finds some use amongst the community, assuming the linux NFS translator really isn't working properly (as seems to be intimated in the emails I recently found) You can use unionfs with an afs volume and a normal volume, and then NFS export THAT unionfs dire

[OpenAFS] NFS export oddness

2012-11-01 Thread Timothy Balcer
(No such file or directory) open("/proc/fs/openafs/afs_ioctl", O_RDWR) = 3 ioctl(3, CAPI_REGISTER or SNDCTL_COPR_LOAD, 0x7fff18ec3a10) = -1 ENODEV (No such device) close(3)= 0 write(2, "Sorry, the nfs-exporter type is "..., 75Sorry, the nfs-exporter type is curren

Re: [OpenAFS] Re: odd problem with RW site after a botched replica

2012-10-31 Thread Timothy Balcer
fileserver > processes. > > Excellent, and it's in line with what I thought was the case. Thanks muchly. > -- > Andrew Deason > adea...@sinenomine.net > > ___ > OpenAFS-info mailing list > OpenAFS-info@openafs.org > ht

Re: [OpenAFS] Re: odd problem with RW site after a botched replica

2012-10-31 Thread Timothy Balcer
wrote: > On Tue, 30 Oct 2012 20:07:57 -0700 > Timothy Balcer wrote: > > > In other news, the latest salvage has been running for 12 hours... I > > straced the busiest pid and it is happily verifying all the links and > > contents (open(), close(), pread() ad infinitum),

Re: [OpenAFS] Re: odd problem with RW site after a botched replica

2012-10-30 Thread Timothy Balcer
p you all posted. There wasn't an error in the AFS logs that > indicated that salvager proceses had been killed due to OOM. It was only in > the kernel logs. > > Salvage was a success! The volume is back in operation and working correctly. So, one more for the books. When salvaging, watch for OOM killing. > -- > Timothy Balcer >

Re: [OpenAFS] Re: odd problem with RW site after a botched replica

2012-10-30 Thread Timothy Balcer
Oct 30, 2012 at 11:14 PM, Timothy Balcer wrote: > >> The OOM killer *is* the kernel, so the AFS logs just know it's dead, >> >>> not that the kernel >>> decided "hy" >>> >> >> Yeah.. I knew that ;-) What I was suggesting w

Re: [OpenAFS] Re: odd problem with RW site after a botched replica

2012-10-30 Thread Timothy Balcer
p SIGTERM, and log it on the way out, no? -- > Derrick > -- Timothy Balcer / IT Services Telmate / San Francisco, CA Direct / (415) 300-4313 Customer Service / (800) 205-5510

Re: [OpenAFS] Re: odd problem with RW site after a botched replica

2012-10-30 Thread Timothy Balcer
salvage, and it may succeed. I'll keep you all posted. There wasn't an error in the AFS logs that indicated that salvager proceses had been killed due to OOM. It was only in the kernel logs. -- Timothy Balcer

Re: [OpenAFS] Re: odd problem with RW site after a botched replica

2012-10-30 Thread Timothy Balcer
ot;renewing" a volume in the VLDB from the data set on the partition? That is to say, can you remove a volume from the VLDB and regenerate it in the VLDB from what is on disk alone? Thanks again for all your help. :) > > -- > Andrew Deason > adea...@sinenomine.net > > _

Re: [OpenAFS] Re: odd problem with RW site after a botched replica

2012-10-29 Thread Timothy Balcer
On Mon, Oct 29, 2012 at 12:03 PM, Andrew Deason wrote: > On Mon, 29 Oct 2012 11:41:09 -0700 > Timothy Balcer wrote: > > > I have a volume that had a replica, which has now been removed with > > vos remsite. > > In the future, you should remove RO sites with 'vos

[OpenAFS] odd problem with RW site after a botched replica

2012-10-29 Thread Timothy Balcer
ere is the vos listvldb output: user.snap RWrite: 536870935 number of sites -> 1 server afs-db.foo.com partition /vicepb RW Site -- New release The rest of the volumes have no "release" notation on them for the RW sites. Any pointers? -- Timothy Balcer / IT Services Tel