and retry the release,
however I am concerned about the error. How is it that an authentication
can expire when the client is automatically reinitiated and aklog'd every 6
hours?
--
Timothy Balcer / IT Services
Telmate / San Francisco, CA
Direct / (415) 300-4313
Customer Service / (800) 205-5510
12, 2014 at 4:17 PM, Timothy Balcer wrote:
> Wow. Just as you send it outyou find the problem. Always the way :-)
>
> This turned out to be a subtle network problem. There was a change in the
> cross colo VPN link by our provider, and while it didn't affect the
> majori
oblem like this, try bypassing parts of your network to see if there is a
network fault hiding amongst the trees.
Best,
On Sun, Jan 12, 2014 at 3:54 PM, Timothy Balcer wrote:
> Hey folks. Odd problem. I have gone over many things, and I am stumped. I
> am tempted to just destroy and rebui
.7
Any help is appreciated! I have already upgraded and rebooted all three
servers, lowest to highest, to no avail. I also attempted moving the data
files out of the way and starting the affected vlserver, but the same
symptoms remain.
--
Timothy Balcer / IT Services
Telmate / San Francisco,
ss did not release it, even though the IP
was no longer active.
So I moved one volume. That worked. But I didn't want to do that for the
entire fileserver.
So I entered -rxbind to the fileserver process and restarted it.
Voila. Problem solved.
--
Timothy Balcer / IT Services
Telmate / Sa
Okie.. that directory was, in fact, mounted RW in the RO path. Interesting!
Blind sided me a bit there :)
Let me ask, is this what happens if you use the -rw flag to fs mkmount? I
just re read the man page and.. it does seem to be true.
On Fri, Dec 20, 2013 at 9:10 PM, Timothy Balcer wrote
ume access
> will be to the rw volume. (unless a mount point is explicitly to the
> volume.readonly.)
>
>
>
>
> On 12/20/2013 7:11 PM, Timothy Balcer wrote:
> > ˇTo be more clear.. the problematic server (one I was salvaging) had thr
> > RW/RO pair, and was pref 5005.
000.
It still went for the 35000 pref server.
On Fri, Dec 20, 2013 at 3:10 PM, Timothy Balcer wrote:
> I took down a file server for salvaging today, part of routine
> maintenance. It had a RW/RO pair on it of volume 'photos', and there were
> two RO copies on two other fileserv
ocal RW/RO volume for the RO path.. doing an
strace showed this was true.
The problem will fix itself once the salvage is over, but this is
disturbing... what can I do to avoid this sort of problem in future? Am I
doing something wrong here? Better troubleshooting? Advice appreciated :)
--
Tim
rs to AFS. It
breaks security, of course. But for performance purposes, and on a private
VLAN, it serves my purpose well.
On Fri, Jun 28, 2013 at 4:15 PM, Andrew Deason wrote:
> On Fri, 28 Jun 2013 12:29:38 -0700
> Timothy Balcer wrote:
>
> > On another VM in the same local colo,
In addition, plenty of memory free on both client and server, and no CPU
spikes.
On Fri, Jun 28, 2013 at 2:14 PM, Timothy Balcer wrote:
> strace gave zero lines more after these after the cat totally completed.
> So it blocked on close.
>
>
> On Fri, Jun 28, 2013 at 1:12 PM,
strace gave zero lines more after these after the cat totally completed. So
it blocked on close.
On Fri, Jun 28, 2013 at 1:12 PM, Timothy Balcer wrote:
> Stracing that 'cat' yields this at the end:
> read(3, "rojects_limit: 10\n authenticati"..., 32768) = 26624
>
he transfer to complete to see
what the final strace lines give.
--
Timothy Balcer / IT Services
Telmate / San Francisco, CA
Direct / (415) 300-4313
Customer Service / (800) 205-5510
o the client
in question to see what changes.
On Tue, Jun 25, 2013 at 2:03 PM, Timothy Balcer wrote:
> Im aware of udp protocol problems.. I've tested network throughput and
> responsivity for udp/tcp several different ways, and it's dandy.
>
> Ok I'll do a f
, 25 Jun 2013 13:49:06 -0700
> Timothy Balcer wrote:
>
> > So, this means that there is no networkly or I/O reason on the
> > destination or source that could be the culprit particularly.
>
> That's not necessarily true; even discarding the issues with Rx (the
> transpor
trace
to all the afs background daemons?
Best,
On Tue, Jun 25, 2013 at 12:14 PM, Andrew Deason wrote:
> On Mon, 24 Jun 2013 15:58:45 -0700
> Timothy Balcer wrote:
>
> >- Soon after client restart, rsync is very fast.. less than a second,
> >compared to rs
at 5:50 PM, Anne Salemme wrote:
> what about the cache? how big is it, and is it on its own disk partition?
>
> anne
>
>
> ----------
> *From:* Timothy Balcer
> *To:* Andrew Deason
> *Cc:* openafs-info@openafs.org
> *Sent:* Monday, June 24, 20
y.
- Network speeds are good
On Mon, Jun 24, 2013 at 1:39 PM, Andrew Deason wrote:
> On Fri, 21 Jun 2013 16:26:22 -0700
> Timothy Balcer wrote:
>
> > This seems counter intuitive... the 100 or so files do not go over the
> > 500,000 block cache size. They are fairly s
with chunksize 13 and the same behavior occurs
as before.
This seems counter intuitive... the 100 or so files do not go over the
500,000 block cache size. They are fairly small (10's to 100's of
kilobytes). Why would increasing cache size impact performance Negatively
in such a case?
-
The files are being not being read after they are written.. its a bulk
write operation.
Thanks or that tip on dcache.. I somehow didn't catch it was for a memory
cache only...
On Mon, Jan 28, 2013 at 2:05 PM, Andrew Deason wrote:
> On Mon, 28 Jan 2013 12:58:17 -0800
> Timothy Ba
ded.
I feel I am missing something.. although I do understand the basics here,
and the relation of the various parameters to activity, I am guessing that
I am missing something important re: tuning client vs. fileserver for bulk
operations. Can anyone advise?
Thanks muchly,
--
Timothy Balcer / IT Services
Well now, this seems like a very nice possibility as an AFS file server
cluster, in a box:
SuperMicro * 2015TA-HTRF
*Just thought I'd share the smile I had when I saw that it had 8 nodes per
2U ;)
--
Timothy Balcer / IT Services
Telmate / San Francisco, CA
Direct / (415) 300-4313
Cus
ix/HDRWQ91.html
Thanks for the pointer!
On Mon, Nov 12, 2012 at 5:38 AM, Jeffrey Altman
wrote:
> On 11/10/2012 2:45 PM, Timothy Balcer wrote:
>> If I turn off dynroot, only the read only volume shows up in the tree,
>> and the read write volume is not accessible. This coincides
il: +49 (0)176 34473913
> GPG Public Key C2E467BB | Jabber: dirk.heinri...@altum.de
--
Timothy Balcer / IT Services
Telmate / San Francisco, CA
Direct / (415) 300-4313
Customer Service / (800) 205-5510
___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info
ing odd, or non standard, with
my cell configuration, and it is affecting my ability to use afsio to
write/read files to the RW volumes. afsio isn't using the afs client and is
going after things directly.
Any advice? I am free to reconfigure my cell as I need to accommodate.
--
Timothy Balcer /
it done, I
can send it off for people to review and use :) If I could have some help
with getting afsio working properly, I think that will be enough to get me
going.
Thanks also to everyone else as well.. I am learning more about the
underpinnings of file systems working with AFS than I have in a very long
time!
--
Timothy Balcer / IT Services
Telmate / San Francisco, CA
Direct / (415) 300-4313
Customer Service / (800) 205-5510
--
Timothy Balcer / IT Services
Telmate / San Francisco, CA
Direct / (415) 300-4313
Customer Service / (800) 205-5510
On Fri, Nov 9, 2012 at 9:45 AM, Andrew Deason wrote:
> On Thu, 8 Nov 2012 22:48:56 -0800
> Timothy Balcer wrote:
>
> > Well, unless I am missing something seriously obvious, for example it
> > took 1.5 hours to rsync a subdirectory to an AFS volume that had not a
> &g
On Tue, Nov 6, 2012 at 8:49 AM, Andrew Deason wrote:
> On Tue, 6 Nov 2012 00:06:53 -0800
> Timothy Balcer wrote:
>
> > I have a need to think about replicating large volumes (multigigabyte)
> > of large number (many terabytes of data total), to at least two other
> &g
talking about
multiple gigabytes per release, I can imagine it being extremely difficult.
Is there a schema that i can use with OpenAFS that will help alleviate this
problem? Or perhaps another approach I am missing that may solve it better?
Best,
--Timothy Balcer
PS: FYI the reason I chose AFS
I found a workaround for now.. I hope it finds some use amongst the
community, assuming the linux NFS translator really isn't working properly
(as seems to be intimated in the emails I recently found)
You can use unionfs with an afs volume and a normal volume, and then NFS
export THAT unionfs dire
(No such file or
directory)
open("/proc/fs/openafs/afs_ioctl", O_RDWR) = 3
ioctl(3, CAPI_REGISTER or SNDCTL_COPR_LOAD, 0x7fff18ec3a10) = -1 ENODEV (No
such device)
close(3)= 0
write(2, "Sorry, the nfs-exporter type is "..., 75Sorry, the nfs-exporter
type is curren
fileserver
> processes.
>
> Excellent, and it's in line with what I thought was the case. Thanks
muchly.
> --
> Andrew Deason
> adea...@sinenomine.net
>
> ___
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> ht
wrote:
> On Tue, 30 Oct 2012 20:07:57 -0700
> Timothy Balcer wrote:
>
> > In other news, the latest salvage has been running for 12 hours... I
> > straced the busiest pid and it is happily verifying all the links and
> > contents (open(), close(), pread() ad infinitum),
p you all posted. There wasn't an error in the AFS logs that
> indicated that salvager proceses had been killed due to OOM. It was only in
> the kernel logs.
>
> Salvage was a success!
The volume is back in operation and working correctly.
So, one more for the books. When salvaging, watch for OOM killing.
> --
> Timothy Balcer
>
Oct 30, 2012 at 11:14 PM, Timothy Balcer wrote:
>
>> The OOM killer *is* the kernel, so the AFS logs just know it's dead,
>>
>>> not that the kernel
>>> decided "hy"
>>>
>>
>> Yeah.. I knew that ;-) What I was suggesting w
p
SIGTERM, and log it on the way out, no?
--
> Derrick
>
--
Timothy Balcer / IT Services
Telmate / San Francisco, CA
Direct / (415) 300-4313
Customer Service / (800) 205-5510
salvage,
and it may succeed.
I'll keep you all posted. There wasn't an error in the AFS logs that
indicated that salvager proceses had been killed due to OOM. It was only in
the kernel logs.
--
Timothy Balcer
ot;renewing" a volume in the VLDB from the data set on the partition? That is
to say, can you remove a volume from the VLDB and regenerate it in the
VLDB from what is on disk alone?
Thanks again for all your help. :)
>
> --
> Andrew Deason
> adea...@sinenomine.net
>
> _
On Mon, Oct 29, 2012 at 12:03 PM, Andrew Deason wrote:
> On Mon, 29 Oct 2012 11:41:09 -0700
> Timothy Balcer wrote:
>
> > I have a volume that had a replica, which has now been removed with
> > vos remsite.
>
> In the future, you should remove RO sites with 'vos
ere is the vos listvldb output:
user.snap
RWrite: 536870935
number of sites -> 1
server afs-db.foo.com partition /vicepb RW Site -- New release
The rest of the volumes have no "release" notation on them for the RW sites.
Any pointers?
--
Timothy Balcer / IT Services
Tel
40 matches
Mail list logo