Dormando, thanks for the info :)

Right now I've isolated the types of data I'm putting into the memcache. In one instance are counters only, and in another instance is metadata (like arrays of keys in the first instance). Long way of saying I'm isolating the problem in our application.

Haven't tried the patch yet (waiting on my app changes to crash :) ), and am beginning to read through assoc.c, naively wondering if the lookahead patch will work as expected. If I understand right, both the lookahead and it pointers are moving targets, and would only detect local loops 2 deep. Wouldn't you want to save an orig for comparing to lookahead?

Do you have a real-time communication mechanism, like IRC? This kind of thing might be easier done there.

You know, I haven't tried the 64bit compile-time option, and I'm trying to use a large amount of memory (8GB). What is the effect of compiling with the 64bit option?

Thanks,
-Michael

dormando wrote:
Sorry to hear :\

Well, hold tight for now.

Info for the list: This is likely another case of the assoc_find bug.
We're putting all bughunting resources into tracking it down right now.
Hopefully we will have a fix soon, and I will be sending more info to
the list to help out as soon as I can.

The bug's in assoc.c somewhere - Something causes the linked lists to
have a loop.

Michael; Check the archives a few days back for a post from trond norbye
with a patch for detecting and bombing out on such loops. If you try
that out it could help us narrow things down.

We've otherwise been completely unable to reproduce the bug outside of
the wild.

Thanks,
-Dormando

Michael D'Agosta wrote:
The latter - both pegging the CPU and not accepting new connections.
This only happens every couple of days, but I can notice any other
symptoms that are relevant...

-MD

dormando wrote:
Hey,

Is memcached simply not accepting new connections and idling, or is it
pegging the CPU and not accepting new connections?

-Dormando

Michael D'Agosta wrote:
Hello,

New to the list - hi people. I was reading a thread from April:
  http://lists.danga.com/pipermail/memcached/2008-April/006683.html

It seems that during busy times, one of our memcached instances will
stop accepting new connections; when I run telnet host port, I don't get
a prompt. It's MC 1.2.5 on CentOS 4.6 w/ libevent 1.1a. We didn't
compile in threading before, so I'm giving that a test drive as I type.

Did anyone discover solutions or workarounds to the problem?

Thanks in advance,

Mike

Reply via email to