[google-appengine] Re: counter use pattern

josh l Mon, 03 Nov 2008 14:26:49 -0800

How is your counter 100% reliable?  It could have a delayed
transaction (or a bunch) and then a memcache failure.


I agree my write-to-memcache-first counter would be (slightly) more
complex, and like yours, it _could_ have issues if memcache failed for
one reason or another.  In that (rare) case it is likely mine _would_
lose some counts, and yours likely would not -- but I think it is  a
rare case and I think the very occasional loss of some counts is more
than made up for by an order of magnitude (or more) less writes, and
drastically faster response times.

According to guide, in this video
http://www.youtube.com/watch?v=CmyFcChTc4M&eurl=http://www.technorati.com/videos/youtube.com/watch%3Fv%3DCmyFcChTc4M
(21:10), you will hit a watchdog if your avg. requests are >300ms, and
I have seen it happen.  I need to avoid it, and I don't see another
way  to do that when the app requirement is for multiple counter
incrememts/request, except with my counter example (far) above.

  -Josh

On Nov 3, 2:17 pm, yejun <[EMAIL PROTECTED]> wrote:
> I am not say they are same.  But you are degrading a 100% reliable
> solution to 99% or 99.9%. To me 99.9% and 99% have similar reliability
> comparing to 100%. And the complexity of combining memcache write and
> sharded write also possibly make the problem somewhat unreliable.
>
> On Nov 3, 5:11 pm, josh l <[EMAIL PROTECTED]> wrote:
>
> > Why do you say with memcache I would only write to the datastore once
> > per second, or every 10 seconds?  This doesn't make any sense... If I
> > only attempt write 1/10 of my counter increments to the datastore, I
> > will still be writing a few times/second at least (I am writing this
> > to expect >30QPS, but likely it will be more).
>
> > Also, my issue/question is not regarding Transaction Collisions.  Of
> > course these are avoidable by just using many shards, and adding more
> > shards until collisions are extremely infrequent.  My issue is
> > regarding total request/response time.  Attempting to write anything
> > to the datastore takes time.  It just does.  The cpu cycles may not
> > count against us (in $cost), but the time does (or seems to, at
> > least).  If a request ends up needing to write to multiple counters
> > (and all of mine do), quickly the total response is in the many
> > hundreds of ms.  I don't want this, I want quicker.
>
> > My testing shows memcache to be fairly reliable.  I am not worried
> > about the very occasional (I hope) memcache issue.  I am worried about
> > total time per request.  I believe I can drastically shorten this time
> > by not attempting to write to the datastore as often, and having the
> > counter use just memcache most of the time for increments.  I can't
> > imagine I am the only person to think of this?
>
> >   -Josh
>
> > On Nov 3, 1:56 pm, yejun <[EMAIL PROTECTED]> wrote:
>
> > > The reason why not use shard count with memcache cache write is that
> > > with memcache you will only write datastore once per second or every
> > > 10 seconds. I see no reason why you need a shard counter for that kind
> > > frequency.
> > > Because when a memcache reset happends, losing 1/10 second of data or
> > > 1 second seems have the similar reliability to me.
>
> > > On Nov 3, 4:32 pm, josh l <[EMAIL PROTECTED]> wrote:
>
> > > > Yejun,
>
> > > > I've been told that the memcached framework on GAE uses a first-
> > > > created, first-deleted algorithm, and that we have finite (a few
> > > > hundred megs, but maybe even up to a GB) of memcache for any specific
> > > > app.  This means that once you hit your limit, older objects WILL get
> > > > deleted.  And my app will definitely be going over this max limit.
> > > > This is not a huge deal to me (probably won't happen that my counter
> > > > gets deleted that often, and it's ok if it's a little bit off) but I
> > > > figured my counter might want as well handle that small case anyway.
>
> > > > Again, the much bigger case:  Not writing to the datastore each time
> > > > you want to increment.  And yes, I am aware of why to use the sharded
> > > > counter.  The point is what if you have about 50QPS coming in (so you
> > > > need a sharded counter with a lot of shards for sure), and every
> > > > single request is writing to ~3 different counters.  Now each request
> > > > is taking a while because of the datastore writes that it attempts
> > > > each time, even with no Transaction collisions on shard-writes.  And
> > > > also I believe there is a GAE watchdog looking to see if over a period
> > > > of time your average request is >300ms.
>
> > > > So I am simply saying, why not try cut down on the total datastore
> > > > writes, and write to a shard only 1/10 times, but still get the
> > > > correct totals?  This is the reasoning for my arguments above.  So now
> > > > you have an order of magnitude less datastore writes, and the average
> > > > response time is way down.  This sounds good to me, and I am sure
> > > > others who plan to write apps that have a number of sharded counter
> > > > increments / avg. request, might feel similarly.  Am I missing
> > > > something obvious here?
>
> > > >   -Josh
>
> > > > On Nov 3, 12:54 pm, yejun <[EMAIL PROTECTED]> wrote:
>
> > > > > I believe mecached algorithm will keep most used object not just by
> > > > > creation time.
> > > > > The reason you use a sharded count is that you want increment
> > > > > operation always hit the disk. Otherwise you don't need shard because
> > > > > you don't access it very often if write is cached in memcache. Shard
> > > > > is used here is to improve write concurrency.
>
> > > > > On Nov 3, 3:43 pm, josh l <[EMAIL PROTECTED]> wrote:
>
> > > > > > Yejun,
>
> > > > > > Thanks for the updated code example.  Since a counter is such a 
> > > > > > common
> > > > > > need, I think it might be helpful if we all worked together on the
> > > > > > same codebase, rather than forking each time, or at least if we 
> > > > > > could
> > > > > > be specific about the changes we made (I know I can do a diff, but 
> > > > > > if
> > > > > > you noted what exactly you changed, and why, that would be awesome 
> > > > > > for
> > > > > > all future users who are curious).
>
> > > > > > Moving on, I think I didn't explain myself well regarding the
> > > > > > destruction of the memcache object.  Imagine an app where
> > > > > >  1) There will definitely be at least 10 counter requests/sec (for
> > > > > > same named counter.  let's call it the TotalRequests counter, and is
> > > > > > referred to by some Stats model)
> > > > > >  2) Lots and lots of other entities get written to memcache 
> > > > > > (millions
> > > > > > of entities in the system, and each gets cached upon initial 
> > > > > > request)
>
> > > > > > In this situation, it is guaranteed objects in our memcache will
> > > > > > disappear after some use, since we have less memcache total avail 
> > > > > > than
> > > > > > the size of items that will be cached over a few days of use.  Now,
> > > > > > which items get removed?  In this case, our counter is the first
> > > > > > created item in memcache, and definitely one of the first items to 
> > > > > > be
> > > > > > just nuked from memcache when we hit the max storage limit for our
> > > > > > apps' memcache.  To ensure it never gets nuked due to it being the
> > > > > > 'oldest object in memcache', then we could 'occasionally' destroy/
> > > > > > recreate it.  Maybe, for example, I could also have a time_created 
> > > > > > on
> > > > > > it, and if it's older than a few hours, then nuke/recreate upon
> > > > > > resetting it.  I figured might as well do this every time, but 
> > > > > > anyway
> > > > > > hopefully you see my point as to why I was thinking about the need 
> > > > > > to
> > > > > > destroy/reuse.
>
> > > > > > Much more important than this very occasional mis-count for a
> > > > > > destroyed memcache item, tho, is my general idea of just not even
> > > > > > attempting to write to a shard entity unless we've had a few (10?,
> > > > > > 50?) counter increments.  I am getting ~350ms/request average due to
> > > > > > the time it takes writing to the shards (multiple counters/request),
> > > > > > and this is my main concern with the current code.
>
> > > > > > I will diff your code (thanks again) and check it out this 
> > > > > > afternoon.
>
> > > > > >   -Josh
>
> > > > > > On Nov 3, 12:22 pm, yejun <[EMAIL PROTECTED]> wrote:
>
> > > > > > > > To solve this, I'm also
> > > > > > > > planning to destroy and recreate the memcache object upon 
> > > > > > > > successful
> > > > > > > > datastore write (and associated memcache delay being reset to 
> > > > > > > > zero).
>
> > > > > > > You shouldn't do this. It the completely negates the reason why 
> > > > > > > you
> > > > > > > use this counter class.
> > > > > > > If you just need a counter for a local object, you should just 
> > > > > > > save
> > > > > > > the counter with object itself.
>
> > > > > > > This counter  class should only be used in situation when the same
> > > > > > > named counter need to be increment more than 10 times per second 
> > > > > > > by
> > > > > > > different requests/users concurrently.
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"Google App Engine" group.
To post to this group, send email to google-appengine@googlegroups.com
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at 
http://groups.google.com/group/google-appengine?hl=en
-~----------~----~----~----~------~----~------~--~---

[google-appengine] Re: counter use pattern

Reply via email to