Some of these bursts of non-responsiveness are dramatic.

E.g., on 10-12 1.48p log time, we had over 400 of these 10-s and 30-s
timeouts (i.e., just during 1.48p to 1.49p - 1 minute). I've not
pasted them here because my last post was annoying enough for most,
I'm sure.

j

On Oct 12, 9:14 am, Jason C <jason.a.coll...@gmail.com> wrote:
> We (appid steprep) are starting to see bursts of 30-s and 10-s
> timeouts occur again. E.g.,
>
> 10-12 07:59AM 45.772 /_ah/queue/deferred 500 10012ms
> 10-12 07:59AM 43.185 /_ah/queue/deferred 500 10202ms
> 10-12 07:59AM 16.844 /_ah/queue/deferred 500 11799ms
> 10-12 07:57AM 44.693 /_ah/queue/deferred 500 29328ms
> 10-12 07:57AM 40.662 /_ah/queue/deferred 500 29515ms
> 10-12 07:57AM 29.826 /_ah/queue/deferred 500 35161ms
> 10-12 07:57AM 34.542 /_ah/queue/deferred 500 29803ms
> 10-12 07:57AM 28.881 /_ah/queue/deferred 500 33018ms
> 10-12 07:57AM 29.604 /_ah/queue/deferred 500 29740ms
> 10-12 07:57AM 29.205 /_ah/queue/deferred 500 29261ms
> 10-12 07:57AM 26.146 /_ah/queue/deferred 500 30481ms
> 10-12 07:57AM 25.589 /_ah/queue/deferred 500 29358ms
> 10-12 07:57AM 23.584 /_ah/queue/deferred 500 29611ms
> 10-12 07:57AM 41.243 /mgmt/alerts/trigger/ 500 10005ms
> 10-12 07:57AM 04.023 /_ah/queue/deferred 500 35939ms
> 10-12 07:57AM 05.307 /_ah/queue/deferred 500 29630ms
> 10-12 07:56AM 54.670 /_ah/queue/deferred 500 29673ms
> 10-12 07:56AM 52.672 /_ah/queue/deferred 500 29831ms
> 10-12 07:56AM 52.380 /_ah/queue/deferred 500 29921ms
> 10-12 07:56AM 52.379 /_ah/queue/deferred 500 29452ms
> 10-12 07:56AM 30.482 /_ah/queue/deferred 500 29534ms
> 10-12 07:56AM 28.382 /_ah/queue/deferred 500 29502ms
> 10-12 07:56AM 19.713 /_ah/queue/deferred 500 37933ms
> 10-12 07:56AM 24.241 /_ah/queue/deferred 500 29304ms
> 10-12 07:56AM 22.052 /_ah/queue/deferred 500 29793ms
> 10-12 07:56AM 18.893 /_ah/queue/deferred 500 31390ms
> 10-12 07:56AM 19.341 /_ah/queue/deferred 500 30034ms
> 10-12 07:56AM 18.722 /_ah/queue/deferred 500 29845ms
> 10-12 07:56AM 18.695 /_ah/queue/deferred 500 29467ms
> 10-12 07:56AM 33.521 /exec-reports/queue/increment-all-time-data-
> counter 500 10027ms
> 10-12 07:54AM 12.524 /_ah/queue/deferred 500 29353ms
> 10-12 07:54AM 11.365 /_ah/queue/deferred 500 29925ms
>
> and a little bit earlier:
>
> 10-12 07:48AM 42.456 /_ah/queue/deferred 500 10008ms
> 10-12 07:48AM 41.223 /_ah/queue/deferred 500 10007ms
> 10-12 07:48AM 38.840 /_ah/queue/deferred 500 10058ms
> 10-12 07:48AM 37.868 /_ah/queue/deferred 500 10033ms
> 10-12 07:48AM 37.283 /_ah/queue/deferred 500 10006ms
> 10-12 07:48AM 36.481 /_ah/queue/deferred 500 10008ms
> 10-12 07:48AM 36.218 /_ah/queue/deferred 500 10001ms
> 10-12 07:48AM 36.184 /mgmt/jobs/put/ 500 10027ms
> 10-12 07:48AM 35.902 /_ah/queue/deferred 500 10001ms
> 10-12 07:48AM 35.888 /_ah/queue/deferred 500 10002ms
> 10-12 07:48AM 35.792 /exec-reports/queue/increment-all-time-data-
> counter 500 10007ms
> 10-12 07:48AM 34.955 /_ah/queue/deferred 500 10074ms
> 10-12 07:48AM 34.678 /_ah/queue/deferred 500 10007ms
> 10-12 07:48AM 33.743 /exec-reports/queue/increment-all-time-data-
> counter 500 10005ms
> 10-12 07:48AM 31.323 /mgmt/jobs/put/deferred/?jobId=f54cddcf-05db-4cdc-
> a2ec-3859f8cb970e 500 10045ms
> 10-12 07:48AM 30.146 /_ah/queue/deferred 500 10001ms
> 10-12 07:48AM 30.106 /_ah/queue/deferred 500 10039ms
> 10-12 07:48AM 29.551 /_ah/queue/deferred 500 10013ms
> 10-12 07:48AM 29.332 /mgmt/jobs/put/deferred/?jobId=1135b765-3199-4961-
> a232-1ee46c235058 500 10036ms
> 10-12 07:48AM 27.608 /_ah/queue/deferred 500 10015ms
> 10-12 07:48AM 27.566 /mgmt/jobs/put/deferred/?
> jobId=26addd00-9958-4747-9f48-b8f5acc61103 500 10012ms
> 10-12 07:48AM 27.172 /mgmt/jobs/put/deferred/?
> jobId=8b69f017-1dc9-4b0f-8491-c3154a2b5dbb 500 10117ms
> 10-12 07:48AM 27.154 /mgmt/jobs/put/deferred/?jobId=5f387ec5-142a-43fc-
> be04-5d911d34c528 500 10001ms
> 10-12 07:48AM 26.930 /exec-reports/queue/increment-daily-data-counter
> 500 10024ms
> 10-12 07:48AM 26.942 /exec-reports/queue/increment-all-time-data-
> counter 500 10009ms
> 10-12 07:48AM 26.869 /mgmt/jobs/put/deferred/?
> jobId=aa4d1923-0ce9-4ad5-88ef-7c835ba3db14 500 10031ms
> 10-12 07:48AM 26.377 /_ah/queue/deferred 500 10001ms
> 10-12 07:48AM 26.376 /exec-reports/queue/increment-all-time-data-
> counter 500 10001ms
> 10-12 07:48AM 26.357 /_ah/queue/deferred 500 10020ms
> 10-12 07:48AM 26.347 /mgmt/jobs/put/deferred/?jobId=27b61fd1-
> b980-4a8e-9b26-cb517a2fbe2a 500 10006ms
> 10-12 07:48AM 26.345 /_ah/queue/deferred 500 10006ms
> 10-12 07:48AM 26.342 /exec-reports/queue/increment-all-time-data-
> counter 500 10008ms
>
> There are all tasks that run quickly (i.e., much less than 30-s) under
> "normal" operation. I can find more example if desired.
>
> j
>
> On Oct 11, 11:04 am, Jeff Schwartz <jefftschwa...@gmail.com> wrote:
>
>
>
> > I am only guessing but I imagine that 'cluster' is some form of load
> > balancing segment that Google uses to service and route requests to the
> > virtual servers and this error/warning is indicative of when that queue is
> > very busy. And regardless of if it is an error or a mere warning it still
> > resulted in a 500, extremely high resource usage and a negative experience
> > for the end user.
>
> > On Mon, Oct 11, 2010 at 12:56 PM, Jamie H <ja...@mhztech.com> wrote:
> > > What do you mean by "cold instance startup that timed out" ?
>
> > > Sometimes I will get the error:
> > > Request was aborted after waiting too long to attempt to service your
> > > request. This may happen sporadically when the App Engine serving
> > > cluster is under unexpectedly high or uneven load. If you see this
> > > message frequently, please contact the App Engine team.
>
> > > I see this error occasionally when my site has NO load, so I assume
> > > this is a cold instance startup that timed out?
>
> > > I've been trying to find out about this issue for a while with little
> > > response.  It is fairly troubling to see this error randomly in
> > > several apps of mine.  I never noticed it until I realized it is a
> > > "warning" not an "error" so you must set your log severity
> > > appropriately to find these.  People have told me this error occurs
> > > when all your apps instances are busy and requests wait too long in
> > > queue for an instance.  However, I see it when my site has NO load so
> > > I don't quite believe this is always the case...
>
> > > On Oct 11, 11:30 am, Eli Jones <eli.jo...@gmail.com> wrote:
> > > > Seems like if you want direct one on one comunication and service
> > > > guarantees, you gotta try to get into the Appengine for Business 
> > > > preview:
>
> > > >http://code.google.com/appengine/business/
>
> > > > <http://code.google.com/appengine/business/>But then.. you have to pay.
>
> > > > Also, from seeing your logs below.. it really appears that the 8:07:46
> > > log
> > > > was for a cold instance startup that timed out, the 8:07:59 log is for a
> > > > cold start that succeeded, and the 8:08:05 log is for a hot instance.
>
> > > > Are you sure that log number 2 (for 8:07:59) didn't have this additional
> > > > info when you expand it?:
>
> > > > "This request caused a new process to be started for your application,
> > > and
> > > > thus caused your application code to be loaded for the first time. This
> > > > request may thus take longer and use more CPU than a typical request for
> > > > your application."
>
> > > > On Mon, Oct 11, 2010 at 12:19 PM, Jeff Schwartz <jefftschwa...@gmail.com
> > > >wrote:
>
> > > > > Here is an example of my logs:
> > > > > 1) 10-11 08:08AM 05.539 /account.groovy 200 218ms 265cpu_ms
> > > 13api_cpu_ms
> > > > > 2kb Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US; rv:1.9.2.10)
> > > > > Gecko/20100914 Firefox/3.6.10 ( .NET CLR 3.5.30729; 
> > > > > .NET4.0C),gzip(gfe)
> > > 2)
> > > > > 10-11 08:07AM 59.775 /account.groovy 200 2401ms 1976cpu_ms 
> > > > > 13api_cpu_ms
> > > > > 2kb Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US; rv:1.9.2.10)
> > > > > Gecko/20100914 Firefox/3.6.10 ( .NET CLR 3.5.30729; 
> > > > > .NET4.0C),gzip(gfe)
> > > 3)
> > > > > 10-11 08:07AM 46.856 /account.groovy 500 10015ms 0cpu_ms 0kb
> > > Mozilla/5.0
> > > > > (Windows; U; Windows NT 6.0; en-US; rv:1.9.2.10) Gecko/20100914
> > > > > Firefox/3.6.10 ( .NET CLR 3.5.30729; .NET4.0C),gzip(gfe) All of the
> > > above
> > > > > are for the exact same code operating on the exact same data.
>
> > > > > Excluding #3 above, the server was already live and ready to receive
> > > > > requests. Notice the variance in response time for #1 & #2. That is
> > > alarming
> > > > > don't you think?
>
> > > > > This is not indicative of a stable environment where one should be
> > > billed
> > > > > for using it. Once issues like these are resolved then billing would 
> > > > > be
> > > > > entirely acceptable.
>
> > > > > Again, I don't believe I am being overly critical but I have yet to
> > > receive
> > > > > a response from anyone at Google regarding my plea for assistance. If
> > > they
> > > > > would prefer to contact me off-line via email then that would be 
> > > > > wholly
> > > > > agreeable on my part.
>
> > > > > Jeff
>
> > > > >  --
> > > > > You received this message because you are subscribed to the Google
> > > Groups
> > > > > "Google App Engine" group.
> > > > > To post to this group, send email to google-appengine@googlegroups.com
> > > .
> > > > > To unsubscribe from this group, send email to
> > > > > google-appengine+unsubscr...@googlegroups.com<google-appengine%2Bunsubscrib
> > > > >  e...@googlegroups.com>
> > > <google-appengine%2bunsubscr...@googlegroups.com<google-appengine%252Bunsub
> > >  scr...@googlegroups.com>
>
> > > > > .
> > > > > For more options, visit this group at
> > > > >http://groups.google.com/group/google-appengine?hl=en.
>
> > > --
> > > You received this message because you are subscribed to the Google Groups
> > > "Google App Engine" group.
> > > To post to this group, send email to google-appeng...@googlegroups.com.
> > > To unsubscribe from this group, send email to
> > > google-appengine+unsubscr...@googlegroups.com<google-appengine%2Bunsubscrib
> > >  e...@googlegroups.com>
> > > .
> > > For more options, visit this group at
> > >http://groups.google.com/group/google-appengine?hl=en.
>
> > --
> > Jeff

-- 
You received this message because you are subscribed to the Google Groups 
"Google App Engine" group.
To post to this group, send email to google-appeng...@googlegroups.com.
To unsubscribe from this group, send email to 
google-appengine+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/google-appengine?hl=en.

Reply via email to