Hi,
I am using appengine flexible environment.

I am facing issues when there is a sudden spurt of requests to our system. 
(About 30 requests in a second). The server gets back to normal in 5-10 
seconds (perhaps with the decrease
in traffic again)
Generally the requests come at about 5-8 per second.

I am running with 2 instances.

<automatic-scaling>
  <min-num-instances>2</min-num-instances>
  <max-num-instances>10</max-num-instances>
  <cool-down-period-sec>120</cool-down-period-sec>
  
<cpu-utilization><target-utilization>0.7</target-utilization></cpu-utilization>
</automatic-scaling>


AFAICT the requests did not hit my servlet. (I added a log, the first line 
in the servlet, and for failed request it does not get printed).
"loading_request" is 0 for the failed requests as well.

The error code is 502

Is it that the requests are queued while a new instance comes up? Is it 
possible to override that, that is start the process of bringing up the new 
servers,
but use the old one (if it is taking a while?). The log shows some of the 
request waiting for 600seconds, most though fail in 10-15ms.
The failure happens for between 0.5 to 2% of the requests

-- 
You received this message because you are subscribed to the Google Groups 
"Google App Engine" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to google-appengine+unsubscr...@googlegroups.com.
To post to this group, send email to google-appengine@googlegroups.com.
Visit this group at https://groups.google.com/group/google-appengine.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/google-appengine/483c4f5e-870e-4d65-b52f-d204385627a7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to