Hello! We have a dockized PHP Nginx application running on google app engine flex. We have automatic scaling and minimum 1 instance is always running. As soon as traffic spikes, users receive 50x gateway errors for some time. The autoscaler works as intended but after checking the logs, all errors originate from the idle instance due to app engine sending most of the requests received to the first instance overloading it.
Why is app engine not distributing requests evenly? What can we do to make sure the service is available during high traffic bursts? Thank you for your time! -- You received this message because you are subscribed to the Google Groups "Google App Engine" group. To unsubscribe from this group and stop receiving emails from it, send an email to google-appengine+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/google-appengine/ac4d6a3b-549a-4f3d-8663-75d48a3add0an%40googlegroups.com.