Re: [google-appengine] Re: Outages?
Just to add some information: When timing the load time of my app (not from the dev server startup but starting from the point my code is beging to load) it takes my application less than 8 seconds to load (few cases were 5secs). So I still can't figure out why on AppEngine it can sometimes take more than 60 seconds... I'd appreciate some analysis guidelines so I can further investigate what makes the app (sometimes) load very slowly. On Monday, April 16, 2012 10:13:50 AM UTC+3, Eliran Bivas wrote: Thanks for the clarification, it should be part of the documentation for AppEngine architecture. As for my question, I believe that a Maven multi-module Spring application has different view on loading times. In earlier posts here it was mentioned that loading several JARs might be an issue. BUT, that's how Maven works. Even core Spring Framework consist of more than 10 JARs. Are there any recommendation on what to look for (analysis tools would be great) in such deployment? Or maybe how such configuration should behave (prefer lazy bean initialization over pre-init singletons)? What about the JARs, would flatten them to a single Uber-JAR will help? And again, my context loading is without DB operations or any HTTP connection. I assume that as my project gets larger, and additional JARs will be added - are there any best practices for such scenario also? On Monday, April 16, 2012 2:53:14 AM UTC+3, Jeff Schnitzer wrote: Woah! 30s to start your app in your local dev environment? That's nuts. You have a mistaken perception of GAE. In nearly all respects, your local dev environment will perform a single thread of execution faster than production. Your local machine has dedicated CPU cores and I/O bandwidth, all local. It has a mock datastore which likely has no synchronization issues. In production, the filesystem is loading across a network. Nearly all service calls require an RPC to a remote machine somewhere else in the cluster - maybe to several machines. You're sharing CPU cores and RAM with a dozen other apps, some of which might be really busy. What you get in production is a system that is not especially fast but nearly always consistent no matter what the queries/sec or database size. Your best strategy is to figure out why your app is taking so long to start up and address that. Jeff On Fri, Apr 13, 2012 at 2:31 PM, Eliran Bivas eliran.bi...@gmail.comwrote: Do you have any recommendations how to reduce the amount of IO operations during startup? My app uses Spring and like many other Java best practices followed, my project consists of several Maven modules which results several jars creation. I do not load any data from datastore or require HTTP connection during startup. I understand the AppEngine filesystem is extremely slow, but on my local dev machine loading my app takes ~30sec, so I assume that in a much superior infrastructure it should take even faster. I believe AppEngine should provide some property to extend loading requests to reach over 60 seconds (like cron operations are allowed 10mins runtime). Thanks in advance On Friday, April 13, 2012 5:51:15 PM UTC+3, Jeff Schnitzer wrote: Right. The problem is the 61244ms that it takes to start your app. How long does it normally take? Look at past warmup requests (the ones that work) and see how long they take. My guess is that the number is close to 60s. If GAE gets marginally slower, it pushes you over the edge. As for why your app takes so long to startup, I can't begin to speculate. What does it do at startup? Do any of your frameworks do classpath scanning? Do you have a lot of big jars? Zillions of class files? Do you load data from the datastore or blobstore? You need to diagnose the warmup requests that *do* work. Sure, look at appstats - although that will only show issues if your warmup fetches data from services (ie, not the filesystem). However, you really should be able to think about it a few minutes and figure out the problem. App startup is almost 100% deterministic. There are no parameters. You know what it does. Jeff On Fri, Apr 13, 2012 at 9:55 AM, Rui Oliveira rucaso2...@gmail.comwrote: Hi Jeff Thanks for your replay. Your answer was very important for me to start looking to the right part of the problem. Just to clarify :startup time is the time to start a new instance right? How can I analyze the startup time inside the server? I'm looking to the appengine logs, appstats, and speedtracer, but in none of them I can find whats happening during the startup. If I deploy the program I'm getting this kind of logs: 1. 012-04-13 11:33:35.889 /_ah/warmup 500 61244ms 0kb 2. W2012-04-13 11:33:35.787 EXCEPTION com.google.apphosting.api.** Dead**lineExceededException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 UTC and was still
Re: [google-appengine] Re: Outages?
As far as anyone outside Google can tell, the biggest issue involves reading files off the filesystem. This may have changed recently since it's been a while since I (or any of the other users who commonly post about such stuff) did any measurements, but reading files seems to be painfully slow. Some things to consider: * Tools that do classpath scanning (Spring, Resteasy, JDO, etc) open every jar and class file looking for annotations. These are usually the biggest enemy. If you can disable scanning, it will usually help a lot - although this means more manual configuration. * Lots of little files take longer than a few big ones. I found a small but significant improvement (maybe 20%) by jaring up my class files - not enough that I actually still do it. The numbers might be different if you use classpath scanning. * Consider carefully each jar in your project and whether you really need it. Jeff On Fri, Apr 20, 2012 at 3:34 AM, Eliran Bivas eliran.bi...@gmail.comwrote: Just to add some information: When timing the load time of my app (not from the dev server startup but starting from the point my code is beging to load) it takes my application less than 8 seconds to load (few cases were 5secs). So I still can't figure out why on AppEngine it can sometimes take more than 60 seconds... I'd appreciate some analysis guidelines so I can further investigate what makes the app (sometimes) load very slowly. On Monday, April 16, 2012 10:13:50 AM UTC+3, Eliran Bivas wrote: Thanks for the clarification, it should be part of the documentation for AppEngine architecture. As for my question, I believe that a Maven multi-module Spring application has different view on loading times. In earlier posts here it was mentioned that loading several JARs might be an issue. BUT, that's how Maven works. Even core Spring Framework consist of more than 10 JARs. Are there any recommendation on what to look for (analysis tools would be great) in such deployment? Or maybe how such configuration should behave (prefer lazy bean initialization over pre-init singletons)? What about the JARs, would flatten them to a single Uber-JAR will help? And again, my context loading is without DB operations or any HTTP connection. I assume that as my project gets larger, and additional JARs will be added - are there any best practices for such scenario also? On Monday, April 16, 2012 2:53:14 AM UTC+3, Jeff Schnitzer wrote: Woah! 30s to start your app in your local dev environment? That's nuts. You have a mistaken perception of GAE. In nearly all respects, your local dev environment will perform a single thread of execution faster than production. Your local machine has dedicated CPU cores and I/O bandwidth, all local. It has a mock datastore which likely has no synchronization issues. In production, the filesystem is loading across a network. Nearly all service calls require an RPC to a remote machine somewhere else in the cluster - maybe to several machines. You're sharing CPU cores and RAM with a dozen other apps, some of which might be really busy. What you get in production is a system that is not especially fast but nearly always consistent no matter what the queries/sec or database size. Your best strategy is to figure out why your app is taking so long to start up and address that. Jeff On Fri, Apr 13, 2012 at 2:31 PM, Eliran Bivas eliran.bi...@gmail.comwrote: Do you have any recommendations how to reduce the amount of IO operations during startup? My app uses Spring and like many other Java best practices followed, my project consists of several Maven modules which results several jars creation. I do not load any data from datastore or require HTTP connection during startup. I understand the AppEngine filesystem is extremely slow, but on my local dev machine loading my app takes ~30sec, so I assume that in a much superior infrastructure it should take even faster. I believe AppEngine should provide some property to extend loading requests to reach over 60 seconds (like cron operations are allowed 10mins runtime). Thanks in advance On Friday, April 13, 2012 5:51:15 PM UTC+3, Jeff Schnitzer wrote: Right. The problem is the 61244ms that it takes to start your app. How long does it normally take? Look at past warmup requests (the ones that work) and see how long they take. My guess is that the number is close to 60s. If GAE gets marginally slower, it pushes you over the edge. As for why your app takes so long to startup, I can't begin to speculate. What does it do at startup? Do any of your frameworks do classpath scanning? Do you have a lot of big jars? Zillions of class files? Do you load data from the datastore or blobstore? You need to diagnose the warmup requests that *do* work. Sure, look at appstats - although that will only show issues if your warmup fetches data from services (ie, not the filesystem). However, you
Re: [google-appengine] Re: Outages?
Oh, what I needed to mention is that if you're trying to time your app load with (say) a filter, a lot of this load time elapses before your code begins executing. Jeff On Fri, Apr 20, 2012 at 11:44 AM, Jeff Schnitzer j...@infohazard.orgwrote: As far as anyone outside Google can tell, the biggest issue involves reading files off the filesystem. This may have changed recently since it's been a while since I (or any of the other users who commonly post about such stuff) did any measurements, but reading files seems to be painfully slow. Some things to consider: * Tools that do classpath scanning (Spring, Resteasy, JDO, etc) open every jar and class file looking for annotations. These are usually the biggest enemy. If you can disable scanning, it will usually help a lot - although this means more manual configuration. * Lots of little files take longer than a few big ones. I found a small but significant improvement (maybe 20%) by jaring up my class files - not enough that I actually still do it. The numbers might be different if you use classpath scanning. * Consider carefully each jar in your project and whether you really need it. Jeff On Fri, Apr 20, 2012 at 3:34 AM, Eliran Bivas eliran.bi...@gmail.comwrote: Just to add some information: When timing the load time of my app (not from the dev server startup but starting from the point my code is beging to load) it takes my application less than 8 seconds to load (few cases were 5secs). So I still can't figure out why on AppEngine it can sometimes take more than 60 seconds... I'd appreciate some analysis guidelines so I can further investigate what makes the app (sometimes) load very slowly. On Monday, April 16, 2012 10:13:50 AM UTC+3, Eliran Bivas wrote: Thanks for the clarification, it should be part of the documentation for AppEngine architecture. As for my question, I believe that a Maven multi-module Spring application has different view on loading times. In earlier posts here it was mentioned that loading several JARs might be an issue. BUT, that's how Maven works. Even core Spring Framework consist of more than 10 JARs. Are there any recommendation on what to look for (analysis tools would be great) in such deployment? Or maybe how such configuration should behave (prefer lazy bean initialization over pre-init singletons)? What about the JARs, would flatten them to a single Uber-JAR will help? And again, my context loading is without DB operations or any HTTP connection. I assume that as my project gets larger, and additional JARs will be added - are there any best practices for such scenario also? On Monday, April 16, 2012 2:53:14 AM UTC+3, Jeff Schnitzer wrote: Woah! 30s to start your app in your local dev environment? That's nuts. You have a mistaken perception of GAE. In nearly all respects, your local dev environment will perform a single thread of execution faster than production. Your local machine has dedicated CPU cores and I/O bandwidth, all local. It has a mock datastore which likely has no synchronization issues. In production, the filesystem is loading across a network. Nearly all service calls require an RPC to a remote machine somewhere else in the cluster - maybe to several machines. You're sharing CPU cores and RAM with a dozen other apps, some of which might be really busy. What you get in production is a system that is not especially fast but nearly always consistent no matter what the queries/sec or database size. Your best strategy is to figure out why your app is taking so long to start up and address that. Jeff On Fri, Apr 13, 2012 at 2:31 PM, Eliran Bivas eliran.bi...@gmail.comwrote: Do you have any recommendations how to reduce the amount of IO operations during startup? My app uses Spring and like many other Java best practices followed, my project consists of several Maven modules which results several jars creation. I do not load any data from datastore or require HTTP connection during startup. I understand the AppEngine filesystem is extremely slow, but on my local dev machine loading my app takes ~30sec, so I assume that in a much superior infrastructure it should take even faster. I believe AppEngine should provide some property to extend loading requests to reach over 60 seconds (like cron operations are allowed 10mins runtime). Thanks in advance On Friday, April 13, 2012 5:51:15 PM UTC+3, Jeff Schnitzer wrote: Right. The problem is the 61244ms that it takes to start your app. How long does it normally take? Look at past warmup requests (the ones that work) and see how long they take. My guess is that the number is close to 60s. If GAE gets marginally slower, it pushes you over the edge. As for why your app takes so long to startup, I can't begin to speculate. What does it do at startup? Do any of your frameworks do classpath scanning? Do you have a lot of big jars? Zillions of
Re: [google-appengine] Re: Outages?
Do you mean Python or Java apps sitting at over 100mb after spin up? I work on some extremely large Python apps, they can sit around ~75mb. If your Python app is heavier than that at startup... you are probably doing stuff in a very questionable way. Robert On Fri, Apr 13, 2012 at 11:02, Brandon Wirtz drak...@digerat.com wrote: My guess is that the number is close to 60s. If GAE gets marginally slower, it pushes you over the edge. I’ll be doing a video very shortly about how to keep your Instances from dying on startup. But one easy way to tell if this is the issue is to up your instance size AND test that you are having the same issue on a Dynamic Backend. If your issues magically go away, but your startups take 65 seconds, you know that you need to optimize your startup. One should also watch the Instance memory usage in the dash board. Often people have instances that just spun up and are sitting at 119 of 128m of memory usage and then wonder why their instances recycle so often. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Thanks for the clarification, it should be part of the documentation for AppEngine architecture. As for my question, I believe that a Maven multi-module Spring application has different view on loading times. In earlier posts here it was mentioned that loading several JARs might be an issue. BUT, that's how Maven works. Even core Spring Framework consist of more than 10 JARs. Are there any recommendation on what to look for (analysis tools would be great) in such deployment? Or maybe how such configuration should behave (prefer lazy bean initialization over pre-init singletons)? What about the JARs, would flatten them to a single Uber-JAR will help? And again, my context loading is without DB operations or any HTTP connection. I assume that as my project gets larger, and additional JARs will be added - are there any best practices for such scenario also? On Monday, April 16, 2012 2:53:14 AM UTC+3, Jeff Schnitzer wrote: Woah! 30s to start your app in your local dev environment? That's nuts. You have a mistaken perception of GAE. In nearly all respects, your local dev environment will perform a single thread of execution faster than production. Your local machine has dedicated CPU cores and I/O bandwidth, all local. It has a mock datastore which likely has no synchronization issues. In production, the filesystem is loading across a network. Nearly all service calls require an RPC to a remote machine somewhere else in the cluster - maybe to several machines. You're sharing CPU cores and RAM with a dozen other apps, some of which might be really busy. What you get in production is a system that is not especially fast but nearly always consistent no matter what the queries/sec or database size. Your best strategy is to figure out why your app is taking so long to start up and address that. Jeff On Fri, Apr 13, 2012 at 2:31 PM, Eliran Bivas eliran.bi...@gmail.comwrote: Do you have any recommendations how to reduce the amount of IO operations during startup? My app uses Spring and like many other Java best practices followed, my project consists of several Maven modules which results several jars creation. I do not load any data from datastore or require HTTP connection during startup. I understand the AppEngine filesystem is extremely slow, but on my local dev machine loading my app takes ~30sec, so I assume that in a much superior infrastructure it should take even faster. I believe AppEngine should provide some property to extend loading requests to reach over 60 seconds (like cron operations are allowed 10mins runtime). Thanks in advance On Friday, April 13, 2012 5:51:15 PM UTC+3, Jeff Schnitzer wrote: Right. The problem is the 61244ms that it takes to start your app. How long does it normally take? Look at past warmup requests (the ones that work) and see how long they take. My guess is that the number is close to 60s. If GAE gets marginally slower, it pushes you over the edge. As for why your app takes so long to startup, I can't begin to speculate. What does it do at startup? Do any of your frameworks do classpath scanning? Do you have a lot of big jars? Zillions of class files? Do you load data from the datastore or blobstore? You need to diagnose the warmup requests that *do* work. Sure, look at appstats - although that will only show issues if your warmup fetches data from services (ie, not the filesystem). However, you really should be able to think about it a few minutes and figure out the problem. App startup is almost 100% deterministic. There are no parameters. You know what it does. Jeff On Fri, Apr 13, 2012 at 9:55 AM, Rui Oliveira rucaso2...@gmail.comwrote: Hi Jeff Thanks for your replay. Your answer was very important for me to start looking to the right part of the problem. Just to clarify :startup time is the time to start a new instance right? How can I analyze the startup time inside the server? I'm looking to the appengine logs, appstats, and speedtracer, but in none of them I can find whats happening during the startup. If I deploy the program I'm getting this kind of logs: 1. 012-04-13 11:33:35.889 /_ah/warmup 500 61244ms 0kb 2. W2012-04-13 11:33:35.787 EXCEPTION com.google.apphosting.api.** Dead**lineExceededException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 UTC and was still e 3. E2012-04-13 11:33:35.790 javax.servlet.**Ser**vletContext log: unavailable javax.servlet.**UnavailableExcep**tion: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 U 4. W2012-04-13 11:33:35.810 Failed startup of context com.google.apphosting.utils.**je**tty.**RuntimeAppEngineWebAppCont** ext@**1811e2c{/,/base/data/home/apps/s~airmenudemo/29.**35817792 **1953 5. C2012-04-13 11:33:35.816 Uncaught exception from servlet
Re: [google-appengine] Re: Outages?
Do you have any recommendations how to reduce the amount of IO operations during startup? My app uses Spring and like many other Java best practices followed, my project consists of several Maven modules which results several jars creation. I do not load any data from datastore or require HTTP connection during startup. I understand the AppEngine filesystem is extremely slow, but on my local dev machine loading my app takes ~30sec, so I assume that in a much superior infrastructure it should take even faster. I believe AppEngine should provide some property to extend loading requests to reach over 60 seconds (like cron operations are allowed 10mins runtime). Thanks in advance On Friday, April 13, 2012 5:51:15 PM UTC+3, Jeff Schnitzer wrote: Right. The problem is the 61244ms that it takes to start your app. How long does it normally take? Look at past warmup requests (the ones that work) and see how long they take. My guess is that the number is close to 60s. If GAE gets marginally slower, it pushes you over the edge. As for why your app takes so long to startup, I can't begin to speculate. What does it do at startup? Do any of your frameworks do classpath scanning? Do you have a lot of big jars? Zillions of class files? Do you load data from the datastore or blobstore? You need to diagnose the warmup requests that *do* work. Sure, look at appstats - although that will only show issues if your warmup fetches data from services (ie, not the filesystem). However, you really should be able to think about it a few minutes and figure out the problem. App startup is almost 100% deterministic. There are no parameters. You know what it does. Jeff On Fri, Apr 13, 2012 at 9:55 AM, Rui Oliveira rucaso2...@gmail.comwrote: Hi Jeff Thanks for your replay. Your answer was very important for me to start looking to the right part of the problem. Just to clarify :startup time is the time to start a new instance right? How can I analyze the startup time inside the server? I'm looking to the appengine logs, appstats, and speedtracer, but in none of them I can find whats happening during the startup. If I deploy the program I'm getting this kind of logs: 1. 012-04-13 11:33:35.889 /_ah/warmup 500 6**1244ms 0kb 2. W2012-04-13 11:33:35.787 EXCEPTION com.google.apphosting.api.**DeadlineExceededException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 UTC and was still e 3. E2012-04-13 11:33:35.790 javax.servlet.**ServletContext log: unavailable javax.servlet.**UnavailableException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 U 4. W2012-04-13 11:33:35.810 Failed startup of context com.google.apphosting.utils.**jetty.**RuntimeAppEngineWebAppContext@** 1811e2c{/,/base/data/home/**apps/s~airmenudemo/29.**358177921953 5. C2012-04-13 11:33:35.816 Uncaught exception from servlet javax.servlet.**UnavailableException: Initialization failed. at com.google.apphosting.runtime.**jetty.AppVersionHandlerMa 6. I2012-04-13 11:33:35.819 This request caused a new process to be started for your application, and thus caused your application code to be loaded for the first time. This requ 7. W2012-04-13 11:33:35.819 A problem was encountered with the process that handled this request, causing it to exit. This is likely to cause a new process to be used for the nex On appstats: ( http://www.airmenudemo.**appspot.com/appstats/statshttp://www.airmenudemo.appspot.com/appstats/stats ) (16) 2012-04-13 11:36:31.587 GET /appstats/ 307 http://www.airmenudemo.appspot.com/appstats/details?time=1334316991587real=215ms api=0ms overhead=0ms (0 RPCs) (17) 2012-04-13 11:35:49.207 GET /symbolmanifest.json 404 http://www.airmenudemo.appspot.com/appstats/details?time=1334316949207real=609ms api=0ms overhead=0ms (0 RPCs) (18) 2012-04-13 11:27:25.255 GET /_ah/warmup 200 http://www.airmenudemo.appspot.com/appstats/details?time=1334316445255real=441ms api=0ms overhead=0ms (0 RPCs) As you can see after deploy the appstats don't log nothing. After deploy I can't even open appstats. After some minutes / hours everything starts ok. Thanks Rui -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/frES_he8DHYJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit
Re: [google-appengine] Re: Outages?
Woah! 30s to start your app in your local dev environment? That's nuts. You have a mistaken perception of GAE. In nearly all respects, your local dev environment will perform a single thread of execution faster than production. Your local machine has dedicated CPU cores and I/O bandwidth, all local. It has a mock datastore which likely has no synchronization issues. In production, the filesystem is loading across a network. Nearly all service calls require an RPC to a remote machine somewhere else in the cluster - maybe to several machines. You're sharing CPU cores and RAM with a dozen other apps, some of which might be really busy. What you get in production is a system that is not especially fast but nearly always consistent no matter what the queries/sec or database size. Your best strategy is to figure out why your app is taking so long to start up and address that. Jeff On Fri, Apr 13, 2012 at 2:31 PM, Eliran Bivas eliran.bi...@gmail.comwrote: Do you have any recommendations how to reduce the amount of IO operations during startup? My app uses Spring and like many other Java best practices followed, my project consists of several Maven modules which results several jars creation. I do not load any data from datastore or require HTTP connection during startup. I understand the AppEngine filesystem is extremely slow, but on my local dev machine loading my app takes ~30sec, so I assume that in a much superior infrastructure it should take even faster. I believe AppEngine should provide some property to extend loading requests to reach over 60 seconds (like cron operations are allowed 10mins runtime). Thanks in advance On Friday, April 13, 2012 5:51:15 PM UTC+3, Jeff Schnitzer wrote: Right. The problem is the 61244ms that it takes to start your app. How long does it normally take? Look at past warmup requests (the ones that work) and see how long they take. My guess is that the number is close to 60s. If GAE gets marginally slower, it pushes you over the edge. As for why your app takes so long to startup, I can't begin to speculate. What does it do at startup? Do any of your frameworks do classpath scanning? Do you have a lot of big jars? Zillions of class files? Do you load data from the datastore or blobstore? You need to diagnose the warmup requests that *do* work. Sure, look at appstats - although that will only show issues if your warmup fetches data from services (ie, not the filesystem). However, you really should be able to think about it a few minutes and figure out the problem. App startup is almost 100% deterministic. There are no parameters. You know what it does. Jeff On Fri, Apr 13, 2012 at 9:55 AM, Rui Oliveira rucaso2...@gmail.comwrote: Hi Jeff Thanks for your replay. Your answer was very important for me to start looking to the right part of the problem. Just to clarify :startup time is the time to start a new instance right? How can I analyze the startup time inside the server? I'm looking to the appengine logs, appstats, and speedtracer, but in none of them I can find whats happening during the startup. If I deploy the program I'm getting this kind of logs: 1. 012-04-13 11:33:35.889 /_ah/warmup 500 61244ms 0kb 2. W2012-04-13 11:33:35.787 EXCEPTION com.google.apphosting.api.** Dead**lineExceededException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 UTC and was still e 3. E2012-04-13 11:33:35.790 javax.servlet.**Ser**vletContext log: unavailable javax.servlet.**UnavailableExcep**tion: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 U 4. W2012-04-13 11:33:35.810 Failed startup of context com.google.apphosting.utils.**je**tty.**RuntimeAppEngineWebAppCont** ext@**1811e2c{/,/base/data/home/apps/s~airmenudemo/29.**35817792* *1953 5. C2012-04-13 11:33:35.816 Uncaught exception from servlet javax.servlet.**UnavailableExcep**tion: Initialization failed. at com.google.apphosting.runtime.jetty.AppVersionHandlerMa 6. I2012-04-13 11:33:35.819 This request caused a new process to be started for your application, and thus caused your application code to be loaded for the first time. This requ 7. W2012-04-13 11:33:35.819 A problem was encountered with the process that handled this request, causing it to exit. This is likely to cause a new process to be used for the nex On appstats: ( http://www.airmenudemo.**appsp**ot.com/appstats/statshttp://www.airmenudemo.appspot.com/appstats/stats ) (16) 2012-04-13 11:36:31.587 GET /appstats/ 307 http://www.airmenudemo.appspot.com/appstats/details?time=1334316991587real=215ms api=0ms overhead=0ms (0 RPCs) (17) 2012-04-13 11:35:49.207 GET /symbolmanifest.json 404 http://www.airmenudemo.appspot.com/appstats/details?time=1334316949207real=609ms api=0ms overhead=0ms (0 RPCs) (18) 2012-04-13 11:27:25.255 GET /_ah/warmup 200
Re: [google-appengine] Re: Outages?
Thanks. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/uuKWxLpFYC0J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Hi Jeff Thanks for your replay. Your answer was very important for me to start looking to the right part of the problem. Just to clarify :startup time is the time to start a new instance right? How can I analyze the startup time? Where can I look for? I installed the appstats and SpeedTracer, but they don't help. Hi Jeff Thanks for your replay. Your answer was very important for me to start looking to the right part of the problem. Just to clarify :startup time is the time to start a new instance right? How can I analyze the startup time inside the server? I'm looking to the appengine logs, appstats, and speedtracer, but in none of them I can find whats happening during the startup. If I deploy the program I'm getting this kind of logs: 1. 012-04-13 11:33:35.889 /_ah/warmup 500 61244ms 0kb 2. W2012-04-13 11:33:35.787 EXCEPTION com.google.apphosting.api.DeadlineExceededException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 UTC and was still e 3. E2012-04-13 11:33:35.790 javax.servlet.ServletContext log: unavailable javax.servlet.UnavailableException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 U 4. W2012-04-13 11:33:35.810 Failed startup of context com.google.apphosting.utils.jetty.RuntimeAppEngineWebAppContext@1811e2c{/,/base/data/home/apps/s~airmenudemo/29.358177921953 5. C2012-04-13 11:33:35.816 Uncaught exception from servlet javax.servlet.UnavailableException: Initialization failed. at com.google.apphosting.runtime.jetty.AppVersionHandlerMa 6. I2012-04-13 11:33:35.819 This request caused a new process to be started for your application, and thus caused your application code to be loaded for the first time. This requ 7. W2012-04-13 11:33:35.819 A problem was encountered with the process that handled this request, causing it to exit. This is likely to cause a new process to be used for the nex On appstats: ( http://www.airmenudemo.appspot.com/appstats/stats ) (16) 2012-04-13 11:36:31.587 GET /appstats/ 307 http://www.airmenudemo.appspot.com/appstats/details?time=1334316991587real=215ms api=0ms overhead=0ms (0 RPCs) (17) 2012-04-13 11:35:49.207 GET /symbolmanifest.json 404 http://www.airmenudemo.appspot.com/appstats/details?time=1334316949207real=609ms api=0ms overhead=0ms (0 RPCs) (18) 2012-04-13 11:27:25.255 GET /_ah/warmup 200 http://www.airmenudemo.appspot.com/appstats/details?time=1334316445255real=441ms api=0ms overhead=0ms (0 RPCs) As you can see after deploy the appstats don't log nothing. After deploy I can't even open appstats. After some minutes / hours everything starts ok. Thanks Rui -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/pBbSCmsBtk0J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Hi Jeff Thanks for your replay. Your answer was very important for me to start looking to the right part of the problem. Just to clarify :startup time is the time to start a new instance right? How can I analyze the startup time inside the server? I'm looking to the appengine logs, appstats, and speedtracer, but in none of them I can find whats happening during the startup. If I deploy the program I'm getting this kind of logs: 1. 012-04-13 11:33:35.889 /_ah/warmup 500 61244ms 0kb 2. W2012-04-13 11:33:35.787 EXCEPTION com.google.apphosting.api.DeadlineExceededException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 UTC and was still e 3. E2012-04-13 11:33:35.790 javax.servlet.ServletContext log: unavailable javax.servlet.UnavailableException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 U 4. W2012-04-13 11:33:35.810 Failed startup of context com.google.apphosting.utils.jetty.RuntimeAppEngineWebAppContext@1811e2c{/,/base/data/home/apps/s~airmenudemo/29.358177921953 5. C2012-04-13 11:33:35.816 Uncaught exception from servlet javax.servlet.UnavailableException: Initialization failed. at com.google.apphosting.runtime.jetty.AppVersionHandlerMa 6. I2012-04-13 11:33:35.819 This request caused a new process to be started for your application, and thus caused your application code to be loaded for the first time. This requ 7. W2012-04-13 11:33:35.819 A problem was encountered with the process that handled this request, causing it to exit. This is likely to cause a new process to be used for the nex On appstats: ( http://www.airmenudemo.appspot.com/appstats/stats ) (16) 2012-04-13 11:36:31.587 GET /appstats/ 307 http://www.airmenudemo.appspot.com/appstats/details?time=1334316991587real=215ms api=0ms overhead=0ms (0 RPCs) (17) 2012-04-13 11:35:49.207 GET /symbolmanifest.json 404 http://www.airmenudemo.appspot.com/appstats/details?time=1334316949207real=609ms api=0ms overhead=0ms (0 RPCs) (18) 2012-04-13 11:27:25.255 GET /_ah/warmup 200 http://www.airmenudemo.appspot.com/appstats/details?time=1334316445255real=441ms api=0ms overhead=0ms (0 RPCs) As you can see after deploy the appstats don't log nothing. After deploy I can't even open appstats. After some minutes / hours everything starts ok. Thanks Rui -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/frES_he8DHYJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Right. The problem is the 61244ms that it takes to start your app. How long does it normally take? Look at past warmup requests (the ones that work) and see how long they take. My guess is that the number is close to 60s. If GAE gets marginally slower, it pushes you over the edge. As for why your app takes so long to startup, I can't begin to speculate. What does it do at startup? Do any of your frameworks do classpath scanning? Do you have a lot of big jars? Zillions of class files? Do you load data from the datastore or blobstore? You need to diagnose the warmup requests that *do* work. Sure, look at appstats - although that will only show issues if your warmup fetches data from services (ie, not the filesystem). However, you really should be able to think about it a few minutes and figure out the problem. App startup is almost 100% deterministic. There are no parameters. You know what it does. Jeff On Fri, Apr 13, 2012 at 9:55 AM, Rui Oliveira rucaso2...@gmail.com wrote: Hi Jeff Thanks for your replay. Your answer was very important for me to start looking to the right part of the problem. Just to clarify :startup time is the time to start a new instance right? How can I analyze the startup time inside the server? I'm looking to the appengine logs, appstats, and speedtracer, but in none of them I can find whats happening during the startup. If I deploy the program I'm getting this kind of logs: 1. 012-04-13 11:33:35.889 /_ah/warmup 500 6**1244ms 0kb 2. W2012-04-13 11:33:35.787 EXCEPTION com.google.apphosting.api.**DeadlineExceededException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 UTC and was still e 3. E2012-04-13 11:33:35.790 javax.servlet.**ServletContext log: unavailable javax.servlet.**UnavailableException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 U 4. W2012-04-13 11:33:35.810 Failed startup of context com.google.apphosting.utils.**jetty.**RuntimeAppEngineWebAppContext@** 1811e2c{/,/base/data/home/**apps/s~airmenudemo/29.**358177921953 5. C2012-04-13 11:33:35.816 Uncaught exception from servlet javax.servlet.**UnavailableException: Initialization failed. at com.google.apphosting.runtime.**jetty.AppVersionHandlerMa 6. I2012-04-13 11:33:35.819 This request caused a new process to be started for your application, and thus caused your application code to be loaded for the first time. This requ 7. W2012-04-13 11:33:35.819 A problem was encountered with the process that handled this request, causing it to exit. This is likely to cause a new process to be used for the nex On appstats: ( http://www.airmenudemo.**appspot.com/appstats/statshttp://www.airmenudemo.appspot.com/appstats/stats ) (16) 2012-04-13 11:36:31.587 GET /appstats/ 307 http://www.airmenudemo.appspot.com/appstats/details?time=1334316991587real=215ms api=0ms overhead=0ms (0 RPCs) (17) 2012-04-13 11:35:49.207 GET /symbolmanifest.json 404 http://www.airmenudemo.appspot.com/appstats/details?time=1334316949207real=609ms api=0ms overhead=0ms (0 RPCs) (18) 2012-04-13 11:27:25.255 GET /_ah/warmup 200 http://www.airmenudemo.appspot.com/appstats/details?time=1334316445255real=441ms api=0ms overhead=0ms (0 RPCs) As you can see after deploy the appstats don't log nothing. After deploy I can't even open appstats. After some minutes / hours everything starts ok. Thanks Rui -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/frES_he8DHYJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
RE: [google-appengine] Re: Outages?
My guess is that the number is close to 60s. If GAE gets marginally slower, it pushes you over the edge. I'll be doing a video very shortly about how to keep your Instances from dying on startup. But one easy way to tell if this is the issue is to up your instance size AND test that you are having the same issue on a Dynamic Backend. If your issues magically go away, but your startups take 65 seconds, you know that you need to optimize your startup. One should also watch the Instance memory usage in the dash board. Often people have instances that just spun up and are sitting at 119 of 128m of memory usage and then wonder why their instances recycle so often. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
FWIW, warmup requests and instance startup has been very very inconsistent the past couple of weeks. I've had an app that usually took 2-3S show up in the logs as DeadLineExceeded and causing our integration tests to fail when originating from the continuous integration server and it's very sporadic. So I think AE has been more than marginally slower as of late. I think the lesson learned is to avoid starting up new instances, and put your whole app in one small file. :) On Friday, April 13, 2012 10:51:15 AM UTC-4, Jeff Schnitzer wrote: Right. The problem is the 61244ms that it takes to start your app. How long does it normally take? Look at past warmup requests (the ones that work) and see how long they take. My guess is that the number is close to 60s. If GAE gets marginally slower, it pushes you over the edge. As for why your app takes so long to startup, I can't begin to speculate. What does it do at startup? Do any of your frameworks do classpath scanning? Do you have a lot of big jars? Zillions of class files? Do you load data from the datastore or blobstore? You need to diagnose the warmup requests that *do* work. Sure, look at appstats - although that will only show issues if your warmup fetches data from services (ie, not the filesystem). However, you really should be able to think about it a few minutes and figure out the problem. App startup is almost 100% deterministic. There are no parameters. You know what it does. Jeff On Fri, Apr 13, 2012 at 9:55 AM, Rui Oliveira rucaso2...@gmail.comwrote: Hi Jeff Thanks for your replay. Your answer was very important for me to start looking to the right part of the problem. Just to clarify :startup time is the time to start a new instance right? How can I analyze the startup time inside the server? I'm looking to the appengine logs, appstats, and speedtracer, but in none of them I can find whats happening during the startup. If I deploy the program I'm getting this kind of logs: 1. 012-04-13 11:33:35.889 /_ah/warmup 500 6**1244ms 0kb 2. W2012-04-13 11:33:35.787 EXCEPTION com.google.apphosting.api.**DeadlineExceededException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 UTC and was still e 3. E2012-04-13 11:33:35.790 javax.servlet.**ServletContext log: unavailable javax.servlet.**UnavailableException: This request (08f66682e6ba5919) started at 2012/04/13 11:32:35.741 U 4. W2012-04-13 11:33:35.810 Failed startup of context com.google.apphosting.utils.**jetty.**RuntimeAppEngineWebAppContext@** 1811e2c{/,/base/data/home/**apps/s~airmenudemo/29.**358177921953 5. C2012-04-13 11:33:35.816 Uncaught exception from servlet javax.servlet.**UnavailableException: Initialization failed. at com.google.apphosting.runtime.**jetty.AppVersionHandlerMa 6. I2012-04-13 11:33:35.819 This request caused a new process to be started for your application, and thus caused your application code to be loaded for the first time. This requ 7. W2012-04-13 11:33:35.819 A problem was encountered with the process that handled this request, causing it to exit. This is likely to cause a new process to be used for the nex On appstats: ( http://www.airmenudemo.**appspot.com/appstats/statshttp://www.airmenudemo.appspot.com/appstats/stats ) (16) 2012-04-13 11:36:31.587 GET /appstats/ 307 http://www.airmenudemo.appspot.com/appstats/details?time=1334316991587real=215ms api=0ms overhead=0ms (0 RPCs) (17) 2012-04-13 11:35:49.207 GET /symbolmanifest.json 404 http://www.airmenudemo.appspot.com/appstats/details?time=1334316949207real=609ms api=0ms overhead=0ms (0 RPCs) (18) 2012-04-13 11:27:25.255 GET /_ah/warmup 200 http://www.airmenudemo.appspot.com/appstats/details?time=1334316445255real=441ms api=0ms overhead=0ms (0 RPCs) As you can see after deploy the appstats don't log nothing. After deploy I can't even open appstats. After some minutes / hours everything starts ok. Thanks Rui -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/frES_he8DHYJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/nVN_RoUpA7sJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this
Re: [google-appengine] Re: Outages?
Sounds like your startup time exceeds the max (60s?) time for a request. You need to cut down your app startup time. Jeff On Thu, Apr 12, 2012 at 5:00 AM, Rui Oliveira rucaso2...@gmail.com wrote: Hi, My appId is air-menu1. HRD. I'm getting this kind of errors always after deploy. Usually after deploy the site only cames alive after 10 - 30 minutes. Yesterday for the first time, 24 hours after deploy the site stop completely during 6 hours. 6 Continuous hours without answering one single request. This start without any kind of modification on the program or database. Am I alone? No one more is having this kind of problems? After this kind of errors start always that I refersh the browser the server starts a new instance... So it's easy to have a lot of instances in some minutes. I'm waiting to solve this issue to start business. My company has been working on this site during the last 10 month... Please Help Me. This is very very serious... Sincerely Rui 2012-04-12 01:06:30.764 /com.phonemenu.conf.Configure/myRemoteService500 70285ms 0kb Mozilla/5.0 (iPad; CPU OS 5_1 like Mac OS X) AppleWebKit/534.46 (KHTML, like Gecko) Version/5.1 Mobile/9B176 Safari/7534.48.3 188.83.217.114 - - [11/Apr/2012:17:06:30 -0700] POST /com.phonemenu.conf.Configure/myRemoteService HTTP/1.1 500 0 http://www.airmenu.com/Configure.html; Mozilla/5.0 (iPad; CPU OS 5_1 like Mac OS X) AppleWebKit/534.46 (KHTML, like Gecko) Version/5.1 Mobile/9B176 Safari/7534.48.3 www.airmenu.com ms=70286 cpu_ms=24714 api_cpu_ms=0 cpm_usd=0.686616 loading_request=1 pending_ms=8036 exit_code=104 instance=00c61b117c6af5c03528fba06c291f298706e3 C2012-04-12 01:06:30.708 Uncaught exception from servlet com.google.apphosting.runtime.HardDeadlineExceededError: This request (bc91fe1c174b394d) started at 2012/04/12 00:05:29.992 UTC and was still executing at 2012/04/12 00:06:30.679 UTC. at java.security.AccessController.getStackAccessControlContext(Native Method) at java.security.AccessController.checkPermission(AccessController.java:540) at java.lang.SecurityManager.checkPermission(SecurityManager.java:549) at com.google.apphosting.runtime.security.CustomSecurityManager.checkPermission(CustomSecurityManager.java:56) at java.lang.SecurityManager.checkRead(SecurityManager.java:888) at java.io.File.lastModified(File.java:909) at java.util.zip.ZipFile.init(ZipFile.java:143) at java.util.jar.JarFile.init(JarFile.java:150) at java.util.jar.JarFile.init(JarFile.java:87) at sun.misc.URLClassPath$JarLoader.getJarFile(URLClassPath.java:919) at sun.misc.URLClassPath$JarLoader.access$900(URLClassPath.java:723) at sun.misc.URLClassPath$JarLoader$1.run(URLClassPath.java:854) at java.security.AccessController.doPrivileged(Native Method) at sun.misc.URLClassPath$JarLoader.ensureOpenSynchronized(URLClassPath.java:846) at sun.misc.URLClassPath$JarLoader.ensureOpen(URLClassPath.java:838) at sun.misc.URLClassPath$JarLoader.init(URLClassPath.java:785) at sun.misc.URLClassPath$JarLoader.init(URLClassPath.java:743) at sun.misc.URLClassPath$3.run(URLClassPath.java:412) at java.security.AccessController.doPrivileged(Native Method) at sun.misc.URLClassPath.getLoader(URLClassPath.java:395) at sun.misc.URLClassPath.getLoader(URLClassPath.java:371) at sun.misc.URLClassPath.findResource(URLClassPath.java:201) at java.net.URLClassLoader$2.run(URLClassLoader.java:379) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findResource(URLClassLoader.java:376) at com.google.apphosting.runtime.security.UserClassLoader.findResource(UserClassLoader.java:723) at java.lang.ClassLoader.getResource(ClassLoader.java:977) at org.mortbay.resource.Resource.newSystemResource(Resource.java:203) at org.mortbay.jetty.webapp.WebXmlConfiguration.configureDefaults(WebXmlConfiguration.java:159) at org.mortbay.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1230) at org.mortbay.jetty.handler.ContextHandler.doStart(ContextHandler.java:517) at org.mortbay.jetty.webapp.WebAppContext.doStart(WebAppContext.java:467) at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50) at com.google.apphosting.runtime.jetty.AppVersionHandlerMap.createHandler(AppVersionHandlerMap.java:202) at com.google.apphosting.runtime.jetty.AppVersionHandlerMap.getHandler(AppVersionHandlerMap.java:171) at com.google.apphosting.runtime.jetty.JettyServletEngineAdapter.serviceRequest(JettyServletEngineAdapter.java:123) at com.google.apphosting.runtime.JavaRuntime$RequestRunnable.run(JavaRuntime.java:446) at com.google.tracing.TraceContext$TraceContextRunnable.runInContext(TraceContext.java:449) at
Re: [google-appengine] Re: Outages?
Hi, Thanks for your reply. The strange is that I have 5 modules inside my application. All of the requests to the different modules fail at the same time. All the modules call different functions. All my functions are very small. My database is really small at the moment. I don't do nothing on the startup. Should I? My client side after start just call one or two functions on the server side. Very simple and small functions. I turn off precompile just to test but it doesn't solve. Yesterday night after 6 hours with the application stopped, just started running like magic. When I wake up today morning the app was on the air completely ok. Sincerely Rui Oliveira -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/FQ5O-j910HgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Look in the logs for startup requests, and check the amount of time they take to complete. It's possible you have an app that normally loads in a few seconds, but a hiccup at google extended those few seconds into a time that extended past the deadline. This would be something to complain about. However, since it sounds like you haven't checked those numbers, it's likely that your app normally takes large amounts of time to start up and normal load fluctuation pushed it over the edge. That means this problem is going to happen again and again and again. Check your startup time. If it's over 30s, you should start investigating ways to fix it. Keep in mind that the biggest problem for app startup is (usually) loading jars off the incredibly-slow filesystem. The usual culprits are having a large number of jars and/or using tools like Spring that do classpath scanning. Jeff On Thu, Apr 12, 2012 at 11:54 AM, Rui Oliveira rucaso2...@gmail.com wrote: Hi, Thanks for your reply. The strange is that I have 5 modules inside my application. All of the requests to the different modules fail at the same time. All the modules call different functions. All my functions are very small. My database is really small at the moment. I don't do nothing on the startup. Should I? My client side after start just call one or two functions on the server side. Very simple and small functions. I turn off precompile just to test but it doesn't solve. Yesterday night after 6 hours with the application stopped, just started running like magic. When I wake up today morning the app was on the air completely ok. Sincerely Rui Oliveira -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/FQ5O-j910HgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Just to add to Jeff's: do use Appstats too. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/vHytJH0kcDsJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Potential fix: set performance sliders to auto. No, that doesn't help in general. I'm running from the start with auto and having the problem. It's still not fixed; would be nice to get some feedback from Google ?! On Wed, Mar 14, 2012 at 3:14 AM, Mauricio Aristizabal aris...@gmail.comwrote: Potential fix: set performance sliders to auto. This is purely anecdotal but it might mean something: After reading some post this afternoon about the instance settings not really working I switched to AUTO idle instances and AUTO pending latency (before they were set to 1-1 and 25ms-1.5s respectively). That was about 5 hours ago, and within an hour or so everything started working fine. Before that, problems had been continuous as far as I could tell for 6 days or so. Or maybe the AppEngine guys finally got it under control. On Tue, Mar 13, 2012 at 5:58 PM, stephenp slpe...@gmail.com wrote: One more here. appid: carglyplatform (HRD) It's been flaky off-and-on for a couple weeks, yesterday was better, today bad again. Lots of warmup errors, instance restarts, errors in general. Stephen On Tuesday, March 6, 2012 3:17:37 PM UTC-6, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. On Tuesday, March 6, 2012 3:17:37 PM UTC-6, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. On Tuesday, March 6, 2012 3:17:37 PM UTC-6, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/C5nrBOmPaPcJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
We've been working on addressing the issues with HRD apps. Your experience is probably a coincidence. There is a very small section of apps that will have a few requests (very small %) that will be a LOT slower. We've scheduled a maintenance period for March 19th that will attempt to address issues with master/slave. You can read more about it here (scroll to the bottom for the correct time): https://groups.google.com/forum/?fromgroups#!topic/google-appengine-downtime-notify/CO_x02OF9Ak In general, everyone should try to migrate your application to High Replication when they can because we have a much higher speed of iteration when implementing different fixes for production issues (I have an earlier post about why this matters - we can make almost any production changes we want to HR apps without causing app downtime). At this point, I would not even create development or staging applications using master/slave because some behaviors (eventually consistency on global queries without an entity group root) differ between master/slave and high replication. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Wed, Mar 14, 2012 at 1:35 AM, Nicanor Babula nicanor.bab...@gmail.comwrote: Now my app is working fine. What happened? I saw at some point my app's graphs reset and some huge values for instance hours values. Afterwards, the instance hours counters turned to normal and my app stpped raising errors. If there was the GAE team working on it, thank you very much, but I think you owe us at least an explanation. What should I say to my customers? The issue has been fixed or they should be expecting outages in the following days? My app is on HRD. Thanks, Cristian Il giorno martedì 6 marzo 2012 22:17:37 UTC+1, Adam Sherman ha scritto: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/hw3niAzjSIAJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Hello all. Only to share my information: My app is doing something like this: JS=xpto JS_OPTOUT=xpto1 if request.cookie.has_key(optout) or request.headers.has_key(DNT) response.write(JS_OPTOUT) else response.write(JS) The normal latency is ~3ms Now I'm seeing 13.9 ms Earlier this morning I saw ~1000ms milisec/req 24hours ago I saw ~2400ms milisec/req [image: Requests/Second (24 hrs)] I noticed this behavior on all my apps for at least the last 8 days. I'm using python 2.5/HRD. On Wed, Mar 14, 2012 at 5:35 AM, Nicanor Babula nicanor.bab...@gmail.comwrote: Now my app is working fine. What happened? I saw at some point my app's graphs reset and some huge values for instance hours values. Afterwards, the instance hours counters turned to normal and my app stpped raising errors. If there was the GAE team working on it, thank you very much, but I think you owe us at least an explanation. What should I say to my customers? The issue has been fixed or they should be expecting outages in the following days? My app is on HRD. Thanks, Cristian Il giorno martedì 6 marzo 2012 22:17:37 UTC+1, Adam Sherman ha scritto: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/hw3niAzjSIAJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/Hyli1fDXHz8J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
I'm encountering the same issue with HR app. The spike got started in about ~1 hour from now. AppID: cmsevobg Datastore: HR On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/YUHiXXkGPAgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. attachment: Screen Shot 2012-03-13 at 10.50.28 AM.png
Re: [google-appengine] Re: Outages?
Hello, Same thing here, since around an hour ago: AppID: fiveorbsgame fiveorbsgame-test On Tue, Mar 13, 2012 at 9:59 AM, Miroslav Genov mge...@gmail.com wrote: I'm encountering the same issue with HR app. The spike got started in about ~1 hour from now. AppID: cmsevobg Datastore: HR On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/YUHiXXkGPAgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Ops my mistake, our HR id is cmsevobg-hr. Normally we get initialization time for ~10 seconds, now the time is 38 seconds using F4 CPU model. Here are some exception traces that might help 1. Failed startup of context com.google.apphosting.utils.jetty.RuntimeAppEngineWebAppContext@199f8e6{/,/base/data/home/apps/s~cmsevobg-hr/production.357324755677494831} java.lang.NullPointerException at com.google.inject.servlet.GuiceServletContextListener.contextInitialized(GuiceServletContextListener.java:46) at xxx.xxx.xxx.xxx.AppBootstrap.contextInitialized(AdmBootstrap.java:189) at org.mortbay.jetty.handler.ContextHandler.startContext(ContextHandler.java:548) at org.mortbay.jetty.servlet.Context.startContext(Context.java:136) at org.mortbay.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1250) at org.mortbay.jetty.handler.ContextHandler.doStart(ContextHandler.java:517) at org.mortbay.jetty.webapp.WebAppContext.doStart(WebAppContext.java:467) at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50) at com.google.apphosting.runtime.jetty.AppVersionHandlerMap.createHandler(AppVersionHandlerMap.java:202) at com.google.apphosting.runtime.jetty.AppVersionHandlerMap.getHandler(AppVersionHandlerMap.java:171) at com.google.apphosting.runtime.jetty.JettyServletEngineAdapter.serviceRequest(JettyServletEngineAdapter.java:123) at com.google.apphosting.runtime.JavaRuntime$RequestRunnable.run(JavaRuntime.java:422) at com.google.tracing.TraceContext$TraceContextRunnable.runInContext(TraceContext.java:449) at com.google.tracing.TraceContext$TraceContextRunnable$1.run(TraceContext.java:455) at com.google.tracing.TraceContext.runInContext(TraceContext.java:695) at com.google.tracing.TraceContext$AbstractTraceContextCallback.runInInheritedContextNoUnref(TraceContext.java:333) at com.google.tracing.TraceContext$AbstractTraceContextCallback.runInInheritedContext(TraceContext.java:325) at com.google.tracing.TraceContext$TraceContextRunnable.run(TraceContext.java:453) at com.google.apphosting.runtime.ThreadGroupPool$PoolEntry.run(ThreadGroupPool.java:251) at java.lang.Thread.run(Thread.java:679) 2. C2012-03-13 11:02:31.883 Uncaught exception from servlet javax.servlet.UnavailableException: Initialization failed. at com.google.apphosting.runtime.jetty.AppVersionHandlerMap.createHandler(AppVersionHandlerMap.java:211) at com.google.apphosting.runtime.jetty.AppVersionHandlerMap.getHandler(AppVersionHandlerMap.java:171) at com.google.apphosting.runtime.jetty.JettyServletEngineAdapter.serviceRequest(JettyServletEngineAdapter.java:123) at com.google.apphosting.runtime.JavaRuntime$RequestRunnable.run(JavaRuntime.java:422) at com.google.tracing.TraceContext$TraceContextRunnable.runInContext(TraceContext.java:449) at com.google.tracing.TraceContext$TraceContextRunnable$1.run(TraceContext.java:455) at com.google.tracing.TraceContext.runInContext(TraceContext.java:695) at com.google.tracing.TraceContext$AbstractTraceContextCallback.runInInheritedContextNoUnref(TraceContext.java:333) at com.google.tracing.TraceContext$AbstractTraceContextCallback.runInInheritedContext(TraceContext.java:325) at com.google.tracing.TraceContext$TraceContextRunnable.run(TraceContext.java:453) at com.google.apphosting.runtime.ThreadGroupPool$PoolEntry.run(ThreadGroupPool.java:251) at java.lang.Thread.run(Thread.java:679) On Tuesday, March 13, 2012 10:59:18 AM UTC+2, Miroslav Genov wrote: I'm encountering the same issue with HR app. The spike got started in about ~1 hour from now. AppID: cmsevobg Datastore: HR On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're
Re: [google-appengine] Re: Outages?
Same here, very slow and frequent instance startups. I use the HR datastore. I've set idle instances to 2 F4s but I have the impression that the scheduler prefers to start a new instance instead of using one of the available idle ones. -- Pieter Coucke Onthoo BVBA zamtam.com http://www.zamtam.com | cashcontact.comhttp://www.cashcontact.com -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Same thing the last minutes on our app (HRD, Java, Low-Traffic, one instance, no new deployment, simple page just hitting MemCache): Request was aborted after waiting too long to attempt to service your request. -- User sees 500er GAE-Team, what is going on the last days? In my opinion the Google App Engine is unreliable and looks more like a alpha- or beta- cloudenvrionment Please Google share you analysis with us. Cheers Mos 2012/3/13 Sébastien Tromp sebastien.tr...@gmail.com Hello, Same thing here, since around an hour ago: AppID: fiveorbsgame fiveorbsgame-test On Tue, Mar 13, 2012 at 9:59 AM, Miroslav Genov mge...@gmail.com wrote: I'm encountering the same issue with HR app. The spike got started in about ~1 hour from now. AppID: cmsevobg Datastore: HR On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/YUHiXXkGPAgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
My graph showing ms/sec (attached) over last 24h. Average spikes were up to 32 seconds, but I think all the errors were 60+-1 sec. On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/7CPSW2zXP5QJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. attachment: MS-per-request.png
Re: [google-appengine] Re: Outages?
Hello Ikai our app is *betscoreslive*. In our settings master/slave replication is activated but we are NOT using the datastore at all and we have been experiencing the DeadlineExceedExceptions and increased instance number mentioned by the rest of the people in the discussion. Our app is only using memcache. Normal operation during traffic peaks: instances ~25, ~45 requests/second and QPS 1 Now we observe: 12 instances doing nth with ~2 requests/second and QPS 0.2 We had the Deadline exceptions since one week now with small periods of normal operation. During the problematic periods, our app is pretty unresponsive. Will the announced maintenance for master/slave apps will solve the issue for our app as well? thanks in advance Charis On Friday, March 9, 2012 9:15:35 PM UTC+2, Ikai Lan wrote: Hey everyone, Here are a few things that will help: 1. Application IDs (--- if you have nothing else, at least provide this) 2. What is your QPS? 3. What % of your requests are errors? -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 7:24 AM, Ronoaldo José de Lana Pereira rpere...@beneficiofacil.com.br wrote: +1 for seeing the same problems on my app. It started to be worse after maintenance on March 7. Em sexta-feira, 9 de março de 2012 08h33min36s UTC-3, Nikolai escreveu: +1 we had to move to our backup systems. Everything is full of 500 errors or hardcore latency. Most of the 500 errors we see aren't even logged so this seems to be a goole problem one abstraction layer above the app. And yes - sometimes we have got the same feeling, that we are the only ones that use appengine in a production setting. You are not alone ;) regards, nikolai Am Dienstag, 6. März 2012 22:17:37 UTC+1 schrieb Adam Sherman: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/yixu1yAlMs4J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/W1KVvW7v-SEJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
This is very disturbing ... Our M/S app is getting higher error rates and some instances take from 15s to 70s to start. We can't do anything about this and even debug what is happening. If there was a issue with our code, they should always take 70s to start! I really can't understand or think about what in our code we are loading the whole world to take all that time... Then the solutions is: move to HRD and ... experience the same load time on app startup??? So, what can I do!!! Sorry to the rude speaking but I'm very concerned. We started the process to get operational support and plan our migration to have the SLA, but if our app startup will continue taking a lot of time I really can imagine what to do... Googlers, does the startup problem is getting solved for both M/S and HRD? Only HRD? None? Is there any thing we can do to avoid that strange behavior of our instances? Instance startup seems to be vital to the application's health: if your app takes to much time to startup then all concurrent requests to that instances die with 500's. This is very odd, and the warmup requests seems to never work. By our observations and other people observations, even if you set a fixed min_idle instances to be always there, they don't serve the traffic and you still get errors! Hope to see some answers, we really liked GAE in our first year woking with the platform, and now I feel completely lost... Best Regards, -Ronoaldo Em terça-feira, 13 de março de 2012 07h51min03s UTC-3, Richard Watson escreveu: My graph showing ms/sec (attached) over last 24h. Average spikes were up to 32 seconds, but I think all the errors were 60+-1 sec. On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/6r3NpHShJcgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
I believe this was related to: http://code.google.com/p/googleappengine/issues/detail?id=7130 This should now be fixed. On Tue, Mar 13, 2012 at 9:59 AM, Miroslav Genov mge...@gmail.com wrote: I'm encountering the same issue with HR app. The spike got started in about ~1 hour from now. AppID: cmsevobg Datastore: HR On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/YUHiXXkGPAgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- Johan Euphrosine (proppy) Developer Programs Engineer Google Developer Relations -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Hi, I believe this was also related to: http://code.google.com/p/googleappengine/issues/detail?id=7130 And should now be fixed. On Tue, Mar 13, 2012 at 10:02 AM, Sébastien Tromp sebastien.tr...@gmail.com wrote: Hello, Same thing here, since around an hour ago: AppID: fiveorbsgame fiveorbsgame-test On Tue, Mar 13, 2012 at 9:59 AM, Miroslav Genov mge...@gmail.com wrote: I'm encountering the same issue with HR app. The spike got started in about ~1 hour from now. AppID: cmsevobg Datastore: HR On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/YUHiXXkGPAgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- Johan Euphrosine (proppy) Developer Programs Engineer Google Developer Relations -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
As Chris pointed earlier in that thread, M/S app are more vulnerable to this kind of transient infrastructure issues because moving them around require a maintenance period. HRD applications are covered by the SLA, replicated around multiple datacenter, and better distributed if we notice an issue impacting one or many of them we can easily take actions without impacting other applications. I strongly suggest to you to try out the self migration tool in your administration console, depending of the size of your data and your write QPS, the read only period needed to migrate your application could be very small: https://appengine.google.com/migrating?app_id=your application id On Tue, Mar 13, 2012 at 2:37 PM, Ronoaldo José de Lana Pereira rpere...@beneficiofacil.com.br wrote: This is very disturbing ... Our M/S app is getting higher error rates and some instances take from 15s to 70s to start. We can't do anything about this and even debug what is happening. If there was a issue with our code, they should always take 70s to start! I really can't understand or think about what in our code we are loading the whole world to take all that time... Then the solutions is: move to HRD and ... experience the same load time on app startup??? So, what can I do!!! Sorry to the rude speaking but I'm very concerned. We started the process to get operational support and plan our migration to have the SLA, but if our app startup will continue taking a lot of time I really can imagine what to do... Googlers, does the startup problem is getting solved for both M/S and HRD? Only HRD? None? Is there any thing we can do to avoid that strange behavior of our instances? Instance startup seems to be vital to the application's health: if your app takes to much time to startup then all concurrent requests to that instances die with 500's. This is very odd, and the warmup requests seems to never work. By our observations and other people observations, even if you set a fixed min_idle instances to be always there, they don't serve the traffic and you still get errors! Hope to see some answers, we really liked GAE in our first year woking with the platform, and now I feel completely lost... Best Regards, -Ronoaldo Em terça-feira, 13 de março de 2012 07h51min03s UTC-3, Richard Watson escreveu: My graph showing ms/sec (attached) over last 24h. Average spikes were up to 32 seconds, but I think all the errors were 60+-1 sec. On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/6r3NpHShJcgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- Johan Euphrosine (proppy) Developer Programs Engineer Google Developer Relations -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
If you are not using the datastore it should be trivial to move your application to the new HRD infrastructure. If you don't need the same appid, just create a new HRD application and deploy your code on it. If you need the same appid, use the self migration tool: https://appengine.google.com/migrating?app_id=your application id Feel free to open a production ticket if you need any assistance migrating your application: http://code.google.com/p/googleappengine/issues/entry?template=Production%20issue On Tue, Mar 13, 2012 at 12:44 PM, charisl charisl...@gmail.com wrote: Hello Ikai our app is *betscoreslive*. In our settings master/slave replication is activated but we are NOT using the datastore at all and we have been experiencing the DeadlineExceedExceptions and increased instance number mentioned by the rest of the people in the discussion. Our app is only using memcache. Normal operation during traffic peaks: instances ~25, ~45 requests/second and QPS 1 Now we observe: 12 instances doing nth with ~2 requests/second and QPS 0.2 We had the Deadline exceptions since one week now with small periods of normal operation. During the problematic periods, our app is pretty unresponsive. Will the announced maintenance for master/slave apps will solve the issue for our app as well? thanks in advance Charis On Friday, March 9, 2012 9:15:35 PM UTC+2, Ikai Lan wrote: Hey everyone, Here are a few things that will help: 1. Application IDs (--- if you have nothing else, at least provide this) 2. What is your QPS? 3. What % of your requests are errors? -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 7:24 AM, Ronoaldo José de Lana Pereira rpere...@beneficiofacil.com.**br rpere...@beneficiofacil.com.br wrote: +1 for seeing the same problems on my app. It started to be worse after maintenance on March 7. Em sexta-feira, 9 de março de 2012 08h33min36s UTC-3, Nikolai escreveu: +1 we had to move to our backup systems. Everything is full of 500 errors or hardcore latency. Most of the 500 errors we see aren't even logged so this seems to be a goole problem one abstraction layer above the app. And yes - sometimes we have got the same feeling, that we are the only ones that use appengine in a production setting. You are not alone ;) regards, nikolai Am Dienstag, 6. März 2012 22:17:37 UTC+1 schrieb Adam Sherman: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/** msg/google-appengine/-/**yixu1yAlMs4Jhttps://groups.google.com/d/msg/google-appengine/-/yixu1yAlMs4J . To post to this group, send email to google-appengine@googlegroups.**comgoogle-appengine@googlegroups.com . To unsubscribe from this group, send email to google-appengine+unsubscribe@**googlegroups.comgoogle-appengine%2bunsubscr...@googlegroups.com . For more options, visit this group at http://groups.google.com/** group/google-appengine?hl=enhttp://groups.google.com/group/google-appengine?hl=en . -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/W1KVvW7v-SEJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- Johan Euphrosine (proppy) Developer Programs Engineer Google Developer Relations -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
What is your application id? Feel free to open a production issue, if you want to investigate this offthread: http://code.google.com/p/googleappengine/issues/entry?template=Production%20issue On Tue, Mar 13, 2012 at 11:44 AM, Mos mosa...@googlemail.com wrote: Same thing the last minutes on our app (HRD, Java, Low-Traffic, one instance, no new deployment, simple page just hitting MemCache): Request was aborted after waiting too long to attempt to service your request. -- User sees 500er GAE-Team, what is going on the last days? In my opinion the Google App Engine is unreliable and looks more like a alpha- or beta- cloudenvrionment Please Google share you analysis with us. Cheers Mos 2012/3/13 Sébastien Tromp sebastien.tr...@gmail.com Hello, Same thing here, since around an hour ago: AppID: fiveorbsgame fiveorbsgame-test On Tue, Mar 13, 2012 at 9:59 AM, Miroslav Genov mge...@gmail.com wrote: I'm encountering the same issue with HR app. The spike got started in about ~1 hour from now. AppID: cmsevobg Datastore: HR On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/YUHiXXkGPAgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- Johan Euphrosine (proppy) Developer Programs Engineer Google Developer Relations -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
What is your application id? Did you already fill a production issue? On Tue, Mar 13, 2012 at 3:10 PM, Jason jaso...@gmail.com wrote: I'm using HRD and am still getting tons of 500s since early this morning. On Mar 13, 8:52 am, Johan Euphrosine pro...@google.com wrote: I believe this was related to: http://code.google.com/p/googleappengine/issues/detail?id=7130 This should now be fixed. On Tue, Mar 13, 2012 at 9:59 AM, Miroslav Genov mge...@gmail.com wrote: I'm encountering the same issue with HR app. The spike got started in about ~1 hour from now. AppID: cmsevobg Datastore: HR On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/YUHiXXkGPAgJ. To post to this group, send email to google-appengine@googlegroups.com . To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- Johan Euphrosine (proppy) Developer Programs Engineer Google Developer Relations -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- Johan Euphrosine (proppy) Developer Programs Engineer Google Developer Relations -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
What is your application id? krisen-talk(www.krisentalk.de) Feel free to open a production issue There is already an issue from someone else (following this thread a lot of people are affected): http://code.google.com/p/googleappengine/issues/detail?id=7133 Johan, what's going on with GAE the last days? It doesn't feel like a PaaS in production mode. Perhaps Google should reintroduce the Beta status. ;) On Tue, Mar 13, 2012 at 3:10 PM, Johan Euphrosine pro...@google.com wrote: What is your application id? Feel free to open a production issue, if you want to investigate this offthread: http://code.google.com/p/googleappengine/issues/entry?template=Production%20issue On Tue, Mar 13, 2012 at 11:44 AM, Mos mosa...@googlemail.com wrote: Same thing the last minutes on our app (HRD, Java, Low-Traffic, one instance, no new deployment, simple page just hitting MemCache): Request was aborted after waiting too long to attempt to service your request. -- User sees 500er GAE-Team, what is going on the last days? In my opinion the Google App Engine is unreliable and looks more like a alpha- or beta- cloudenvrionment Please Google share you analysis with us. Cheers Mos 2012/3/13 Sébastien Tromp sebastien.tr...@gmail.com Hello, Same thing here, since around an hour ago: AppID: fiveorbsgame fiveorbsgame-test On Tue, Mar 13, 2012 at 9:59 AM, Miroslav Genov mge...@gmail.comwrote: I'm encountering the same issue with HR app. The spike got started in about ~1 hour from now. AppID: cmsevobg Datastore: HR On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr On Tuesday, March 13, 2012 10:36:38 AM UTC+2, Richard Watson wrote: In case you're keeping track of issues thinking it's generally cleared up: I'm on HR and have noticed higher latencies (couple seconds instead of e.g. 300ms) lately and sometimes higher error rates (a few instead of 0-3). Yesterday over about 6 hours I got a ton of 60-second requests that threw 500's with accompanying messages [1], usually on memcache sets hitting a deadline exceeded. Also, over the last couple weeks I've been running 3 instances permanently despite sometimes shutting them down manually. Usually I get by on one just fine with bursts of 2 or 3. I've noticed that one instance serves the majority of traffic with the other two serving maybe 50 requests over many hours, so the shutdown isn't aggressive enough. This request caused a new process to be started for your application... + A problem was encountered with the process that handled this request, causing it to exit in the same request. App-id: 2dumo-hr -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/YUHiXXkGPAgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- Johan Euphrosine (proppy) Developer Programs Engineer Google Developer Relations -- You received this message because
Re: [google-appengine] Re: Outages?
We migrated our app to the HRD last night. With 4 GB of data quota in around 2M entities it took 20-30 minutes. On the new app, we are seeing response times at about 1% of the MS app - 100 times faster. Our app was read only for less than three seconds - enough to affect 10 requests from 2 users. We decided to do this without a very thorough testing because the app was broken across all of our users on M/S, and will only break in a small percentage of users on HRD. When it does break, it will be pretty harmless, and we expect to fix the inconsistency problems by the end of today. We just could not wait until Monday for a possible fix for the MS app. If you've got a similar setup or similar needs, I recommend a hastier-than-usual switch. Just watch out for the email limits~ On Tue, Mar 13, 2012 at 9:04 AM, Johan Euphrosine pro...@google.com wrote: If you are not using the datastore it should be trivial to move your application to the new HRD infrastructure. If you don't need the same appid, just create a new HRD application and deploy your code on it. If you need the same appid, use the self migration tool: https://appengine.google.com/migrating?app_id=your application id Feel free to open a production ticket if you need any assistance migrating your application: http://code.google.com/p/googleappengine/issues/entry?template=Production%20issue On Tue, Mar 13, 2012 at 12:44 PM, charisl charisl...@gmail.com wrote: Hello Ikai our app is *betscoreslive*. In our settings master/slave replication is activated but we are NOT using the datastore at all and we have been experiencing the DeadlineExceedExceptions and increased instance number mentioned by the rest of the people in the discussion. Our app is only using memcache. Normal operation during traffic peaks: instances ~25, ~45 requests/second and QPS 1 Now we observe: 12 instances doing nth with ~2 requests/second and QPS 0.2 We had the Deadline exceptions since one week now with small periods of normal operation. During the problematic periods, our app is pretty unresponsive. Will the announced maintenance for master/slave apps will solve the issue for our app as well? thanks in advance Charis On Friday, March 9, 2012 9:15:35 PM UTC+2, Ikai Lan wrote: Hey everyone, Here are a few things that will help: 1. Application IDs (--- if you have nothing else, at least provide this) 2. What is your QPS? 3. What % of your requests are errors? -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 7:24 AM, Ronoaldo José de Lana Pereira rpere...@beneficiofacil.com.**br rpere...@beneficiofacil.com.brwrote: +1 for seeing the same problems on my app. It started to be worse after maintenance on March 7. Em sexta-feira, 9 de março de 2012 08h33min36s UTC-3, Nikolai escreveu: +1 we had to move to our backup systems. Everything is full of 500 errors or hardcore latency. Most of the 500 errors we see aren't even logged so this seems to be a goole problem one abstraction layer above the app. And yes - sometimes we have got the same feeling, that we are the only ones that use appengine in a production setting. You are not alone ;) regards, nikolai Am Dienstag, 6. März 2012 22:17:37 UTC+1 schrieb Adam Sherman: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/** msg/google-appengine/-/**yixu1yAlMs4Jhttps://groups.google.com/d/msg/google-appengine/-/yixu1yAlMs4J . To post to this group, send email to google-appengine@googlegroups.** com google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscribe@**googlegroups.comgoogle-appengine%2bunsubscr...@googlegroups.com . For more options, visit this group at http://groups.google.com/** group/google-appengine?hl=enhttp://groups.google.com/group/google-appengine?hl=en . -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/W1KVvW7v-SEJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- Johan Euphrosine (proppy) Developer Programs Engineer Google Developer Relations -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to
Re: [google-appengine] Re: Outages?
I have had outages throughout this morning. My app is bedbuzzserver.appspot.com, Java app on HR. From 2am until 7.48am. I have filed Production issue 7138. My instances keep getting reset (and then taking too long to start up), my '*Current Load' *logs get reset to 0 (e.g. bedbuzzserver.appspot.com/logon goes from 100 calls today, to 0) -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/mRSLiWjZgc0J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
RE: [google-appengine] Re: Outages?
We have noticed that many of the downtimes Pingdom reports are the result of AppsForDomains. If you hit your app from another app, or via AOL, or another provider that has a peering arrangement with AppEngine it will be up. I'm calling this AppsForDomains issues because typically during these outages we get error pages in AppsforDomains admin pages. In these instances Green Checks will show in the status for Appengine. But your app will fail to resolve. From: google-appengine@googlegroups.com [mailto:google-appengine@googlegroups.com] On Behalf Of Rick Mangi Sent: Tuesday, March 13, 2012 12:06 PM To: google-appengine@googlegroups.com Subject: [google-appengine] Re: Outages? Same here. I use pingdom to monitor my site and it's been down on and off for the past 24 hours to the tune of around 30 minutes. I opened an enterprise support ticket but haven't heard anything back. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/TAjLzJanor4J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Ikai, I have not moved to HRD yet. But I am pretty sure I am the only user of my application. However, ever since couples of days back, not only that it is slow but I kept on running out of quota, despite the fact that I turned on the billing. I have switched off billing yesterday as it didn't help me. Can I get a refund? The request is due to the fact that I am hitting the app less than 20 times per day, and I run out of quotas. Enabled billing didn't help. I am perplexed with a single user, how would it be possible to exhaust 0.05 million operations? see below as an example - Datastore Read Operations [image: 100%] 100%0.05 of 0.05 Million Ops0.00$0.70/ Million Ops$0.00 I don't want publish my appid, if you would like to know, I get text you at your google voice number. If that helps. Thanks for any reply. On Monday, March 12, 2012 2:52:29 PM UTC-4, Ikai Lan wrote: Hi Riley, That's a legitimate question, and one that we haven't officially answered yet. It's certainly the direction that things have been moving simply due to the nature of production management. Given that the SLA applies to HRD and not master/slave applications, you are definitely going to get a better quality of service migrating to HRD. In fact, I strongly advise that you do so. One challenge that we have when dealing with issues is to decide whether we should do emergency maintenance that requires downtime. With any production system, it's not always guaranteed that maintenance will result in issues being completely resolved, which would be really bad for app developers. At what threshold do we determine that a downtime with no guarantee of addressing the issues is worthwhile? Global 0.1% error rate? 1%? The call is not always clear cut because those errors may not be evenly distributed, and the impact may be huge, or it may be small. With master/slave applications, we do what we can to address the short term symptoms as well as the underlying system issues without impacting serving, which is often an order of magnitude more difficult (It kind of reminds me of that scene in Indiana Jones where he takes an artifact, swapping it with a bag of sand as quickly as possible to try to avoid setting off traps. Pillaging of historic artifacts is way easier when it's not dangerous, not speaking from personal experience). When your application runs on High Replication, the call is easy: there's no downtime required in 99% of cases, so we perform the maintenance right away because if it doesn't address the issue, there's no serving downtime for users. If you're not subscribed to downtime-notify, I recommend that you do so. Announcements like this will NOT and never will be moving to StackOverflow: https://groups.google.com/forum/?fromgroups#!forum/google-appengine-downtime-notify We may be announcing a maintenance in the very near future that will impact the serving of master/slave applications. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 9:46 AM, Riley rileyl...@gmail.com wrote: Ikai, it sounds like support for HR apps is being prioritized. Is that the case? Should we expect that to be the case in the future? Sorry if that's documented somewhere already~ Riley On Monday, March 12, 2012 11:44:59 AM UTC-5, Riley wrote: Our appid is activegrade, we use the m/s datastore, and get from 0-10 QPS throughout the day. Normally we have 1-4 instances running, but since this seems *mostly* related to startup, we dedicated 10 idle resident instances to run all the time. This covers us a little, but still, when a user triggers a new instance, they get the 60+ second wait and then an error. Ugh! Our costs are relatively minor - about $20 a day now that we are running these 10 ordinarily unnecessary instances - but this is a big cost for us, and embarrassing too. On Tuesday, March 6, 2012 3:17:37 PM UTC-6, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/VzKRK5UG96MJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/O-8tEZUbUEYJ. To post to this group, send email to
Re: [google-appengine] Re: Outages?
Potential fix: set performance sliders to auto. This is purely anecdotal but it might mean something: After reading some post this afternoon about the instance settings not really working I switched to AUTO idle instances and AUTO pending latency (before they were set to 1-1 and 25ms-1.5s respectively). That was about 5 hours ago, and within an hour or so everything started working fine. Before that, problems had been continuous as far as I could tell for 6 days or so. Or maybe the AppEngine guys finally got it under control. On Tue, Mar 13, 2012 at 5:58 PM, stephenp slpe...@gmail.com wrote: One more here. appid: carglyplatform (HRD) It's been flaky off-and-on for a couple weeks, yesterday was better, today bad again. Lots of warmup errors, instance restarts, errors in general. Stephen On Tuesday, March 6, 2012 3:17:37 PM UTC-6, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. On Tuesday, March 6, 2012 3:17:37 PM UTC-6, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. On Tuesday, March 6, 2012 3:17:37 PM UTC-6, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/C5nrBOmPaPcJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Hi Riley, That's a legitimate question, and one that we haven't officially answered yet. It's certainly the direction that things have been moving simply due to the nature of production management. Given that the SLA applies to HRD and not master/slave applications, you are definitely going to get a better quality of service migrating to HRD. In fact, I strongly advise that you do so. One challenge that we have when dealing with issues is to decide whether we should do emergency maintenance that requires downtime. With any production system, it's not always guaranteed that maintenance will result in issues being completely resolved, which would be really bad for app developers. At what threshold do we determine that a downtime with no guarantee of addressing the issues is worthwhile? Global 0.1% error rate? 1%? The call is not always clear cut because those errors may not be evenly distributed, and the impact may be huge, or it may be small. With master/slave applications, we do what we can to address the short term symptoms as well as the underlying system issues without impacting serving, which is often an order of magnitude more difficult (It kind of reminds me of that scene in Indiana Jones where he takes an artifact, swapping it with a bag of sand as quickly as possible to try to avoid setting off traps. Pillaging of historic artifacts is way easier when it's not dangerous, not speaking from personal experience). When your application runs on High Replication, the call is easy: there's no downtime required in 99% of cases, so we perform the maintenance right away because if it doesn't address the issue, there's no serving downtime for users. If you're not subscribed to downtime-notify, I recommend that you do so. Announcements like this will NOT and never will be moving to StackOverflow: https://groups.google.com/forum/?fromgroups#!forum/google-appengine-downtime-notify We may be announcing a maintenance in the very near future that will impact the serving of master/slave applications. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 9:46 AM, Riley rileyl...@gmail.com wrote: Ikai, it sounds like support for HR apps is being prioritized. Is that the case? Should we expect that to be the case in the future? Sorry if that's documented somewhere already~ Riley On Monday, March 12, 2012 11:44:59 AM UTC-5, Riley wrote: Our appid is activegrade, we use the m/s datastore, and get from 0-10 QPS throughout the day. Normally we have 1-4 instances running, but since this seems *mostly* related to startup, we dedicated 10 idle resident instances to run all the time. This covers us a little, but still, when a user triggers a new instance, they get the 60+ second wait and then an error. Ugh! Our costs are relatively minor - about $20 a day now that we are running these 10 ordinarily unnecessary instances - but this is a big cost for us, and embarrassing too. On Tuesday, March 6, 2012 3:17:37 PM UTC-6, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/VzKRK5UG96MJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Regarding that maintenance period: https://groups.google.com/forum/?fromgroups#!topic/google-appengine-downtime-notify/CO_x02OF9Ak It's happening next Monday, March 19th at 4pm US/Pacific (19th March, 23:00 GMT). -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 11:52 AM, Ikai Lan (Google) ika...@google.comwrote: Hi Riley, That's a legitimate question, and one that we haven't officially answered yet. It's certainly the direction that things have been moving simply due to the nature of production management. Given that the SLA applies to HRD and not master/slave applications, you are definitely going to get a better quality of service migrating to HRD. In fact, I strongly advise that you do so. One challenge that we have when dealing with issues is to decide whether we should do emergency maintenance that requires downtime. With any production system, it's not always guaranteed that maintenance will result in issues being completely resolved, which would be really bad for app developers. At what threshold do we determine that a downtime with no guarantee of addressing the issues is worthwhile? Global 0.1% error rate? 1%? The call is not always clear cut because those errors may not be evenly distributed, and the impact may be huge, or it may be small. With master/slave applications, we do what we can to address the short term symptoms as well as the underlying system issues without impacting serving, which is often an order of magnitude more difficult (It kind of reminds me of that scene in Indiana Jones where he takes an artifact, swapping it with a bag of sand as quickly as possible to try to avoid setting off traps. Pillaging of historic artifacts is way easier when it's not dangerous, not speaking from personal experience). When your application runs on High Replication, the call is easy: there's no downtime required in 99% of cases, so we perform the maintenance right away because if it doesn't address the issue, there's no serving downtime for users. If you're not subscribed to downtime-notify, I recommend that you do so. Announcements like this will NOT and never will be moving to StackOverflow: https://groups.google.com/forum/?fromgroups#!forum/google-appengine-downtime-notify We may be announcing a maintenance in the very near future that will impact the serving of master/slave applications. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 9:46 AM, Riley rileyl...@gmail.com wrote: Ikai, it sounds like support for HR apps is being prioritized. Is that the case? Should we expect that to be the case in the future? Sorry if that's documented somewhere already~ Riley On Monday, March 12, 2012 11:44:59 AM UTC-5, Riley wrote: Our appid is activegrade, we use the m/s datastore, and get from 0-10 QPS throughout the day. Normally we have 1-4 instances running, but since this seems *mostly* related to startup, we dedicated 10 idle resident instances to run all the time. This covers us a little, but still, when a user triggers a new instance, they get the 60+ second wait and then an error. Ugh! Our costs are relatively minor - about $20 a day now that we are running these 10 ordinarily unnecessary instances - but this is a big cost for us, and embarrassing too. On Tuesday, March 6, 2012 3:17:37 PM UTC-6, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/VzKRK5UG96MJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Thanks a lot. FYI: That post says Wednesday the 19th instead of Monday the 19th. Riley On Mon, Mar 12, 2012 at 3:52 PM, Ikai Lan (Google) ika...@google.comwrote: Regarding that maintenance period: https://groups.google.com/forum/?fromgroups#!topic/google-appengine-downtime-notify/CO_x02OF9Ak It's happening next Monday, March 19th at 4pm US/Pacific (19th March, 23:00 GMT). -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 11:52 AM, Ikai Lan (Google) ika...@google.comwrote: Hi Riley, That's a legitimate question, and one that we haven't officially answered yet. It's certainly the direction that things have been moving simply due to the nature of production management. Given that the SLA applies to HRD and not master/slave applications, you are definitely going to get a better quality of service migrating to HRD. In fact, I strongly advise that you do so. One challenge that we have when dealing with issues is to decide whether we should do emergency maintenance that requires downtime. With any production system, it's not always guaranteed that maintenance will result in issues being completely resolved, which would be really bad for app developers. At what threshold do we determine that a downtime with no guarantee of addressing the issues is worthwhile? Global 0.1% error rate? 1%? The call is not always clear cut because those errors may not be evenly distributed, and the impact may be huge, or it may be small. With master/slave applications, we do what we can to address the short term symptoms as well as the underlying system issues without impacting serving, which is often an order of magnitude more difficult (It kind of reminds me of that scene in Indiana Jones where he takes an artifact, swapping it with a bag of sand as quickly as possible to try to avoid setting off traps. Pillaging of historic artifacts is way easier when it's not dangerous, not speaking from personal experience). When your application runs on High Replication, the call is easy: there's no downtime required in 99% of cases, so we perform the maintenance right away because if it doesn't address the issue, there's no serving downtime for users. If you're not subscribed to downtime-notify, I recommend that you do so. Announcements like this will NOT and never will be moving to StackOverflow: https://groups.google.com/forum/?fromgroups#!forum/google-appengine-downtime-notify We may be announcing a maintenance in the very near future that will impact the serving of master/slave applications. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 9:46 AM, Riley rileyl...@gmail.com wrote: Ikai, it sounds like support for HR apps is being prioritized. Is that the case? Should we expect that to be the case in the future? Sorry if that's documented somewhere already~ Riley On Monday, March 12, 2012 11:44:59 AM UTC-5, Riley wrote: Our appid is activegrade, we use the m/s datastore, and get from 0-10 QPS throughout the day. Normally we have 1-4 instances running, but since this seems *mostly* related to startup, we dedicated 10 idle resident instances to run all the time. This covers us a little, but still, when a user triggers a new instance, they get the 60+ second wait and then an error. Ugh! Our costs are relatively minor - about $20 a day now that we are running these 10 ordinarily unnecessary instances - but this is a big cost for us, and embarrassing too. On Tuesday, March 6, 2012 3:17:37 PM UTC-6, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/VzKRK5UG96MJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to
Re: [google-appengine] Re: Outages?
GAH, it's like no matter how many times I read these things over I always make at least one mistake. And that's why code review is a Good Thing. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 1:54 PM, Riley Eynon-Lynch rileyl...@gmail.comwrote: Thanks a lot. FYI: That post says Wednesday the 19th instead of Monday the 19th. Riley On Mon, Mar 12, 2012 at 3:52 PM, Ikai Lan (Google) ika...@google.comwrote: Regarding that maintenance period: https://groups.google.com/forum/?fromgroups#!topic/google-appengine-downtime-notify/CO_x02OF9Ak It's happening next Monday, March 19th at 4pm US/Pacific (19th March, 23:00 GMT). -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 11:52 AM, Ikai Lan (Google) ika...@google.comwrote: Hi Riley, That's a legitimate question, and one that we haven't officially answered yet. It's certainly the direction that things have been moving simply due to the nature of production management. Given that the SLA applies to HRD and not master/slave applications, you are definitely going to get a better quality of service migrating to HRD. In fact, I strongly advise that you do so. One challenge that we have when dealing with issues is to decide whether we should do emergency maintenance that requires downtime. With any production system, it's not always guaranteed that maintenance will result in issues being completely resolved, which would be really bad for app developers. At what threshold do we determine that a downtime with no guarantee of addressing the issues is worthwhile? Global 0.1% error rate? 1%? The call is not always clear cut because those errors may not be evenly distributed, and the impact may be huge, or it may be small. With master/slave applications, we do what we can to address the short term symptoms as well as the underlying system issues without impacting serving, which is often an order of magnitude more difficult (It kind of reminds me of that scene in Indiana Jones where he takes an artifact, swapping it with a bag of sand as quickly as possible to try to avoid setting off traps. Pillaging of historic artifacts is way easier when it's not dangerous, not speaking from personal experience). When your application runs on High Replication, the call is easy: there's no downtime required in 99% of cases, so we perform the maintenance right away because if it doesn't address the issue, there's no serving downtime for users. If you're not subscribed to downtime-notify, I recommend that you do so. Announcements like this will NOT and never will be moving to StackOverflow: https://groups.google.com/forum/?fromgroups#!forum/google-appengine-downtime-notify We may be announcing a maintenance in the very near future that will impact the serving of master/slave applications. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 9:46 AM, Riley rileyl...@gmail.com wrote: Ikai, it sounds like support for HR apps is being prioritized. Is that the case? Should we expect that to be the case in the future? Sorry if that's documented somewhere already~ Riley On Monday, March 12, 2012 11:44:59 AM UTC-5, Riley wrote: Our appid is activegrade, we use the m/s datastore, and get from 0-10 QPS throughout the day. Normally we have 1-4 instances running, but since this seems *mostly* related to startup, we dedicated 10 idle resident instances to run all the time. This covers us a little, but still, when a user triggers a new instance, they get the 60+ second wait and then an error. Ugh! Our costs are relatively minor - about $20 a day now that we are running these 10 ordinarily unnecessary instances - but this is a big cost for us, and embarrassing too. On Tuesday, March 6, 2012 3:17:37 PM UTC-6, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/VzKRK5UG96MJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at
Re: [google-appengine] Re: Outages?
Hi Ikan, We wouldn't mind moving to HRD from M/S, but isn't it 3X more expensive? Also, what's the minimal way to impact our users when datastore is in read-only mode during downtimes? Consider that every action our users take involves writing to a datastore. Will using memcache help? Will memcache be available without interruption during datastore downtime? thanks! Amit -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/MooASjwFQ28J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
RE: [google-appengine] Re: Outages?
We wouldn't mind moving to HRD from M/S, but isn't it 3X more expensive? No. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Moving to HRD is the safest way to ensure that your users are not impacted during a downtime. Memcache and other mechanisms can be used, but will definitely not scale and aren't guaranteed to be resilient in the face of all downtime scenarios. For the particular issues reported on this thread, we have a few root causes that we're looking into. In terms of a fix though, all of the affected apps are running on M/S and, as a result, our options are much more constrained -- we're not able to move the as apps freely as we can with HRD-based applications. M/S worked really well when it was first rolled out, but given the increase in number of apps and datastore transactions we needed an even better solution -- thus HRD. While the pros and cons of HRD have been discussed and debated within this group, the simple fact is: if you want to minimize your exposure to downtimes you need to move over to HRD. There's an SLA of 99.95%, which we've consistently beathttp://googleappengine.blogspot.com/2012/01/happy-birthday-high-replication.htmlmonth over month. We're committed to resolving the current issue, but I strongly urge anyone running on M/S to make the move over to HRD. It's the quickest and most long-term fix that you can make. -- Chris Ramsdale Product Manager, Google App Engine On Mon, Mar 12, 2012 at 2:33 PM, Amit Sangani amit.sang...@gmail.comwrote: Hi Ikan, We wouldn't mind moving to HRD from M/S, but isn't it 3X more expensive? Also, what's the minimal way to impact our users when datastore is in read-only mode during downtimes? Consider that every action our users take involves writing to a datastore. Will using memcache help? Will memcache be available without interruption during datastore downtime? thanks! Amit -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/MooASjwFQ28J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
HRD is not 3x more expensive. We lowered the cost to make it match the master/slave cost. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 2:33 PM, Amit Sangani amit.sang...@gmail.comwrote: Hi Ikan, We wouldn't mind moving to HRD from M/S, but isn't it 3X more expensive? Also, what's the minimal way to impact our users when datastore is in read-only mode during downtimes? Consider that every action our users take involves writing to a datastore. Will using memcache help? Will memcache be available without interruption during datastore downtime? thanks! Amit -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/MooASjwFQ28J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Quick update: the time has been pushed back 2 hours to 6PM US/Pacific. See the latest message here: https://groups.google.com/forum/?fromgroups#!topic/google-appengine-downtime-notify/CO_x02OF9Ak -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 1:59 PM, Ikai Lan (Google) ika...@google.comwrote: GAH, it's like no matter how many times I read these things over I always make at least one mistake. And that's why code review is a Good Thing. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 1:54 PM, Riley Eynon-Lynch rileyl...@gmail.comwrote: Thanks a lot. FYI: That post says Wednesday the 19th instead of Monday the 19th. Riley On Mon, Mar 12, 2012 at 3:52 PM, Ikai Lan (Google) ika...@google.comwrote: Regarding that maintenance period: https://groups.google.com/forum/?fromgroups#!topic/google-appengine-downtime-notify/CO_x02OF9Ak It's happening next Monday, March 19th at 4pm US/Pacific (19th March, 23:00 GMT). -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 11:52 AM, Ikai Lan (Google) ika...@google.comwrote: Hi Riley, That's a legitimate question, and one that we haven't officially answered yet. It's certainly the direction that things have been moving simply due to the nature of production management. Given that the SLA applies to HRD and not master/slave applications, you are definitely going to get a better quality of service migrating to HRD. In fact, I strongly advise that you do so. One challenge that we have when dealing with issues is to decide whether we should do emergency maintenance that requires downtime. With any production system, it's not always guaranteed that maintenance will result in issues being completely resolved, which would be really bad for app developers. At what threshold do we determine that a downtime with no guarantee of addressing the issues is worthwhile? Global 0.1% error rate? 1%? The call is not always clear cut because those errors may not be evenly distributed, and the impact may be huge, or it may be small. With master/slave applications, we do what we can to address the short term symptoms as well as the underlying system issues without impacting serving, which is often an order of magnitude more difficult (It kind of reminds me of that scene in Indiana Jones where he takes an artifact, swapping it with a bag of sand as quickly as possible to try to avoid setting off traps. Pillaging of historic artifacts is way easier when it's not dangerous, not speaking from personal experience). When your application runs on High Replication, the call is easy: there's no downtime required in 99% of cases, so we perform the maintenance right away because if it doesn't address the issue, there's no serving downtime for users. If you're not subscribed to downtime-notify, I recommend that you do so. Announcements like this will NOT and never will be moving to StackOverflow: https://groups.google.com/forum/?fromgroups#!forum/google-appengine-downtime-notify We may be announcing a maintenance in the very near future that will impact the serving of master/slave applications. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Mon, Mar 12, 2012 at 9:46 AM, Riley rileyl...@gmail.com wrote: Ikai, it sounds like support for HR apps is being prioritized. Is that the case? Should we expect that to be the case in the future? Sorry if that's documented somewhere already~ Riley On Monday, March 12, 2012 11:44:59 AM UTC-5, Riley wrote: Our appid is activegrade, we use the m/s datastore, and get from 0-10 QPS throughout the day. Normally we have 1-4 instances running, but since this seems *mostly* related to startup, we dedicated 10 idle resident instances to run all the time. This covers us a little, but still, when a user triggers a new instance, they get the 60+ second wait and then an error. Ugh! Our costs are relatively minor - about $20 a day now that we are running these 10 ordinarily unnecessary instances - but this is a big cost for us, and embarrassing too. On Tuesday, March 6, 2012 3:17:37 PM UTC-6, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/VzKRK5UG96MJ. To post to this group, send email to google-appengine@googlegroups.com . To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at
Re: [google-appengine] Re: Outages?
Hi Chris, On Monday, March 12, 2012 3:55:00 PM UTC-7, Chris Ramsdale wrote: For the particular issues reported on this thread, we have a few root causes that we're looking into. In terms of a fix though, all of the affected apps are running on M/S Our app prodicta is using HRD, not M/S. We have been seeing a big degradation in performance the last few days. I just ran a test and my client code timed out because it took 47 seconds to load a 200KB static JAR file. It took 34 seconds to load a small PNG file. Customers are complaining. I starred this issue: http://code.google.com/p/googleappengine/issues/detail?id=7093 Is this problem being considered a high priority? The status chart does not seem to reflect the problems being reported. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/tYTjYwmVIZoJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
RE: [google-appengine] Re: Outages?
Should update the message when you start a new app that says it costs 3x as much. From: google-appengine@googlegroups.com [mailto:google-appengine@googlegroups.com] On Behalf Of Ikai Lan (Google) Sent: Monday, March 12, 2012 4:09 PM To: google-appengine@googlegroups.com Subject: Re: [google-appengine] Re: Outages? HRD is not 3x more expensive. We lowered the cost to make it match the master/slave cost. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com http://plus.ikailan.com/ On Mon, Mar 12, 2012 at 2:33 PM, Amit Sangani amit.sang...@gmail.com wrote: Hi Ikan, We wouldn't mind moving to HRD from M/S, but isn't it 3X more expensive? Also, what's the minimal way to impact our users when datastore is in read-only mode during downtimes? Consider that every action our users take involves writing to a datastore. Will using memcache help? Will memcache be available without interruption during datastore downtime? thanks! Amit -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/MooASjwFQ28J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com mailto:google-appengine%2bunsubscr...@googlegroups.com . For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Ikai, Our apps ids: rvaserver rvauser contentfinancial contentsports QPS and error rates differ but they've all been getting a lot of DeadlineExceeded exceptions and the number of instances has been higher than usual over the last couple of days. Regards, Alexey On Friday, March 9, 2012 2:15:35 PM UTC-5, Ikai Lan wrote: Hey everyone, Here are a few things that will help: 1. Application IDs (--- if you have nothing else, at least provide this) 2. What is your QPS? 3. What % of your requests are errors? -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 7:24 AM, Ronoaldo José de Lana Pereira rpere...@beneficiofacil.com.br wrote: +1 for seeing the same problems on my app. It started to be worse after maintenance on March 7. Em sexta-feira, 9 de março de 2012 08h33min36s UTC-3, Nikolai escreveu: +1 we had to move to our backup systems. Everything is full of 500 errors or hardcore latency. Most of the 500 errors we see aren't even logged so this seems to be a goole problem one abstraction layer above the app. And yes - sometimes we have got the same feeling, that we are the only ones that use appengine in a production setting. You are not alone ;) regards, nikolai Am Dienstag, 6. März 2012 22:17:37 UTC+1 schrieb Adam Sherman: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/yixu1yAlMs4J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/Qpy-ppdrhrAJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
appid: textyserver Still getting lots of exceptions, mainly: 1) com.google.apphosting.runtime.HardDeadlineExceededError exceptions, 2) Failed startup of context com.google.apphosting.utils.jetty.RuntimeAppEngineWebAppContext 3) javax.jdo.JDOException: Transaction failed to commit at org.datanucleus.jdo.NucleusJDOHelper.getJDOExceptionForNucleusException(NucleusJDOHelper.java:419) at org.datanucleus.jdo.JDOPersistenceManager.close(JDOPersistenceManager.java:281) Status page is saying everything is normal - http://code.google.com/status/appengine which does not seem true. Please let us know if you need more information. thanks! Amit -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/8NS6YYPFVtkJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Best explanation ever. On Wednesday, March 7, 2012 9:45:11 PM UTC+5:30, Brandon Wirtz wrote: So, apparently, we all imagined the problem. The status page no longer admits to anything. In most systems the Uptime is 100% minus the summation of the downtime of all other systems. The exception to this rule is logging. When Logging fails to record the downtime, Uptime goes up. As a result Google has been working hard to build a logging system that goes down just ahead of all other systems, and comes up shortly after. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/LeAHoX7YXR4J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Hey everyone, Here are a few things that will help: 1. Application IDs (--- if you have nothing else, at least provide this) 2. What is your QPS? 3. What % of your requests are errors? -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 7:24 AM, Ronoaldo José de Lana Pereira rpere...@beneficiofacil.com.br wrote: +1 for seeing the same problems on my app. It started to be worse after maintenance on March 7. Em sexta-feira, 9 de março de 2012 08h33min36s UTC-3, Nikolai escreveu: +1 we had to move to our backup systems. Everything is full of 500 errors or hardcore latency. Most of the 500 errors we see aren't even logged so this seems to be a goole problem one abstraction layer above the app. And yes - sometimes we have got the same feeling, that we are the only ones that use appengine in a production setting. You are not alone ;) regards, nikolai Am Dienstag, 6. März 2012 22:17:37 UTC+1 schrieb Adam Sherman: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/yixu1yAlMs4J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Hey Ikai, Our app id: petaclasses QPS: 5-20 requests per second Current instances in dashboard: 110 - 160 Usual instances: 8-15 It's hard to say % of failed requests as we have also request that fail for other reasons (e.g. non existing pages, etc) and not sure how easily separate them. By the way, are you guys considering to create a page where we can post/report this data in some structured way and join an issue so you can accumulate and understand the scale of an issue easily. Alex On Friday, 9 March 2012 15:15:35 UTC-4, Ikai Lan wrote: Hey everyone, Here are a few things that will help: 1. Application IDs (--- if you have nothing else, at least provide this) 2. What is your QPS? 3. What % of your requests are errors? -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 7:24 AM, Ronoaldo José de Lana Pereira rpere...@beneficiofacil.com.br wrote: +1 for seeing the same problems on my app. It started to be worse after maintenance on March 7. Em sexta-feira, 9 de março de 2012 08h33min36s UTC-3, Nikolai escreveu: +1 we had to move to our backup systems. Everything is full of 500 errors or hardcore latency. Most of the 500 errors we see aren't even logged so this seems to be a goole problem one abstraction layer above the app. And yes - sometimes we have got the same feeling, that we are the only ones that use appengine in a production setting. You are not alone ;) regards, nikolai Am Dienstag, 6. März 2012 22:17:37 UTC+1 schrieb Adam Sherman: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/yixu1yAlMs4J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/ErrbHpuYmWgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Alex, to answer that question: yes. We are looking to revamp the production issues tracker which is far from optimal. When users can join or aggregate issues, it allows us to quickly separate actual infrastructure hiccups from user code issues. Thanks for the info! Is there any other behavior you can report? Does it sound reasonable that you have 110-160 instances because of long startup teams leading to more instances required to serve the same load? Are you Python, Java or Go, and do you have concurrent requests enabled? -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 11:48 AM, Alexander Trakhimenok alexander.trakhime...@gmail.com wrote: Hey Ikai, Our app id: petaclasses QPS: 5-20 requests per second Current instances in dashboard: 110 - 160 Usual instances: 8-15 It's hard to say % of failed requests as we have also request that fail for other reasons (e.g. non existing pages, etc) and not sure how easily separate them. By the way, are you guys considering to create a page where we can post/report this data in some structured way and join an issue so you can accumulate and understand the scale of an issue easily. Alex On Friday, 9 March 2012 15:15:35 UTC-4, Ikai Lan wrote: Hey everyone, Here are a few things that will help: 1. Application IDs (--- if you have nothing else, at least provide this) 2. What is your QPS? 3. What % of your requests are errors? -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 7:24 AM, Ronoaldo José de Lana Pereira rpere...@beneficiofacil.com.**br rpere...@beneficiofacil.com.br wrote: +1 for seeing the same problems on my app. It started to be worse after maintenance on March 7. Em sexta-feira, 9 de março de 2012 08h33min36s UTC-3, Nikolai escreveu: +1 we had to move to our backup systems. Everything is full of 500 errors or hardcore latency. Most of the 500 errors we see aren't even logged so this seems to be a goole problem one abstraction layer above the app. And yes - sometimes we have got the same feeling, that we are the only ones that use appengine in a production setting. You are not alone ;) regards, nikolai Am Dienstag, 6. März 2012 22:17:37 UTC+1 schrieb Adam Sherman: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/** msg/google-appengine/-/**yixu1yAlMs4Jhttps://groups.google.com/d/msg/google-appengine/-/yixu1yAlMs4J . To post to this group, send email to google-appengine@googlegroups.**comgoogle-appengine@googlegroups.com . To unsubscribe from this group, send email to google-appengine+unsubscribe@**googlegroups.comgoogle-appengine%2bunsubscr...@googlegroups.com . For more options, visit this group at http://groups.google.com/** group/google-appengine?hl=enhttp://groups.google.com/group/google-appengine?hl=en . -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/ErrbHpuYmWgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
We are Python 2.5 (no concurrent). Yes, it seems the start-up time is just crazy high for at least some or all instances. I also noticed that there are lot's of instances that served just 1 request and have average latency 0ms and have QPS=0 average instance age about 8-9 minutes (up to 11 minutes). For me it seems like an instance is created to serve static content and not used anymore and stays here until it die in a while. At the moment we have 264 active instances and it's killing our budget :( - see the screenshot attached. We had 2 hours downtime due to exceeded budget. Alex On Friday, 9 March 2012 15:57:27 UTC-4, Ikai Lan wrote: Alex, to answer that question: yes. We are looking to revamp the production issues tracker which is far from optimal. When users can join or aggregate issues, it allows us to quickly separate actual infrastructure hiccups from user code issues. Thanks for the info! Is there any other behavior you can report? Does it sound reasonable that you have 110-160 instances because of long startup teams leading to more instances required to serve the same load? Are you Python, Java or Go, and do you have concurrent requests enabled? -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 11:48 AM, Alexander Trakhimenok alexander.trakhime...@gmail.com wrote: Hey Ikai, Our app id: petaclasses QPS: 5-20 requests per second Current instances in dashboard: 110 - 160 Usual instances: 8-15 It's hard to say % of failed requests as we have also request that fail for other reasons (e.g. non existing pages, etc) and not sure how easily separate them. By the way, are you guys considering to create a page where we can post/report this data in some structured way and join an issue so you can accumulate and understand the scale of an issue easily. Alex On Friday, 9 March 2012 15:15:35 UTC-4, Ikai Lan wrote: Hey everyone, Here are a few things that will help: 1. Application IDs (--- if you have nothing else, at least provide this) 2. What is your QPS? 3. What % of your requests are errors? -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 7:24 AM, Ronoaldo José de Lana Pereira rpere...@beneficiofacil.com.**br rpere...@beneficiofacil.com.brwrote: +1 for seeing the same problems on my app. It started to be worse after maintenance on March 7. Em sexta-feira, 9 de março de 2012 08h33min36s UTC-3, Nikolai escreveu: +1 we had to move to our backup systems. Everything is full of 500 errors or hardcore latency. Most of the 500 errors we see aren't even logged so this seems to be a goole problem one abstraction layer above the app. And yes - sometimes we have got the same feeling, that we are the only ones that use appengine in a production setting. You are not alone ;) regards, nikolai Am Dienstag, 6. März 2012 22:17:37 UTC+1 schrieb Adam Sherman: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/** msg/google-appengine/-/**yixu1yAlMs4Jhttps://groups.google.com/d/msg/google-appengine/-/yixu1yAlMs4J . To post to this group, send email to google-appengine@googlegroups.** com google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscribe@**googlegroups.comgoogle-appengine%2bunsubscr...@googlegroups.com . For more options, visit this group at http://groups.google.com/** group/google-appengine?hl=enhttp://groups.google.com/group/google-appengine?hl=en . -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/ErrbHpuYmWgJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/DAWn9eldNmEJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. attachment: Screen Shot 2012-03-09 at 16.12.27.png
Re: [google-appengine] Re: Outages?
Just a follow up: 1. Application Id: oferta-unica 2. QPS: Currently around ~10 dynamic req/sec, overall ~32 req/sec 3. After disabling concurrent requests, ~0.6 errors/sec; before, ~1.5 errors/sec. Like Alexanders said, some of the errors aren't due to this issue, but I can confirm that we have lots of 500 user-facing errors because our custom 500 error page sends events in Google Analytics: https://lh5.googleusercontent.com/-UWJyZCb-zjE/T1pmEnDI3ZI/AC8/_m27Og4FPEg/s1600/appengine-error-rate.png Thanks for your help. Em sexta-feira, 9 de março de 2012 16h15min35s UTC-3, Ikai Lan escreveu: Hey everyone, Here are a few things that will help: 1. Application IDs (--- if you have nothing else, at least provide this) 2. What is your QPS? 3. What % of your requests are errors? -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 7:24 AM, Ronoaldo José de Lana Pereira rpere...@beneficiofacil.com.br wrote: +1 for seeing the same problems on my app. It started to be worse after maintenance on March 7. Em sexta-feira, 9 de março de 2012 08h33min36s UTC-3, Nikolai escreveu: +1 we had to move to our backup systems. Everything is full of 500 errors or hardcore latency. Most of the 500 errors we see aren't even logged so this seems to be a goole problem one abstraction layer above the app. And yes - sometimes we have got the same feeling, that we are the only ones that use appengine in a production setting. You are not alone ;) regards, nikolai Am Dienstag, 6. März 2012 22:17:37 UTC+1 schrieb Adam Sherman: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/yixu1yAlMs4J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/X341mSA2KIcJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
appid: i-strive-to java - thread safe set to true. On Friday, March 9, 2012 2:15:35 PM UTC-5, Ikai Lan wrote: Hey everyone, Here are a few things that will help: 1. Application IDs (--- if you have nothing else, at least provide this) 2. What is your QPS? 3. What % of your requests are errors? -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 7:24 AM, Ronoaldo José de Lana Pereira rpere...@beneficiofacil.com.br wrote: +1 for seeing the same problems on my app. It started to be worse after maintenance on March 7. Em sexta-feira, 9 de março de 2012 08h33min36s UTC-3, Nikolai escreveu: +1 we had to move to our backup systems. Everything is full of 500 errors or hardcore latency. Most of the 500 errors we see aren't even logged so this seems to be a goole problem one abstraction layer above the app. And yes - sometimes we have got the same feeling, that we are the only ones that use appengine in a production setting. You are not alone ;) regards, nikolai Am Dienstag, 6. März 2012 22:17:37 UTC+1 schrieb Adam Sherman: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/yixu1yAlMs4J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/BIhSzW__bVUJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Also now getting below exceptions - java.lang.ExceptionInInitializerError at org.datanucleus.jdo.metadata.JDOAnnotationReader.processClassAnnotations(JDOAnnotationReader.java:140) at org.datanucleus.metadata.annotations.AbstractAnnotationReader.getMetaDataForClass(AbstractAnnotationReader.java:122) at org.datanucleus.metadata.annotations.AnnotationManagerImpl.getMetaDataForClass(AnnotationManagerImpl.java:136) at org.datanucleus.metadata.MetaDataManager.loadAnnotationsForClass(MetaDataManager.java:2278) at org.datanucleus.jdo.metadata.JDOMetaDataManager.getMetaDataForClassInternal(JDOMetaDataManager.java:369) at org.datanucleus.metadata.MetaDataManager.getMetaDataForClass(MetaDataManager.java:1125) at org.datanucleus.store.appengine.EntityUtils.idToInternalKey(EntityUtils.java:122) at org.datanucleus.store.appengine.jdo.DatastoreJDOPersistenceManager.getObjectById(DatastoreJDOPersistenceManager.java:63) com.google.apphosting.runtime.HardDeadlineExceededError: This request (a9a7135b6a5f023e) started at 2012/03/09 21:37:22.971 UTC and was still executing at 2012/03/09 21:38:22.911 UTC. at org.datanucleus.store.appengine.DatastoreManager.getDatastoreClass(DatastoreManager.java:765) at org.datanucleus.store.appengine.DatastoreManager.getDatastoreClass(DatastoreManager.java:88) at org.datanucleus.store.appengine.EntityUtils.determineKind(EntityUtils.java:98) at org.datanucleus.store.appengine.EntityUtils.determineKind(EntityUtils.java:94) at org.datanucleus.store.appengine.EntityUtils.idToInternalKey(EntityUtils.java:124) com.google.apphosting.api.DeadlineExceededException: This request (34c3e1d5cfc6d211) started at 2012/03/09 21:36:34.867 UTC and was still executing at 2012/03/09 21:37:34.367 UTC. at com.google.appengine.runtime.Request.process-34c3e1d5cfc6d211(Request.java) at java.util.zip.ZipFile.read(Native Method) at java.util.zip.ZipFile.access$1200(ZipFile.java:57) at java.util.zip.ZipFile$ZipFileInputStream.read(ZipFile.java:476) at java.util.zip.ZipFile$1.fill(ZipFile.java:259) at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:158) at java.io.FilterInputStream.read(FilterInputStream.java:133) at java.io.FilterInputStream.read(FilterInputStream.java:107) -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
I forgot to ask if these were master/slave or high replication apps. I can always check by going to the admin console, but I'm hoping to separate them out. We're looking into the HR apps first (one I figure out which is which). -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 1:53 PM, Amit Sangani amit.sang...@gmail.com wrote: Also now getting below exceptions - java.lang.ExceptionInInitializerError at org.datanucleus.jdo.metadata.JDOAnnotationReader.processClassAnnotations(JDOAnnotationReader.java:140) at org.datanucleus.metadata.annotations.AbstractAnnotationReader.getMetaDataForClass(AbstractAnnotationReader.java:122) at org.datanucleus.metadata.annotations.AnnotationManagerImpl.getMetaDataForClass(AnnotationManagerImpl.java:136) at org.datanucleus.metadata.MetaDataManager.loadAnnotationsForClass(MetaDataManager.java:2278) at org.datanucleus.jdo.metadata.JDOMetaDataManager.getMetaDataForClassInternal(JDOMetaDataManager.java:369) at org.datanucleus.metadata.MetaDataManager.getMetaDataForClass(MetaDataManager.java:1125) at org.datanucleus.store.appengine.EntityUtils.idToInternalKey(EntityUtils.java:122) at org.datanucleus.store.appengine.jdo.DatastoreJDOPersistenceManager.getObjectById(DatastoreJDOPersistenceManager.java:63) com.google.apphosting.runtime.HardDeadlineExceededError: This request (a9a7135b6a5f023e) started at 2012/03/09 21:37:22.971 UTC and was still executing at 2012/03/09 21:38:22.911 UTC. at org.datanucleus.store.appengine.DatastoreManager.getDatastoreClass(DatastoreManager.java:765) at org.datanucleus.store.appengine.DatastoreManager.getDatastoreClass(DatastoreManager.java:88) at org.datanucleus.store.appengine.EntityUtils.determineKind(EntityUtils.java:98) at org.datanucleus.store.appengine.EntityUtils.determineKind(EntityUtils.java:94) at org.datanucleus.store.appengine.EntityUtils.idToInternalKey(EntityUtils.java:124) com.google.apphosting.api.DeadlineExceededException: This request (34c3e1d5cfc6d211) started at 2012/03/09 21:36:34.867 UTC and was still executing at 2012/03/09 21:37:34.367 UTC. at com.google.appengine.runtime.Request.process-34c3e1d5cfc6d211(Request.java) at java.util.zip.ZipFile.read(Native Method) at java.util.zip.ZipFile.access$1200(ZipFile.java:57) at java.util.zip.ZipFile$ZipFileInputStream.read(ZipFile.java:476) at java.util.zip.ZipFile$1.fill(ZipFile.java:259) at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:158) at java.io.FilterInputStream.read(FilterInputStream.java:133) at java.io.FilterInputStream.read(FilterInputStream.java:107) -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
textyserver is on master/slave. On Fri, Mar 9, 2012 at 2:07 PM, Ikai Lan (Google) ika...@google.com wrote: I forgot to ask if these were master/slave or high replication apps. I can always check by going to the admin console, but I'm hoping to separate them out. We're looking into the HR apps first (one I figure out which is which). -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 1:53 PM, Amit Sangani amit.sang...@gmail.comwrote: Also now getting below exceptions - java.lang.ExceptionInInitializerError at org.datanucleus.jdo.metadata.JDOAnnotationReader.processClassAnnotations(JDOAnnotationReader.java:140) at org.datanucleus.metadata.annotations.AbstractAnnotationReader.getMetaDataForClass(AbstractAnnotationReader.java:122) at org.datanucleus.metadata.annotations.AnnotationManagerImpl.getMetaDataForClass(AnnotationManagerImpl.java:136) at org.datanucleus.metadata.MetaDataManager.loadAnnotationsForClass(MetaDataManager.java:2278) at org.datanucleus.jdo.metadata.JDOMetaDataManager.getMetaDataForClassInternal(JDOMetaDataManager.java:369) at org.datanucleus.metadata.MetaDataManager.getMetaDataForClass(MetaDataManager.java:1125) at org.datanucleus.store.appengine.EntityUtils.idToInternalKey(EntityUtils.java:122) at org.datanucleus.store.appengine.jdo.DatastoreJDOPersistenceManager.getObjectById(DatastoreJDOPersistenceManager.java:63) com.google.apphosting.runtime.HardDeadlineExceededError: This request (a9a7135b6a5f023e) started at 2012/03/09 21:37:22.971 UTC and was still executing at 2012/03/09 21:38:22.911 UTC. at org.datanucleus.store.appengine.DatastoreManager.getDatastoreClass(DatastoreManager.java:765) at org.datanucleus.store.appengine.DatastoreManager.getDatastoreClass(DatastoreManager.java:88) at org.datanucleus.store.appengine.EntityUtils.determineKind(EntityUtils.java:98) at org.datanucleus.store.appengine.EntityUtils.determineKind(EntityUtils.java:94) at org.datanucleus.store.appengine.EntityUtils.idToInternalKey(EntityUtils.java:124) com.google.apphosting.api.DeadlineExceededException: This request (34c3e1d5cfc6d211) started at 2012/03/09 21:36:34.867 UTC and was still executing at 2012/03/09 21:37:34.367 UTC. at com.google.appengine.runtime.Request.process-34c3e1d5cfc6d211(Request.java) at java.util.zip.ZipFile.read(Native Method) at java.util.zip.ZipFile.access$1200(ZipFile.java:57) at java.util.zip.ZipFile$ZipFileInputStream.read(ZipFile.java:476) at java.util.zip.ZipFile$1.fill(ZipFile.java:259) at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:158) at java.io.FilterInputStream.read(FilterInputStream.java:133) at java.io.FilterInputStream.read(FilterInputStream.java:107) -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Yep, I figured it out (when you look at an app in the admin console, if the app ID has a s~ prefix, that means it runs in High Replication). I was just pointing it out for people who hadn't yet reported application IDs. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 2:12 PM, Amit Sangani amit.sang...@gmail.com wrote: textyserver is on master/slave. On Fri, Mar 9, 2012 at 2:07 PM, Ikai Lan (Google) ika...@google.comwrote: I forgot to ask if these were master/slave or high replication apps. I can always check by going to the admin console, but I'm hoping to separate them out. We're looking into the HR apps first (one I figure out which is which). -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 1:53 PM, Amit Sangani amit.sang...@gmail.comwrote: Also now getting below exceptions - java.lang.ExceptionInInitializerError at org.datanucleus.jdo.metadata.JDOAnnotationReader.processClassAnnotations(JDOAnnotationReader.java:140) at org.datanucleus.metadata.annotations.AbstractAnnotationReader.getMetaDataForClass(AbstractAnnotationReader.java:122) at org.datanucleus.metadata.annotations.AnnotationManagerImpl.getMetaDataForClass(AnnotationManagerImpl.java:136) at org.datanucleus.metadata.MetaDataManager.loadAnnotationsForClass(MetaDataManager.java:2278) at org.datanucleus.jdo.metadata.JDOMetaDataManager.getMetaDataForClassInternal(JDOMetaDataManager.java:369) at org.datanucleus.metadata.MetaDataManager.getMetaDataForClass(MetaDataManager.java:1125) at org.datanucleus.store.appengine.EntityUtils.idToInternalKey(EntityUtils.java:122) at org.datanucleus.store.appengine.jdo.DatastoreJDOPersistenceManager.getObjectById(DatastoreJDOPersistenceManager.java:63) com.google.apphosting.runtime.HardDeadlineExceededError: This request (a9a7135b6a5f023e) started at 2012/03/09 21:37:22.971 UTC and was still executing at 2012/03/09 21:38:22.911 UTC. at org.datanucleus.store.appengine.DatastoreManager.getDatastoreClass(DatastoreManager.java:765) at org.datanucleus.store.appengine.DatastoreManager.getDatastoreClass(DatastoreManager.java:88) at org.datanucleus.store.appengine.EntityUtils.determineKind(EntityUtils.java:98) at org.datanucleus.store.appengine.EntityUtils.determineKind(EntityUtils.java:94) at org.datanucleus.store.appengine.EntityUtils.idToInternalKey(EntityUtils.java:124) com.google.apphosting.api.DeadlineExceededException: This request (34c3e1d5cfc6d211) started at 2012/03/09 21:36:34.867 UTC and was still executing at 2012/03/09 21:37:34.367 UTC. at com.google.appengine.runtime.Request.process-34c3e1d5cfc6d211(Request.java) at java.util.zip.ZipFile.read(Native Method) at java.util.zip.ZipFile.access$1200(ZipFile.java:57) at java.util.zip.ZipFile$ZipFileInputStream.read(ZipFile.java:476) at java.util.zip.ZipFile$1.fill(ZipFile.java:259) at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:158) at java.io.FilterInputStream.read(FilterInputStream.java:133) at java.io.FilterInputStream.read(FilterInputStream.java:107) -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Okay, at least with this thread, it seems like the common thread is that the applications are master/slave applications. We're going to try to tweak a few things on our end to lessen the pain, but pay attention to the downtime-notify@ list ( https://groups.google.com/forum/?fromgroups#!forum/google-appengine-downtime-notify). We may announce another maintenance soon as a slightly longer-term fix (while the real long term fix is on its way). I'm going to switch to take a look at some of the issues with error code 20x - those seem to be impacting the High Replication applications as well. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 2:14 PM, Ikai Lan (Google) ika...@google.com wrote: Yep, I figured it out (when you look at an app in the admin console, if the app ID has a s~ prefix, that means it runs in High Replication). I was just pointing it out for people who hadn't yet reported application IDs. -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 2:12 PM, Amit Sangani amit.sang...@gmail.comwrote: textyserver is on master/slave. On Fri, Mar 9, 2012 at 2:07 PM, Ikai Lan (Google) ika...@google.comwrote: I forgot to ask if these were master/slave or high replication apps. I can always check by going to the admin console, but I'm hoping to separate them out. We're looking into the HR apps first (one I figure out which is which). -- Ikai Lan Developer Programs Engineer, Google App Engine plus.ikailan.com On Fri, Mar 9, 2012 at 1:53 PM, Amit Sangani amit.sang...@gmail.comwrote: Also now getting below exceptions - java.lang.ExceptionInInitializerError at org.datanucleus.jdo.metadata.JDOAnnotationReader.processClassAnnotations(JDOAnnotationReader.java:140) at org.datanucleus.metadata.annotations.AbstractAnnotationReader.getMetaDataForClass(AbstractAnnotationReader.java:122) at org.datanucleus.metadata.annotations.AnnotationManagerImpl.getMetaDataForClass(AnnotationManagerImpl.java:136) at org.datanucleus.metadata.MetaDataManager.loadAnnotationsForClass(MetaDataManager.java:2278) at org.datanucleus.jdo.metadata.JDOMetaDataManager.getMetaDataForClassInternal(JDOMetaDataManager.java:369) at org.datanucleus.metadata.MetaDataManager.getMetaDataForClass(MetaDataManager.java:1125) at org.datanucleus.store.appengine.EntityUtils.idToInternalKey(EntityUtils.java:122) at org.datanucleus.store.appengine.jdo.DatastoreJDOPersistenceManager.getObjectById(DatastoreJDOPersistenceManager.java:63) com.google.apphosting.runtime.HardDeadlineExceededError: This request (a9a7135b6a5f023e) started at 2012/03/09 21:37:22.971 UTC and was still executing at 2012/03/09 21:38:22.911 UTC. at org.datanucleus.store.appengine.DatastoreManager.getDatastoreClass(DatastoreManager.java:765) at org.datanucleus.store.appengine.DatastoreManager.getDatastoreClass(DatastoreManager.java:88) at org.datanucleus.store.appengine.EntityUtils.determineKind(EntityUtils.java:98) at org.datanucleus.store.appengine.EntityUtils.determineKind(EntityUtils.java:94) at org.datanucleus.store.appengine.EntityUtils.idToInternalKey(EntityUtils.java:124) com.google.apphosting.api.DeadlineExceededException: This request (34c3e1d5cfc6d211) started at 2012/03/09 21:36:34.867 UTC and was still executing at 2012/03/09 21:37:34.367 UTC. at com.google.appengine.runtime.Request.process-34c3e1d5cfc6d211(Request.java) at java.util.zip.ZipFile.read(Native Method) at java.util.zip.ZipFile.access$1200(ZipFile.java:57) at java.util.zip.ZipFile$ZipFileInputStream.read(ZipFile.java:476) at java.util.zip.ZipFile$1.fill(ZipFile.java:259) at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:158) at java.io.FilterInputStream.read(FilterInputStream.java:133) at java.io.FilterInputStream.read(FilterInputStream.java:107) -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at
Re: [google-appengine] Re: Outages?
My app is responding, again. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/pJUdXZmD8uwJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Any update on this? Still seeing many errors exceptions in the logs. On Fri, Mar 9, 2012 at 4:18 PM, John jwb...@gmail.com wrote: My app is responding, again. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/pJUdXZmD8uwJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Our apps seem to be better since about 15:30 PT. On Friday, March 9, 2012 5:56:02 PM UTC-8, Amit Sangani wrote: Any update on this? Still seeing many errors exceptions in the logs. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/EN-dWbASmZQJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
Same on my side. Normally our app is booting for 6-7 seconds, now the new instance requests are taking 40-50 seconds, which are causing request to timeout. The status page is not displaying any errors. Any ideas ? On Thursday, March 8, 2012 3:07:58 AM UTC+1, Nick wrote: I'm getting the same errors :( On Wednesday, March 7, 2012 11:18:39 AM UTC-5, Adam Sherman wrote: On Wed, Mar 7, 2012 at 11:15 AM, Brandon Wirtz drak...@digerat.com wrote: In most systems the Uptime is 100% minus the summation of the downtime of all other systems. The exception to this rule is logging. When Logging fails to record the downtime, Uptime goes up. As a result Google has been working hard to build a logging system that goes down just ahead of all other systems, and comes up shortly after. Well said sir! I'm still laughing. A. -- Adam Sherman, CTO Versature Corp. / +1.877.498.3772 x113 Follow us on Twitter - http://twitter.com/Versature Check out the Versature Blog - http://inside.versature.com -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/hLgMAd0SOP4J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
So, apparently, we all imagined the problem. The status page no longer admits to anything. A. On Tue, Mar 6, 2012 at 4:44 PM, Francois Masurel f.masu...@gmail.com wrote: Yep, getting quite a few errors on loading requests lately like this one for example : 2012-03-06 20:26:42.834 Uncaught exception from servlet org.apache.xerces.parsers.ObjectFactory$ConfigurationError: Provider org.apache.xerces.parsers.XIncludeAwareParserConfiguration could not be instantiated: com.google.apphosting.api.DeadlineExceededException: This request (c2d42bb1d5647665) started at 2012/03/06 19:25:43.000 UTC and was still executing at 2012/03/06 19:26:42.782 UTC. at org.apache.xerces.parsers.ObjectFactory.newInstance(Unknown Source) at org.apache.xerces.parsers.ObjectFactory.createObject(Unknown Source) at org.apache.xerces.parsers.ObjectFactory.createObject(Unknown Source) at org.apache.xerces.parsers.SAXParser.init(Unknown Source) at org.apache.xerces.parsers.SAXParser.init(Unknown Source) at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.init(Unknown Source) at org.apache.xerces.jaxp.SAXParserImpl.init(Unknown Source) at org.apache.xerces.jaxp.SAXParserImpl.init(Unknown Source) at org.apache.xerces.jaxp.SAXParserFactoryImpl.newSAXParserImpl(Unknown Source) at org.apache.xerces.jaxp.SAXParserFactoryImpl.setFeature(Unknown Source) at org.mortbay.xml.XmlParser.makeFactorySecure(XmlParser.java:162) at org.mortbay.xml.XmlParser.setValidating(XmlParser.java:102) at org.mortbay.xml.XmlParser.init(XmlParser.java:91) at org.mortbay.jetty.webapp.TagLibConfiguration.configureWebApp(TagLibConfiguration.java:210) at org.mortbay.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1247) at org.mortbay.jetty.handler.ContextHandler.doStart(ContextHandler.java:517) at org.mortbay.jetty.webapp.WebAppContext.doStart(WebAppContext.java:467) at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50) at com.google.apphosting.runtime.jetty.AppVersionHandlerMap.createHandler(AppVersionHandlerMap.java:202) at com.google.apphosting.runtime.jetty.AppVersionHandlerMap.getHandler(AppVersionHandlerMap.java:171) at com.google.apphosting.runtime.jetty.JettyServletEngineAdapter.serviceRequest(JettyServletEngineAdapter.java:123) at com.google.apphosting.runtime.JavaRuntime$RequestRunnable.run(JavaRuntime.java:422) at com.google.tracing.TraceContext$TraceContextRunnable.runInContext(TraceContext.java:449) at com.google.tracing.TraceContext$TraceContextRunnable$1.run(TraceContext.java:455) at com.google.tracing.TraceContext.runInContext(TraceContext.java:695) at com.google.tracing.TraceContext$AbstractTraceContextCallback.runInInheritedContextNoUnref(TraceContext.java:333) at com.google.tracing.TraceContext$AbstractTraceContextCallback.runInInheritedContext(TraceContext.java:325) at com.google.tracing.TraceContext$TraceContextRunnable.run(TraceContext.java:453) at com.google.apphosting.runtime.ThreadGroupPool$PoolEntry.run(ThreadGroupPool.java:251) at java.lang.Thread.run(Thread.java:679) I2012-03-06 20:26:42.879 This request caused a new process to be started for your application, and thus caused your application code to be loaded for the first time. This request may thus take longer and use more CPU than a typical request for your application. W2012-03-06 20:26:42.879 A problem was encountered with the process that handled this request, causing it to exit. This is likely to cause a new process to be used for the next request to your application. If you see this message frequently, you may be throwing exceptions during the initialization of your application. (Error code 104) On Tuesday, March 6, 2012 10:17:37 PM UTC+1, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/BCIjV778ufoJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- Adam Sherman, CTO Versature Corp. / +1.877.498.3772 x113 Follow us on Twitter - http://twitter.com/Versature Check out the Versature Blog - http://inside.versature.com -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from
RE: [google-appengine] Re: Outages?
So, apparently, we all imagined the problem. The status page no longer admits to anything. In most systems the Uptime is 100% minus the summation of the downtime of all other systems. The exception to this rule is logging. When Logging fails to record the downtime, Uptime goes up. As a result Google has been working hard to build a logging system that goes down just ahead of all other systems, and comes up shortly after. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
On Wed, Mar 7, 2012 at 11:15 AM, Brandon Wirtz drak...@digerat.com wrote: In most systems the Uptime is 100% minus the summation of the downtime of all other systems. The exception to this rule is logging. When Logging fails to record the downtime, Uptime goes up. As a result Google has been working hard to build a logging system that goes down just ahead of all other systems, and comes up shortly after. Well said sir! I'm still laughing. A. -- Adam Sherman, CTO Versature Corp. / +1.877.498.3772 x113 Follow us on Twitter - http://twitter.com/Versature Check out the Versature Blog - http://inside.versature.com -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
I'm getting the same errors :( On Wednesday, March 7, 2012 11:18:39 AM UTC-5, Adam Sherman wrote: On Wed, Mar 7, 2012 at 11:15 AM, Brandon Wirtz drak...@digerat.com wrote: In most systems the Uptime is 100% minus the summation of the downtime of all other systems. The exception to this rule is logging. When Logging fails to record the downtime, Uptime goes up. As a result Google has been working hard to build a logging system that goes down just ahead of all other systems, and comes up shortly after. Well said sir! I'm still laughing. A. -- Adam Sherman, CTO Versature Corp. / +1.877.498.3772 x113 Follow us on Twitter - http://twitter.com/Versature Check out the Versature Blog - http://inside.versature.com -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/zGUBscnczeMJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
I see a lot of errors on app startup like this: Failed startup of context com.google.apphosting.utils.jetty.RuntimeAppEngineWebAppContext@15a2dc4{/,/base/data/home/apps/s~voostip/1.357238701206702102} com.google.apphosting.api.DeadlineExceededException: This request (71d5265cd8f687bc) started at 2012/03/06 21:22:53.913 UTC and was still executing at 2012/03/06 21:23:53.699 UTC. at com.google.appengine.runtime.Request.process-71d5265cd8f687bc(Request.java) at java.util.zip.ZipFile.read(Native Method) at java.util.zip.ZipFile.access$1200(ZipFile.java:57) at java.util.zip.ZipFile$ZipFileInputStream.read(ZipFile.java:476) at java.util.zip.ZipFile$1.fill(ZipFile.java:259) at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:158) at sun.misc.Resource.getBytes(Resource.java:124) at java.net.URLClassLoader.defineClass(URLClassLoader.java:273) at sun.reflect.GeneratedMethodAccessor5.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:616) at java.lang.ClassLoader.loadClass(ClassLoader.java:266) It's not 100% of the time, but it's often enough to be scary. If an instance gets up it seems to stay up. But getting there is a problem. Jeff On Tue, Mar 6, 2012 at 4:22 PM, Cesium cesiumpic...@gmail.com wrote: I've got nothing but errors. On Tuesday, March 6, 2012 2:17:37 PM UTC-7, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/AwHg5a7-EPoJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
That is also what I am seeing. On Tue, Mar 6, 2012 at 4:27 PM, Jeff Schnitzer j...@infohazard.org wrote: I see a lot of errors on app startup like this: Failed startup of context com.google.apphosting.utils.jetty.RuntimeAppEngineWebAppContext@15a2dc4{/,/base/data/home/apps/s~voostip/1.357238701206702102} com.google.apphosting.api.DeadlineExceededException: This request (71d5265cd8f687bc) started at 2012/03/06 21:22:53.913 UTC and was still executing at 2012/03/06 21:23:53.699 UTC. at com.google.appengine.runtime.Request.process-71d5265cd8f687bc(Request.java) at java.util.zip.ZipFile.read(Native Method) at java.util.zip.ZipFile.access$1200(ZipFile.java:57) at java.util.zip.ZipFile$ZipFileInputStream.read(ZipFile.java:476) at java.util.zip.ZipFile$1.fill(ZipFile.java:259) at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:158) at sun.misc.Resource.getBytes(Resource.java:124) at java.net.URLClassLoader.defineClass(URLClassLoader.java:273) at sun.reflect.GeneratedMethodAccessor5.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:616) at java.lang.ClassLoader.loadClass(ClassLoader.java:266) It's not 100% of the time, but it's often enough to be scary. If an instance gets up it seems to stay up. But getting there is a problem. Jeff On Tue, Mar 6, 2012 at 4:22 PM, Cesium cesiumpic...@gmail.com wrote: I've got nothing but errors. On Tuesday, March 6, 2012 2:17:37 PM UTC-7, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/AwHg5a7-EPoJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- Adam Sherman, CTO Versature Corp. / +1.877.498.3772 x113 Follow us on Twitter - http://twitter.com/Versature Check out the Versature Blog - http://inside.versature.com -- You received this message because you are subscribed to the Google Groups Google App Engine group. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Re: [google-appengine] Re: Outages?
In addition to those, I've been getting logs with a single line of text reading Request was aborted after waiting too long to attempt to service your request. Too long seems to be about ten seconds. On Tuesday, March 6, 2012 10:27:16 PM UTC+1, Jeff Schnitzer wrote: I see a lot of errors on app startup like this: Failed startup of context com.google.apphosting.utils.jetty.RuntimeAppEngineWebAppContext@15a2dc4{/,/base/data/home/apps/s~voostip/1.357238701206702102} com.google.apphosting.api.DeadlineExceededException: This request (71d5265cd8f687bc) started at 2012/03/06 21:22:53.913 UTC and was still executing at 2012/03/06 21:23:53.699 UTC. at com.google.appengine.runtime.Request.process-71d5265cd8f687bc(Request.java) at java.util.zip.ZipFile.read(Native Method) at java.util.zip.ZipFile.access$1200(ZipFile.java:57) at java.util.zip.ZipFile$ZipFileInputStream.read(ZipFile.java:476) at java.util.zip.ZipFile$1.fill(ZipFile.java:259) at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:158) at sun.misc.Resource.getBytes(Resource.java:124) at java.net.URLClassLoader.defineClass(URLClassLoader.java:273) at sun.reflect.GeneratedMethodAccessor5.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:616) at java.lang.ClassLoader.loadClass(ClassLoader.java:266) It's not 100% of the time, but it's often enough to be scary. If an instance gets up it seems to stay up. But getting there is a problem. Jeff On Tue, Mar 6, 2012 at 4:22 PM, Cesium cesiumpic...@gmail.com wrote: I've got nothing but errors. On Tuesday, March 6, 2012 2:17:37 PM UTC-7, Adam Sherman wrote: Am I the only one seeing short duration outages? They are being reflected at: http://code.google.com/status/**appenginehttp://code.google.com/status/appengine But I don't see anyone else complaining anywhere, so it makes me worried. A. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/AwHg5a7-EPoJ. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en. -- You received this message because you are subscribed to the Google Groups Google App Engine group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/8Y7e3_REEO8J. To post to this group, send email to google-appengine@googlegroups.com. To unsubscribe from this group, send email to google-appengine+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.