Re: [Sequoia] Controller Timeout Issue?

Christopher Ekberg Thu, 08 Jan 2009 11:30:42 -0800

Ha, we struggled with that too.  Still are.

1.  Is this on Linux?
2.  Are you using Java 5?

3. Do you notice that your system load shoots through the roof? (Wesaw system load numbers of like 300 on our systems, which becameunresponsive for over a minute)

We suspect this is a GC bug in the VM; we mostly eliminated thisproblem by commenting out an explicit call to System.gc() in Sequoia'scode (can't remember quite where right now). We still see the problemon rare occasions under load; our backends get trashed and we have torestore the databases from another copy. We were also able to triggerthis "load bomb" by hitting the "Perform GC" button in jconsole a fewtimes in rapid succession; it's as if this VM gets very unhappy ifsomething triggers a GC while it's already doing GC.

You can also see those warnings if your system clock changed, but I'mhoping you are having the same problem we are. We're hoping that thisis a GC bug in Java 5 and are considering switching to Java 6, whichapparently rewrote a lot of the GC stuff. Has anyone else had thisproblem and worked around it?


-Chris


On Jan 7, 2009, at 10:23 PM, Tyler Shen wrote:

G’day,

We have been running Sequoia in our production environment for weeks
until this morning our application crashed with the following
messages:
2009-01-07 10:44:16,490 INFO
[org.karora.moomba.ui.internal.FolderLayout] - Adding view to
layout: com.xbio.nba.abdr.site.ui.view.memberdetailsv...@224d0286
25s time shift detected, from Wed Jan 07 10:44:29 EST 2009 to Wed
Jan 07 10:44:56 EST 2009
Brutal changes of date/time can lead to erroneous controller failure
detections!
2009-01-07 10:44:59,141 INFO
[org.karora.moomba.ui.internal.FolderLayout] - Adding view to
layout:
com.xbio.nba.abdr.checklist.ui.view.tasktypesetupviewp...@6729246b
2009-01-07 10:44:59,148 INFO
[org.karora.moomba.ui.internal.PageLayout] - Adding view to layout:
View(com.xbio.nba.abdr.site.ui.view.MemberNavigator)
2009-01-07 10:44:59,172 WARN
[org.hibernate.util.JDBCExceptionReporter] - SQL Error: 0, SQLState:
null
2009-01-07 10:44:59,172 ERROR
[org.hibernate.util.JDBCExceptionReporter] - Protocol corruption
while trying to send command: 0. Check the previous command
18s time shift detected, from Wed Jan 07 10:44:40 EST 2009 to Wed
Jan 07 10:45:00 EST 2009
Brutal changes of date/time can lead to erroneous controller failure
detections!
29s time shift detected, from Wed Jan 07 10:44:29 EST 2009 to Wed
Jan 07 10:45:00 EST 2009
Brutal changes of date/time can lead to erroneous controller failure
detections!
2009-01-07 10:45:32,600 INFO
[com
.xbio
.nba
.abdr.patienttransition.service.impl.PatientTransitionServiceImpl] -
completeOutstandingTransitions executed
2009-01-07 10:45:32,713 WARN
[org.hibernate.util.JDBCExceptionReporter] - SQL Error: 0, SQLState:
null
2009-01-07 10:45:32,713 ERROR
[org.hibernate.util.JDBCExceptionReporter] - Protocol corruption
while trying to send command: 0. Check the previous command

We have the server running on NTP. How such a time shift occurred is
beyond me. Having briefly gone through the source code, I noticed
that tweaking ControllerTimeout can make the Sequoia more tolerant
once a time shit is detected.
Where can I change ControllerTimeout and is this the proper way to
go around the issues? Thanks!

Regards,

Tyler Shen


<ATT00001.txt>



_______________________________________________
Sequoia mailing list
[email protected]
https://forge.continuent.org/mailman/listinfo/sequoia

Re: [Sequoia] Controller Timeout Issue?

Reply via email to