Re: [jira] Commented: (OFBIZ-3867) JobManager.poll() enters an endless loop when it can't get a connection

Robert Morley Tue, 20 Jul 2010 10:15:10 -0700

We have made a number of changes to the JobScheduler to properly workwith multi-tenancy. In this spot we created a list of the databasesthat were down and when polling for jobs we would exclude these jobs.We then had a separate polling period (default 5 minutes) that wouldcheck the offline databases to see if they have gone back online.

This might not match what you are trying to do exactly because we havea technique of storing all persisted jobs in our "main" database whichhas a "delegatorName" column (which represents the tenant). Jobs thatare destined to run for all tenants would be "exploded" into a job pertenant (targeted for it). This allows a "sendEmail" job (for example)to execute on all tenant databases that are online, and safely skipnon-online tenants until they go back online. This also creates asingleton jobManager so you do not have one running for each tenant ...


Anyway those are my thoughts on it :)

On Jul 20, 2010, at 1:06 PM, Adrian Crum (JIRA) wrote:

[ https://issues.apache.org/jira/browse/OFBIZ-3867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890329#action_12890329 ]
Adrian Crum commented on OFBIZ-3867:
------------------------------------
One idea off the top of my head and without looking at the codewould be to give the JobManager a state. It could enter a suspendedstate and then try switching to an active state from time to time.State change log entries would be informational, not warnings.
Having a hook where outside events could monitor/trigger statechanges could be useful. A process monitoring the request load couldsuspend the JobManager during peak traffic times.
JobManager.poll() enters an endless loop when it can't get aconnection
-----------------------------------------------------------------------

               Key: OFBIZ-3867
               URL: https://issues.apache.org/jira/browse/OFBIZ-3867
           Project: OFBiz
        Issue Type: Bug
          Reporter: Adam Heath
          Assignee: Adam Heath
JobManager.poll(), line 157, where it calls storeByCondition, canfail when there is no connection available from the database(due toa connection leak, or just load, or whatever). An exception thengets thrown by storeByCondition(deep inside ofbiz/commons-dbcp/postgres). The catch(Throwable) then logs the error, and the looptries again. Since pollDone never gets set to true, this loop is*very* tight, and the log file starts to fill up *very* fast, eacheach thread of JobPoller tries the same thing over and over.I'm filing this bug mainly to see if anyone else works on it, butif not, it's a reminder for me.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Re: [jira] Commented: (OFBIZ-3867) JobManager.poll() enters an endless loop when it can't get a connection

Reply via email to