Hi Oliver, here is the stack trace, it happened again :( The interesting thing is that when i artificially slow down our throughoutput, it never happens. When i set it loose (with highest throughoutput) it usually happens. This stack trace is from i386 Linux 2.4, but we noticed the same on Sun Solaris HW. Btw, we made the needed changes to LockManager (to prevent lock accumulation is lock manager, a memory leak), coming a patch for review soon. The log output is a bit messy, it seems that when FRM declares DIRTY for database, and all my threads are dying in row.... it is ok. But why ERR_SYSTEM? Remember, that commit occures BEFORE the filename is sent over Queue, and every Stage is working in it's own workspace (eg. FRM is set to storeDir=/ips/store, workDir=/ips/work), and Stage1 works with files ONLY in "/Stage1", Stage2 in "/Stage2" accessed thru FRM... so, it is theoretically IMPOSSIBLE for two different stage/thread to work on the SAME file. Just to remeber: Stage1 reads using FRM from the inputWorkarea (eg. /stage1), process the file and spits it out to outputWorkarea (/stage2), the Stage2 gets the filename over Queue and reads the file from it's inputarea (/stage2) and writes to outputarea (/stage3) and so on... Real stages are: CRBExploder, InputReader, CDRProcessor, CRBPacker 2005-11-30 05:20:36.693 INFO [CRBPacker:Stage CRBPacker] - hu.siemens.tmd.CRBPackerTask: Tariff Switch CDR written into CRB file ipms_node1_20051130_042036_00001.cdr 2005-11-30 05:20:36.693 INFO [CRBPacker:Stage CRBPacker] - hu.siemens.tmd.CRBPackerTask: Tariff Switch CDR written into CRB file ipms_node1_20051130_042036_00001.cdr 2005-11-30 05:20:36.696 INFO [CRBPacker:Stage CRBPacker] - hu.siemens.tmd.CRBPackerTask: Wrote out a new CRB file: ipms_node1_20051130_042036_00001.cdr 2005-11-30 05:20:36.825 ERROR [InputReader:Stage InputReader] - hu.siemens.tmk.transaction.fs.FSTransactionalResource: Fatal error during critical commit/rollback of transaction 1133324406295_129327, setting database to dirty. org.apache.commons.transaction.file.ResourceManagerSystemException: 1133324406295_129327: Commit failed (ERR_SYSTEM) Caused by: java.io.FileNotFoundException: /users/barath/ips-test-1.2.14/consolidator/txfs/store/inputreader/20051115122953_87327151000_mdqs_20051115_122957_00000.cdr_0083_ber.cdr (No such file or directory) at java.io.FileOutputStream.open(Native Method) at java.io.FileOutputStream.<init>(FileOutputStream.java:179) at java.io.FileOutputStream.<init>(FileOutputStream.java:131) at org.apache.commons.transaction.util.FileHelper.copy(FileHelper.java:235) at org.apache.commons.transaction.util.FileHelper.moveRec(FileHelper.java:130) at org.apache.commons.transaction.util.FileHelper.moveRec(FileHelper.java:111) at org.apache.commons.transaction.util.FileHelper.moveRec(FileHelper.java:135) at org.apache.commons.transaction.util.FileHelper.moveRec(FileHelper.java:111) at org.apache.commons.transaction.file.FileResourceManager$TransactionContext.commit(FileResourceManager.java:1366) at org.apache.commons.transaction.file.FileResourceManager.commitTransaction(FileResourceManager.java:648) at hu.siemens.tmk.transaction.fs.FSManager.doCommit(FSManager.java:128) at hu.siemens.tmk.transaction.AbstractTransactionManager.commitOrRollback(AbstractTransactionManager.java:99) at hu.siemens.tmk.transaction.AbstractTransactionManager.commit(AbstractTransactionManager.java:79) at hu.siemens.tmk.AbstractBridgingTransactionalTask.commitTransaction(AbstractBridgingTransactionalTask.java:125) at hu.siemens.tmd.AbstractBridgingTMDTask.doRunBridged(AbstractBridgingTMDTask.java:105) at hu.siemens.tmk.AbstractBridgingTransactionalTask.doRun(AbstractBridgingTransactionalTask.java:75) at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:47) at hu.siemens.tmk.InterconnectedSTStage.doTheWork(InterconnectedSTStage.java:40) at hu.siemens.tmk.AbstractStage.run(AbstractStage.java:80) at java.lang.Thread.run(Thread.java:595) at org.apache.commons.transaction.file.FileResourceManager$TransactionContext.commit(FileResourceManager.java:1368) at org.apache.commons.transaction.file.FileResourceManager.commitTransaction(FileResourceManager.java:648) at hu.siemens.tmk.transaction.fs.FSManager.doCommit(FSManager.java:128) at hu.siemens.tmk.transaction.AbstractTransactionManager.commitOrRollback(AbstractTransactionManager.java:99) at hu.siemens.tmk.transaction.AbstractTransactionManager.commit(AbstractTransactionManager.java:79) at hu.siemens.tmk.AbstractBridgingTransactionalTask.commitTransaction(AbstractBridgingTransactionalTask.java:125) at hu.siemens.tmd.AbstractBridgingTMDTask.doRunBridged(AbstractBridgingTMDTask.java:105) at hu.siemens.tmk.AbstractBridgingTransactionalTask.doRun(AbstractBridgingTransactionalTask.java:75) at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:47) at hu.siemens.tmk.InterconnectedSTStage.doTheWork(InterconnectedSTStage.java:40) at hu.siemens.tmk.AbstractStage.run(AbstractStage.java:80) at java.lang.Thread.run(Thread.java:595) 2005-11-30 05:20:36.830 ERROR [InputReader:Stage InputReader] - hu.siemens.tmk.transaction.fs.FSManager: Exception during finishing transaction FSTx:1133324406295_129327 hu.siemens.tmk.transaction.TransactionException: FileResourceManager threw exception during commit. at hu.siemens.tmk.transaction.fs.FSManager.doCommit(FSManager.java:130) at hu.siemens.tmk.transaction.AbstractTransactionManager.commitOrRollback(AbstractTransactionManager.java:99) at hu.siemens.tmk.transaction.AbstractTransactionManager.commit(AbstractTransactionManager.java:79) at hu.siemens.tmk.AbstractBridgingTransactionalTask.commitTransaction(AbstractBridgingTransactionalTask.java:125) at hu.siemens.tmd.AbstractBridgingTMDTask.doRunBridged(AbstractBridgingTMDTask.java:105) at hu.siemens.tmk.AbstractBridgingTransactionalTask.doRun(AbstractBridgingTransactionalTask.java:75) at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:47) at hu.siemens.tmk.InterconnectedSTStage.doTheWork(InterconnectedSTStage.java:40) at hu.siemens.tmk.AbstractStage.run(AbstractStage.java:80) at java.lang.Thread.run(Thread.java:595) Caused by: org.apache.commons.transaction.file.ResourceManagerSystemException: 1133324406295_129327: Commit failed (ERR_SYSTEM) Caused by: java.io.FileNotFoundException: /users/barath/ips-test-1.2.14/consolidator/txfs/store/inputreader/20051115122953_87327151000_mdqs_20051115_122957_00000.cdr_0083_ber.cdr (No such file or directory) at java.io.FileOutputStream.open(Native Method) at java.io.FileOutputStream.<init>(FileOutputStream.java:179) at java.io.FileOutputStream.<init>(FileOutputStream.java:131) at org.apache.commons.transaction.util.FileHelper.copy(FileHelper.java:235) at org.apache.commons.transaction.util.FileHelper.moveRec(FileHelper.java:130) at org.apache.commons.transaction.util.FileHelper.moveRec(FileHelper.java:111) at org.apache.commons.transaction.util.FileHelper.moveRec(FileHelper.java:135) at org.apache.commons.transaction.util.FileHelper.moveRec(FileHelper.java:111) at org.apache.commons.transaction.file.FileResourceManager$TransactionContext.commit(FileResourceManager.java:1366) at org.apache.commons.transaction.file.FileResourceManager.commitTransaction(FileResourceManager.java:648) at hu.siemens.tmk.transaction.fs.FSManager.doCommit(FSManager.java:128) at hu.siemens.tmk.transaction.AbstractTransactionManager.commitOrRollback(AbstractTransactionManager.java:99) at hu.siemens.tmk.transaction.AbstractTransactionManager.commit(AbstractTransactionManager.java:79) at hu.siemens.tmk.AbstractBridgingTransactionalTask.commitTransaction(AbstractBridgingTransactionalTask.java:125) at hu.siemens.tmd.AbstractBridgingTMDTask.doRunBridged(AbstractBridgingTMDTask.java:105) at hu.siemens.tmk.AbstractBridgingTransactionalTask.doRun(AbstractBridgingTransactionalTask.java:75) at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:47) at hu.siemens.tmk.InterconnectedSTStage.doTheWork(InterconnectedSTStage.java:40) at hu.siemens.tmk.AbstractStage.run(AbstractStage.java:80) at java.lang.Thread.run(Thread.java:595) at org.apache.commons.transaction.file.FileResourceManager$TransactionContext.commit(FileResourceManager.java:1368) at org.apache.commons.transaction.file.FileResourceManager.commitTransaction(FileResourceManager.java:648) at hu.siemens.tmk.transaction.fs.FSManager.doCommit(FSManager.java:128) ... 9 more 2005-11-30 05:20:36.831 ERROR [InputReader:Stage InputReader] - hu.siemens.tmk.transaction.fs.FSManager: Exception during finishing transaction FSTx:1133324406295_129327 hu.siemens.tmk.transaction.TransactionException: FileResourceManager threw exception during rollback. at hu.siemens.tmk.transaction.fs.FSManager.doRollback(FSManager.java:148) at hu.siemens.tmk.transaction.AbstractTransactionManager.commitOrRollback(AbstractTransactionManager.java:102) at hu.siemens.tmk.transaction.AbstractTransactionManager.rollback(AbstractTransactionManager.java:84) at hu.siemens.tmk.AbstractBridgingTransactionalTask.rollbackTransaction(AbstractBridgingTransactionalTask.java:141) at hu.siemens.tmd.AbstractBridgingTMDTask.doRunBridged(AbstractBridgingTMDTask.java:108) at hu.siemens.tmk.AbstractBridgingTransactionalTask.doRun(AbstractBridgingTransactionalTask.java:75) at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:47) at hu.siemens.tmk.InterconnectedSTStage.doTheWork(InterconnectedSTStage.java:40) at hu.siemens.tmk.AbstractStage.run(AbstractStage.java:80) at java.lang.Thread.run(Thread.java:595) Caused by: org.apache.commons.transaction.file.ResourceManagerSystemException: 1133324406295_129327: Database is set to dirty, this *may* mean it is corrupt. No modifications are allowed until a recovery run has been performed! (ERR_SYSTEM) at org.apache.commons.transaction.file.FileResourceManager.txInitialSaneCheckForWriting(FileResourceManager.java:932) at org.apache.commons.transaction.file.FileResourceManager.rollbackTransaction(FileResourceManager.java:600) at hu.siemens.tmk.transaction.fs.FSManager.doRollback(FSManager.java:146) ... 9 more 2005-11-30 05:20:36.831 ERROR [InputReader:Stage InputReader] - hu.siemens.tmk.InterconnectedSTStage: Stage InputReader throw Exception, exiting. hu.siemens.tmk.ExecutionException: hu.siemens.tmk.transaction.TransactionException: FileResourceManager threw exception during rollback. at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:53) at hu.siemens.tmk.InterconnectedSTStage.doTheWork(InterconnectedSTStage.java:40) at hu.siemens.tmk.AbstractStage.run(AbstractStage.java:80) at java.lang.Thread.run(Thread.java:595) Caused by: hu.siemens.tmk.transaction.TransactionException: FileResourceManager threw exception during rollback. at hu.siemens.tmk.transaction.fs.FSManager.doRollback(FSManager.java:148) at hu.siemens.tmk.transaction.AbstractTransactionManager.commitOrRollback(AbstractTransactionManager.java:102) at hu.siemens.tmk.transaction.AbstractTransactionManager.rollback(AbstractTransactionManager.java:84) at hu.siemens.tmk.AbstractBridgingTransactionalTask.rollbackTransaction(AbstractBridgingTransactionalTask.java:141) at hu.siemens.tmd.AbstractBridgingTMDTask.doRunBridged(AbstractBridgingTMDTask.java:108) at hu.siemens.tmk.AbstractBridgingTransactionalTask.doRun(AbstractBridgingTransactionalTask.java:75) at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:47) ... 3 more Caused by: org.apache.commons.transaction.file.ResourceManagerSystemException: 1133324406295_129327: Database is set to dirty, this *may* mean it is corrupt. No modifications are allowed until a recovery run has been performed! (ERR_SYSTEM) at org.apache.commons.transaction.file.FileResourceManager.txInitialSaneCheckForWriting(FileResourceManager.java:932) at org.apache.commons.transaction.file.FileResourceManager.rollbackTransaction(FileResourceManager.java:600) at hu.siemens.tmk.transaction.fs.FSManager.doRollback(FSManager.java:146) ... 9 more 2005-11-30 05:20:36.834 ERROR [TMKCmd:TMKCmd] - hu.siemens.tmk.TMKernel: Got ERROR from Stage InputReader 2005-11-30 05:20:36.834 ERROR [InputReader:Stage InputReader] - hu.siemens.tmk.transaction.fs.FSManager: Exception during finishing transaction FSTx:1133324406295_129327 hu.siemens.tmk.transaction.TransactionException: FileResourceManager threw exception during rollback. at hu.siemens.tmk.transaction.fs.FSManager.doRollback(FSManager.java:148) at hu.siemens.tmk.transaction.AbstractTransactionManager.commitOrRollback(AbstractTransactionManager.java:102) at hu.siemens.tmk.transaction.AbstractTransactionManager.rollback(AbstractTransactionManager.java:84) at hu.siemens.tmk.AbstractBridgingTransactionalTask.rollbackTransaction(AbstractBridgingTransactionalTask.java:141) at hu.siemens.tmd.AbstractBridgingTMDTask.doRunBridged(AbstractBridgingTMDTask.java:108) at hu.siemens.tmk.AbstractBridgingTransactionalTask.doRun(AbstractBridgingTransactionalTask.java:75) at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:47) at hu.siemens.tmk.InterconnectedSTStage.doTheWork(InterconnectedSTStage.java:40) at hu.siemens.tmk.AbstractStage.run(AbstractStage.java:80) at java.lang.Thread.run(Thread.java:595) Caused by: org.apache.commons.transaction.file.ResourceManagerSystemException: 1133324406295_129327: Database is set to dirty, this *may* mean it is corrupt. No modifications are allowed until a recovery run has been performed! (ERR_SYSTEM) at org.apache.commons.transaction.file.FileResourceManager.txInitialSaneCheckForWriting(FileResourceManager.java:932) at org.apache.commons.transaction.file.FileResourceManager.rollbackTransaction(FileResourceManager.java:600) at hu.siemens.tmk.transaction.fs.FSManager.doRollback(FSManager.java:146) ... 9 more 2005-11-30 05:20:36.834 INFO [TMKCmd:TMKCmd] - hu.siemens.tmk.TMKernel: Alert [Internal kernel error] [Component Stage InputReader reported ERROR.] sent by Stage InputReader(hu.siemens.tmk.InterconnectedSTStage) 2005-11-30 05:20:36.834 ERROR [InputReader:Stage InputReader] - hu.siemens.tmk.InterconnectedSTStage: Stage InputReader throw Exception, exiting. hu.siemens.tmk.ExecutionException: hu.siemens.tmk.transaction.TransactionException: FileResourceManager threw exception during rollback. at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:53) at hu.siemens.tmk.InterconnectedSTStage.doTheWork(InterconnectedSTStage.java:40) at hu.siemens.tmk.AbstractStage.run(AbstractStage.java:80) at java.lang.Thread.run(Thread.java:595) Caused by: hu.siemens.tmk.transaction.TransactionException: FileResourceManager threw exception during rollback. at hu.siemens.tmk.transaction.fs.FSManager.doRollback(FSManager.java:148) at hu.siemens.tmk.transaction.AbstractTransactionManager.commitOrRollback(AbstractTransactionManager.java:102) at hu.siemens.tmk.transaction.AbstractTransactionManager.rollback(AbstractTransactionManager.java:84) at hu.siemens.tmk.AbstractBridgingTransactionalTask.rollbackTransaction(AbstractBridgingTransactionalTask.java:141) at hu.siemens.tmd.AbstractBridgingTMDTask.doRunBridged(AbstractBridgingTMDTask.java:108) at hu.siemens.tmk.AbstractBridgingTransactionalTask.doRun(AbstractBridgingTransactionalTask.java:75) at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:47) ... 3 more Caused by: org.apache.commons.transaction.file.ResourceManagerSystemException: 1133324406295_129327: Database is set to dirty, this *may* mean it is corrupt. No modifications are allowed until a recovery run has been performed! (ERR_SYSTEM) at org.apache.commons.transaction.file.FileResourceManager.txInitialSaneCheckForWriting(FileResourceManager.java:932) at org.apache.commons.transaction.file.FileResourceManager.rollbackTransaction(FileResourceManager.java:600) at hu.siemens.tmk.transaction.fs.FSManager.doRollback(FSManager.java:146) ... 9 more 2005-11-30 05:20:36.835 ERROR [InputReader:Stage InputReader] - hu.siemens.tmk.transaction.fs.FSManager: Exception during finishing transaction FSTx:1133324406295_129327 hu.siemens.tmk.transaction.TransactionException: FileResourceManager threw exception during rollback. at hu.siemens.tmk.transaction.fs.FSManager.doRollback(FSManager.java:148) at hu.siemens.tmk.transaction.AbstractTransactionManager.commitOrRollback(AbstractTransactionManager.java:102) at hu.siemens.tmk.transaction.AbstractTransactionManager.rollback(AbstractTransactionManager.java:84) at hu.siemens.tmk.AbstractBridgingTransactionalTask.rollbackTransaction(AbstractBridgingTransactionalTask.java:141) at hu.siemens.tmd.AbstractBridgingTMDTask.doRunBridged(AbstractBridgingTMDTask.java:108) at hu.siemens.tmk.AbstractBridgingTransactionalTask.doRun(AbstractBridgingTransactionalTask.java:75) at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:47) at hu.siemens.tmk.InterconnectedSTStage.doTheWork(InterconnectedSTStage.java:40) at hu.siemens.tmk.AbstractStage.run(AbstractStage.java:80) at java.lang.Thread.run(Thread.java:595) Caused by: org.apache.commons.transaction.file.ResourceManagerSystemException: 1133324406295_129327: Database is set to dirty, this *may* mean it is corrupt. No modifications are allowed until a recovery run has been performed! (ERR_SYSTEM) at org.apache.commons.transaction.file.FileResourceManager.txInitialSaneCheckForWriting(FileResourceManager.java:932) at org.apache.commons.transaction.file.FileResourceManager.rollbackTransaction(FileResourceManager.java:600) at hu.siemens.tmk.transaction.fs.FSManager.doRollback(FSManager.java:146) ... 9 more 2005-11-30 05:20:36.836 ERROR [InputReader:Stage InputReader] - hu.siemens.tmk.InterconnectedSTStage: Stage InputReader throw Exception, exiting. hu.siemens.tmk.ExecutionException: hu.siemens.tmk.transaction.TransactionException: FileResourceManager threw exception during rollback. at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:53) at hu.siemens.tmk.InterconnectedSTStage.doTheWork(InterconnectedSTStage.java:40) at hu.siemens.tmk.AbstractStage.run(AbstractStage.java:80) at java.lang.Thread.run(Thread.java:595) Caused by: hu.siemens.tmk.transaction.TransactionException: FileResourceManager threw exception during rollback. at hu.siemens.tmk.transaction.fs.FSManager.doRollback(FSManager.java:148) at hu.siemens.tmk.transaction.AbstractTransactionManager.commitOrRollback(AbstractTransactionManager.java:102) at hu.siemens.tmk.transaction.AbstractTransactionManager.rollback(AbstractTransactionManager.java:84) at hu.siemens.tmk.AbstractBridgingTransactionalTask.rollbackTransaction(AbstractBridgingTransactionalTask.java:141) at hu.siemens.tmd.AbstractBridgingTMDTask.doRunBridged(AbstractBridgingTMDTask.java:108) at hu.siemens.tmk.AbstractBridgingTransactionalTask.doRun(AbstractBridgingTransactionalTask.java:75) at hu.siemens.tmk.AbstractTask.run(AbstractTask.java:47) ... 3 more Caused by: org.apache.commons.transaction.file.ResourceManagerSystemException: 1133324406295_129327: Database is set to dirty, this *may* mean it is corrupt. No modifications are allowed until a recovery run has been performed! (ERR_SYSTEM) at org.apache.commons.transaction.file.FileResourceManager.txInitialSaneCheckForWriting(FileResourceManager.java:932) at org.apache.commons.transaction.file.FileResourceManager.rollbackTransaction(FileResourceManager.java:600) at hu.siemens.tmk.transaction.fs.FSManager.doRollback(FSManager.java:146) ... 9 more 2005-11-30 05:20:36.836 INFO [TMKCmd:TMKCmd] - hu.siemens.tmk.TMKernel: Initiating shutdown by call from Stage InputReader 2005-11-30 05:20:36.844 ERROR [TMKCmd:TMKCmd] - hu.siemens.tmk.TMKernel: Got ERROR from Stage InputReader 2005-11-30 05:20:36.845 INFO [TMKCmd:TMKCmd] - hu.siemens.tmk.TMKernel: Alert [Internal kernel error] [Component Stage InputReader reported ERROR.] sent by Stage InputReader(hu.siemens.tmk.InterconnectedSTStage) 2005-11-30 05:20:36.845 INFO [TMKCmd:TMKCmd] - hu.siemens.tmk.InterconnectedSTStage: Stopping Stage CRBExploder 2005-11-30 05:20:36.845 INFO [TMKCmd:TMKCmd] - hu.siemens.tmk.InterconnectedSTStage: Stopping Stage InputReader 2005-11-30 05:20:36.845 INFO [TMKCmd:TMKCmd] - hu.siemens.tmk.InterconnectedSTStage: Stopping Stage Algorithm 2005-11-30 05:20:36.845 INFO [TMKCmd:TMKCmd] - hu.siemens.tmk.InterconnectedSTStage: Stopping Stage CRBPacker thanx, ~t~ Oliver Zeigermann wrote: commit and not when you synchronize commit it is most likely that two commits clash in a way they should not.By the way synchronizing the commit takes a lot of possible concurrency from the manager as at most one tx can commit at it time in this case. |
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]