No, we are using Atomicity as atomic. One of the cache we are querying data from having 30GB data on single node & we have heap size of 24GB
Does it cause any problem? Do we need to keep the heap more than the data we have on the cache? Thanks On Sat, 9 Jul, 2022, 12:47 am Stephen Darlington, < stephen.darling...@gridgain.com> wrote: > Most of the logs just show other nodes disconnecting. > > How are your tables configured? Are you using cache mode > TRANSACTIONAL_SNAPSHOT by any chance? > > On 7 Jul 2022, at 12:08, Farhan Abdul Shakoor <farhan.c...@gmail.com> > wrote: > > > Hey Stephen, > > Thanks for your reply, please find attached logs from 2 nodes. Similar > logs we have seen on all the nodes before crash. > > Thanks > > On Thu, 7 Jul, 2022, 2:03 pm Stephen Darlington, < > stephen.darling...@gridgain.com> wrote: > >> We’d need to see more of the log to figure out what the problem is. >> That’s just the end of a thread dump and not not the error itself. >> >> On 6 Jul 2022, at 19:12, Farhan Abdul Shakoor <farhan.c...@gmail.com> >> wrote: >> >> Hi Folks, >> >> We are running into strange issues in running queries into ignite. Here >> is our current setup >> >> - 8 Node ignite on 128 GB VMs deployed on Azure kubernetes >> - Persistence enabled with 30GB Data region size >> >> With following node configuration: >> <property name="dataStorageConfiguration"> >> <bean >> class="org.apache.ignite.configuration.DataStorageConfiguration"> >> <property name="metricsEnabled" value="true"/> >> <property name="pageSize" value="#{8 * 1024}"/> >> <property name="defaultDataRegionConfiguration"> >> <bean >> class="org.apache.ignite.configuration.DataRegionConfiguration"> >> <property name="persistenceEnabled" value="true"/> >> <property name="maxSize" value="#{30L * 1024 * >> 1024 * 1024}"/> >> <property name="pageReplacementMode" >> value="SEGMENTED_LRU"/> >> <property name="pageEvictionMode" value="RANDOM_2_LRU"/> >> <property name="metricsEnabled" value="true"/> >> </bean> >> </property> >> <property name="walSegmentSize" value="#{128L * 1024 * >> 1024}"/> >> <property name="walPath" value="/ignite/wal"/> >> <property name="walArchivePath" >> value="/ignite/walarchive"/> >> <property name="walMode" value="FSYNC"/> >> </bean> >> </property> >> <property name="failureHandler"> >> <bean >> class="org.apache.ignite.failure.RestartProcessFailureHandler"/> >> </property> >> >> >> When query exception start, we got multiple waiting error like this: >> >> Thread [name="main", id=1, state=WAITING, blockCnt=5, waitCnt=2636] >> Lock [object=java.util.concurrent.CountDownLatch$Sync@b027ad0, >> ownerName=null, ownerId=-1] >> at sun.misc.Unsafe.park(Native Method) >> at >> java.util.concurrent.locks.LockSupport.park(LockSupport.java:175) >> at >> java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836) >> at >> java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:997) >> at >> java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304) >> at >> java.util.concurrent.CountDownLatch.await(CountDownLatch.java:231) >> at >> o.a.i.startup.cmdline.CommandLineStartup.main(CommandLineStartup.java:398) >> [14:25:07,980][SEVERE][disco-event-worker-#67][FailureProcessor] Ignite >> node is in invalid state due to a critical failure. >> >> And then all nodes gets crashed. >> >> Please suggest if there is any config value we can change to terminate >> long running queries. >> >> Thanks >> >> >>