Hi,
We have an application with ignite thick clients which writes to ignite
caches on ignite grid deployed separately. Below is the ignite
configuration per node

With this configuration, we see throttling happening and checkpointing time
is between 20-30 seconds. Did we miss something in configuration or any
other settings we can enable. Any suggestions will be of great help.

* 100-200 concurrent writes to 25 node cluster
* #partitions 512
* cache backups = 2
* cache mode partitioned
* syncronizationMode : primary Sync
* Off Heap caches
* Server nodes : 25
* RAM : 64G
* maxmemoryDirectSize :  4G
* Heap: 25G

* persistenceEnabled: true
* data region size : 24GB
* checkPointingBufferSize: 6gb
* walSegmentSize: 1G
* walBufferSize :  256MB
* walarchiveSize: 24G
* writeThrotlingEnabled: true
* checkPointingfreq :  60 sec
* checkPointingThreads: 4
* DirectIO enabled: true

SSDs atatched:
work volume : 20G
wal volume : 15G
Wal archive volume : 26G


Checkpointing logs:

[10:27:13,237][INFO][db-checkpoint-thread-#230][Checkpointer] Checkpoint
started [checkpointId=11749dc0-fd0d-4b5f-8b9a-510e774fec38,
startPtr=WALPointer [idx=26, fileOff=385214751, len=16683],
checkpointBeforeLockTime=29ms, checkpointLockWait=0ms,
checkpointListenersExecuteTime=2ms, checkpointLockHoldTime=3ms,
walCpRecordFsyncDuration=11ms, writeCheckpointEntryDuration=3ms,
splitAndSortCpPagesDuration=30ms, pages=40505, reason='timeout']
[10:27:13,242][INFO][sys-stripe-7-#8][PageMemoryImpl] Throttling is applied
to page modifications [percentOfPartTime=0.88, markDirty=2121 pages/sec,
checkpointWrite=1219 pages/sec, estIdealMarkDirty=0 pages/sec,
curDirty=0.00, maxDirty=0.02, avgParkTime=410172 ns, pages: (total=40505,
evicted=0, written=10, synced=0, cpBufUsed=133, cpBufTotal=1554645)]
[10:27:29,935][INFO][grid-timeout-worker-#30][IgniteKernal]
Metrics for local node (to disable set 'metricsLogFrequency' to 0)
    ^-- Node [id=214f3c2b, uptime=00:45:00.227]
    ^-- Cluster [hosts=45, CPUs=540, servers=25, clients=20, topVer=75,
minorTopVer=0]
    ^-- Network [addrs=[127.0.0.1, 192.168.98.141], discoPort=47500,
commPort=47100]
    ^-- CPU [CPUs=12, curLoad=3.67%, avgLoad=0.82%, GC=0%]
    ^-- Heap [used=5330MB, free=79.18%, comm=20480MB]
    ^-- Off-heap memory [used=1019MB, free=95.92%, allocated=24775MB]
    ^-- Page memory [pages=257976]
    ^--   sysMemPlc region [type=internal, persistence=true,
lazyAlloc=false,
      ...  initCfg=40MB, maxCfg=100MB, usedRam=0MB, freeRam=99.99%,
allocRam=99MB, allocTotal=0MB]
    ^--   default region [type=default, persistence=true, lazyAlloc=true,
      ...  initCfg=24576MB, maxCfg=24576MB, usedRam=1018MB, freeRam=95.86%,
allocRam=24576MB, allocTotal=3820MB]
    ^--   metastoreMemPlc region [type=internal, persistence=true,
lazyAlloc=false,
      ...  initCfg=40MB, maxCfg=100MB, usedRam=1MB, freeRam=98.78%,
allocRam=0MB, allocTotal=1MB]
    ^--   TxLog region [type=internal, persistence=true, lazyAlloc=false,
      ...  initCfg=40MB, maxCfg=100MB, usedRam=0MB, freeRam=100%,
allocRam=99MB, allocTotal=0MB]
    ^--   volatileDsMemPlc region [type=user, persistence=false,
lazyAlloc=true,
      ...  initCfg=40MB, maxCfg=100MB, usedRam=0MB, freeRam=100%,
allocRam=0MB]
    ^-- Ignite persistence [used=3821MB]
    ^-- Outbound messages queue [size=0]
    ^-- Public thread pool [active=0, idle=0, qSize=0]
    ^-- System thread pool [active=0, idle=7, qSize=0]
    ^-- Striped thread pool [active=0, idle=12, qSize=0]
[10:27:38,261][INFO][db-checkpoint-thread-#230][Checkpointer] Checkpoint
finished [cpId=11749dc0-fd0d-4b5f-8b9a-510e774fec38, pages=40505,
markPos=WALPointer [idx=26, fileOff=385214751, len=16683],
walSegmentsCovered=[], markDuration=47ms, pagesWrite=25018ms, fsync=6ms,
total=25100ms]

Reply via email to