Hi
I deployed 5 node Ignite 2.9.0 on k8s with below configuration
Total RAM per instance 64 GB
JVM 32 GB
Default data region 12 GB
Persistence storage 500GB volume
WAL + WAL archive 30 GB volume
After this I started ingesting data to 3 tables created, the data ingestion
is using basic JDBC batch insertions.
After around 14 hours it generated around 100GB of persistence data on each
node in 3 tables (each having backup of 1).
But suddenly 2 PODs crashed and when I check the logs, *there was errors
which says no space left on the storage volume* configured for WAL+WAL
archive.
I'm not sure what exactly caused this issue, but i couldn't recover from
this POD crash on K8S as i cannot expand the volume attached to ignite PODS.
The only operation I did when pods crashed was select count(*) from table;
and there were around 21 crore records in that table.
Does WAL archive is needed ? how I can avoid these kind of issues, which end
up cluster in unusable state.
Your help is greatly appreciated
Thank you
Shiva
--
Sent from: http://apache-ignite-users.70518.x6.nabble.com/