Re: Ignite Snapshots take minutes to complete

2022-05-31 Thread Surinder Mehra
Hey,
I have one more query on this one. Currently statefulset
"podManagementPolicy" is "OrderedReady". It means kubernetes will start one
pod at a time until it reaches the replica count given.
If we change  "podManagementPolicy" to "Parallel", all pods will start in
parallel and hence I can control init containers as intended to clean up
and copy from snapshot before any main container starts.

Question is, is it fine to start Ignite pods in parallel or should they
always start one after another. Documentation doesn't say anything about
this and I just want to make sure this has no side effects before I start
implementing my approach to restore data.

On Fri, May 27, 2022 at 11:33 PM Surinder Mehra  wrote:

> Hi Maxim,
> As I explained in original email, I do cleanup as part of init container.
> Since ignite nodes starts after one another, init container would also run
> in sequence.
> So when ignite node 1 completes startup(init container clean up work
> directory and copy data from snapshot) but node 2 is still running init
> container which might be still copying data. And node 3 since it is yet to
> start, it's wal, cp, and work directory hasn't been cleaned yet.
>
> So my question was, is there a way I can do cleanup of work directory  and
> copy from snapshot for all nodes before first ignite nodes starts.
>
> On Fri, 27 May 2022, 23:11 Maxim Muzafarov,  wrote:
>
>> Hello,
>>
>> If you're copying a snapshot part to the new node, then you have to be
>> sure that the /ignite/work/cp, /ignite/wal, /ignite/walarchive
>> directories are empty prior to the node start. Is it true for your
>> case?
>>
>> On Fri, 27 May 2022 at 10:29, Surinder Mehra  wrote:
>> >
>> > Hi,
>> > Please find ignite config and error log below
>> >
>> > config :
>> > 
>> > 
>> > > value="/opt/ignite/apache-ignite/config/ignite-log4j.xml"/>
>> > 
>> > 
>> > 
>> > 
>> > 
>> > 
>> > 
>> >
>> > 
>> > > class="org.apache.ignite.configuration.DataStorageConfiguration">
>> > 
>> > 
>> > 
>> > 
>> > 
>> > 
>> > > class="org.apache.ignite.configuration.DataRegionConfiguration">
>> > > value="true"/>
>> > 
>> > 
>> > 
>> > 
>> > 
>> > 
>> > > value="/ignite/walarchive"/>
>> > 
>> > 
>> >
>> >
>> > Error log:
>> >
>> > at
>> org.apache.ignite.internal.processors.cache.persistence.wal.FileWriteAheadLogManager$RecordsIterator.access$1000(FileWriteAheadLogManager.java:2763)
>> > at
>> org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.readMetastore(GridCacheDatabaseSharedManager.java:870)
>> > at
>> org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.notifyMetaStorageSubscribersOnReadyForRead(GridCacheDatabaseSharedManager.java:3200)
>> > at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1116)
>> > at
>> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1799)
>> > at
>> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1721)
>> > at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1160)
>> > at
>> org.apache.ignite.internal.IgnitionEx.startConfigurations(IgnitionEx.java:1054)
>> > at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:940)
>> > at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:839)
>> > at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:709)
>> > at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:678)
>> > at org.apache.ignite.Ignition.start(Ignition.java:353)
>> > ... 1 more
>> > Failed to start grid: WAL history is too short [descs=[FileDescriptor
>> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0060.wal,
>> idx=60], FileDescriptor
>> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0061.wal,
>> idx=61], FileDescriptor
>> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0062.wal,
>> idx=62], FileDescriptor
>> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0063.wal,
>> idx=63], FileDescriptor
>> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0064.wal,
>> idx=64], FileDescriptor
>> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0065.wal,
>> idx=65], FileDescriptor
>> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0066.wal,
>> idx=66], FileDescriptor
>> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0067.wal,
>> idx=67], FileDescriptor
>> [file=/ignite/walarchive

Re: Ignite Snapshots take minutes to complete

2022-05-27 Thread Surinder Mehra
Hi Maxim,
As I explained in original email, I do cleanup as part of init container.
Since ignite nodes starts after one another, init container would also run
in sequence.
So when ignite node 1 completes startup(init container clean up work
directory and copy data from snapshot) but node 2 is still running init
container which might be still copying data. And node 3 since it is yet to
start, it's wal, cp, and work directory hasn't been cleaned yet.

So my question was, is there a way I can do cleanup of work directory  and
copy from snapshot for all nodes before first ignite nodes starts.

On Fri, 27 May 2022, 23:11 Maxim Muzafarov,  wrote:

> Hello,
>
> If you're copying a snapshot part to the new node, then you have to be
> sure that the /ignite/work/cp, /ignite/wal, /ignite/walarchive
> directories are empty prior to the node start. Is it true for your
> case?
>
> On Fri, 27 May 2022 at 10:29, Surinder Mehra  wrote:
> >
> > Hi,
> > Please find ignite config and error log below
> >
> > config :
> > 
> > 
> >  value="/opt/ignite/apache-ignite/config/ignite-log4j.xml"/>
> > 
> > 
> > 
> > 
> > 
> > 
> > 
> >
> > 
> >  class="org.apache.ignite.configuration.DataStorageConfiguration">
> > 
> > 
> > 
> > 
> > 
> > 
> >  class="org.apache.ignite.configuration.DataRegionConfiguration">
> >  value="true"/>
> > 
> > 
> > 
> > 
> > 
> > 
> >  value="/ignite/walarchive"/>
> > 
> > 
> >
> >
> > Error log:
> >
> > at
> org.apache.ignite.internal.processors.cache.persistence.wal.FileWriteAheadLogManager$RecordsIterator.access$1000(FileWriteAheadLogManager.java:2763)
> > at
> org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.readMetastore(GridCacheDatabaseSharedManager.java:870)
> > at
> org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.notifyMetaStorageSubscribersOnReadyForRead(GridCacheDatabaseSharedManager.java:3200)
> > at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1116)
> > at
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1799)
> > at
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1721)
> > at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1160)
> > at
> org.apache.ignite.internal.IgnitionEx.startConfigurations(IgnitionEx.java:1054)
> > at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:940)
> > at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:839)
> > at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:709)
> > at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:678)
> > at org.apache.ignite.Ignition.start(Ignition.java:353)
> > ... 1 more
> > Failed to start grid: WAL history is too short [descs=[FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0060.wal,
> idx=60], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0061.wal,
> idx=61], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0062.wal,
> idx=62], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0063.wal,
> idx=63], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0064.wal,
> idx=64], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0065.wal,
> idx=65], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0066.wal,
> idx=66], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0067.wal,
> idx=67], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0068.wal,
> idx=68], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0069.wal,
> idx=69], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0070.wal,
> idx=70], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0071.wal,
> idx=71], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0072.wal,
> idx=72], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0073.wal,
> idx=73], FileDescriptor
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0074.wal,
> idx=74], FileDescriptor
> [file=/ignite/walarchive

Re: Ignite Snapshots take minutes to complete

2022-05-27 Thread Maxim Muzafarov
Hello,

If you're copying a snapshot part to the new node, then you have to be
sure that the /ignite/work/cp, /ignite/wal, /ignite/walarchive
directories are empty prior to the node start. Is it true for your
case?

On Fri, 27 May 2022 at 10:29, Surinder Mehra  wrote:
>
> Hi,
> Please find ignite config and error log below
>
> config :
> 
> 
>  value="/opt/ignite/apache-ignite/config/ignite-log4j.xml"/>
> 
> 
> 
> 
> 
> 
> 
>
> 
>  class="org.apache.ignite.configuration.DataStorageConfiguration">
> 
> 
> 
> 
> 
> 
>  class="org.apache.ignite.configuration.DataRegionConfiguration">
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
>
>
> Error log:
>
> at 
> org.apache.ignite.internal.processors.cache.persistence.wal.FileWriteAheadLogManager$RecordsIterator.access$1000(FileWriteAheadLogManager.java:2763)
> at 
> org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.readMetastore(GridCacheDatabaseSharedManager.java:870)
> at 
> org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.notifyMetaStorageSubscribersOnReadyForRead(GridCacheDatabaseSharedManager.java:3200)
> at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1116)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1799)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1721)
> at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1160)
> at 
> org.apache.ignite.internal.IgnitionEx.startConfigurations(IgnitionEx.java:1054)
> at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:940)
> at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:839)
> at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:709)
> at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:678)
> at org.apache.ignite.Ignition.start(Ignition.java:353)
> ... 1 more
> Failed to start grid: WAL history is too short [descs=[FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0060.wal,
>  idx=60], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0061.wal,
>  idx=61], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0062.wal,
>  idx=62], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0063.wal,
>  idx=63], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0064.wal,
>  idx=64], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0065.wal,
>  idx=65], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0066.wal,
>  idx=66], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0067.wal,
>  idx=67], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0068.wal,
>  idx=68], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0069.wal,
>  idx=69], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0070.wal,
>  idx=70], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0071.wal,
>  idx=71], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0072.wal,
>  idx=72], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0073.wal,
>  idx=73], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0074.wal,
>  idx=74], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0075.wal,
>  idx=75], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0076.wal,
>  idx=76], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0077.wal,
>  idx=77], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0078.wal,
>  idx=78], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0079.wal,
>  idx=79], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0080.wal,
>  idx=80], FileDescriptor 
> [file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0

Re: Ignite Snapshots take minutes to complete

2022-05-27 Thread Surinder Mehra
Hi,
Please find ignite config and error log below

config :
































Error log:

at
org.apache.ignite.internal.processors.cache.persistence.wal.FileWriteAheadLogManager$RecordsIterator.access$1000(FileWriteAheadLogManager.java:2763)
at
org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.readMetastore(GridCacheDatabaseSharedManager.java:870)
at
org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.notifyMetaStorageSubscribersOnReadyForRead(GridCacheDatabaseSharedManager.java:3200)
at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1116)
at
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1799)
at
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1721)
at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1160)
at
org.apache.ignite.internal.IgnitionEx.startConfigurations(IgnitionEx.java:1054)
at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:940)
at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:839)
at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:709)
at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:678)
at org.apache.ignite.Ignition.start(Ignition.java:353)
... 1 more
Failed to start grid: WAL history is too short [descs=[FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0060.wal,
idx=60], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0061.wal,
idx=61], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0062.wal,
idx=62], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0063.wal,
idx=63], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0064.wal,
idx=64], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0065.wal,
idx=65], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0066.wal,
idx=66], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0067.wal,
idx=67], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0068.wal,
idx=68], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0069.wal,
idx=69], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0070.wal,
idx=70], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0071.wal,
idx=71], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0072.wal,
idx=72], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0073.wal,
idx=73], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0074.wal,
idx=74], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0075.wal,
idx=75], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0076.wal,
idx=76], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0077.wal,
idx=77], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0078.wal,
idx=78], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0079.wal,
idx=79], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0080.wal,
idx=80], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0081.wal,
idx=81], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0082.wal,
idx=82], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0083.wal,
idx=83], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0084.wal,
idx=84], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0085.wal,
idx=85], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0086.wal,
idx=86], FileDescriptor
[file=/ignite/walarchive/node00-44a0ade8-60c2-4190-aac3-7fb465129efe/0087.wal,
idx=87

Re: Ignite Snapshots take minutes to complete

2022-05-26 Thread Николай Ижиков
Can you, please, send your config and full log file that contains error message.

> 26 мая 2022 г., в 17:50, Surinder Mehra  написал(а):
> 
> Hello,
> I upgraded to 2.13.0 and I am able to take sync snapshots now. However, I ran 
> into another problem while restoring from snapshot using manual steps 
> mentioned in documentation.
> 
> We run ignite statefulset on kubernetes cluster so when we scale it to N 
> nodes, it brings up one node at a time.
> 
> Now I am trying to attach init container which will copy /db directory from 
> snapshots to work directory after clearing db directory from work directory 
> and then start main container which runs ignite.
> 
> It works well on single node, it's able to start cluster with snapshot Data.
> 
> When I start multiple nodes, init container will run each one of those as 
> first step. Since nodes starts one at a time, it's runs into error saying 
> "too small WAL segments data" 
> 
> I suppose that could be because 2nd node is still in init step while first 
> one is in running mode. There are few which haven't started yet, waiting for 
> 2nd node to be in running state.
> 
> Any idea how can we make main containers wait until all init containers are 
> completed
> 
> Asking this here as its related to ignite setup in kubernetes. 
> 
> Any help wil be appreciated. Thanks
> 
> On Wed, 25 May 2022, 00:04 Surinder Mehra,  > wrote:
> Thanks a lot. I will try this.
> 
> On Tue, 24 May 2022, 23:50 Николай Ижиков,  > wrote:
> > Does it ensure consistency while copying data which is parallely getting 
> > updated by application writes
> 
> Yes.
> 
> From the documentation:
> 
> «An Ignite snapshot includes a consistent cluster-wide copy of all data 
> records persisted on disk and some other files needed for a restore 
> procedure.» 
> 
> > will this be a stop the world process
> 
> No.
> 
> 
>> 24 мая 2022 г., в 21:17, Surinder Mehra > > написал(а):
>> 
>> Hi 
>> Thanks for reply.
>> 
>> #1:  So it's not a stop the world task. Does it ensure consistency while 
>> copying data which is parallely getting updated by application writes. Or 
>> does it mark the data to copied and ignore further updates on it.
>> 
>> #2: 
>> I will try sync snapshot. But just to confirm, will this be a stop the world 
>> process. Couldn't find anything on Documentation page about it
>> 
>> On Tue, 24 May 2022, 23:12 Николай Ижиков, > > wrote:
>> Hello, Mehra.
>> 
>> > 1. Is it stop the world process.
>> 
>> No, you can perform any actions.
>> Note, topology changes will cancel snapshot create process.
>> 
>> > 2. If so, is it stop the world only during command execution (500millis) 
>> > or until snapshot Dara is fully copied(takes many minutes) to complete.
>> 
>> Please, take a look at `—sync` option of create snapshot command (you can 
>> see help in `control.sh` output).
>> `EVT_CLUSTER_SNAPSHOT_FINISHED` raise on snapshot create finish.
>> 
>> > 3. Is there a way around to speed up this other than increasing snapshot 
>> > threads
>> 
>> Stop write operations.
>> The less you change the quicker snapshot will be created.
>> 
>>> 24 мая 2022 г., в 20:12, Surinder Mehra >> > написал(а):
>>> 
>>> Hi,
>>> I have 3 node ignite cluster each node contains 60G work directory(ebs) and 
>>> I need to create snapshots.
>>> I followed steps to create snapshots and run create snapshot command using 
>>> control utility. Command completed in 500millis but snapshot directory only 
>>> had 400Mb data. Later I realised directory size grew up 30G. I suppose it 
>>> would reach size of work directory.
>>> 
>>> 
>>> I have few questions.
>>> 1. Is it stop the world process.
>>> 2. If so, is it stop the world only during command execution (500millis) or 
>>> until snapshot Dara is fully copied(takes many minutes) to complete.
>>> 3. Is there a way around to speed up this other than increasing snapshot 
>>> threads
>> 
> 



Re: Ignite Snapshots take minutes to complete

2022-05-26 Thread Surinder Mehra
Hello,
I upgraded to 2.13.0 and I am able to take sync snapshots now. However, I
ran into another problem while restoring from snapshot using manual steps
mentioned in documentation.

We run ignite statefulset on kubernetes cluster so when we scale it to N
nodes, it brings up one node at a time.

Now I am trying to attach init container which will copy /db directory from
snapshots to work directory after clearing db directory from work directory
and then start main container which runs ignite.

It works well on single node, it's able to start cluster with snapshot Data.

When I start multiple nodes, init container will run each one of those as
first step. Since nodes starts one at a time, it's runs into error saying
"too small WAL segments data"

I suppose that could be because 2nd node is still in init step while first
one is in running mode. There are few which haven't started yet, waiting
for 2nd node to be in running state.

Any idea how can we make main containers wait until all init containers are
completed

Asking this here as its related to ignite setup in kubernetes.

Any help wil be appreciated. Thanks

On Wed, 25 May 2022, 00:04 Surinder Mehra,  wrote:

> Thanks a lot. I will try this.
>
> On Tue, 24 May 2022, 23:50 Николай Ижиков,  wrote:
>
>> > Does it ensure consistency while copying data which is parallely
>> getting updated by application writes
>>
>> Yes.
>>
>> From the documentation:
>>
>> «An Ignite snapshot includes a consistent cluster-wide copy of all data
>> records persisted on disk and some other files needed for a restore
>> procedure.»
>>
>> > will this be a stop the world process
>>
>> No.
>>
>>
>> 24 мая 2022 г., в 21:17, Surinder Mehra  написал(а):
>>
>> Hi
>> Thanks for reply.
>>
>> #1:  So it's not a stop the world task. Does it ensure consistency while
>> copying data which is parallely getting updated by application writes. Or
>> does it mark the data to copied and ignore further updates on it.
>>
>> #2:
>> I will try sync snapshot. But just to confirm, will this be a stop the
>> world process. Couldn't find anything on Documentation page about it
>>
>> On Tue, 24 May 2022, 23:12 Николай Ижиков,  wrote:
>>
>>> Hello, Mehra.
>>>
>>> > 1. Is it stop the world process.
>>>
>>> No, you can perform any actions.
>>> Note, topology changes will cancel snapshot create process.
>>>
>>> > 2. If so, is it stop the world only during command execution
>>> (500millis) or until snapshot Dara is fully copied(takes many minutes) to
>>> complete.
>>>
>>> Please, take a look at `—sync` option of create snapshot command (you
>>> can see help in `control.sh` output).
>>> `EVT_CLUSTER_SNAPSHOT_FINISHED` raise on snapshot create finish.
>>>
>>> > 3. Is there a way around to speed up this other than increasing
>>> snapshot threads
>>>
>>> Stop write operations.
>>> The less you change the quicker snapshot will be created.
>>>
>>> 24 мая 2022 г., в 20:12, Surinder Mehra  написал(а):
>>>
>>> Hi,
>>> I have 3 node ignite cluster each node contains 60G work directory(ebs)
>>> and I need to create snapshots.
>>> I followed steps to create snapshots and run create snapshot command
>>> using control utility. Command completed in 500millis but snapshot
>>> directory only had 400Mb data. Later I realised directory size grew up 30G.
>>> I suppose it would reach size of work directory.
>>>
>>>
>>> I have few questions.
>>> 1. Is it stop the world process.
>>> 2. If so, is it stop the world only during command execution (500millis)
>>> or until snapshot Dara is fully copied(takes many minutes) to complete.
>>> 3. Is there a way around to speed up this other than increasing snapshot
>>> threads
>>>
>>>
>>>
>>


Re: Ignite Snapshots take minutes to complete

2022-05-24 Thread Surinder Mehra
Thanks a lot. I will try this.

On Tue, 24 May 2022, 23:50 Николай Ижиков,  wrote:

> > Does it ensure consistency while copying data which is parallely getting
> updated by application writes
>
> Yes.
>
> From the documentation:
>
> «An Ignite snapshot includes a consistent cluster-wide copy of all data
> records persisted on disk and some other files needed for a restore
> procedure.»
>
> > will this be a stop the world process
>
> No.
>
>
> 24 мая 2022 г., в 21:17, Surinder Mehra  написал(а):
>
> Hi
> Thanks for reply.
>
> #1:  So it's not a stop the world task. Does it ensure consistency while
> copying data which is parallely getting updated by application writes. Or
> does it mark the data to copied and ignore further updates on it.
>
> #2:
> I will try sync snapshot. But just to confirm, will this be a stop the
> world process. Couldn't find anything on Documentation page about it
>
> On Tue, 24 May 2022, 23:12 Николай Ижиков,  wrote:
>
>> Hello, Mehra.
>>
>> > 1. Is it stop the world process.
>>
>> No, you can perform any actions.
>> Note, topology changes will cancel snapshot create process.
>>
>> > 2. If so, is it stop the world only during command execution
>> (500millis) or until snapshot Dara is fully copied(takes many minutes) to
>> complete.
>>
>> Please, take a look at `—sync` option of create snapshot command (you can
>> see help in `control.sh` output).
>> `EVT_CLUSTER_SNAPSHOT_FINISHED` raise on snapshot create finish.
>>
>> > 3. Is there a way around to speed up this other than increasing
>> snapshot threads
>>
>> Stop write operations.
>> The less you change the quicker snapshot will be created.
>>
>> 24 мая 2022 г., в 20:12, Surinder Mehra  написал(а):
>>
>> Hi,
>> I have 3 node ignite cluster each node contains 60G work directory(ebs)
>> and I need to create snapshots.
>> I followed steps to create snapshots and run create snapshot command
>> using control utility. Command completed in 500millis but snapshot
>> directory only had 400Mb data. Later I realised directory size grew up 30G.
>> I suppose it would reach size of work directory.
>>
>>
>> I have few questions.
>> 1. Is it stop the world process.
>> 2. If so, is it stop the world only during command execution (500millis)
>> or until snapshot Dara is fully copied(takes many minutes) to complete.
>> 3. Is there a way around to speed up this other than increasing snapshot
>> threads
>>
>>
>>
>


Re: Ignite Snapshots take minutes to complete

2022-05-24 Thread Николай Ижиков
> Does it ensure consistency while copying data which is parallely getting 
> updated by application writes

Yes.

From the documentation:

«An Ignite snapshot includes a consistent cluster-wide copy of all data records 
persisted on disk and some other files needed for a restore procedure.» 

> will this be a stop the world process

No.


> 24 мая 2022 г., в 21:17, Surinder Mehra  написал(а):
> 
> Hi 
> Thanks for reply.
> 
> #1:  So it's not a stop the world task. Does it ensure consistency while 
> copying data which is parallely getting updated by application writes. Or 
> does it mark the data to copied and ignore further updates on it.
> 
> #2: 
> I will try sync snapshot. But just to confirm, will this be a stop the world 
> process. Couldn't find anything on Documentation page about it
> 
> On Tue, 24 May 2022, 23:12 Николай Ижиков,  > wrote:
> Hello, Mehra.
> 
> > 1. Is it stop the world process.
> 
> No, you can perform any actions.
> Note, topology changes will cancel snapshot create process.
> 
> > 2. If so, is it stop the world only during command execution (500millis) or 
> > until snapshot Dara is fully copied(takes many minutes) to complete.
> 
> Please, take a look at `—sync` option of create snapshot command (you can see 
> help in `control.sh` output).
> `EVT_CLUSTER_SNAPSHOT_FINISHED` raise on snapshot create finish.
> 
> > 3. Is there a way around to speed up this other than increasing snapshot 
> > threads
> 
> Stop write operations.
> The less you change the quicker snapshot will be created.
> 
>> 24 мая 2022 г., в 20:12, Surinder Mehra > > написал(а):
>> 
>> Hi,
>> I have 3 node ignite cluster each node contains 60G work directory(ebs) and 
>> I need to create snapshots.
>> I followed steps to create snapshots and run create snapshot command using 
>> control utility. Command completed in 500millis but snapshot directory only 
>> had 400Mb data. Later I realised directory size grew up 30G. I suppose it 
>> would reach size of work directory.
>> 
>> 
>> I have few questions.
>> 1. Is it stop the world process.
>> 2. If so, is it stop the world only during command execution (500millis) or 
>> until snapshot Dara is fully copied(takes many minutes) to complete.
>> 3. Is there a way around to speed up this other than increasing snapshot 
>> threads
> 



Re: Ignite Snapshots take minutes to complete

2022-05-24 Thread Surinder Mehra
Hi
Thanks for reply.

#1:  So it's not a stop the world task. Does it ensure consistency while
copying data which is parallely getting updated by application writes. Or
does it mark the data to copied and ignore further updates on it.

#2:
I will try sync snapshot. But just to confirm, will this be a stop the
world process. Couldn't find anything on Documentation page about it

On Tue, 24 May 2022, 23:12 Николай Ижиков,  wrote:

> Hello, Mehra.
>
> > 1. Is it stop the world process.
>
> No, you can perform any actions.
> Note, topology changes will cancel snapshot create process.
>
> > 2. If so, is it stop the world only during command execution (500millis)
> or until snapshot Dara is fully copied(takes many minutes) to complete.
>
> Please, take a look at `—sync` option of create snapshot command (you can
> see help in `control.sh` output).
> `EVT_CLUSTER_SNAPSHOT_FINISHED` raise on snapshot create finish.
>
> > 3. Is there a way around to speed up this other than increasing
> snapshot threads
>
> Stop write operations.
> The less you change the quicker snapshot will be created.
>
> 24 мая 2022 г., в 20:12, Surinder Mehra  написал(а):
>
> Hi,
> I have 3 node ignite cluster each node contains 60G work directory(ebs)
> and I need to create snapshots.
> I followed steps to create snapshots and run create snapshot command using
> control utility. Command completed in 500millis but snapshot directory only
> had 400Mb data. Later I realised directory size grew up 30G. I suppose it
> would reach size of work directory.
>
>
> I have few questions.
> 1. Is it stop the world process.
> 2. If so, is it stop the world only during command execution (500millis)
> or until snapshot Dara is fully copied(takes many minutes) to complete.
> 3. Is there a way around to speed up this other than increasing snapshot
> threads
>
>
>


Re: Ignite Snapshots take minutes to complete

2022-05-24 Thread Николай Ижиков
Hello, Mehra.

> 1. Is it stop the world process.

No, you can perform any actions.
Note, topology changes will cancel snapshot create process.

> 2. If so, is it stop the world only during command execution (500millis) or 
> until snapshot Dara is fully copied(takes many minutes) to complete.

Please, take a look at `—sync` option of create snapshot command (you can see 
help in `control.sh` output).
`EVT_CLUSTER_SNAPSHOT_FINISHED` raise on snapshot create finish.

> 3. Is there a way around to speed up this other than increasing snapshot 
> threads

Stop write operations.
The less you change the quicker snapshot will be created.

> 24 мая 2022 г., в 20:12, Surinder Mehra  написал(а):
> 
> Hi,
> I have 3 node ignite cluster each node contains 60G work directory(ebs) and I 
> need to create snapshots.
> I followed steps to create snapshots and run create snapshot command using 
> control utility. Command completed in 500millis but snapshot directory only 
> had 400Mb data. Later I realised directory size grew up 30G. I suppose it 
> would reach size of work directory.
> 
> 
> I have few questions.
> 1. Is it stop the world process.
> 2. If so, is it stop the world only during command execution (500millis) or 
> until snapshot Dara is fully copied(takes many minutes) to complete.
> 3. Is there a way around to speed up this other than increasing snapshot 
> threads