Aaron,

It’s safe to preload the data on a changing cluster topology. Both data stream 
and CacheStore approaches handle this:
https://apacheignite.readme.io/docs/data-loading

During the rebalancing a node evicts data it’s no longer primary or backup one. 
You don’t need to worry about this, it’s Ignite’s job.

—
Denis M.

> On Sep 26, 2017, at 1:46 AM, [email protected] wrote:
> 
> Thanks Denis, 
> 
> The real tough issue is we not sure when the entire cluster may be ready, as 
> we may increase or decrease the nodes at run-time. 
> 
> Another question is , if I load the data once on first started node,  after 
> other nodes bring up, and after re-balance, will the primary nodes evict the 
> entries not below to it?
> 
> As we have regular aggregated run locally on each nodes, we do not want this 
> will be too heavy on the first node. 
> 
> 
> Regards
> Aaron
> [email protected] <mailto:[email protected]>
>  
> From: Denis Mekhanikov <mailto:[email protected]>
> Date: 2017-09-25 19:46
> To: user <mailto:[email protected]>
> Subject: Re: What's the best practice to init the cache when in cluster env?
> Hi Aaron!
> 
> There are two good options for data loading: using DataStreamer or 
> IgniteCache.loadCache(...) 
> <https://apacheignite.readme.io/docs/3rd-party-store#section-loadcache->. The 
> second option is good when initial data is stored in some database.
> 
> If you worry about overhead on data rebalancing, you can start the cluster 
> and start streaming data once all nodes are up. In this case records will 
> appear at their final destination at once, without need to move to other 
> nodes.
> 
> Denis
> 
> пн, 25 сент. 2017 г. в 14:31, [email protected] <mailto:[email protected]> 
> <[email protected] <mailto:[email protected]>>:
> hi All, 
> 
> If we have dozen of nodes to cache millions data from DB;
> 
> When init,  what's the best way to loading those data? we use the data 
> streamer to load data, while all our entry include a partition ID when insert 
> into DB. 
> 
> As the nodes are started one by one, if loading from one Node and then 
> re-balance this seems impossible & wasting. 
> 
> Not sure whether there any guideline or best practice/advice for such 
> scenario.
> 
> Thanks for our time!
> 
> 
> Regards
> Aaron
> [email protected] <mailto:[email protected]>

Reply via email to