Yes you will have all the data in two nodes provided there is no mutation drop at node level or data is repaired
For example if you data A,B,C and D. with RF=3 and 4 nodes (node1, node2, node3 and node4) Data A is in node1, node2 and node3 Data B is in node2, node3, and node4 Data C is in node3, node4 and node1 Data D is in node4, node1 and node2 With this configuration, any *two nodes combined* will give all the data. Regards Manish On Fri, Feb 7, 2020 at 12:53 AM Voytek Jarnot <voytek.jar...@gmail.com> wrote: > Been thinking about it, and I can't really see how with 4 nodes and RF=3, > any 2 nodes would *not* have all the data; but am more than willing to > learn. > > On the other thing: that's an attractive option, but in our case, the > target cluster will likely come into use before the source-cluster data is > available to load. Seemed to me the safest approach was sstableloader. > > Thanks > > On Wed, Feb 5, 2020 at 6:56 PM Erick Ramirez <flightc...@gmail.com> wrote: > >> Unfortunately, there isn't a guarantee that 2 nodes alone will have the >> full copy of data. I'd rather not say "it depends". 😁 >> >> TIP: If the nodes in the target cluster have identical tokens allocated, >> you can just do a straight copy of the sstables node-for-node then do >> nodetool >> refresh. If the target cluster is already built and you can't assign the >> same tokens then sstableloader is your only option. Cheers! >> >> P.S. No need to apologise for asking questions. That's what we're all >> here for. Just keep them coming. 👍 >> >>>