Hi. I try to understand tricks that I can use with the SSTables, for faster manipulation of datas in clusters.
I learn I how copy a keyspaces from data directories to a new node and change replicationfactor (thx Jonathan). If I understood, Each SSTable have 3 files : ColumnFamily-ID-Datas.db ColumnFamily-ID-Index.db ColumnFamily-ID-Filter.db If I want merge datas from 2 clusters, with differents keys (each key is only in one cluster) but with the same ColumnFamily. Can I copy all the files from SSTables with the same methode ? > 1. nodetool drain & stop original node > 2. copy everything ***files sstables*** in data/ directories (but not system > keyspace!) to new node > 3. restart and autobootstrap=false [the default] Thx. On Tue, Jun 8, 2010 at 7:12 AM, xavier manach <x...@tekio.org> wrote: > Hi. > > I have a cluster with only 1 node with a lot of datas (500 Go) . > I want add a new node with the same datas (with a ReplicationFactor > 2) > > The method normal is : > stop node. > add a node. > change replication factor to 2. > start nodes > use nodetool repair > > But , I didn't know if this other method is valid, and if it's can > be faster : > stop nodes. > copy all SSTables > change replication factor. > start nodes > and > use nodetool repair > > Have you an idea for the faster valid method ? > > Thx. >