> I copied all the snapshots from each individual nodes where the snapshot > data size was around 12Gb on each node to a common folder(one folder alone). > > Strangely I found duplicate file names in multiple snapshots and > more strangely the data size was different of each duplicate file which lead > to the total data size to close to 13Gb(else have to be overwritten) where > as the expectation was 12*6 = 72Gb.
You have detected via experimentation that the namespacing of sstable filenames per CF per node is not unique. In order to do the operation you are doing, you have to rename them to be globally unique. Just inflate the integer part is the easiest way. https://issues.apache.org/jira/browse/CASSANDRA-1983 =Rob -- =Robert Coli AIM>ALK - rc...@palominodb.com YAHOO - rcoli.palominob SKYPE - rcoli_palominodb