Re: Streaming performance with 1.2.6

2013-07-02 Thread sankalp kohli
I dont know much about streaming in vnodes but you might be hitting this https://issues.apache.org/jira/browse/CASSANDRA-4650 On Tue, Jul 2, 2013 at 12:43 PM, Mike Heffner wrote: > As a test, adding a 7th node in the first AZ will stream from both the two > existing nodes in the same AZ. > > Ag

Re: Streaming performance with 1.2.6

2013-07-02 Thread Mike Heffner
As a test, adding a 7th node in the first AZ will stream from both the two existing nodes in the same AZ. Aggregate streaming bandwidth at the 7th node is approximately 12 MB/sec when all limits are set at 800 MB/sec, or about double what I saw streaming from a single node. This would seem to indi

Re: Streaming performance with 1.2.6

2013-07-02 Thread Mike Heffner
Sankalp, Parallel sstableloader streaming would definitely be valuable. However, this ring is currently using vnodes and I was surprised to see that a bootstrapping node only streamed from one node in the ring. My understanding was that a bootstrapping node would stream from multiple nodes in the

Re: Streaming performance with 1.2.6

2013-07-02 Thread sankalp kohli
This was a problem pre vnodes. I had several JIRA for that but some of them were voted down saying the performance will improve with vnodes. The main problem is that it streams one sstable at a time and not in parallel. Jira 4784 can speed up the bootstrap performance. You can also do a zero copy

Re: Streaming performance with 1.2.6

2013-07-02 Thread Mike Heffner
On Mon, Jul 1, 2013 at 10:06 PM, Mike Heffner wrote: > > The only changes we've made to the config (aside from dirs/hosts) are: > Forgot to include we've changed this as well: -partitioner: org.apache.cassandra.dht.Murmur3Partitioner +partitioner: org.apache.cassandra.dht.RandomPartitioner Ch

Streaming performance with 1.2.6

2013-07-01 Thread Mike Heffner
Hi, We've recently been testing some of the higher performance instance classes on EC2, specifically the hi1.4xlarge, with Cassandra. For those that are not familiar with them, they have two SSD disks and 10 gige. While we have observed much improved raw performance over our current instances, we