[jira] [Commented] (CASSANDRA-5272) Hinted Handoff Throttle based on cluster size

2013-05-29 Thread Jonathan Ellis (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-5272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13669639#comment-13669639
 ] 

Jonathan Ellis commented on CASSANDRA-5272:
---

I dunno, then you have to worry about flooding again with edge cases like 
CASSANDRA-3533.  I'll go ahead and commit this and we can try to be more 
sophisticated later if warranted.

> Hinted Handoff Throttle based on cluster size
> -
>
> Key: CASSANDRA-5272
> URL: https://issues.apache.org/jira/browse/CASSANDRA-5272
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Core
>Affects Versions: 1.2.0
>Reporter: Rick Branson
>Assignee: Jonathan Ellis
>Priority: Minor
>  Labels: lhf
> Fix For: 1.2.6
>
> Attachments: 5272.txt
>
>
> For a 12-node EC2 m1.xlarge cluster, restarting a node causes it to get 
> completely overloaded with the default 2-thread, 1024KB setting in 1.2.x. 
> This seemed to be a smaller problem when it was 6-nodes, but still required 
> us to abort handoffs. The old defaults in 1.1.x were WAY more conservative. 
> I've dropped this way down to 128KB on our production cluster which is really 
> conservative, but appears to have solved it. The default seems way too high 
> on any cluster that is non-trivial in size.
> After putting some thought to this, it seems that this should really be based 
> on cluster size, making the throttle a "target" for how much write load a 
> single node can swallow. As the cluster grows, the amount of hints that can 
> be delivered by each other node in the cluster goes down, so the throttle 
> should self-adjust to take that into account.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (CASSANDRA-5272) Hinted Handoff Throttle based on cluster size

2013-05-28 Thread Brandon Williams (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-5272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13668609#comment-13668609
 ] 

Brandon Williams commented on CASSANDRA-5272:
-

Would it be better perhaps to throttle by the amount of live nodes?  It won't 
always be accurate, but if you lost a WAN link between DCs you'd be throttling 
a lot more than needed at the time.

> Hinted Handoff Throttle based on cluster size
> -
>
> Key: CASSANDRA-5272
> URL: https://issues.apache.org/jira/browse/CASSANDRA-5272
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Core
>Affects Versions: 1.2.0
>Reporter: Rick Branson
>Assignee: Jonathan Ellis
>Priority: Minor
>  Labels: lhf
> Fix For: 1.2.6
>
> Attachments: 5272.txt
>
>
> For a 12-node EC2 m1.xlarge cluster, restarting a node causes it to get 
> completely overloaded with the default 2-thread, 1024KB setting in 1.2.x. 
> This seemed to be a smaller problem when it was 6-nodes, but still required 
> us to abort handoffs. The old defaults in 1.1.x were WAY more conservative. 
> I've dropped this way down to 128KB on our production cluster which is really 
> conservative, but appears to have solved it. The default seems way too high 
> on any cluster that is non-trivial in size.
> After putting some thought to this, it seems that this should really be based 
> on cluster size, making the throttle a "target" for how much write load a 
> single node can swallow. As the cluster grows, the amount of hints that can 
> be delivered by each other node in the cluster goes down, so the throttle 
> should self-adjust to take that into account.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (CASSANDRA-5272) Hinted Handoff Throttle based on cluster size

2013-05-28 Thread Rick Branson (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-5272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13668584#comment-13668584
 ] 

Rick Branson commented on CASSANDRA-5272:
-

There's a small spelling error "eeach" in the cassandra.yaml. Also, should we 
add a line to NEWS to describe this behavior since it's going to change the 
setting in place? (i.e. we will need to bump our throttle up a bunch when 
rolling this out).

> Hinted Handoff Throttle based on cluster size
> -
>
> Key: CASSANDRA-5272
> URL: https://issues.apache.org/jira/browse/CASSANDRA-5272
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Core
>Affects Versions: 1.2.0
>Reporter: Rick Branson
>Assignee: Jonathan Ellis
>Priority: Minor
>  Labels: lhf
> Fix For: 1.2.6
>
> Attachments: 5272.txt
>
>
> For a 12-node EC2 m1.xlarge cluster, restarting a node causes it to get 
> completely overloaded with the default 2-thread, 1024KB setting in 1.2.x. 
> This seemed to be a smaller problem when it was 6-nodes, but still required 
> us to abort handoffs. The old defaults in 1.1.x were WAY more conservative. 
> I've dropped this way down to 128KB on our production cluster which is really 
> conservative, but appears to have solved it. The default seems way too high 
> on any cluster that is non-trivial in size.
> After putting some thought to this, it seems that this should really be based 
> on cluster size, making the throttle a "target" for how much write load a 
> single node can swallow. As the cluster grows, the amount of hints that can 
> be delivered by each other node in the cluster goes down, so the throttle 
> should self-adjust to take that into account.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (CASSANDRA-5272) Hinted Handoff Throttle based on cluster size

2013-05-22 Thread Jonathan Ellis (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-5272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13664482#comment-13664482
 ] 

Jonathan Ellis commented on CASSANDRA-5272:
---

Correction; patch is against trunk but should be trivial to retrofit if 
necessary.

> Hinted Handoff Throttle based on cluster size
> -
>
> Key: CASSANDRA-5272
> URL: https://issues.apache.org/jira/browse/CASSANDRA-5272
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Core
>Affects Versions: 1.2.0
>Reporter: Rick Branson
>Assignee: Jonathan Ellis
>Priority: Minor
>  Labels: lhf
> Fix For: 1.2.6
>
> Attachments: 5272.txt
>
>
> For a 12-node EC2 m1.xlarge cluster, restarting a node causes it to get 
> completely overloaded with the default 2-thread, 1024KB setting in 1.2.x. 
> This seemed to be a smaller problem when it was 6-nodes, but still required 
> us to abort handoffs. The old defaults in 1.1.x were WAY more conservative. 
> I've dropped this way down to 128KB on our production cluster which is really 
> conservative, but appears to have solved it. The default seems way too high 
> on any cluster that is non-trivial in size.
> After putting some thought to this, it seems that this should really be based 
> on cluster size, making the throttle a "target" for how much write load a 
> single node can swallow. As the cluster grows, the amount of hints that can 
> be delivered by each other node in the cluster goes down, so the throttle 
> should self-adjust to take that into account.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira