[ 
https://issues.apache.org/jira/browse/CASSANDRA-2170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13107233#comment-13107233
 ] 

Peter Schuller commented on CASSANDRA-2170:
-------------------------------------------

Wow, interesting. Are you sure it's 0.8.5 though? The stack trace is not 
matching what I see in the 0.8.5 tag (mismatched line number for 
MessagingService.addCallback()).

We've been seeing load spikes on 0.7, but havent reported it because it's such 
an old version. However we were never able to grab stacks because no JMX query 
would ever succeed during this condition.

The stack trace indicates it's stuck doing resize operations on the NBHM where 
each thread is trying to help the resizing operation along by performing 
potentially duplicate (for forward progress producing) work.

Do you have a list of all stacks? Do you find any thread (should be 0 or 1) 
that are executing in ExpiringMap.CacheMonitor.run() at the time of the load 
spikes?

I guess we're seeing some kind of fallen-and-cant-get-up senario having to do 
with the resize. Maybe dogpiling the resize is making it overall slow enough 
that it never gets unstuck without a temporary stop in incoming requests. Or 
some such. That's gut feely speculation without having actually looked at it 
carefully, so take it with a grain of salt :)


> Load spikes
> -----------
>
>                 Key: CASSANDRA-2170
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2170
>             Project: Cassandra
>          Issue Type: Bug
>    Affects Versions: 0.6.11
>            Reporter: Jonathan Ellis
>            Assignee: Brandon Williams
>
> as reported on CASSANDRA-2058, some users are still seeing load spikes on 
> 0.6.11, even with fairly low-volume read workloads.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to