[jira] [Comment Edited] (CASSANDRA-6774) Cleanup fails with assertion error after stopping previous run

2014-03-17 Thread Marcus Eriksson (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13937541#comment-13937541
 ] 

Marcus Eriksson edited comment on CASSANDRA-6774 at 3/17/14 8:31 AM:
-

hmm, i actually think we should throw an exception if we fail, so that the 
nodetool user gets some feedback without having to check logs, attached


was (Author: krummas):
hmm, i think we should throw an exception if we fail, so that the nodetool user 
gets some feedback without having to check logs, attached

> Cleanup fails with assertion error after stopping previous run
> --
>
> Key: CASSANDRA-6774
> URL: https://issues.apache.org/jira/browse/CASSANDRA-6774
> Project: Cassandra
>  Issue Type: Bug
>  Components: Core
> Environment: 2.0.5
>Reporter: Keith Wright
>Assignee: Marcus Eriksson
> Fix For: 2.0.7
>
> Attachments: 0001-6774-wip.patch, 
> 0001-Dont-continue-after-failing-to-cancel-in-progress-co.patch, 6774-v2.txt
>
>
> I am stress testing a new 2.0.5 cluster and did the following:
> - start decommission during heavy write, moderate read load
> - trigger cleanup on non-decommissioning node (nodetool cleanup)
> - Started to see higher GC load stop stopped cleanup via nodetool stop CLEANUP
> - attempt to launch cleanup now fails with the following message in console.
> Cassandra log shows: 
> http://aep.appspot.com/display/cKmlMcDuKD72iYAcBykDuVZkRWY/



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Comment Edited] (CASSANDRA-6774) Cleanup fails with assertion error after stopping previous run

2014-03-12 Thread Dmitrij Koniajev (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13931563#comment-13931563
 ] 

Dmitrij Koniajev edited comment on CASSANDRA-6774 at 3/12/14 6:36 PM:
--

Same issue here on LCS - after decreasing RF cleanup fails with the same 
stacktrace. Restart didn't help, after running "nodetool disableautocompaction" 
command and waiting for compaction get finished cleanup fails with the 
following stacktrace:
{quote}
Error occurred during cleanup
java.util.concurrent.ExecutionException: java.lang.IndexOutOfBoundsException: 
Index: 1, Size: 1
at java.util.concurrent.FutureTask.report(FutureTask.java:122)
at java.util.concurrent.FutureTask.get(FutureTask.java:188)
at 
org.apache.cassandra.db.compaction.CompactionManager.performAllSSTableOperation(CompactionManager.java:227)
at 
org.apache.cassandra.db.compaction.CompactionManager.performCleanup(CompactionManager.java:265)
at 
org.apache.cassandra.db.ColumnFamilyStore.forceCleanup(ColumnFamilyStore.java:1115)
at 
org.apache.cassandra.service.StorageService.forceKeyspaceCleanup(StorageService.java:2152)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at sun.reflect.misc.Trampoline.invoke(MethodUtil.java:75)
at sun.reflect.GeneratedMethodAccessor12.invoke(Unknown Source)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at sun.reflect.misc.MethodUtil.invoke(MethodUtil.java:279)
at 
com.sun.jmx.mbeanserver.StandardMBeanIntrospector.invokeM2(StandardMBeanIntrospector.java:112)
at 
com.sun.jmx.mbeanserver.StandardMBeanIntrospector.invokeM2(StandardMBeanIntrospector.java:46)
at 
com.sun.jmx.mbeanserver.MBeanIntrospector.invokeM(MBeanIntrospector.java:237)
at com.sun.jmx.mbeanserver.PerInterface.invoke(PerInterface.java:138)
at com.sun.jmx.mbeanserver.MBeanSupport.invoke(MBeanSupport.java:252)
at 
com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.invoke(DefaultMBeanServerInterceptor.java:819)
at 
com.sun.jmx.mbeanserver.JmxMBeanServer.invoke(JmxMBeanServer.java:801)
at 
javax.management.remote.rmi.RMIConnectionImpl.doOperation(RMIConnectionImpl.java:1487)
at 
javax.management.remote.rmi.RMIConnectionImpl.access$300(RMIConnectionImpl.java:97)
at 
javax.management.remote.rmi.RMIConnectionImpl$PrivilegedOperation.run(RMIConnectionImpl.java:1328)
at 
javax.management.remote.rmi.RMIConnectionImpl.doPrivilegedOperation(RMIConnectionImpl.java:1420)
at 
javax.management.remote.rmi.RMIConnectionImpl.invoke(RMIConnectionImpl.java:848)
at sun.reflect.GeneratedMethodAccessor20.invoke(Unknown Source)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at sun.rmi.server.UnicastServerRef.dispatch(UnicastServerRef.java:322)
at sun.rmi.transport.Transport$1.run(Transport.java:177)
at sun.rmi.transport.Transport$1.run(Transport.java:174)
at java.security.AccessController.doPrivileged(Native Method)
at sun.rmi.transport.Transport.serviceCall(Transport.java:173)
at 
sun.rmi.transport.tcp.TCPTransport.handleMessages(TCPTransport.java:556)
at 
sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run0(TCPTransport.java:811)
at 
sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run(TCPTransport.java:670)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:744)
Caused by: java.lang.IndexOutOfBoundsException: Index: 1, Size: 1
at java.util.ArrayList.rangeCheck(ArrayList.java:635)
at java.util.ArrayList.get(ArrayList.java:411)
at 
org.apache.cassandra.db.compaction.CompactionManager.needsCleanup(CompactionManager.java:502)
at 
org.apache.cassandra.db.compaction.CompactionManager.doCleanupCompaction(CompactionManager.java:540)
at 
org.apache.cassandra.db.compaction.CompactionManager.access$400(CompactionManager.java:62)
at 
org.apache.cassandra.db.compaction.CompactionManager$5.perform(CompactionManager.java:274)
at 
org.apache.cassandra.db.compaction.CompactionManager$2.call(CompactionManager.java:222)
at java.util.concurrent.FutureTask.run(FutureTask.java:262)
... 3 more
{quot

[jira] [Comment Edited] (CASSANDRA-6774) Cleanup fails with assertion error after stopping previous run

2014-02-26 Thread Keith Wright (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13913311#comment-13913311
 ] 

Keith Wright edited comment on CASSANDRA-6774 at 2/26/14 6:55 PM:
--

Additional info:

cqlsh:system> select * from compactions_in_progress limit 10;

 id   | columnfamily_name  | inputs 


  | keyspace_name
--++--+---
 9a20d760-9f15-11e3-bfe7-fb46089edbb0 |global_user | {16796, 17068, 
17073, 17077, 17118, 17135, 17164, 17193, 17219, 17281, 17293, 17365, 17553, 
17575, 17606, 17668, 17695, 17701, 17906, 17983, 18016, 18017, 18069, 18089, 
18098, 18099, 18108, 18114, 18119, 18123, 18188, 18449} | users
 93778f70-9f16-11e3-bfe7-fb46089edbb0 | client_end_user_lookup |
 
{3280, 3281, 3283, 3284, 3286, 3293, 3297, 3298, 3301, 3302, 3303, 3306, 3323, 
3324, 3326, 3328, 3329, 3330, 3331, 3332, , 3334} | users
 96d23e10-9f14-11e3-bfe7-fb46089edbb0 | cookie_user_lookup |

 {9072, 9078, 9083, 9088, 9094, 9100, 9106, 9112, 9118, 
9124, 9131, 9146, 9369, 9370, 9371, 9372, 9373, 9393} | users

(3 rows)


was (Author: keithwrightbos):
Additional info:

cqlsh:system> select * from compactions_in_progress limit 10;

 id   | columnfamily_name  | inputs 


  | keyspace_name
--++--+---
 d56eeb80-9f16-11e3-8115-2981d742 | cookie_user_lookup |

{8203, 8206, 
9990, 9991, 9992, 9993, 9994, 9995, 9996, 10409} | users
 33807e50-9f17-11e3-8115-2981d742 |  shard_user_lookup |

   {6684, 
6687, 6690, 6693, 6697, 6700, 7109, 7110, 7486} | users
 576eefd0-9f13-11e3-8115-2981d742 |global_user | {17107, 17115, 
17123, 17132, 17142, 17150, 17209, 17218, 17409, 17417, 18276, 18322, 18334, 
18335, 18337, 18351, 18369, 18371, 18379, 18439, 18445, 18446, 18451, 19515, 
19519, 19538, 19555, 19580, 19624, 19753, 19754, 19755} | users

(3 rows)

[kwright@castest06 ~]$ nodetool compactionstats
pending tasks: 33
  compaction typekeyspace   table   completed   
total  unit  progress
   Compaction   users global_user 14478355576 
17165883943 bytes84.34%
   Compaction   usersshard_user_lookup   705875461  
3249369496 bytes21.72%
   Compaction   userscookie_user_lookup  1723865787 
 3489228069 bytes49.41%
Active compaction remaining time :n/a

> Cleanup fails with assertion error after stopping previous run
> --
>
> Key: CASSANDRA-6774
> URL: https://issues.apache.org/jira/browse/CASSANDRA-6774
> Project: Cassandra
>  Issue Type: Bug
>  Components: Core
> Environment: 2.0.5
>Reporter: Keith Wright
>
> I am stress testing a new 2.0.5 cluster and did the following:
> - start decommission during heavy write, moderate read load
> - trigger cleanup on non-decommissioning node (nodetool cleanup)
> - Started to see higher GC load stop stopped cleanup via nodetool stop CLEANUP
> - attempt to launch cleanup now fails with the following message in console.
> Cassandra log shows:
> WARN 18:45:38,420 Unable to cancel in-progress compactions for 
>