[jira] [Commented] (CASSANDRA-10714) tcp retransmission issue seen in cassandra cluster

2016-05-03 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-10714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15269110#comment-15269110
 ] 

Jeff Liu commented on CASSANDRA-10714:
--

[~JoshuaMcKenzie], Josh, I think it's related to the aws network. I'm seeing it 
on other hosts as well.

> tcp retransmission issue seen in cassandra cluster
> --
>
> Key: CASSANDRA-10714
> URL: https://issues.apache.org/jira/browse/CASSANDRA-10714
> Project: Cassandra
>  Issue Type: Bug
>  Components: Streaming and Messaging
>Reporter: Jeff Liu
>
> I have been seen tcp package retransmission issue in various stacks in our 
> environment. ( AWS with no VPC). I'm currently using hsha rpc server type on 
> cassandra 2.1.6 version. The information captured by wireshark shows that the 
> retransmission happened both between client-to-server and server-to-server. 
> Even within a cluster that doesn't have any client traffic, sporadic 
> retransmission still happens.
> It's pretty easy to reproduce this issue by watching "netstat -s | grep 
> retrans". 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-9625) GraphiteReporter not reporting

2016-02-22 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-9625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15157368#comment-15157368
 ] 

Jeff Liu commented on CASSANDRA-9625:
-

I'm also running a test for 2.1.13. Will keep you all posted.

> GraphiteReporter not reporting
> --
>
> Key: CASSANDRA-9625
> URL: https://issues.apache.org/jira/browse/CASSANDRA-9625
> Project: Cassandra
>  Issue Type: Bug
> Environment: Debian Jessie, 7u79-2.5.5-1~deb8u1, Cassandra 2.1.3
>Reporter: Eric Evans
>Assignee: T Jake Luciani
> Attachments: metrics.yaml, thread-dump.log
>
>
> When upgrading from 2.1.3 to 2.1.6, the Graphite metrics reporter stops 
> working.  The usual startup is logged, and one batch of samples is sent, but 
> the reporting interval comes and goes, and no other samples are ever sent.  
> The logs are free from errors.
> Frustratingly, metrics reporting works in our smaller (staging) environment 
> on 2.1.6; We are able to reproduce this on all 6 of production nodes, but not 
> on a 3 node (otherwise identical) staging cluster (maybe it takes a certain 
> level of concurrency?).
> Attached is a thread dump, and our metrics.yaml.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-9625) GraphiteReporter not reporting

2016-02-09 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-9625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15139488#comment-15139488
 ] 

Jeff Liu commented on CASSANDRA-9625:
-

anyone has update on this ticket ? is it resolved on any later version?

> GraphiteReporter not reporting
> --
>
> Key: CASSANDRA-9625
> URL: https://issues.apache.org/jira/browse/CASSANDRA-9625
> Project: Cassandra
>  Issue Type: Bug
> Environment: Debian Jessie, 7u79-2.5.5-1~deb8u1, Cassandra 2.1.3
>Reporter: Eric Evans
> Attachments: metrics.yaml, thread-dump.log
>
>
> When upgrading from 2.1.3 to 2.1.6, the Graphite metrics reporter stops 
> working.  The usual startup is logged, and one batch of samples is sent, but 
> the reporting interval comes and goes, and no other samples are ever sent.  
> The logs are free from errors.
> Frustratingly, metrics reporting works in our smaller (staging) environment 
> on 2.1.6; We are able to reproduce this on all 6 of production nodes, but not 
> on a 3 node (otherwise identical) staging cluster (maybe it takes a certain 
> level of concurrency?).
> Attached is a thread dump, and our metrics.yaml.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-9945) Add transparent data encryption core classes

2016-02-09 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-9945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15139291#comment-15139291
 ] 

Jeff Liu commented on CASSANDRA-9945:
-

[~jasobrown], should I expect it to be ready in 3.4 version?

> Add transparent data encryption core classes
> 
>
> Key: CASSANDRA-9945
> URL: https://issues.apache.org/jira/browse/CASSANDRA-9945
> Project: Cassandra
>  Issue Type: Improvement
>Reporter: Jason Brown
>Assignee: Jason Brown
>  Labels: encryption
> Fix For: 3.2
>
>
> This patch will add the core infrastructure classes necessary for transparent 
> data encryption (file-level encryption), as required for CASSANDRA-6018 and 
> CASSANDRA-9633.  The phrase "transparent data encryption", while not the most 
> aesthetically pleasing, seems to be used throughout the database industry 
> (Oracle, SQLQServer, Datastax Enterprise) to describe file level encryption, 
> so we'll go with that, as well. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-9945) Add transparent data encryption core classes

2016-02-08 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-9945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15137372#comment-15137372
 ] 

Jeff Liu commented on CASSANDRA-9945:
-

Thanks, [~jjordan], do you know when should I expect the transparent encryption 
to be fully available? we have some business requirement that comes in later 
this year that requires encryption for data at rest. I'm trying to align up the 
schedule.

> Add transparent data encryption core classes
> 
>
> Key: CASSANDRA-9945
> URL: https://issues.apache.org/jira/browse/CASSANDRA-9945
> Project: Cassandra
>  Issue Type: Improvement
>Reporter: Jason Brown
>Assignee: Jason Brown
>  Labels: encryption
> Fix For: 3.2
>
>
> This patch will add the core infrastructure classes necessary for transparent 
> data encryption (file-level encryption), as required for CASSANDRA-6018 and 
> CASSANDRA-9633.  The phrase "transparent data encryption", while not the most 
> aesthetically pleasing, seems to be used throughout the database industry 
> (Oracle, SQLQServer, Datastax Enterprise) to describe file level encryption, 
> so we'll go with that, as well. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (CASSANDRA-9945) Add transparent data encryption core classes

2016-02-05 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-9945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135221#comment-15135221
 ] 

Jeff Liu edited comment on CASSANDRA-9945 at 2/5/16 11:09 PM:
--

[~jasobrown] With this ticket being resolved, does it mean we can do 
transparent data encryption in 3.2 as what we can do with datastax enterprise 
version? More specifically, can I run the following command to encrypt data?

ALTER TABLE dept
WITH compression_parameters:sstable_compression = 'Encryptor'
AND compression_parameters:cipher_algorithm = 'AES/ECB/PKCS5Padding'
AND compression_parameters:secret_key_strength = 128; 


Thanks,
Jeff


was (Author: jeffl):
[~jasobrown] With this ticket resolved, does it mean we can do transparent data 
encryption in 3.2 as what we can do with datastax enterprise version? More 
specifically, can I run the following command to encrypt data?

ALTER TABLE dept
WITH compression_parameters:sstable_compression = 'Encryptor'
AND compression_parameters:cipher_algorithm = 'AES/ECB/PKCS5Padding'
AND compression_parameters:secret_key_strength = 128; 


Thanks,
Jeff

> Add transparent data encryption core classes
> 
>
> Key: CASSANDRA-9945
> URL: https://issues.apache.org/jira/browse/CASSANDRA-9945
> Project: Cassandra
>  Issue Type: Improvement
>Reporter: Jason Brown
>Assignee: Jason Brown
>  Labels: encryption
> Fix For: 3.2
>
>
> This patch will add the core infrastructure classes necessary for transparent 
> data encryption (file-level encryption), as required for CASSANDRA-6018 and 
> CASSANDRA-9633.  The phrase "transparent data encryption", while not the most 
> aesthetically pleasing, seems to be used throughout the database industry 
> (Oracle, SQLQServer, Datastax Enterprise) to describe file level encryption, 
> so we'll go with that, as well. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (CASSANDRA-9945) Add transparent data encryption core classes

2016-02-05 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-9945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135221#comment-15135221
 ] 

Jeff Liu edited comment on CASSANDRA-9945 at 2/5/16 11:09 PM:
--

[~jasobrown] With this ticket resolved, does it mean we can do transparent data 
encryption in 3.2 as what we can do with datastax enterprise version? More 
specifically, can I run the following command to encrypt data?

ALTER TABLE dept
WITH compression_parameters:sstable_compression = 'Encryptor'
AND compression_parameters:cipher_algorithm = 'AES/ECB/PKCS5Padding'
AND compression_parameters:secret_key_strength = 128; 


Thanks,
Jeff


was (Author: jeffl):
With this ticket resolved, does it mean we can do transparent data encryption 
in 3.2 as what we can do with datastax enterprise version? More specifically, 
can I run the following command to encrypt data?

ALTER TABLE dept
WITH compression_parameters:sstable_compression = 'Encryptor'
AND compression_parameters:cipher_algorithm = 'AES/ECB/PKCS5Padding'
AND compression_parameters:secret_key_strength = 128; 


Thanks,
Jeff

> Add transparent data encryption core classes
> 
>
> Key: CASSANDRA-9945
> URL: https://issues.apache.org/jira/browse/CASSANDRA-9945
> Project: Cassandra
>  Issue Type: Improvement
>Reporter: Jason Brown
>Assignee: Jason Brown
>  Labels: encryption
> Fix For: 3.2
>
>
> This patch will add the core infrastructure classes necessary for transparent 
> data encryption (file-level encryption), as required for CASSANDRA-6018 and 
> CASSANDRA-9633.  The phrase "transparent data encryption", while not the most 
> aesthetically pleasing, seems to be used throughout the database industry 
> (Oracle, SQLQServer, Datastax Enterprise) to describe file level encryption, 
> so we'll go with that, as well. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-9945) Add transparent data encryption core classes

2016-02-05 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-9945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135221#comment-15135221
 ] 

Jeff Liu commented on CASSANDRA-9945:
-

With this ticket resolved, does it mean we can do transparent data encryption 
in 3.2 as what we can do with datastax enterprise version? More specifically, 
can I run the following command to encrypt data?

ALTER TABLE dept
WITH compression_parameters:sstable_compression = 'Encryptor'
AND compression_parameters:cipher_algorithm = 'AES/ECB/PKCS5Padding'
AND compression_parameters:secret_key_strength = 128; 


Thanks,
Jeff

> Add transparent data encryption core classes
> 
>
> Key: CASSANDRA-9945
> URL: https://issues.apache.org/jira/browse/CASSANDRA-9945
> Project: Cassandra
>  Issue Type: Improvement
>Reporter: Jason Brown
>Assignee: Jason Brown
>  Labels: encryption
> Fix For: 3.2
>
>
> This patch will add the core infrastructure classes necessary for transparent 
> data encryption (file-level encryption), as required for CASSANDRA-6018 and 
> CASSANDRA-9633.  The phrase "transparent data encryption", while not the most 
> aesthetically pleasing, seems to be used throughout the database industry 
> (Oracle, SQLQServer, Datastax Enterprise) to describe file level encryption, 
> so we'll go with that, as well. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-10714) tcp retransmission issue seen in cassandra cluster

2015-11-17 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-10714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15009217#comment-15009217
 ] 

Jeff Liu commented on CASSANDRA-10714:
--

[~mshuler], I checked the NIC status and it shows errors:0 dropped:0 overruns:0 
frame:0 on both rx and tx. I even increased the tx queue size to 5000 from 
default 1000 without any improvement. Specifically for linux networking 
settings, I also increased both net.core.rmem, net.core.wmem,, 
net.ipv4.tcp_rmem, net.ipv4.tcp_wmem to 32M. increasd 
net.core.netdev_max_backlog to 3 with no luck to eliminate the 
retransmission. 

> tcp retransmission issue seen in cassandra cluster
> --
>
> Key: CASSANDRA-10714
> URL: https://issues.apache.org/jira/browse/CASSANDRA-10714
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Streaming and Messaging
>Reporter: Jeff Liu
>
> I have been seen tcp package retransmission issue in various stacks in our 
> environment. ( AWS with no VPC). I'm currently using hsha rpc server type on 
> cassandra 2.1.6 version. The information captured by wireshark shows that the 
> retransmission happened both between client-to-server and server-to-server. 
> Even within a cluster that doesn't have any client traffic, sporadic 
> retransmission still happens.
> It's pretty easy to reproduce this issue by watching "netstat -s | grep 
> retrans". 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CASSANDRA-10714) tcp retransmission issue seen in cassandra cluster

2015-11-16 Thread Jeff Liu (JIRA)
Jeff Liu created CASSANDRA-10714:


 Summary: tcp retransmission issue seen in cassandra cluster
 Key: CASSANDRA-10714
 URL: https://issues.apache.org/jira/browse/CASSANDRA-10714
 Project: Cassandra
  Issue Type: Improvement
  Components: Streaming and Messaging
Reporter: Jeff Liu


I have been seen tcp package retransmission issue in various stacks in our 
environment. ( AWS with no VPC). I'm currently using hsha rpc server type on 
cassandra 2.1.6 version. The information captured by wireshark shows that the 
retransmission happened both between client-to-server and server-to-server. 
Even within a cluster that doesn't have any client traffic, sporadic 
retransmission still happens.

It's pretty easy to reproduce this issue by watching "netstat -s | grep 
retrans". 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-10713) Gate keeper to do rate limiter and qps cap

2015-11-16 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-10713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-10713:
-
Description: 
As cassandra becomes the primary noSQL data store in more and more production 
environment, have we thought about adding gate keeper features like rate 
limiter and qps cap, which will give a greater integration experience when 
implementing cassandra in a large services infrastructure.

Reliability has become more and more important in those days for service 
providers. cassandra, together with other SQL or noSQL solutions, provides the 
data layers that power the complete application stack. In today's distributed 
system frameworks, dependences across application layers have been largely 
reduced, however, as the central data repository, data store system sits on the 
critical path, and become more fragile especially for the real-time, large 
scale systems. Companies like Twitter, facebook has built gate keeper 
internally to protect their data store systems, what should we do for cassandra?

Thoughts?



  was:
As cassandra becomes the primary noSQL data store in more and more production 
environment, have we thought about adding gate keeper features like rate 
limiter and qps cap, which will give a greater integration experience when 
implementing cassandra in a large services infrastructure.

Thoughts?


> Gate keeper to do rate limiter and qps cap
> --
>
> Key: CASSANDRA-10713
> URL: https://issues.apache.org/jira/browse/CASSANDRA-10713
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Configuration
>Reporter: Jeff Liu
>
> As cassandra becomes the primary noSQL data store in more and more production 
> environment, have we thought about adding gate keeper features like rate 
> limiter and qps cap, which will give a greater integration experience when 
> implementing cassandra in a large services infrastructure.
> Reliability has become more and more important in those days for service 
> providers. cassandra, together with other SQL or noSQL solutions, provides 
> the data layers that power the complete application stack. In today's 
> distributed system frameworks, dependences across application layers have 
> been largely reduced, however, as the central data repository, data store 
> system sits on the critical path, and become more fragile especially for the 
> real-time, large scale systems. Companies like Twitter, facebook has built 
> gate keeper internally to protect their data store systems, what should we do 
> for cassandra?
> Thoughts?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CASSANDRA-10713) Gate keeper to do rate limiter and qps cap

2015-11-16 Thread Jeff Liu (JIRA)
Jeff Liu created CASSANDRA-10713:


 Summary: Gate keeper to do rate limiter and qps cap
 Key: CASSANDRA-10713
 URL: https://issues.apache.org/jira/browse/CASSANDRA-10713
 Project: Cassandra
  Issue Type: Improvement
  Components: Configuration
Reporter: Jeff Liu


As cassandra becomes the primary noSQL data store in more and more production 
environment, have we thought about adding gate keeper features like rate 
limiter and qps cap, which will give a greater integration experience when 
implementing cassandra in a large services infrastructure.

Thoughts?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CASSANDRA-9804) Periodically Client Timeout Seen in Cassandra 1.2.9

2015-07-14 Thread Jeff Liu (JIRA)
Jeff Liu created CASSANDRA-9804:
---

 Summary: Periodically Client Timeout Seen in Cassandra 1.2.9
 Key: CASSANDRA-9804
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9804
 Project: Cassandra
  Issue Type: Bug
  Components: Core
Reporter: Jeff Liu


We are seeing some Timout exception from various clients that connect to 
cassandra in our environment. It happens very randomly at a frequency about one 
very short 2 mini-seconds in around 36 hours. 
The TimeoutException from client side looks like the following:
{noformat}
WARNING impl.Slf4jConnectionPoolMonitorImpl [tid-394] 
[request_id=a8f48aa0-2a35-11e5-bce6-22000b4b0a4b] : OperationTimeoutException: 
[host=10.136.6.98(10.136.6.98):9160, latency=10001(10001), 
attempts=1]TimedOutException()
{noformat}

Checked system.log on server side and no log entries at all the time duration 
when the client timeout happened.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-05-06 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14530863#comment-14530863
 ] 

Jeff Liu commented on CASSANDRA-8723:
-

Just returned from vacation. 
We have been running with 2.0.12 quite stable while waiting for 2.1.3. I'll 
schedule an upgrade sometime soon. 

Thanks,

> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until process is killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.x
>
> Attachments: cassandra.yaml
>
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cassandra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/usr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassandra/stress.jar:
>  -XX:HeapDumpPath=/var/lib/cassandra/java_1421511248.hprof 
> -XX:ErrorFile=/var/lib/cassandra/hs_err_1421511248.log 
> org.apache.cassandra.service.CassandraDaemon
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-8899) cqlsh - not able to get row count with select(*) for large table

2015-04-20 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14503876#comment-14503876
 ] 

Jeff Liu commented on CASSANDRA-8899:
-

Yes, [~blerer]. The problem is worked around by increasing timeout value.

> cqlsh - not able to get row count with select(*) for large table
> 
>
> Key: CASSANDRA-8899
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8899
> Project: Cassandra
>  Issue Type: Bug
> Environment: Cassandra 2.1.2 ubuntu12.04
>Reporter: Jeff Liu
>Assignee: Benjamin Lerer
>
>  I'm getting errors when running a query that looks at a large number of rows.
> {noformat}
> cqlsh:events> select count(*) from catalog;
>  count
> ---
>  1
> (1 rows)
> cqlsh:events> select count(*) from catalog limit 11000;
>  count
> ---
>  11000
> (1 rows)
> cqlsh:events> select count(*) from catalog limit 5;
> errors={}, last_host=127.0.0.1
> cqlsh:events> 
> {noformat}
> We are not able to make the select * query to get row count.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces return java.lang.RuntimeException: Could not create snapshot

2015-03-18 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14367499#comment-14367499
 ] 

Jeff Liu commented on CASSANDRA-8696:
-

Unfortunately I had to rebuild the cluster and keyspace and I don't have an 
environment handy to debug this issue. [~ranrub] you asked about the syntax to 
enable DEBUG in logback.xml.  Do you still have the environment ready to debug? 
Thanks!

> nodetool repair on cassandra 2.1.2 keyspaces return 
> java.lang.RuntimeException: Could not create snapshot
> -
>
> Key: CASSANDRA-8696
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8696
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> When trying to run nodetool repair -pr on cassandra node ( 2.1.2), cassandra 
> throw java exceptions: cannot create snapshot. 
> the error log from system.log:
> {noformat}
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:28,815 
> StreamResultFuture.java:166 - [Stream #692c1450-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 2 files(221187 bytes), sending 5 
> files(632105 bytes)
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:180 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> Session with /10.97.9.110 is complete
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:212 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,047 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> INFO  [RepairJobTask:1] 2015-01-28 02:07:29,065 StreamResultFuture.java:86 - 
> [Stream #692c6270-a692-11e4-9973-070e938df227] Executing streaming plan for 
> Repair
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,065 
> StreamSession.java:213 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Starting streaming to /10.66.187.201
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,070 
> StreamCoordinator.java:209 - [Stream #692c6270-a692-11e4-9973-070e938df227, 
> ID#0] Beginning stream session with /10.66.187.201
> INFO  [STREAM-IN-/10.66.187.201] 2015-01-28 02:07:29,465 
> StreamResultFuture.java:166 - [Stream #692c6270-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 5 files(627994 bytes), sending 5 
> files(632105 bytes)
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,971 
> StreamResultFuture.java:180 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Session with /10.66.187.201 is complete
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamResultFuture.java:212 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> ERROR [RepairJobTask:1] 2015-01-28 02:07:39,444 RepairJob.java:127 - Error 
> occurred during snapshot phase
> java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
> at 
> org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
> ~[na:1.7.0_45]
> at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
> ~[na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  [na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  [na:1.7.0_45]
> at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
> INFO  [AntiEntropySessions:6] 2015-01-28 02:07:39,445 RepairSession.java:260 
> - [repair #6f85e740-a692-11e4-9973-070e938df227] new session: will sync 
> /10.98.194.68, /10.66.187.201, /10.226.218.135 on range 
> (12817179804668051873746972069086
> 2638799,12863540308359254031520865977436165] for events.[bigint0text, 
> bigint0boolean, bigint0int, dataset_catalog, column_categories, 
> bigint0double, bigint0bigint]
> ERROR [AntiEntropySessions:5] 2015-01-28 02:07:39,445 RepairSession.java:303 
> - [repair #685e3d00-a692-11e4-9973-070e938df227] session completed with the 
> following error
> java.io.IOException: Failed during snapshot creation.
> at 
> org.apache.cassandra.repair.RepairSession.failedSnapshot(RepairSession.java:344)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.repair.RepairJob$2.onFailure(R

[jira] [Commented] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces return java.lang.RuntimeException: Could not create snapshot

2015-03-18 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14367470#comment-14367470
 ] 

Jeff Liu commented on CASSANDRA-8696:
-

[~benedict] and team, there have been several people reporting the same issue 
since I created the ticket. Any progress on reproducing and identifying the 
root cause? Thanks.

> nodetool repair on cassandra 2.1.2 keyspaces return 
> java.lang.RuntimeException: Could not create snapshot
> -
>
> Key: CASSANDRA-8696
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8696
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> When trying to run nodetool repair -pr on cassandra node ( 2.1.2), cassandra 
> throw java exceptions: cannot create snapshot. 
> the error log from system.log:
> {noformat}
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:28,815 
> StreamResultFuture.java:166 - [Stream #692c1450-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 2 files(221187 bytes), sending 5 
> files(632105 bytes)
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:180 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> Session with /10.97.9.110 is complete
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:212 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,047 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> INFO  [RepairJobTask:1] 2015-01-28 02:07:29,065 StreamResultFuture.java:86 - 
> [Stream #692c6270-a692-11e4-9973-070e938df227] Executing streaming plan for 
> Repair
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,065 
> StreamSession.java:213 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Starting streaming to /10.66.187.201
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,070 
> StreamCoordinator.java:209 - [Stream #692c6270-a692-11e4-9973-070e938df227, 
> ID#0] Beginning stream session with /10.66.187.201
> INFO  [STREAM-IN-/10.66.187.201] 2015-01-28 02:07:29,465 
> StreamResultFuture.java:166 - [Stream #692c6270-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 5 files(627994 bytes), sending 5 
> files(632105 bytes)
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,971 
> StreamResultFuture.java:180 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Session with /10.66.187.201 is complete
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamResultFuture.java:212 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> ERROR [RepairJobTask:1] 2015-01-28 02:07:39,444 RepairJob.java:127 - Error 
> occurred during snapshot phase
> java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
> at 
> org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
> ~[na:1.7.0_45]
> at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
> ~[na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  [na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  [na:1.7.0_45]
> at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
> INFO  [AntiEntropySessions:6] 2015-01-28 02:07:39,445 RepairSession.java:260 
> - [repair #6f85e740-a692-11e4-9973-070e938df227] new session: will sync 
> /10.98.194.68, /10.66.187.201, /10.226.218.135 on range 
> (12817179804668051873746972069086
> 2638799,12863540308359254031520865977436165] for events.[bigint0text, 
> bigint0boolean, bigint0int, dataset_catalog, column_categories, 
> bigint0double, bigint0bigint]
> ERROR [AntiEntropySessions:5] 2015-01-28 02:07:39,445 RepairSession.java:303 
> - [repair #685e3d00-a692-11e4-9973-070e938df227] session completed with the 
> following error
> java.io.IOException: Failed during snapshot creation.
> at 
> org.apache.cassandra.repair.RepairSession.failedSnapshot(RepairSession.java:344)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.repair.RepairJob$2.onFailure(RepairJob.java:128) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at

[jira] [Resolved] (CASSANDRA-8968) Cassandra cqlsh query return different result randomly

2015-03-13 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu resolved CASSANDRA-8968.
-
Resolution: Fixed

> Cassandra cqlsh query return different result randomly
> --
>
> Key: CASSANDRA-8968
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8968
> Project: Cassandra
>  Issue Type: Wish
> Environment: Cassandra 2.0
>Reporter: Jeff Liu
>
> Noticed that a select query in cqlsh returns different result randomly. It 
> would be nice to get consistent result when same query is performed.
> {noformat}
> cqlsh> select * from cass_dc.cass_dc ;
>  key | value
> -+---
>c |   ccc
>b |   bbb
> Tracing session: d20490f0-c9ac-11e4-a527-23b8e6fcc12b
>  activity| timestamp| 
> source   | source_elapsed
> -+--+--+
>   execute_cql3_query | 18:14:42,305 | 
> 54.92.168.12 |  0
>  Parsing select * from cass_dc.cass_dc  LIMIT 1; | 18:14:42,306 | 
> 54.92.168.12 |855
>  Preparing statement | 18:14:42,307 | 
> 54.92.168.12 |   1950
>Determining replicas to query | 18:14:42,307 | 
> 54.92.168.12 |   2101
>  Message received from /x.x.x.12 | 18:14:42,308 | 
> 54.82.42.121 | 56
>   Enqueuing request to /x.x.x.121 | 18:14:42,308 | 
> 54.92.168.12 |   2685
> Sending message to /x.x.x.121 | 18:14:42,308 | 
> 54.92.168.12 |   2825
>  Executing seq scan across 0 sstables for [min(-1), min(-1)] | 18:14:42,309 | 
> 54.82.42.121 |556
>   Read 1 live and 0 tombstoned cells | 18:14:42,309 | 
> 54.82.42.121 |868
>   Read 1 live and 0 tombstoned cells | 18:14:42,309 | 
> 54.82.42.121 |956
> Scanned 2 rows and matched 2 | 18:14:42,309 | 
> 54.82.42.121 |989
>  Enqueuing response to /x.x.x.12 | 18:14:42,309 | 
> 54.82.42.121 |   1007
> Sending message to /x.x.x.12 | 18:14:42,310 | 
> 54.82.42.121 |   1287
>  Message received from /x.x.x.121 | 18:14:42,319 | 
> 54.92.168.12 |  13656
>   Processing response from /x.x.x.121 | 18:14:42,319 | 
> 54.92.168.12 |  14133
> Request complete | 18:14:42,319 | 
> 54.92.168.12 |  14808
> cqlsh> select * from cass_dc.cass_dc ;
>  key | value
> -+---
>a |   aaa
>c |   ccc
>b |   bbb
> Tracing session: d4ecbcc0-c9ac-11e4-a527-23b8e6fcc12b
>  activity| timestamp| 
> source   | source_elapsed
> -+--+--+
>   execute_cql3_query | 18:14:47,180 | 
> 54.92.168.12 |  0
>  Parsing select * from cass_dc.cass_dc  LIMIT 1; | 18:14:47,180 | 
> 54.92.168.12 | 81
>  Preparing statement | 18:14:47,181 | 
> 54.92.168.12 |224
>Determining replicas to query | 18:14:47,181 | 
> 54.92.168.12 |383
>  Executing seq scan across 0 sstables for [min(-1), min(-1)] | 18:14:47,184 | 
> 54.92.168.12 |   3611
>   Read 1 live and 0 tombstoned cells | 18:14:47,184 | 
> 54.92.168.12 |   4073
>   Read 1 live and 0 tombstoned cells | 18:14:47,185 | 
> 54.92.168.12 |   4239
>   Read 1 live and 0 tombstoned cells | 18:14:47,185 | 
> 54.92.168.12 |   4559
> Scanned 3 rows and matched 3 | 18:14:47,185 | 
> 54.92.168.12 |   4601
> Request complete | 18:14:47,185 | 
> 54.92.168.12 |   5812
> cqlsh> select * from cass_dc.cass_dc ;
>  key | value
> -+---
>a |   aaa
>c |   ccc
>b |   bbb
> Tracing session: 10247f30-c9ad-11e4-a527-23b8e6fcc12b
>  activity| timestamp| 
> source   | source_elapsed
> -+--+--+
>   execute_cql3_query | 18:16:26,531 | 
> 54.92.168.12 |  0
>  Parsing select * from cass_dc.cass_dc 

[jira] [Commented] (CASSANDRA-8968) Cassandra cqlsh query return different result randomly

2015-03-13 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14360948#comment-14360948
 ] 

Jeff Liu commented on CASSANDRA-8968:
-

That works.  Thanks.

> Cassandra cqlsh query return different result randomly
> --
>
> Key: CASSANDRA-8968
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8968
> Project: Cassandra
>  Issue Type: Wish
> Environment: Cassandra 2.0
>Reporter: Jeff Liu
>
> Noticed that a select query in cqlsh returns different result randomly. It 
> would be nice to get consistent result when same query is performed.
> {noformat}
> cqlsh> select * from cass_dc.cass_dc ;
>  key | value
> -+---
>c |   ccc
>b |   bbb
> Tracing session: d20490f0-c9ac-11e4-a527-23b8e6fcc12b
>  activity| timestamp| 
> source   | source_elapsed
> -+--+--+
>   execute_cql3_query | 18:14:42,305 | 
> 54.92.168.12 |  0
>  Parsing select * from cass_dc.cass_dc  LIMIT 1; | 18:14:42,306 | 
> 54.92.168.12 |855
>  Preparing statement | 18:14:42,307 | 
> 54.92.168.12 |   1950
>Determining replicas to query | 18:14:42,307 | 
> 54.92.168.12 |   2101
>  Message received from /x.x.x.12 | 18:14:42,308 | 
> 54.82.42.121 | 56
>   Enqueuing request to /x.x.x.121 | 18:14:42,308 | 
> 54.92.168.12 |   2685
> Sending message to /x.x.x.121 | 18:14:42,308 | 
> 54.92.168.12 |   2825
>  Executing seq scan across 0 sstables for [min(-1), min(-1)] | 18:14:42,309 | 
> 54.82.42.121 |556
>   Read 1 live and 0 tombstoned cells | 18:14:42,309 | 
> 54.82.42.121 |868
>   Read 1 live and 0 tombstoned cells | 18:14:42,309 | 
> 54.82.42.121 |956
> Scanned 2 rows and matched 2 | 18:14:42,309 | 
> 54.82.42.121 |989
>  Enqueuing response to /x.x.x.12 | 18:14:42,309 | 
> 54.82.42.121 |   1007
> Sending message to /x.x.x.12 | 18:14:42,310 | 
> 54.82.42.121 |   1287
>  Message received from /x.x.x.121 | 18:14:42,319 | 
> 54.92.168.12 |  13656
>   Processing response from /x.x.x.121 | 18:14:42,319 | 
> 54.92.168.12 |  14133
> Request complete | 18:14:42,319 | 
> 54.92.168.12 |  14808
> cqlsh> select * from cass_dc.cass_dc ;
>  key | value
> -+---
>a |   aaa
>c |   ccc
>b |   bbb
> Tracing session: d4ecbcc0-c9ac-11e4-a527-23b8e6fcc12b
>  activity| timestamp| 
> source   | source_elapsed
> -+--+--+
>   execute_cql3_query | 18:14:47,180 | 
> 54.92.168.12 |  0
>  Parsing select * from cass_dc.cass_dc  LIMIT 1; | 18:14:47,180 | 
> 54.92.168.12 | 81
>  Preparing statement | 18:14:47,181 | 
> 54.92.168.12 |224
>Determining replicas to query | 18:14:47,181 | 
> 54.92.168.12 |383
>  Executing seq scan across 0 sstables for [min(-1), min(-1)] | 18:14:47,184 | 
> 54.92.168.12 |   3611
>   Read 1 live and 0 tombstoned cells | 18:14:47,184 | 
> 54.92.168.12 |   4073
>   Read 1 live and 0 tombstoned cells | 18:14:47,185 | 
> 54.92.168.12 |   4239
>   Read 1 live and 0 tombstoned cells | 18:14:47,185 | 
> 54.92.168.12 |   4559
> Scanned 3 rows and matched 3 | 18:14:47,185 | 
> 54.92.168.12 |   4601
> Request complete | 18:14:47,185 | 
> 54.92.168.12 |   5812
> cqlsh> select * from cass_dc.cass_dc ;
>  key | value
> -+---
>a |   aaa
>c |   ccc
>b |   bbb
> Tracing session: 10247f30-c9ad-11e4-a527-23b8e6fcc12b
>  activity| timestamp| 
> source   | source_elapsed
> -+--+--+
>   execute_cql3_query | 18:16:26,531 | 
> 54.92.168.12 |  

[jira] [Updated] (CASSANDRA-8968) Cassandra cqlsh query return different result randomly

2015-03-13 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8968:

Description: 
Noticed that a select query in cqlsh returns different result randomly. It 
would be nice to get consistent result when same query is performed.

{noformat}

cqlsh> select * from cass_dc.cass_dc ;

 key | value
-+---
   c |   ccc
   b |   bbb


Tracing session: d20490f0-c9ac-11e4-a527-23b8e6fcc12b

 activity| timestamp| 
source   | source_elapsed
-+--+--+
  execute_cql3_query | 18:14:42,305 | 
54.92.168.12 |  0
 Parsing select * from cass_dc.cass_dc  LIMIT 1; | 18:14:42,306 | 
54.92.168.12 |855
 Preparing statement | 18:14:42,307 | 
54.92.168.12 |   1950
   Determining replicas to query | 18:14:42,307 | 
54.92.168.12 |   2101
 Message received from /x.x.x.12 | 18:14:42,308 | 
54.82.42.121 | 56
  Enqueuing request to /x.x.x.121 | 18:14:42,308 | 
54.92.168.12 |   2685
Sending message to /x.x.x.121 | 18:14:42,308 | 
54.92.168.12 |   2825
 Executing seq scan across 0 sstables for [min(-1), min(-1)] | 18:14:42,309 | 
54.82.42.121 |556
  Read 1 live and 0 tombstoned cells | 18:14:42,309 | 
54.82.42.121 |868
  Read 1 live and 0 tombstoned cells | 18:14:42,309 | 
54.82.42.121 |956
Scanned 2 rows and matched 2 | 18:14:42,309 | 
54.82.42.121 |989
 Enqueuing response to /x.x.x.12 | 18:14:42,309 | 
54.82.42.121 |   1007
Sending message to /x.x.x.12 | 18:14:42,310 | 
54.82.42.121 |   1287
 Message received from /x.x.x.121 | 18:14:42,319 | 
54.92.168.12 |  13656
  Processing response from /x.x.x.121 | 18:14:42,319 | 
54.92.168.12 |  14133
Request complete | 18:14:42,319 | 
54.92.168.12 |  14808

cqlsh> select * from cass_dc.cass_dc ;

 key | value
-+---
   a |   aaa
   c |   ccc
   b |   bbb


Tracing session: d4ecbcc0-c9ac-11e4-a527-23b8e6fcc12b

 activity| timestamp| 
source   | source_elapsed
-+--+--+
  execute_cql3_query | 18:14:47,180 | 
54.92.168.12 |  0
 Parsing select * from cass_dc.cass_dc  LIMIT 1; | 18:14:47,180 | 
54.92.168.12 | 81
 Preparing statement | 18:14:47,181 | 
54.92.168.12 |224
   Determining replicas to query | 18:14:47,181 | 
54.92.168.12 |383
 Executing seq scan across 0 sstables for [min(-1), min(-1)] | 18:14:47,184 | 
54.92.168.12 |   3611
  Read 1 live and 0 tombstoned cells | 18:14:47,184 | 
54.92.168.12 |   4073
  Read 1 live and 0 tombstoned cells | 18:14:47,185 | 
54.92.168.12 |   4239
  Read 1 live and 0 tombstoned cells | 18:14:47,185 | 
54.92.168.12 |   4559
Scanned 3 rows and matched 3 | 18:14:47,185 | 
54.92.168.12 |   4601
Request complete | 18:14:47,185 | 
54.92.168.12 |   5812

cqlsh> select * from cass_dc.cass_dc ;

 key | value
-+---
   a |   aaa
   c |   ccc
   b |   bbb


Tracing session: 10247f30-c9ad-11e4-a527-23b8e6fcc12b

 activity| timestamp| 
source   | source_elapsed
-+--+--+
  execute_cql3_query | 18:16:26,531 | 
54.92.168.12 |  0
 Parsing select * from cass_dc.cass_dc  LIMIT 1; | 18:16:26,531 | 
54.92.168.12 |116
 Preparing statement | 18:16:26,531 | 
54.92.168.12 |237
   Determining replicas to query | 18:16:26,531 | 
54.92.168.12 |386
 Executing seq scan across 0 sstables for [min(-1), min(-1)] | 18:16:26,532 | 
54.92.168.12 |   1280
  Read 1 live and 0 tombstoned cells | 18:16:26,533 | 
54.92.168.12 |   1447
  Rea

[jira] [Created] (CASSANDRA-8968) Cassandra cqlsh query return different result randomly

2015-03-13 Thread Jeff Liu (JIRA)
Jeff Liu created CASSANDRA-8968:
---

 Summary: Cassandra cqlsh query return different result randomly
 Key: CASSANDRA-8968
 URL: https://issues.apache.org/jira/browse/CASSANDRA-8968
 Project: Cassandra
  Issue Type: Wish
 Environment: Cassandra 2.0
Reporter: Jeff Liu


Noticed that a select query in cqlsh returns different result randomly. It 
would be nice to get consistent result when same query is performed.

{noformat}

cqlsh> select * from cass_dc.cass_dc ;

 key | value
-+---
   c |   ccc
   b |   bbb


Tracing session: d20490f0-c9ac-11e4-a527-23b8e6fcc12b

 activity| timestamp| 
source   | source_elapsed
-+--+--+
  execute_cql3_query | 18:14:42,305 | 
54.92.168.12 |  0
 Parsing select * from cass_dc.cass_dc  LIMIT 1; | 18:14:42,306 | 
54.92.168.12 |855
 Preparing statement | 18:14:42,307 | 
54.92.168.12 |   1950
   Determining replicas to query | 18:14:42,307 | 
54.92.168.12 |   2101
 Message received from /54.92.168.12 | 18:14:42,308 | 
54.82.42.121 | 56
  Enqueuing request to /54.82.42.121 | 18:14:42,308 | 
54.92.168.12 |   2685
Sending message to /54.82.42.121 | 18:14:42,308 | 
54.92.168.12 |   2825
 Executing seq scan across 0 sstables for [min(-1), min(-1)] | 18:14:42,309 | 
54.82.42.121 |556
  Read 1 live and 0 tombstoned cells | 18:14:42,309 | 
54.82.42.121 |868
  Read 1 live and 0 tombstoned cells | 18:14:42,309 | 
54.82.42.121 |956
Scanned 2 rows and matched 2 | 18:14:42,309 | 
54.82.42.121 |989
 Enqueuing response to /54.92.168.12 | 18:14:42,309 | 
54.82.42.121 |   1007
Sending message to /54.92.168.12 | 18:14:42,310 | 
54.82.42.121 |   1287
 Message received from /54.82.42.121 | 18:14:42,319 | 
54.92.168.12 |  13656
  Processing response from /54.82.42.121 | 18:14:42,319 | 
54.92.168.12 |  14133
Request complete | 18:14:42,319 | 
54.92.168.12 |  14808

cqlsh> select * from cass_dc.cass_dc ;

 key | value
-+---
   a |   aaa
   c |   ccc
   b |   bbb


Tracing session: d4ecbcc0-c9ac-11e4-a527-23b8e6fcc12b

 activity| timestamp| 
source   | source_elapsed
-+--+--+
  execute_cql3_query | 18:14:47,180 | 
54.92.168.12 |  0
 Parsing select * from cass_dc.cass_dc  LIMIT 1; | 18:14:47,180 | 
54.92.168.12 | 81
 Preparing statement | 18:14:47,181 | 
54.92.168.12 |224
   Determining replicas to query | 18:14:47,181 | 
54.92.168.12 |383
 Executing seq scan across 0 sstables for [min(-1), min(-1)] | 18:14:47,184 | 
54.92.168.12 |   3611
  Read 1 live and 0 tombstoned cells | 18:14:47,184 | 
54.92.168.12 |   4073
  Read 1 live and 0 tombstoned cells | 18:14:47,185 | 
54.92.168.12 |   4239
  Read 1 live and 0 tombstoned cells | 18:14:47,185 | 
54.92.168.12 |   4559
Scanned 3 rows and matched 3 | 18:14:47,185 | 
54.92.168.12 |   4601
Request complete | 18:14:47,185 | 
54.92.168.12 |   5812

cqlsh> select * from cass_dc.cass_dc ;

 key | value
-+---
   a |   aaa
   c |   ccc
   b |   bbb


Tracing session: 10247f30-c9ad-11e4-a527-23b8e6fcc12b

 activity| timestamp| 
source   | source_elapsed
-+--+--+
  execute_cql3_query | 18:16:26,531 | 
54.92.168.12 |  0
 Parsing select * from cass_dc.cass_dc  LIMIT 1; | 18:16:26,531 | 
54.92.168.12 |116
 Preparing statement | 18:16:26,531 | 
54.92.168.12 |237
   Determining replicas to query | 18:16:26,531 | 
54.92.168.12 |386
 Executing seq scan across 0 sstables for [min(-1), 

[jira] [Created] (CASSANDRA-8899) cqlsh not able to get row count with select(*) with large table

2015-03-03 Thread Jeff Liu (JIRA)
Jeff Liu created CASSANDRA-8899:
---

 Summary: cqlsh not able to get row count with select(*) with large 
table
 Key: CASSANDRA-8899
 URL: https://issues.apache.org/jira/browse/CASSANDRA-8899
 Project: Cassandra
  Issue Type: Bug
 Environment: Cassandra 2.1.2 ubuntu12.04
Reporter: Jeff Liu


 I'm getting errors when running a query that looks at a large number of rows.
{noformat}
cqlsh:events> select count(*) from catalog;

 count
---
 1

(1 rows)

cqlsh:events> select count(*) from catalog limit 11000;

 count
---
 11000

(1 rows)

cqlsh:events> select count(*) from catalog limit 5;
errors={}, last_host=127.0.0.1
cqlsh:events> 
{noformat}

We don't make queries w/o a WHERE clause in Chisel itself but I can't validate 
the correct number of rows are being inserted into the table.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8899) cqlsh - not able to get row count with select(*) for large table

2015-03-03 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8899:

Description: 
 I'm getting errors when running a query that looks at a large number of rows.
{noformat}
cqlsh:events> select count(*) from catalog;

 count
---
 1

(1 rows)

cqlsh:events> select count(*) from catalog limit 11000;

 count
---
 11000

(1 rows)

cqlsh:events> select count(*) from catalog limit 5;
errors={}, last_host=127.0.0.1
cqlsh:events> 
{noformat}

We are not able to make the select * query to get row count.

  was:
 I'm getting errors when running a query that looks at a large number of rows.
{noformat}
cqlsh:events> select count(*) from catalog;

 count
---
 1

(1 rows)

cqlsh:events> select count(*) from catalog limit 11000;

 count
---
 11000

(1 rows)

cqlsh:events> select count(*) from catalog limit 5;
errors={}, last_host=127.0.0.1
cqlsh:events> 
{noformat}

We are not able to make the select(*) query to get row count.


> cqlsh - not able to get row count with select(*) for large table
> 
>
> Key: CASSANDRA-8899
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8899
> Project: Cassandra
>  Issue Type: Bug
> Environment: Cassandra 2.1.2 ubuntu12.04
>Reporter: Jeff Liu
>
>  I'm getting errors when running a query that looks at a large number of rows.
> {noformat}
> cqlsh:events> select count(*) from catalog;
>  count
> ---
>  1
> (1 rows)
> cqlsh:events> select count(*) from catalog limit 11000;
>  count
> ---
>  11000
> (1 rows)
> cqlsh:events> select count(*) from catalog limit 5;
> errors={}, last_host=127.0.0.1
> cqlsh:events> 
> {noformat}
> We are not able to make the select * query to get row count.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8899) cqlsh - not able to get row count with select(*) for large table

2015-03-03 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8899:

Description: 
 I'm getting errors when running a query that looks at a large number of rows.
{noformat}
cqlsh:events> select count(*) from catalog;

 count
---
 1

(1 rows)

cqlsh:events> select count(*) from catalog limit 11000;

 count
---
 11000

(1 rows)

cqlsh:events> select count(*) from catalog limit 5;
errors={}, last_host=127.0.0.1
cqlsh:events> 
{noformat}

We are not able to make the select(*) query to get row count.

  was:
 I'm getting errors when running a query that looks at a large number of rows.
{noformat}
cqlsh:events> select count(*) from catalog;

 count
---
 1

(1 rows)

cqlsh:events> select count(*) from catalog limit 11000;

 count
---
 11000

(1 rows)

cqlsh:events> select count(*) from catalog limit 5;
errors={}, last_host=127.0.0.1
cqlsh:events> 
{noformat}

We don't make queries w/o a WHERE clause in Chisel itself but I can't validate 
the correct number of rows are being inserted into the table.


> cqlsh - not able to get row count with select(*) for large table
> 
>
> Key: CASSANDRA-8899
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8899
> Project: Cassandra
>  Issue Type: Bug
> Environment: Cassandra 2.1.2 ubuntu12.04
>Reporter: Jeff Liu
>
>  I'm getting errors when running a query that looks at a large number of rows.
> {noformat}
> cqlsh:events> select count(*) from catalog;
>  count
> ---
>  1
> (1 rows)
> cqlsh:events> select count(*) from catalog limit 11000;
>  count
> ---
>  11000
> (1 rows)
> cqlsh:events> select count(*) from catalog limit 5;
> errors={}, last_host=127.0.0.1
> cqlsh:events> 
> {noformat}
> We are not able to make the select(*) query to get row count.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8899) cqlsh - not able to get row count with select(*) with large table

2015-03-03 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8899:

Summary: cqlsh - not able to get row count with select(*) with large table  
(was: cqlsh not able to get row count with select(*) with large table)

> cqlsh - not able to get row count with select(*) with large table
> -
>
> Key: CASSANDRA-8899
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8899
> Project: Cassandra
>  Issue Type: Bug
> Environment: Cassandra 2.1.2 ubuntu12.04
>Reporter: Jeff Liu
>
>  I'm getting errors when running a query that looks at a large number of rows.
> {noformat}
> cqlsh:events> select count(*) from catalog;
>  count
> ---
>  1
> (1 rows)
> cqlsh:events> select count(*) from catalog limit 11000;
>  count
> ---
>  11000
> (1 rows)
> cqlsh:events> select count(*) from catalog limit 5;
> errors={}, last_host=127.0.0.1
> cqlsh:events> 
> {noformat}
> We don't make queries w/o a WHERE clause in Chisel itself but I can't 
> validate the correct number of rows are being inserted into the table.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8899) cqlsh - not able to get row count with select(*) for large table

2015-03-03 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8899:

Summary: cqlsh - not able to get row count with select(*) for large table  
(was: cqlsh - not able to get row count with select(*) with large table)

> cqlsh - not able to get row count with select(*) for large table
> 
>
> Key: CASSANDRA-8899
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8899
> Project: Cassandra
>  Issue Type: Bug
> Environment: Cassandra 2.1.2 ubuntu12.04
>Reporter: Jeff Liu
>
>  I'm getting errors when running a query that looks at a large number of rows.
> {noformat}
> cqlsh:events> select count(*) from catalog;
>  count
> ---
>  1
> (1 rows)
> cqlsh:events> select count(*) from catalog limit 11000;
>  count
> ---
>  11000
> (1 rows)
> cqlsh:events> select count(*) from catalog limit 5;
> errors={}, last_host=127.0.0.1
> cqlsh:events> 
> {noformat}
> We don't make queries w/o a WHERE clause in Chisel itself but I can't 
> validate the correct number of rows are being inserted into the table.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-8870) Tombstone overwhelming issue aborts client queries

2015-03-03 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345461#comment-14345461
 ] 

Jeff Liu commented on CASSANDRA-8870:
-

Another question I have been curious about is that why we would see those 
tombstone errors. in our application, we are doing insert and update only. Will 
update ops generate tombstones?

> Tombstone overwhelming issue aborts client queries
> --
>
> Key: CASSANDRA-8870
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8870
> Project: Cassandra
>  Issue Type: Bug
> Environment: cassandra 2.1.2 ubunbtu 12.04
>Reporter: Jeff Liu
>
> We are getting client queries timeout issues on the clients who are trying to 
> query data from cassandra cluster. 
> Nodetool status shows that all nodes are still up regardless.
> Logs from client side:
> {noformat}
> com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) 
> tried for query failed (tried: 
> cass-chisel01.abc01.abc02.abc.abc.com/10.66.182.113:9042 
> (com.datastax.driver.core.TransportException: 
> [cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042] Connection 
> has been closed))
> at 
> com.datastax.driver.core.RequestHandler.sendRequest(RequestHandler.java:108) 
> ~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
> at 
> com.datastax.driver.core.RequestHandler$1.run(RequestHandler.java:179) 
> ~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  ~[na:1.7.0_55]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  ~[na:1.7.0_55]
> at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
> {noformat}
> Logs from cassandra/system.log
> {noformat}
> ERROR [HintedHandoff:2] 2015-02-23 23:46:28,410 SliceQueryFilter.java:212 - 
> Scanned over 10 tombstones in system.hints; query aborted (see 
> tombstone_failure_threshold)
> ERROR [HintedHandoff:2] 2015-02-23 23:46:28,417 CassandraDaemon.java:153 - 
> Exception in thread Thread[HintedHandoff:2,1,main]
> org.apache.cassandra.db.filter.TombstoneOverwhelmingException: null
> at 
> org.apache.cassandra.db.filter.SliceQueryFilter.collectReducedColumns(SliceQueryFilter.java:214)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.filter.QueryFilter.collateColumns(QueryFilter.java:107)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:81)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:69)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.CollationController.collectAllData(CollationController.java:310)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.CollationController.getTopLevelColumns(CollationController.java:60)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.ColumnFamilyStore.getTopLevelColumns(ColumnFamilyStore.java:1858)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.ColumnFamilyStore.getColumnFamily(ColumnFamilyStore.java:1666)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager.doDeliverHintsToEndpoint(HintedHandOffManager.java:385)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager.deliverHintsToEndpoint(HintedHandOffManager.java:344)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager.access$400(HintedHandOffManager.java:94)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager$5.run(HintedHandOffManager.java:555)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  ~[na:1.7.0_55]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  ~[na:1.7.0_55]
> at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8870) Tombstone overwhelming issue aborts client queries

2015-02-26 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8870:

Description: 
We are getting client queries timeout issues on the clients who are trying to 
query data from cassandra cluster. 
Nodetool status shows that all nodes are still up regardless.

Logs from client side:
{noformat}
com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) tried 
for query failed (tried: 
cass-chisel01.abc01.abc02.abc.abc.com/10.66.182.113:9042 
(com.datastax.driver.core.TransportException: 
[cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042] Connection 
has been closed))
at 
com.datastax.driver.core.RequestHandler.sendRequest(RequestHandler.java:108) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
com.datastax.driver.core.RequestHandler$1.run(RequestHandler.java:179) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_55]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
~[na:1.7.0_55]
at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
{noformat}

Logs from cassandra/system.log

{noformat}
ERROR [HintedHandoff:2] 2015-02-23 23:46:28,410 SliceQueryFilter.java:212 - 
Scanned over 10 tombstones in system.hints; query aborted (see 
tombstone_failure_threshold)
ERROR [HintedHandoff:2] 2015-02-23 23:46:28,417 CassandraDaemon.java:153 - 
Exception in thread Thread[HintedHandoff:2,1,main]
org.apache.cassandra.db.filter.TombstoneOverwhelmingException: null
at 
org.apache.cassandra.db.filter.SliceQueryFilter.collectReducedColumns(SliceQueryFilter.java:214)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.filter.QueryFilter.collateColumns(QueryFilter.java:107) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:81)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:69)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.CollationController.collectAllData(CollationController.java:310)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.CollationController.getTopLevelColumns(CollationController.java:60)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.ColumnFamilyStore.getTopLevelColumns(ColumnFamilyStore.java:1858)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.ColumnFamilyStore.getColumnFamily(ColumnFamilyStore.java:1666)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager.doDeliverHintsToEndpoint(HintedHandOffManager.java:385)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager.deliverHintsToEndpoint(HintedHandOffManager.java:344)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager.access$400(HintedHandOffManager.java:94)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager$5.run(HintedHandOffManager.java:555)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_55]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
~[na:1.7.0_55]
at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
{noformat}

  was:
We are getting client queries timeout issues on the clients who are trying to 
query data from cassandra cluster. 
Nodetool status shows that all nodes are still up regardless.

Logs from client side:
{noformat}
com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) tried 
for query failed (tried: 
cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042 
(com.datastax.driver.core.TransportException: 
[cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042] Connection 
has been closed))
at 
com.datastax.driver.core.RequestHandler.sendRequest(RequestHandler.java:108) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
com.datastax.driver.core.RequestHandler$1.run(RequestHandler.java:179) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_55]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
~[na:1.7.0_55]
at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
{noformat}

Logs from cassandra/system.log

{noformat}
ERROR [HintedHandoff:2] 2015-02-23 23:46:28,410 SliceQueryFilter.java:212 - 
Scanned over 10 tombstones in system.hints; query aborted (see 
tombst

[jira] [Comment Edited] (CASSANDRA-8870) Tombstone overwhelming issue aborts client queries

2015-02-26 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14338890#comment-14338890
 ] 

Jeff Liu edited comment on CASSANDRA-8870 at 2/26/15 6:49 PM:
--

Thanks Brandon. Just want to point it out that we are running into this issue 
in version 2.1.2. CASSANDRA-6998 said to have been fixed at 2.1.1


was (Author: jeffl):
we are running into this issue in version 2.1.2. CASSANDRA-6998 said to have 
been fixed at 2.1.1

> Tombstone overwhelming issue aborts client queries
> --
>
> Key: CASSANDRA-8870
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8870
> Project: Cassandra
>  Issue Type: Bug
> Environment: cassandra 2.1.2 ubunbtu 12.04
>Reporter: Jeff Liu
>
> We are getting client queries timeout issues on the clients who are trying to 
> query data from cassandra cluster. 
> Nodetool status shows that all nodes are still up regardless.
> Logs from client side:
> {noformat}
> com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) 
> tried for query failed (tried: 
> cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042 
> (com.datastax.driver.core.TransportException: 
> [cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042] Connection 
> has been closed))
> at 
> com.datastax.driver.core.RequestHandler.sendRequest(RequestHandler.java:108) 
> ~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
> at 
> com.datastax.driver.core.RequestHandler$1.run(RequestHandler.java:179) 
> ~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  ~[na:1.7.0_55]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  ~[na:1.7.0_55]
> at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
> {noformat}
> Logs from cassandra/system.log
> {noformat}
> ERROR [HintedHandoff:2] 2015-02-23 23:46:28,410 SliceQueryFilter.java:212 - 
> Scanned over 10 tombstones in system.hints; query aborted (see 
> tombstone_failure_threshold)
> ERROR [HintedHandoff:2] 2015-02-23 23:46:28,417 CassandraDaemon.java:153 - 
> Exception in thread Thread[HintedHandoff:2,1,main]
> org.apache.cassandra.db.filter.TombstoneOverwhelmingException: null
> at 
> org.apache.cassandra.db.filter.SliceQueryFilter.collectReducedColumns(SliceQueryFilter.java:214)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.filter.QueryFilter.collateColumns(QueryFilter.java:107)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:81)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:69)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.CollationController.collectAllData(CollationController.java:310)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.CollationController.getTopLevelColumns(CollationController.java:60)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.ColumnFamilyStore.getTopLevelColumns(ColumnFamilyStore.java:1858)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.ColumnFamilyStore.getColumnFamily(ColumnFamilyStore.java:1666)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager.doDeliverHintsToEndpoint(HintedHandOffManager.java:385)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager.deliverHintsToEndpoint(HintedHandOffManager.java:344)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager.access$400(HintedHandOffManager.java:94)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager$5.run(HintedHandOffManager.java:555)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  ~[na:1.7.0_55]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  ~[na:1.7.0_55]
> at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-8870) Tombstone overwhelming issue aborts client queries

2015-02-26 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14338890#comment-14338890
 ] 

Jeff Liu commented on CASSANDRA-8870:
-

we are running into this issue in version 2.1.2. CASSANDRA-6998 said to have 
been fixed at 2.1.1

> Tombstone overwhelming issue aborts client queries
> --
>
> Key: CASSANDRA-8870
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8870
> Project: Cassandra
>  Issue Type: Bug
> Environment: cassandra 2.1.2 ubunbtu 12.04
>Reporter: Jeff Liu
>
> We are getting client queries timeout issues on the clients who are trying to 
> query data from cassandra cluster. 
> Nodetool status shows that all nodes are still up regardless.
> Logs from client side:
> {noformat}
> com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) 
> tried for query failed (tried: 
> cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042 
> (com.datastax.driver.core.TransportException: 
> [cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042] Connection 
> has been closed))
> at 
> com.datastax.driver.core.RequestHandler.sendRequest(RequestHandler.java:108) 
> ~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
> at 
> com.datastax.driver.core.RequestHandler$1.run(RequestHandler.java:179) 
> ~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  ~[na:1.7.0_55]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  ~[na:1.7.0_55]
> at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
> {noformat}
> Logs from cassandra/system.log
> {noformat}
> ERROR [HintedHandoff:2] 2015-02-23 23:46:28,410 SliceQueryFilter.java:212 - 
> Scanned over 10 tombstones in system.hints; query aborted (see 
> tombstone_failure_threshold)
> ERROR [HintedHandoff:2] 2015-02-23 23:46:28,417 CassandraDaemon.java:153 - 
> Exception in thread Thread[HintedHandoff:2,1,main]
> org.apache.cassandra.db.filter.TombstoneOverwhelmingException: null
> at 
> org.apache.cassandra.db.filter.SliceQueryFilter.collectReducedColumns(SliceQueryFilter.java:214)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.filter.QueryFilter.collateColumns(QueryFilter.java:107)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:81)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:69)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.CollationController.collectAllData(CollationController.java:310)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.CollationController.getTopLevelColumns(CollationController.java:60)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.ColumnFamilyStore.getTopLevelColumns(ColumnFamilyStore.java:1858)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.ColumnFamilyStore.getColumnFamily(ColumnFamilyStore.java:1666)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager.doDeliverHintsToEndpoint(HintedHandOffManager.java:385)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager.deliverHintsToEndpoint(HintedHandOffManager.java:344)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager.access$400(HintedHandOffManager.java:94)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager$5.run(HintedHandOffManager.java:555)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  ~[na:1.7.0_55]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  ~[na:1.7.0_55]
> at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8870) Tombstone overwhelming issue aborts client queries

2015-02-26 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8870:

Summary: Tombstone overwhelming issue aborts client queries  (was: 
Tombstone overwhelming issue abort client queries)

> Tombstone overwhelming issue aborts client queries
> --
>
> Key: CASSANDRA-8870
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8870
> Project: Cassandra
>  Issue Type: Bug
> Environment: cassandra 2.1.2 ubunbtu 12.04
>Reporter: Jeff Liu
>
> We are getting client queries timeout issues on the clients who are trying to 
> query data from cassandra cluster. 
> Nodetool status shows that all nodes are still up regardless.
> Logs from client side:
> {noformat}
> com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) 
> tried for query failed (tried: 
> cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042 
> (com.datastax.driver.core.TransportException: 
> [cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042] Connection 
> has been closed))
> at 
> com.datastax.driver.core.RequestHandler.sendRequest(RequestHandler.java:108) 
> ~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
> at 
> com.datastax.driver.core.RequestHandler$1.run(RequestHandler.java:179) 
> ~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  ~[na:1.7.0_55]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  ~[na:1.7.0_55]
> at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
> {noformat}
> Logs from cassandra/system.log
> {noformat}
> ERROR [HintedHandoff:2] 2015-02-23 23:46:28,410 SliceQueryFilter.java:212 - 
> Scanned over 10 tombstones in system.hints; query aborted (see 
> tombstone_failure_threshold)
> ERROR [HintedHandoff:2] 2015-02-23 23:46:28,417 CassandraDaemon.java:153 - 
> Exception in thread Thread[HintedHandoff:2,1,main]
> org.apache.cassandra.db.filter.TombstoneOverwhelmingException: null
> at 
> org.apache.cassandra.db.filter.SliceQueryFilter.collectReducedColumns(SliceQueryFilter.java:214)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.filter.QueryFilter.collateColumns(QueryFilter.java:107)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:81)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:69)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.CollationController.collectAllData(CollationController.java:310)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.CollationController.getTopLevelColumns(CollationController.java:60)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.ColumnFamilyStore.getTopLevelColumns(ColumnFamilyStore.java:1858)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.ColumnFamilyStore.getColumnFamily(ColumnFamilyStore.java:1666)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager.doDeliverHintsToEndpoint(HintedHandOffManager.java:385)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager.deliverHintsToEndpoint(HintedHandOffManager.java:344)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager.access$400(HintedHandOffManager.java:94)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.db.HintedHandOffManager$5.run(HintedHandOffManager.java:555)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  ~[na:1.7.0_55]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  ~[na:1.7.0_55]
> at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8870) Tombstone overwhelming issue abort client queries

2015-02-26 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8870:

Description: 
We are getting client queries timeout issues on the clients who are trying to 
query data from cassandra cluster. 
Nodetool status shows that all nodes are still up regardless.

Logs from client side:
{noformat}
com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) tried 
for query failed (tried: 
cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042 
(com.datastax.driver.core.TransportException: 
[cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042] Connection 
has been closed))
at 
com.datastax.driver.core.RequestHandler.sendRequest(RequestHandler.java:108) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
com.datastax.driver.core.RequestHandler$1.run(RequestHandler.java:179) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_55]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
~[na:1.7.0_55]
at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
{noformat}

Logs from cassandra/system.log

{noformat}
ERROR [HintedHandoff:2] 2015-02-23 23:46:28,410 SliceQueryFilter.java:212 - 
Scanned over 10 tombstones in system.hints; query aborted (see 
tombstone_failure_threshold)
ERROR [HintedHandoff:2] 2015-02-23 23:46:28,417 CassandraDaemon.java:153 - 
Exception in thread Thread[HintedHandoff:2,1,main]
org.apache.cassandra.db.filter.TombstoneOverwhelmingException: null
at 
org.apache.cassandra.db.filter.SliceQueryFilter.collectReducedColumns(SliceQueryFilter.java:214)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.filter.QueryFilter.collateColumns(QueryFilter.java:107) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:81)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:69)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.CollationController.collectAllData(CollationController.java:310)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.CollationController.getTopLevelColumns(CollationController.java:60)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.ColumnFamilyStore.getTopLevelColumns(ColumnFamilyStore.java:1858)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.ColumnFamilyStore.getColumnFamily(ColumnFamilyStore.java:1666)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager.doDeliverHintsToEndpoint(HintedHandOffManager.java:385)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager.deliverHintsToEndpoint(HintedHandOffManager.java:344)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager.access$400(HintedHandOffManager.java:94)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager$5.run(HintedHandOffManager.java:555)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_55]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
~[na:1.7.0_55]
at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]
{noformat}

  was:
We are getting client queries timeout issues on the clients who are trying to 
query data from cassandra cluster. 
Nodetool status shows that all nodes are still up regardless.

Logs from client side:

com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) tried 
for query failed (tried: 
cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042 
(com.datastax.driver.core.TransportException: 
[cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042] Connection 
has been closed))
at 
com.datastax.driver.core.RequestHandler.sendRequest(RequestHandler.java:108) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
com.datastax.driver.core.RequestHandler$1.run(RequestHandler.java:179) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_55]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
~[na:1.7.0_55]
at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]


Logs from cassandra/system.log


ERROR [HintedHandoff:2] 2015-02-23 23:46:28,410 SliceQueryFilter.java:212 - 
Scanned over 10 tombstones in system.hints; query aborted (see 
tombstone_failure_threshold)

[jira] [Updated] (CASSANDRA-8870) Tombstone overwhelming issue abort client queries

2015-02-26 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8870:

Description: 
We are getting client queries timeout issues on the clients who are trying to 
query data from cassandra cluster. 
Nodetool status shows that all nodes are still up regardless.

Logs from client side:

com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) tried 
for query failed (tried: 
cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042 
(com.datastax.driver.core.TransportException: 
[cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042] Connection 
has been closed))
at 
com.datastax.driver.core.RequestHandler.sendRequest(RequestHandler.java:108) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
com.datastax.driver.core.RequestHandler$1.run(RequestHandler.java:179) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_55]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
~[na:1.7.0_55]
at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]


Logs from cassandra/system.log


ERROR [HintedHandoff:2] 2015-02-23 23:46:28,410 SliceQueryFilter.java:212 - 
Scanned over 10 tombstones in system.hints; query aborted (see 
tombstone_failure_threshold)
ERROR [HintedHandoff:2] 2015-02-23 23:46:28,417 CassandraDaemon.java:153 - 
Exception in thread Thread[HintedHandoff:2,1,main]
org.apache.cassandra.db.filter.TombstoneOverwhelmingException: null
at 
org.apache.cassandra.db.filter.SliceQueryFilter.collectReducedColumns(SliceQueryFilter.java:214)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.filter.QueryFilter.collateColumns(QueryFilter.java:107) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:81)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:69)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.CollationController.collectAllData(CollationController.java:310)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.CollationController.getTopLevelColumns(CollationController.java:60)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.ColumnFamilyStore.getTopLevelColumns(ColumnFamilyStore.java:1858)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.ColumnFamilyStore.getColumnFamily(ColumnFamilyStore.java:1666)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager.doDeliverHintsToEndpoint(HintedHandOffManager.java:385)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager.deliverHintsToEndpoint(HintedHandOffManager.java:344)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager.access$400(HintedHandOffManager.java:94)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager$5.run(HintedHandOffManager.java:555)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_55]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
~[na:1.7.0_55]
at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]


  was:
We are getting client queries timeout issues on the clients who are trying to 
query data from cassandra cluster. 
Nodetool status shows that all nodes are still up regardless.

Logs from client side:

com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) tried 
for query failed (tried: 
cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042 
(com.datastax.driver.core.TransportException: 
[cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042] Connection 
has been closed))
at 
com.datastax.driver.core.RequestHandler.sendRequest(RequestHandler.java:108) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
com.datastax.driver.core.RequestHandler$1.run(RequestHandler.java:179) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_55]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
~[na:1.7.0_55]
at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]


Logs from cassandra/system.log


ERROR [HintedHandoff:2] 2015-02-23 23:46:28,410 SliceQueryFilter.java:212 - 
Scanned over 10 tombstones in system.hints; query aborted (see 
tombstone_failure_threshold)
ERROR [HintedHandoff:2] 2015-02-23 23:46

[jira] [Created] (CASSANDRA-8870) Tombstone overwhelming issue abort client queries

2015-02-26 Thread Jeff Liu (JIRA)
Jeff Liu created CASSANDRA-8870:
---

 Summary: Tombstone overwhelming issue abort client queries
 Key: CASSANDRA-8870
 URL: https://issues.apache.org/jira/browse/CASSANDRA-8870
 Project: Cassandra
  Issue Type: Bug
 Environment: cassandra 2.1.2 ubunbtu 12.04
Reporter: Jeff Liu


We are getting client queries timeout issues on the clients who are trying to 
query data from cassandra cluster. 
Nodetool status shows that all nodes are still up regardless.

Logs from client side:

com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) tried 
for query failed (tried: 
cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042 
(com.datastax.driver.core.TransportException: 
[cass-chisel01.tgr01.iad02.testd.nestlabs.com/10.66.182.113:9042] Connection 
has been closed))
at 
com.datastax.driver.core.RequestHandler.sendRequest(RequestHandler.java:108) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
com.datastax.driver.core.RequestHandler$1.run(RequestHandler.java:179) 
~[com.datastax.cassandra.cassandra-driver-core-2.1.3.jar:na]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_55]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
~[na:1.7.0_55]
at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]


Logs from cassandra/system.log


ERROR [HintedHandoff:2] 2015-02-23 23:46:28,410 SliceQueryFilter.java:212 - 
Scanned over 10 tombstones in system.hints; query aborted (see 
tombstone_failure_threshold)
ERROR [HintedHandoff:2] 2015-02-23 23:46:28,417 CassandraDaemon.java:153 - 
Exception in thread Thread[HintedHandoff:2,1,main]
org.apache.cassandra.db.filter.TombstoneOverwhelmingException: null
at 
org.apache.cassandra.db.filter.SliceQueryFilter.collectReducedColumns(SliceQueryFilter.java:214)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.filter.QueryFilter.collateColumns(QueryFilter.java:107) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:81)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.filter.QueryFilter.collateOnDiskAtom(QueryFilter.java:69)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.CollationController.collectAllData(CollationController.java:310)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.CollationController.getTopLevelColumns(CollationController.java:60)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.ColumnFamilyStore.getTopLevelColumns(ColumnFamilyStore.java:1858)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.ColumnFamilyStore.getColumnFamily(ColumnFamilyStore.java:1666)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager.doDeliverHintsToEndpoint(HintedHandOffManager.java:385)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager.deliverHintsToEndpoint(HintedHandOffManager.java:344)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager.access$400(HintedHandOffManager.java:94)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.db.HintedHandOffManager$5.run(HintedHandOffManager.java:555)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_55]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
~[na:1.7.0_55]
at java.lang.Thread.run(Thread.java:745) ~[na:1.7.0_55]




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-02-23 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14333686#comment-14333686
 ] 

Jeff Liu commented on CASSANDRA-8723:
-

[~benedict] what is the release schedule for 2.1.3 and 2.1.4? if an estimate is 
possible at this time?

> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until process is killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.4
>
> Attachments: cassandra.yaml
>
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cassandra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/usr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassandra/stress.jar:
>  -XX:HeapDumpPath=/var/lib/cassandra/java_1421511248.hprof 
> -XX:ErrorFile=/var/lib/cassandra/hs_err_1421511248.log 
> org.apache.cassandra.service.CassandraDaemon
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-02-08 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14311779#comment-14311779
 ] 

Jeff Liu commented on CASSANDRA-8723:
-

Thank you for your input [~thebrenthaines]. Glad that you were able to work 
around it. How were you able to identify the problematic CF in the first place?

I haven't been able to find any issue, data related or not, in my cluster 
before it went into OOM killing java situation. we are just steaming data into 
the cluster at this stage. And because of this issue, we had to rebuild the 
cluster couple times but still no luck to resolve the OOM killing issue.

> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until process is killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
> Attachments: cassandra.yaml
>
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cassandra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/usr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/ca

[jira] [Commented] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-02-06 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14310075#comment-14310075
 ] 

Jeff Liu commented on CASSANDRA-8723:
-

yes, we know c1.xlarge is relatively small in terms of the memory resource for 
cassandra. We haven't identified any other resource contention on this instance 
type with our application is the reason that we haven't turned to another high 
capacity instance type. Also c1.xlarge gave us a cheap solution with enough 
cheap storage. 

The swap is enabled and swappiness is set at 60%. 

> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until process is killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
> Attachments: cassandra.yaml
>
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cassandra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/usr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassandra/stress.jar:
>  -XX:HeapDumpPath=/var/lib/cassandra/java_1421511248.hp

[jira] [Comment Edited] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-02-06 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14309943#comment-14309943
 ] 

Jeff Liu edited comment on CASSANDRA-8723 at 2/6/15 9:27 PM:
-

hi Alan,
  thanks for your relay and the answer is yes, we have been able to reproduce 
it several times. The first cassandra version we observed this issue was 2.1.0, 
and reproduced in 2.1.1 and 2.1.3.

1). java version 1.7.0_55
2). ubuntu 3.2.0-70-virtual . AWS c1.xlarge
3) process resource usage with 1.7 reserved for jvm heap:
{noformat}
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
21279 cassandr  20   0 21.1g 3.7g  23m S  149 55.1  12001:58 java
{noformat}
4. nodetool status show that the nodes are having over 40GB data/load:
{noformat}
--  Address Load   Tokens  OwnsHost ID  
 Rack
UN  10.98.xxx.5343.62 GB   256 ?   
1ab1fec7-383b-4801-b899-059ea0c18def  us-east-1d
UN  10.182.xxx.24   46.53 GB   256 ?   
07a9849b-73f2-4166-a8e9-bedd40a1f30e  us-east-1e
UN  10.32.xxx.222   62.36 GB   256 ?   
fb8c38fc-0eb1-41c2-96c3-fc597aeede10  us-east-1d
UN  10.66.xxx.48 45.03 GB   256 ?   
45820ccb-ce05-4a1b-9fd5-23d1c1512dd2  us-east-1c
UN  10.232.xxx.154   50.57 GB   256 ?   
a3ccf3ba-2649-4d40-8992-41d38dcab493  us-east-1e
UN  10.98.xxx.31 43.71 GB   256 ?   
290a48fc-9efe-4e3d-9237-c363894c7b13  us-east-1d
UN  10.99.xxx.18739.76 GB   256 ?   
8d9ca84a-4abe-40b4-b7a6-5424f84cab8b  us-east-1d
{noformat}



was (Author: jeffl):
hi Alan,
  thanks for your relay and the answer is yes, we have been able to reproduce 
it several times. The first cassandra version we observed this issue was 2.1.0, 
and reproduced in 2.1.1 and 2.1.3.

1). java version 1.7.0_55
2). ubuntu 3.2.0-70-virtual . AWS c1.xlarge
3) process resource usage with 1.7 reserved for jvm heap:
{noformat}
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
21279 cassandr  20   0 21.1g 3.7g  23m S  149 55.1  12001:58 java
{noformat}
4. nodetool status show that the nodes are having over 40GB data/load:
{noformat}
--  Address Load   Tokens  OwnsHost ID  
 Rack
UN  10.98.xxx.5343.62 GB   256 ?   
1ab1fec7-383b-4801-b899-059ea0c18def  us-east-1d
UN  10.182.xxx.24   46.53 GB   256 ?   
07a9849b-73f2-4166-a8e9-bedd40a1f30e  us-east-1e
UN  10.32.xxx.222   62.36 GB   256 ?   
fb8c38fc-0eb1-41c2-96c3-fc597aeede10  us-east-1d
UN  10.66.xxx.48 45.03 GB   256 ?   
45820ccb-ce05-4a1b-9fd5-23d1c1512dd2  us-east-1c
UN  10.232.xxx.154   50.57 GB   256 ?   
a3ccf3ba-2649-4d40-8992-41d38dcab493  us-east-1e
UN  10.98.xxx.31 43.71 GB   256 ?   
290a48fc-9efe-4e3d-9237-c363894c7b13  us-east-1d
UN  10.99.79.18739.76 GB   256 ?   
8d9ca84a-4abe-40b4-b7a6-5424f84cab8b  us-east-1d
{noformat}


> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until process is killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
> Attachments: cassandra.yaml
>
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxre

[jira] [Comment Edited] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-02-06 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14309943#comment-14309943
 ] 

Jeff Liu edited comment on CASSANDRA-8723 at 2/6/15 9:26 PM:
-

hi Alan,
  thanks for your relay and the answer is yes, we have been able to reproduce 
it several times. The first cassandra version we observed this issue was 2.1.0, 
and reproduced in 2.1.1 and 2.1.3.

1). java version 1.7.0_55
2). ubuntu 3.2.0-70-virtual . AWS c1.xlarge
3) process resource usage with 1.7 reserved for jvm heap:
{noformat}
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
21279 cassandr  20   0 21.1g 3.7g  23m S  149 55.1  12001:58 java
{noformat}
4. nodetool status show that the nodes are having over 40GB data/load:
{noformat}
--  Address Load   Tokens  OwnsHost ID  
 Rack
UN  10.98.xxx.5343.62 GB   256 ?   
1ab1fec7-383b-4801-b899-059ea0c18def  us-east-1d
UN  10.182.xxx.24   46.53 GB   256 ?   
07a9849b-73f2-4166-a8e9-bedd40a1f30e  us-east-1e
UN  10.32.xxx.222   62.36 GB   256 ?   
fb8c38fc-0eb1-41c2-96c3-fc597aeede10  us-east-1d
UN  10.66.xxx.48 45.03 GB   256 ?   
45820ccb-ce05-4a1b-9fd5-23d1c1512dd2  us-east-1c
UN  10.232.xxx.154   50.57 GB   256 ?   
a3ccf3ba-2649-4d40-8992-41d38dcab493  us-east-1e
UN  10.98.xxx.31 43.71 GB   256 ?   
290a48fc-9efe-4e3d-9237-c363894c7b13  us-east-1d
UN  10.99.79.18739.76 GB   256 ?   
8d9ca84a-4abe-40b4-b7a6-5424f84cab8b  us-east-1d
{noformat}



was (Author: jeffl):
hi Alan,
  thanks for your relay and the answer is yes, we have been able to reproduce 
it several times. The first cassandra version we observed this issue was 2.1.0, 
and reproduced in 2.1.1 and 2.1.3.

1). java version 1.7.0_55
2). ubuntu 3.2.0-70-virtual . AWS c1.xlarge
3) process resource usage with 1.7 reserved for jvm heap:
{noformat}
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
21279 cassandr  20   0 21.1g 3.7g  23m S  149 55.1  12001:58 java
{noformat}
4. nodetool status show that the nodes are having 40GB data/load on average:
{noformat}
--  Address Load   Tokens  OwnsHost ID  
 Rack
UN  10.98.xxx.5343.62 GB   256 ?   
1ab1fec7-383b-4801-b899-059ea0c18def  us-east-1d
UN  10.182.xxx.24   46.53 GB   256 ?   
07a9849b-73f2-4166-a8e9-bedd40a1f30e  us-east-1e
UN  10.32.xxx.222   62.36 GB   256 ?   
fb8c38fc-0eb1-41c2-96c3-fc597aeede10  us-east-1d
UN  10.66.xxx.48 45.03 GB   256 ?   
45820ccb-ce05-4a1b-9fd5-23d1c1512dd2  us-east-1c
UN  10.232.xxx.154   50.57 GB   256 ?   
a3ccf3ba-2649-4d40-8992-41d38dcab493  us-east-1e
UN  10.98.xxx.31 43.71 GB   256 ?   
290a48fc-9efe-4e3d-9237-c363894c7b13  us-east-1d
UN  10.99.79.18739.76 GB   256 ?   
8d9ca84a-4abe-40b4-b7a6-5424f84cab8b  us-east-1d
{noformat}


> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until process is killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
> Attachments: cassandra.yaml
>
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.

[jira] [Comment Edited] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-02-06 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14309943#comment-14309943
 ] 

Jeff Liu edited comment on CASSANDRA-8723 at 2/6/15 9:26 PM:
-

hi Alan,
  thanks for your relay and the answer is yes, we have been able to reproduce 
it several times. The first cassandra version we observed this issue was 2.1.0, 
and reproduced in 2.1.1 and 2.1.3.

1). java version 1.7.0_55
2). ubuntu 3.2.0-70-virtual . AWS c1.xlarge
3) process resource usage with 1.7 reserved for jvm heap:
{noformat}
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
21279 cassandr  20   0 21.1g 3.7g  23m S  149 55.1  12001:58 java
{noformat}
4. nodetool status show that the nodes are having 40GB data/load on average:
{noformat}
--  Address Load   Tokens  OwnsHost ID  
 Rack
UN  10.98.xxx.5343.62 GB   256 ?   
1ab1fec7-383b-4801-b899-059ea0c18def  us-east-1d
UN  10.182.xxx.24   46.53 GB   256 ?   
07a9849b-73f2-4166-a8e9-bedd40a1f30e  us-east-1e
UN  10.32.xxx.222   62.36 GB   256 ?   
fb8c38fc-0eb1-41c2-96c3-fc597aeede10  us-east-1d
UN  10.66.xxx.48 45.03 GB   256 ?   
45820ccb-ce05-4a1b-9fd5-23d1c1512dd2  us-east-1c
UN  10.232.xxx.154   50.57 GB   256 ?   
a3ccf3ba-2649-4d40-8992-41d38dcab493  us-east-1e
UN  10.98.xxx.31 43.71 GB   256 ?   
290a48fc-9efe-4e3d-9237-c363894c7b13  us-east-1d
UN  10.99.79.18739.76 GB   256 ?   
8d9ca84a-4abe-40b4-b7a6-5424f84cab8b  us-east-1d
{noformat}



was (Author: jeffl):
hi Alan,
  thanks for your relay and the answer is yes, we have been able to reproduce 
it several times. The first cassandra version we observed this issue was 2.1.0, 
and reproduced in 2.1.1 and 2.1.3.

1). java version 1.7.0_55
2). ubuntu 3.2.0-70-virtual . AWS c1.xlarge
3) process resource usage with 1.7 reserved for jvm heap:
{noformat}
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
21279 cassandr  20   0 21.1g 3.7g  23m S  149 55.1  12001:58 java
{noformat}

> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until process is killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
> Attachments: cassandra.yaml
>
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor

[jira] [Comment Edited] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-02-06 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14309943#comment-14309943
 ] 

Jeff Liu edited comment on CASSANDRA-8723 at 2/6/15 9:24 PM:
-

hi Alan,
  thanks for your relay and the answer is yes, we have been able to reproduce 
it several times. The first cassandra version we observed this issue was 2.1.0, 
and reproduced in 2.1.1 and 2.1.3.

1). java version 1.7.0_55
2). ubuntu 3.2.0-70-virtual . AWS c1.xlarge
3) process resource usage with 1.7 reserved for jvm heap:
{noformat}
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
21279 cassandr  20   0 21.1g 3.7g  23m S  149 55.1  12001:58 java
{noformat}


was (Author: jeffl):
hi Alan,
  thanks for your relay and the answer is yes, we have been able to reproduce 
it several times. The first cassandra version we observed this issue was 2.1.0, 
and reproduced in 2.1.1 and 2.1.3.

1). java version 1.7.0_55
2). ubuntu 3.2.0-70-virtual . AWS c1.xlarge
3) process resource usage:
{noformat}
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
21279 cassandr  20   0 21.1g 3.7g  23m S  149 55.1  12001:58 java
{noformat}

> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until process is killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
> Attachments: cassandra.yaml
>
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib

[jira] [Commented] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-02-06 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14309943#comment-14309943
 ] 

Jeff Liu commented on CASSANDRA-8723:
-

hi Alan,
  thanks for your relay and the answer is yes, we have been able to reproduce 
it several times. The first cassandra version we observed this issue was 2.1.0, 
and reproduced in 2.1.1 and 2.1.3.

1). java version 1.7.0_55
2). ubuntu 3.2.0-70-virtual . AWS c1.xlarge
3) process resource usage:
{noformat}
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
21279 cassandr  20   0 21.1g 3.7g  23m S  149 55.1  12001:58 java
{noformat}

> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until process is killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
> Attachments: cassandra.yaml
>
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cassandra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/usr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassand

[jira] [Commented] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces return java.lang.RuntimeException: Could not create snapshot

2015-02-06 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14309563#comment-14309563
 ] 

Jeff Liu commented on CASSANDRA-8696:
-

[~yukim], [~benedict], any additional thoughts on this issue?

> nodetool repair on cassandra 2.1.2 keyspaces return 
> java.lang.RuntimeException: Could not create snapshot
> -
>
> Key: CASSANDRA-8696
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8696
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> When trying to run nodetool repair -pr on cassandra node ( 2.1.2), cassandra 
> throw java exceptions: cannot create snapshot. 
> the error log from system.log:
> {noformat}
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:28,815 
> StreamResultFuture.java:166 - [Stream #692c1450-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 2 files(221187 bytes), sending 5 
> files(632105 bytes)
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:180 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> Session with /10.97.9.110 is complete
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:212 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,047 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> INFO  [RepairJobTask:1] 2015-01-28 02:07:29,065 StreamResultFuture.java:86 - 
> [Stream #692c6270-a692-11e4-9973-070e938df227] Executing streaming plan for 
> Repair
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,065 
> StreamSession.java:213 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Starting streaming to /10.66.187.201
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,070 
> StreamCoordinator.java:209 - [Stream #692c6270-a692-11e4-9973-070e938df227, 
> ID#0] Beginning stream session with /10.66.187.201
> INFO  [STREAM-IN-/10.66.187.201] 2015-01-28 02:07:29,465 
> StreamResultFuture.java:166 - [Stream #692c6270-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 5 files(627994 bytes), sending 5 
> files(632105 bytes)
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,971 
> StreamResultFuture.java:180 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Session with /10.66.187.201 is complete
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamResultFuture.java:212 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> ERROR [RepairJobTask:1] 2015-01-28 02:07:39,444 RepairJob.java:127 - Error 
> occurred during snapshot phase
> java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
> at 
> org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
> ~[na:1.7.0_45]
> at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
> ~[na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  [na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  [na:1.7.0_45]
> at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
> INFO  [AntiEntropySessions:6] 2015-01-28 02:07:39,445 RepairSession.java:260 
> - [repair #6f85e740-a692-11e4-9973-070e938df227] new session: will sync 
> /10.98.194.68, /10.66.187.201, /10.226.218.135 on range 
> (12817179804668051873746972069086
> 2638799,12863540308359254031520865977436165] for events.[bigint0text, 
> bigint0boolean, bigint0int, dataset_catalog, column_categories, 
> bigint0double, bigint0bigint]
> ERROR [AntiEntropySessions:5] 2015-01-28 02:07:39,445 RepairSession.java:303 
> - [repair #685e3d00-a692-11e4-9973-070e938df227] session completed with the 
> following error
> java.io.IOException: Failed during snapshot creation.
> at 
> org.apache.cassandra.repair.RepairSession.failedSnapshot(RepairSession.java:344)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.repair.RepairJob$2.onFailure(RepairJob.java:128) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at com.google.common.util.concurrent.Futures$4.run(Futures.java:1172) 
> ~[guava-16.0.jar:na]
> at 
> java.u

[jira] [Commented] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-02-06 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14309559#comment-14309559
 ] 

Jeff Liu commented on CASSANDRA-8723:
-

[~benedict] and the team, have you seen any others reporting this issue? This 
is driving us nuts here validating cassandra 2.1.3. Without the memory 
stability, we cannot even use this version for dev environment. 

> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until process is killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
> Attachments: cassandra.yaml
>
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cassandra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/usr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassandra/stress.jar:
>  -XX:HeapDumpPath=/var/lib/cassandra/java_1421511248.hprof 
> -XX:ErrorFile=/var/lib/cassandra/hs_err_1421511248.log 
> org.apache.cassandra.service.CassandraDaemon
> {noformat}



--
This message was sent by Atlassian JIRA
(

[jira] [Commented] (CASSANDRA-8711) cassandra 2.1.2 cqlsh not able to connect when ssl client encryption enabled

2015-02-05 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14307704#comment-14307704
 ] 

Jeff Liu commented on CASSANDRA-8711:
-

Here is some additional information:

When I don't have a cqlshrc file:
{noformat}
/home/jliu# cqlsh --ssl
Validation is enabled; SSL transport factory requires a valid certfile to be 
specified. Please provide path to the certfile in [ssl] section as 'certfile' 
option in /root/.cassandra/cqlshrc (or use [certfiles] section) or set 
SSL_CERTFILE environment variable.
{noformat}

when I have a cqlshrc file(contents as pasted in description)
{noformat}
/home/jliu# cqlsh --ssl
Password:
Connection error: ('Unable to connect to any servers', {'localhost': error(8, 
'ENOEXEC')})
{noformat}

> cassandra 2.1.2 cqlsh not able to connect when ssl client encryption enabled
> 
>
> Key: CASSANDRA-8711
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8711
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
>
> I have been trying to setup client encryption on a three nodes 2.1.2 version 
> cassandra cluster and keep getting the following error:
> {noformat}
> Connection error: ('Unable to connect to any servers', {'localhost': 
> ConnectionShutdown('Connection  (closed)> is already closed',)})
> {noformat}
> I tried with both cqlsh and datatax python cassandra-driver and no luck to 
> login.
> I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
> {noformat}
> [authentication]
> username =
> password =
> [connection]
> hostname = localhost
> port = 9160
> factory = cqlshlib.ssl.ssl_transport_factory
> [ssl]
> certfile = /root/.cassandra/localhost_user1.pem
> validate = false ## Optional, true by default
> {noformat}
> my cassandra.yaml configuration related to client_encryptions:
> {noformat}
> client_encryption_options:
> enabled: True
> keystore: /etc/cassandra/conf/.keystore
> keystore_password: cassnest
> {noformat}
> the keystore, truststore, cert/pem (localhost_user1.pem) key have been 
> verified to be working fine for datastax enterprise version.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-8711) cassandra 2.1.2 cqlsh not able to connect when ssl client encryption enabled

2015-02-05 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14307700#comment-14307700
 ] 

Jeff Liu commented on CASSANDRA-8711:
-

Which port should be used? 

> cassandra 2.1.2 cqlsh not able to connect when ssl client encryption enabled
> 
>
> Key: CASSANDRA-8711
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8711
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
>
> I have been trying to setup client encryption on a three nodes 2.1.2 version 
> cassandra cluster and keep getting the following error:
> {noformat}
> Connection error: ('Unable to connect to any servers', {'localhost': 
> ConnectionShutdown('Connection  (closed)> is already closed',)})
> {noformat}
> I tried with both cqlsh and datatax python cassandra-driver and no luck to 
> login.
> I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
> {noformat}
> [authentication]
> username =
> password =
> [connection]
> hostname = localhost
> port = 9160
> factory = cqlshlib.ssl.ssl_transport_factory
> [ssl]
> certfile = /root/.cassandra/localhost_user1.pem
> validate = false ## Optional, true by default
> {noformat}
> my cassandra.yaml configuration related to client_encryptions:
> {noformat}
> client_encryption_options:
> enabled: True
> keystore: /etc/cassandra/conf/.keystore
> keystore_password: cassnest
> {noformat}
> the keystore, truststore, cert/pem (localhost_user1.pem) key have been 
> verified to be working fine for datastax enterprise version.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-8711) cassandra 2.1.2 cqlsh not able to connect when ssl client encryption enabled

2015-02-05 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14307696#comment-14307696
 ] 

Jeff Liu commented on CASSANDRA-8711:
-

hi [~mishail].
Yes. I did use the --ssl flag. Actually after I configure ~/.cassandra/cqlshrc 
file and specify ssl configuration, cqlsh will pick the ssl configuration and 
try to connect with ssl even without --ssl flag. 
However, either one worked for me in terms of connecting to cassandra.

> cassandra 2.1.2 cqlsh not able to connect when ssl client encryption enabled
> 
>
> Key: CASSANDRA-8711
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8711
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
>
> I have been trying to setup client encryption on a three nodes 2.1.2 version 
> cassandra cluster and keep getting the following error:
> {noformat}
> Connection error: ('Unable to connect to any servers', {'localhost': 
> ConnectionShutdown('Connection  (closed)> is already closed',)})
> {noformat}
> I tried with both cqlsh and datatax python cassandra-driver and no luck to 
> login.
> I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
> {noformat}
> [authentication]
> username =
> password =
> [connection]
> hostname = localhost
> port = 9160
> factory = cqlshlib.ssl.ssl_transport_factory
> [ssl]
> certfile = /root/.cassandra/localhost_user1.pem
> validate = false ## Optional, true by default
> {noformat}
> my cassandra.yaml configuration related to client_encryptions:
> {noformat}
> client_encryption_options:
> enabled: True
> keystore: /etc/cassandra/conf/.keystore
> keystore_password: cassnest
> {noformat}
> the keystore, truststore, cert/pem (localhost_user1.pem) key have been 
> verified to be working fine for datastax enterprise version.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8711) cassandra 2.1.2 cqlsh not able to connect when ssl client encryption enabled

2015-02-04 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8711:

Summary: cassandra 2.1.2 cqlsh not able to connect when ssl client 
encryption enabled  (was: cassandra 2.1.2 ssl client encryption not working )

> cassandra 2.1.2 cqlsh not able to connect when ssl client encryption enabled
> 
>
> Key: CASSANDRA-8711
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8711
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
>
> I have been trying to setup client encryption on a three nodes 2.1.2 version 
> cassandra cluster and keep getting the following error:
> {noformat}
> Connection error: ('Unable to connect to any servers', {'localhost': 
> ConnectionShutdown('Connection  (closed)> is already closed',)})
> {noformat}
> I tried with both cqlsh and datatax python cassandra-driver and no luck to 
> login.
> I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
> {noformat}
> [authentication]
> username =
> password =
> [connection]
> hostname = localhost
> port = 9160
> factory = cqlshlib.ssl.ssl_transport_factory
> [ssl]
> certfile = /root/.cassandra/localhost_user1.pem
> validate = false ## Optional, true by default
> {noformat}
> my cassandra.yaml configuration related to client_encryptions:
> {noformat}
> client_encryption_options:
> enabled: True
> keystore: /etc/cassandra/conf/.keystore
> keystore_password: cassnest
> {noformat}
> the keystore, truststore, cert/pem (localhost_user1.pem) key have been 
> verified to be working fine for datastax enterprise version.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-8711) cassandra 2.1.2 ssl client encryption not working

2015-02-04 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14305716#comment-14305716
 ] 

Jeff Liu commented on CASSANDRA-8711:
-

After spending a little more time, I wrote some quick ruby code to test client 
encryption/authentication and managed to get it working! 
I think my problem was really the cqlsh not able to connect to cassandra nodes 
with client encryption or authentication or both enabled. 
here is my cqlshrc file contents:
{noformat}
[authentication]
username = cassandra
password = cassandra

[connection]
hostname = localhost
port = 9160
factory = cqlshlib.ssl.ssl_transport_factory

[ssl]
certfile = /root/.cassandra/private1.pem
validate = false ## Optional, true by default
{noformat}

the private1.pem is the private key that I was able to connect from client 
(ruby script) with. 


> cassandra 2.1.2 ssl client encryption not working 
> --
>
> Key: CASSANDRA-8711
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8711
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
>
> I have been trying to setup client encryption on a three nodes 2.1.2 version 
> cassandra cluster and keep getting the following error:
> {noformat}
> Connection error: ('Unable to connect to any servers', {'localhost': 
> ConnectionShutdown('Connection  (closed)> is already closed',)})
> {noformat}
> I tried with both cqlsh and datatax python cassandra-driver and no luck to 
> login.
> I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
> {noformat}
> [authentication]
> username =
> password =
> [connection]
> hostname = localhost
> port = 9160
> factory = cqlshlib.ssl.ssl_transport_factory
> [ssl]
> certfile = /root/.cassandra/localhost_user1.pem
> validate = false ## Optional, true by default
> {noformat}
> my cassandra.yaml configuration related to client_encryptions:
> {noformat}
> client_encryption_options:
> enabled: True
> keystore: /etc/cassandra/conf/.keystore
> keystore_password: cassnest
> {noformat}
> the keystore, truststore, cert/pem (localhost_user1.pem) key have been 
> verified to be working fine for datastax enterprise version.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-02-03 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8723:

Attachment: cassandra.yaml

> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until process is killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
> Attachments: cassandra.yaml
>
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cassandra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/usr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassandra/stress.jar:
>  -XX:HeapDumpPath=/var/lib/cassandra/java_1421511248.hprof 
> -XX:ErrorFile=/var/lib/cassandra/hs_err_1421511248.log 
> org.apache.cassandra.service.CassandraDaemon
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-02-03 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14303751#comment-14303751
 ] 

Jeff Liu commented on CASSANDRA-8723:
-

cassandra.yaml

{noformat}
# Cassandra storage config YAML 

# NOTE:
#   See http://wiki.apache.org/cassandra/StorageConfiguration for
#   full explanations of configuration directives
# /NOTE

# The name of the cluster. This is mainly used to prevent machines in
# one logical cluster from joining another.
cluster_name: ch_cassandra

# This defines the number of tokens randomly assigned to this node on the ring
# The more tokens, relative to other nodes, the larger the proportion of data
# that this node will store. You probably want all nodes to have the same number
# of tokens assuming they have equal hardware capability.
#
# If you leave this unspecified, Cassandra will use the default of 1 token for 
legacy compatibility,
# and will use the initial_token as described below.
#
# Specifying initial_token will override this setting.
#
# If you already have a cluster with 1 token per node, and wish to migrate to 
# multiple tokens per node, see http://wiki.apache.org/cassandra/Operations

num_tokens: 256

# initial_token allows you to specify tokens manually.  While you can use # it 
with
# vnodes (num_tokens > 1, above) -- in which case you should provide a 
# comma-separated list -- it's primarily used when adding nodes # to legacy 
clusters 
# that do not have vnodes enabled.
# initial_token:
initial_token: 

# See http://wiki.apache.org/cassandra/HintedHandoff
hinted_handoff_enabled: true
# this defines the maximum amount of time a dead host will have hints
# generated.  After it has been dead this long, new hints for it will not be
# created until it has been seen alive and gone down again.
max_hint_window_in_ms: 1080 # 3 hours

# Maximum throttle in KBs per second, per delivery thread.  This will be
# reduced proportionally to the number of nodes in the cluster.  (If there
# are two nodes in the cluster, each delivery thread will use the maximum
# rate; if there are three, each will throttle to half of the maximum,
# since we expect two nodes to be delivering hints simultaneously.)
hinted_handoff_throttle_in_kb: 1024
# Number of threads with which to deliver hints;
# Consider increasing this number when you have multi-dc deployments, since
# cross-dc handoff tends to be slower
max_hints_delivery_threads: 2




# The following setting populates the page cache on memtable flush and 
compaction
# WARNING: Enable this setting only when the whole node's data fits in memory.
# Defaults to: false
# populate_io_cache_on_flush: false

# Authentication backend, implementing IAuthenticator; used to identify users
# Out of the box, Cassandra provides 
org.apache.cassandra.auth.{AllowAllAuthenticator,
# PasswordAuthenticator}.
#
# - AllowAllAuthenticator performs no checks - set it to disable authentication.
# - PasswordAuthenticator relies on username/password pairs to authenticate
#   users. It keeps usernames and hashed passwords in system_auth.credentials 
table.
#   Please increase system_auth keyspace replication factor if you use this 
authenticator.

authenticator: AllowAllAuthenticator



# Authorization backend, implementing IAuthorizer; used to limit access/provide 
permissions
# Out of the box, Cassandra provides 
org.apache.cassandra.auth.{AllowAllAuthorizer,
# CassandraAuthorizer}.
#
# - AllowAllAuthorizer allows any action to any user - set it to disable 
authorization.
# - CassandraAuthorizer stores permissions in system_auth.permissions table. 
Please
#   increase system_auth keyspace replication factor if you use this authorizer.
authorizer: AllowAllAuthorizer



# Validity period for permissions cache (fetching permissions can be an
# expensive operation depending on the authorizer, CassandraAuthorizer is
# one example). Defaults to 2000, set to 0 to disable.
# Will be disabled automatically for AllowAllAuthorizer.
permissions_validity_in_ms: 2000


# The partitioner is responsible for distributing rows (by key) across
# nodes in the cluster.  Any IPartitioner may be used, including your
# own as long as it is on the classpath.  Out of the box, Cassandra
# provides org.apache.cassandra.dht.{Murmur3Partitioner, RandomPartitioner
# ByteOrderedPartitioner, OrderPreservingPartitioner (deprecated)}.
# 
# - RandomPartitioner distributes rows across the cluster evenly by md5.
#   This is the default prior to 1.2 and is retained for compatibility.
# - Murmur3Partitioner is similar to RandomPartioner but uses Murmur3_128
#   Hash Function instead of md5.  When in doubt, this is the best option.
# - ByteOrderedPartitioner orders rows lexically by key bytes.  BOP allows
#   scanning rows in key order, but the ordering can generate hot spots
#   for sequential insertion workloads.
# - OrderPreservingPartitioner is an obsolete form of BOP, that stores

[jira] [Updated] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until process is killed by OOM killer

2015-02-02 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8723:

Summary: Cassandra 2.1.2 Memory issue - java process memory usage 
continuously increases until process is killed by OOM killer  (was: Cassandra 
2.1.2 Memory issue - java process memory usage continuously increases until 
killed by OOM killer)

> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until process is killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cassandra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/usr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassandra/stress.jar:
>  -XX:HeapDumpPath=/var/lib/cassandra/java_1421511248.hprof 
> -XX:ErrorFile=/var/lib/cassandra/hs_err_1421511248.log 
> org.apache.cassandra.service.CassandraDaemon
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increase until killed by OOM killer

2015-02-02 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8723:

Summary: Cassandra 2.1.2 Memory issue - java process memory usage 
continuously increase until killed by OOM killer  (was: Cassandra 2.1.2 Memory 
issue - java process memory continuously increase until killed by OOM killer)

> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increase until killed by OOM killer
> -
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cassandra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/usr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassandra/stress.jar:
>  -XX:HeapDumpPath=/var/lib/cassandra/java_1421511248.hprof 
> -XX:ErrorFile=/var/lib/cassandra/hs_err_1421511248.log 
> org.apache.cassandra.service.CassandraDaemon
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory usage continuously increases until killed by OOM killer

2015-02-02 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8723:

Summary: Cassandra 2.1.2 Memory issue - java process memory usage 
continuously increases until killed by OOM killer  (was: Cassandra 2.1.2 Memory 
issue - java process memory usage continuously increase until killed by OOM 
killer)

> Cassandra 2.1.2 Memory issue - java process memory usage continuously 
> increases until killed by OOM killer
> --
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cassandra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/usr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassandra/stress.jar:
>  -XX:HeapDumpPath=/var/lib/cassandra/java_1421511248.hprof 
> -XX:ErrorFile=/var/lib/cassandra/hs_err_1421511248.log 
> org.apache.cassandra.service.CassandraDaemon
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - java process memory continuously increase until killed by OOM killer

2015-02-02 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8723:

Summary: Cassandra 2.1.2 Memory issue - java process memory continuously 
increase until killed by OOM killer  (was: Cassandra 2.1.2 Memory issue - 
Continuously process memory increase until killed by OOM killer)

> Cassandra 2.1.2 Memory issue - java process memory continuously increase 
> until killed by OOM killer
> ---
>
> Key: CASSANDRA-8723
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> Issue:
> We have an on-going issue with cassandra nodes running with continuously 
> increasing memory until killed by OOM.
> {noformat}
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
> process 13919 (java) score 911 or sacrifice child
> Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
> (java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
> {noformat}
> System Profile:
> cassandra version 2.1.2
> system: aws c1.xlarge instance with 8 cores, 7.1G memory.
> cassandra jvm:
> -Xms1792M -Xmx1792M -Xmn400M -Xss256k
> {noformat}
> java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
> -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
> -Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
> -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
> -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
> -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
> -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
> -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
> -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
> -XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
> -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
> -Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
> -Dcom.sun.management.jmxremote.ssl=false 
> -Dcom.sun.management.jmxremote.authenticate=false 
> -javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
>  -Dlogback.configurationFile=logback.xml 
> -Dcassandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
> -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
> /etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cassandra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/usr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassandra/stress.jar:
>  -XX:HeapDumpPath=/var/lib/cassandra/java_1421511248.hprof 
> -XX:ErrorFile=/var/lib/cassandra/hs_err_1421511248.log 
> org.apache.cassandra.service.CassandraDaemon
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CASSANDRA-8723) Cassandra 2.1.2 Memory issue - Continuously process memory increase until killed by OOM killer

2015-02-02 Thread Jeff Liu (JIRA)
Jeff Liu created CASSANDRA-8723:
---

 Summary: Cassandra 2.1.2 Memory issue - Continuously process 
memory increase until killed by OOM killer
 Key: CASSANDRA-8723
 URL: https://issues.apache.org/jira/browse/CASSANDRA-8723
 Project: Cassandra
  Issue Type: Bug
Reporter: Jeff Liu


Issue:
We have an on-going issue with cassandra nodes running with continuously 
increasing memory until killed by OOM.


{noformat}
Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783481] Out of memory: Kill 
process 13919 (java) score 911 or sacrifice child
Jan 29 10:15:41 cass-chisel19 kernel: [24533109.783557] Killed process 13919 
(java) total-vm:18366340kB, anon-rss:6461472kB, file-rss:6684kB
{noformat}

System Profile:
cassandra version 2.1.2
system: aws c1.xlarge instance with 8 cores, 7.1G memory.

cassandra jvm:
-Xms1792M -Xmx1792M -Xmn400M -Xss256k


{noformat}
java -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar 
-XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M 
-Xmn400M -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
-XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled 
-XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 
-XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly 
-XX:+UseTLAB -XX:+CMSClassUnloadingEnabled -XX:+UseCondCardMark 
-XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC 
-XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
-XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1421511249.log 
-XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
-Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=7199 
-Dcom.sun.management.jmxremote.ssl=false 
-Dcom.sun.management.jmxremote.authenticate=false 
-javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
 -Dlogback.configurationFile=logback.xml -Dcassandra.logdir=/var/log/cassandra 
-Dcassandra.storagedir= -Dcassandra-pidfile=/var/run/cassandra/cassandra.pid 
-cp 
/etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cassandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.jar:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassandra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassandra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cassandra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/usr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassandra/stress.jar:
 -XX:HeapDumpPath=/var/lib/cassandra/java_1421511248.hprof 
-XX:ErrorFile=/var/lib/cassandra/hs_err_1421511248.log 
org.apache.cassandra.service.CassandraDaemon
{noformat}




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CASSANDRA-8689) Assertion error in 2.1.2: ERROR [IndexSummaryManager:1]

2015-02-02 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14301596#comment-14301596
 ] 

Jeff Liu commented on CASSANDRA-8689:
-

Surprised to know that there are so much information behind this blur java 
assertion error. Thank you Benedict for the info. What is the impact of this 
issue without fix? are in some kind of compacting danger ? I guess I'm really 
asking for the criticality of this issue. Thanks.

> Assertion error in 2.1.2: ERROR [IndexSummaryManager:1]
> ---
>
> Key: CASSANDRA-8689
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8689
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
> Fix For: 2.1.3
>
>
> After upgrading a 6 nodes cassandra from 2.1.0 to 2.1.2, start getting the 
> following assertion error.
> {noformat}
> ERROR [IndexSummaryManager:1] 2015-01-26 20:55:40,451 
> CassandraDaemon.java:153 - Exception in thread 
> Thread[IndexSummaryManager:1,1,main]
> java.lang.AssertionError: null
> at org.apache.cassandra.io.util.Memory.size(Memory.java:307) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.io.sstable.IndexSummary.getOffHeapSize(IndexSummary.java:192)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.io.sstable.SSTableReader.getIndexSummaryOffHeapSize(SSTableReader.java:1070)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.io.sstable.IndexSummaryManager.redistributeSummaries(IndexSummaryManager.java:292)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.io.sstable.IndexSummaryManager.redistributeSummaries(IndexSummaryManager.java:238)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.io.sstable.IndexSummaryManager$1.runMayThrow(IndexSummaryManager.java:139)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.concurrent.DebuggableScheduledThreadPoolExecutor$UncomplainingRunnable.run(DebuggableScheduledThreadPoolExecutor.java:77)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
> [na:1.7.0_45]
> at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:304) 
> [na:1.7.0_45]
> at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:178)
>  [na:1.7.0_45]
> at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
>  [na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  [na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  [na:1.7.0_45]
> at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
> {noformat}
> cassandra service is still running despite the issue. Node has total 8G 
> memory with 2G allocated to heap. We are basically running read queries to 
> retrieve data out of cassandra.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8711) cassandra 2.1.2 ssl client encryption not working

2015-01-30 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8711:

Description: 
I have been trying to setup client encryption on a three nodes 2.1.2 version 
cassandra cluster and keep getting the following error:
{noformat}
Connection error: ('Unable to connect to any servers', {'localhost': 
ConnectionShutdown('Connection  is already closed',)})
{noformat}

I tried with both cqlsh and datatax python cassandra-driver and no luck to 
login.

I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
{noformat}
[authentication]
username =
password =

[connection]
hostname = localhost
port = 9160
factory = cqlshlib.ssl.ssl_transport_factory

[ssl]
certfile = /root/.cassandra/localhost_user1.pem
validate = false ## Optional, true by default
{noformat}

my cassandra.yaml configuration related to client_encryptions:
{noformat}
client_encryption_options:
enabled: True
keystore: /etc/cassandra/conf/.keystore
keystore_password: cassnest
{noformat}

the keystore, truststore, cert/pem (localhost_user1.pem) key have been verified 
to be working fine for datastax enterprise version.


  was:
I have been trying to setup client encryption on a three nodes 2.1.2 version 
cassandra cluster and keep getting the following error:
{noformat}
Connection error: ('Unable to connect to any servers', {'localhost': 
ConnectionShutdown('Connection  is already closed',)})
{noformat}

I tried with both cqlsh and datatax python cassandra-driver and no luck to 
login.

I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
{noformat}
[authentication]
username =
password =

[connection]
hostname = localhost
port = 9160
factory = cqlshlib.ssl.ssl_transport_factory

[ssl]
certfile = /root/.cassandra/localhost_user1.pem
validate = false ## Optional, true by default
{noformat}

my cassandra.yaml configuration related to client_encryptions:
{noformat}
client_encryption_options:
enabled: True
keystore: /etc/cassandra/conf/.keystore
keystore_password: cassnest
{noformat}

the keystore, truststore, cert/pem key have been verified to be working fine 
for datastax enterprise version.



> cassandra 2.1.2 ssl client encryption not working 
> --
>
> Key: CASSANDRA-8711
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8711
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> I have been trying to setup client encryption on a three nodes 2.1.2 version 
> cassandra cluster and keep getting the following error:
> {noformat}
> Connection error: ('Unable to connect to any servers', {'localhost': 
> ConnectionShutdown('Connection  (closed)> is already closed',)})
> {noformat}
> I tried with both cqlsh and datatax python cassandra-driver and no luck to 
> login.
> I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
> {noformat}
> [authentication]
> username =
> password =
> [connection]
> hostname = localhost
> port = 9160
> factory = cqlshlib.ssl.ssl_transport_factory
> [ssl]
> certfile = /root/.cassandra/localhost_user1.pem
> validate = false ## Optional, true by default
> {noformat}
> my cassandra.yaml configuration related to client_encryptions:
> {noformat}
> client_encryption_options:
> enabled: True
> keystore: /etc/cassandra/conf/.keystore
> keystore_password: cassnest
> {noformat}
> the keystore, truststore, cert/pem (localhost_user1.pem) key have been 
> verified to be working fine for datastax enterprise version.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CASSANDRA-8711) cassandra 2.1.2 ssl client encryption not working

2015-01-30 Thread Jeff Liu (JIRA)
Jeff Liu created CASSANDRA-8711:
---

 Summary: cassandra 2.1.2 ssl client encryption not working 
 Key: CASSANDRA-8711
 URL: https://issues.apache.org/jira/browse/CASSANDRA-8711
 Project: Cassandra
  Issue Type: Bug
Reporter: Jeff Liu


I have been trying to setup client encryption on a three nodes 2.1.2 version 
cassandra cluster and keep getting the following error:
{noformat}
Connection error: ('Unable to connect to any servers', {'localhost': 
ConnectionShutdown('Connection  is already closed',)})
{noformat}

I tried with both cqlsh and datatax python cassandra-driver and no luck to 
login.

I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
{/noformat}
[authentication]
username =
password =

[connection]
hostname = localhost
port = 9160
factory = cqlshlib.ssl.ssl_transport_factory

[ssl]
certfile = /root/.cassandra/localhost_user1.pem
validate = false ## Optional, true by default
{/noformat}

my cassandra.yaml configuration related to client_encryptions:
{noformat}
client_encryption_options:
enabled: True
keystore: /etc/cassandra/conf/.keystore
keystore_password: cassnest
{noformat}

the keystore, truststore, cert/pem key have been verified to be working fine 
for datastax enterprise version.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8711) cassandra 2.1.2 ssl client encryption not working

2015-01-30 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8711:

Description: 
I have been trying to setup client encryption on a three nodes 2.1.2 version 
cassandra cluster and keep getting the following error:
{noformat}
Connection error: ('Unable to connect to any servers', {'localhost': 
ConnectionShutdown('Connection  is already closed',)})
{noformat}

I tried with both cqlsh and datatax python cassandra-driver and no luck to 
login.

I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
{noformat}
[authentication]
username =
password =

[connection]
hostname = localhost
port = 9160
factory = cqlshlib.ssl.ssl_transport_factory

[ssl]
certfile = /root/.cassandra/localhost_user1.pem
validate = false ## Optional, true by default
{noformat}

my cassandra.yaml configuration related to client_encryptions:
{noformat}
client_encryption_options:
enabled: True
keystore: /etc/cassandra/conf/.keystore
keystore_password: cassnest
{noformat}

the keystore, truststore, cert/pem key have been verified to be working fine 
for datastax enterprise version.


  was:
I have been trying to setup client encryption on a three nodes 2.1.2 version 
cassandra cluster and keep getting the following error:
{noformat}
Connection error: ('Unable to connect to any servers', {'localhost': 
ConnectionShutdown('Connection  is already closed',)})
{noformat}

I tried with both cqlsh and datatax python cassandra-driver and no luck to 
login.

I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
{noformat}
[authentication]
username =
password =

[connection]
hostname = localhost
port = 9160
factory = cqlshlib.ssl.ssl_transport_factory

[ssl]
certfile = /root/.cassandra/localhost_user1.pem
validate = false ## Optional, true by default
{/noformat}

my cassandra.yaml configuration related to client_encryptions:
{noformat}
client_encryption_options:
enabled: True
keystore: /etc/cassandra/conf/.keystore
keystore_password: cassnest
{noformat}

the keystore, truststore, cert/pem key have been verified to be working fine 
for datastax enterprise version.



> cassandra 2.1.2 ssl client encryption not working 
> --
>
> Key: CASSANDRA-8711
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8711
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> I have been trying to setup client encryption on a three nodes 2.1.2 version 
> cassandra cluster and keep getting the following error:
> {noformat}
> Connection error: ('Unable to connect to any servers', {'localhost': 
> ConnectionShutdown('Connection  (closed)> is already closed',)})
> {noformat}
> I tried with both cqlsh and datatax python cassandra-driver and no luck to 
> login.
> I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
> {noformat}
> [authentication]
> username =
> password =
> [connection]
> hostname = localhost
> port = 9160
> factory = cqlshlib.ssl.ssl_transport_factory
> [ssl]
> certfile = /root/.cassandra/localhost_user1.pem
> validate = false ## Optional, true by default
> {noformat}
> my cassandra.yaml configuration related to client_encryptions:
> {noformat}
> client_encryption_options:
> enabled: True
> keystore: /etc/cassandra/conf/.keystore
> keystore_password: cassnest
> {noformat}
> the keystore, truststore, cert/pem key have been verified to be working fine 
> for datastax enterprise version.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (CASSANDRA-8711) cassandra 2.1.2 ssl client encryption not working

2015-01-30 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8711:

Description: 
I have been trying to setup client encryption on a three nodes 2.1.2 version 
cassandra cluster and keep getting the following error:
{noformat}
Connection error: ('Unable to connect to any servers', {'localhost': 
ConnectionShutdown('Connection  is already closed',)})
{noformat}

I tried with both cqlsh and datatax python cassandra-driver and no luck to 
login.

I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
{noformat}
[authentication]
username =
password =

[connection]
hostname = localhost
port = 9160
factory = cqlshlib.ssl.ssl_transport_factory

[ssl]
certfile = /root/.cassandra/localhost_user1.pem
validate = false ## Optional, true by default
{/noformat}

my cassandra.yaml configuration related to client_encryptions:
{noformat}
client_encryption_options:
enabled: True
keystore: /etc/cassandra/conf/.keystore
keystore_password: cassnest
{noformat}

the keystore, truststore, cert/pem key have been verified to be working fine 
for datastax enterprise version.


  was:
I have been trying to setup client encryption on a three nodes 2.1.2 version 
cassandra cluster and keep getting the following error:
{noformat}
Connection error: ('Unable to connect to any servers', {'localhost': 
ConnectionShutdown('Connection  is already closed',)})
{noformat}

I tried with both cqlsh and datatax python cassandra-driver and no luck to 
login.

I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
{/noformat}
[authentication]
username =
password =

[connection]
hostname = localhost
port = 9160
factory = cqlshlib.ssl.ssl_transport_factory

[ssl]
certfile = /root/.cassandra/localhost_user1.pem
validate = false ## Optional, true by default
{/noformat}

my cassandra.yaml configuration related to client_encryptions:
{noformat}
client_encryption_options:
enabled: True
keystore: /etc/cassandra/conf/.keystore
keystore_password: cassnest
{noformat}

the keystore, truststore, cert/pem key have been verified to be working fine 
for datastax enterprise version.



> cassandra 2.1.2 ssl client encryption not working 
> --
>
> Key: CASSANDRA-8711
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8711
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> I have been trying to setup client encryption on a three nodes 2.1.2 version 
> cassandra cluster and keep getting the following error:
> {noformat}
> Connection error: ('Unable to connect to any servers', {'localhost': 
> ConnectionShutdown('Connection  (closed)> is already closed',)})
> {noformat}
> I tried with both cqlsh and datatax python cassandra-driver and no luck to 
> login.
> I created /rooot/.cassandra/cqlshrc file for cqlsh settings, the content is:
> {noformat}
> [authentication]
> username =
> password =
> [connection]
> hostname = localhost
> port = 9160
> factory = cqlshlib.ssl.ssl_transport_factory
> [ssl]
> certfile = /root/.cassandra/localhost_user1.pem
> validate = false ## Optional, true by default
> {/noformat}
> my cassandra.yaml configuration related to client_encryptions:
> {noformat}
> client_encryption_options:
> enabled: True
> keystore: /etc/cassandra/conf/.keystore
> keystore_password: cassnest
> {noformat}
> the keystore, truststore, cert/pem key have been verified to be working fine 
> for datastax enterprise version.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces return java.lang.RuntimeException: Could not create snapshot

2015-01-29 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297272#comment-14297272
 ] 

Jeff Liu edited comment on CASSANDRA-8696 at 1/29/15 6:21 PM:
--

It's getting worse today after having the "failed to create snapshot" error. 4 
nodes of 6 are showing "UN" in nodetool status. I checked the system.log and 
syslog and couldn't find anything that tells why the node was down. The java 
cassandra process is also gone.

I tried to restart cassandra service on one node and watch system.log. Shortly 
after the service start running, cassandra starts to throw "failed to create 
snapshot" error for a while before it hung with no log output, nodetool status 
show error:"error: No nodes present in the cluster. Has this node finished 
starting up?". However the cassandra java process is still running in the 
system.

{noformat}
ps -ef | grep cass
110  18634 1 19 18:01 ?00:01:39 java -ea 
-javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar -XX:+UseThreadPriorities 
-XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M -Xmn400M 
-XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
-XX:+UseParNewGC -XX
:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled -XX:SurvivorRatio=8 
-XX:MaxTenuringThreshold=1 -XX:CMSInitiatingOccupancyFraction=75 
-XX:+UseCMSInitiatingOccupancyOnly -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled 
-XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC -XX
:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
-XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1422554504.log 
-XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
-Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=719
9 -Dcom.sun.management.jmxremote.ssl=false 
-Dcom.sun.management.jmxremote.authenticate=false 
-javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
 -Dlogback.configurationFile=logback.xml -Dca
ssandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
-Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
/etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cas
sandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.ja
r:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar
:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassan
dra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassan
dra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cass
andra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/u
sr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassandra/stress.jar: 
-XX:HeapDumpPath=/var/lib/cassandra/java_1422554504.hprof 
-XX:ErrorFile=/var/lib/cassandra/hs_err_1422554504.log 
org.apache.cassandra.service.CassandraDaemon
root 19317 19074  0 18:10 pts/100:00:00 grep --color=auto cass
#:/home/jliu# less /var/log/cassandra/system.log
{noformat}

{noformat}
#:/home/jliu# nodetool status
error: No nodes present in the cluster. Has this node finished starting up?
-- StackTrace --
java.lang.RuntimeException: No nodes present in the cluster. Has this node 
finished starting up?
at 
org.apache.cassandra.dht.RandomPartitioner.describeOwnership(RandomPartitioner.java:143)
at 
org.apache.cassandra.service.StorageService.getOwnership(StorageService.java:3702)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethod

[jira] [Comment Edited] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces return java.lang.RuntimeException: Could not create snapshot

2015-01-29 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14295881#comment-14295881
 ] 

Jeff Liu edited comment on CASSANDRA-8696 at 1/29/15 6:20 PM:
--

The replica nodes are not busy at all. Actually I have stopped all client 
connection while trying to run a nodetool repair. 

nodetool tpstats also shows that there isn't much activities going on.


{noformat}
Pool NameActive   Pending  Completed   Blocked  All 
time blocked
CounterMutationStage  0 0  0 0  
   0
ReadStage 0 0  35441 0  
   0
RequestResponseStage  0 0   26940702 0  
   0
MutationStage 1 0   42298652 0  
   0
ReadRepairStage   0 0101 0  
   0
GossipStage   0 0  77379 0  
   0
CacheCleanupExecutor  0 0  0 0  
   0
AntiEntropyStage  0 0333 0  
   0
MigrationStage0 0   1664 0  
   0
ValidationExecutor0 0 51 0  
   0
CommitLogArchiver 0 0  0 0  
   0
MiscStage 0 0  0 0  
   0
MemtableFlushWriter   1 1853 0  
   0
MemtableReclaimMemory 0 0853 0  
   0
PendingRangeCalculator0 0 12 0  
   0
MemtablePostFlush 0 0   1365 0  
   0
CompactionExecutor2 9244 0  
   0
InternalResponseStage 0 0   1679 0  
   0
HintedHandoff 0 0 13 0  
   0

Message type   Dropped
RANGE_SLICE  0
READ_REPAIR  0
PAGED_RANGE  0
BINARY   0
READ  6067
MUTATION 8
_TRACE   0
REQUEST_RESPONSE 0
COUNTER_MUTATION 0
{noformat}




was (Author: jeffl):
The replica nodes are not busy at all. Actually I have stopped all client 
connection while trying to run a nodetool repair. 

nodetool tpstats also shows that there isn't much activities going on.


{noformat}
Pool NameActive   Pending  Completed   Blocked  All 
time blocked
CounterMutationStage  0 0  0 0  
   0
ReadStage 0 0  35441 0  
   0
RequestResponseStage  0 0   26940702 0  
   0
MutationStage 1 0   42298652 0  
   0
ReadRepairStage   0 0101 0  
   0
GossipStage   0 0  77379 0  
   0
CacheCleanupExecutor  0 0  0 0  
   0
AntiEntropyStage  0 0333 0  
   0
MigrationStage0 0   1664 0  
   0
ValidationExecutor0 0 51 0  
   0
CommitLogArchiver 0 0  0 0  
   0
MiscStage 0 0  0 0  
   0
MemtableFlushWriter   1 1853 0  
   0
MemtableReclaimMemory 0 0853 0  
   0
PendingRangeCalculator0 0 12 0  
   0
MemtablePostFlush 0 0   1365 0  
   0
CompactionExecutor2 9244 0  
   0
InternalResponseStage 0 0   1679 0  
   0
HintedHandoff 0 0 13 0  
   0

Message type   Dropped
RANGE_SLICE  0
READ_REPAIR  0
PAGED_RANGE  0
BINARY   0
READ  6067
MUTATION 8
_TRACE   0
REQUEST_RESPONSE 0
COUNTER_MUTATION 0
{noformat}

ta

[jira] [Comment Edited] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces return java.lang.RuntimeException: Could not create snapshot

2015-01-29 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14295881#comment-14295881
 ] 

Jeff Liu edited comment on CASSANDRA-8696 at 1/29/15 6:19 PM:
--

The replica nodes are not busy at all. Actually I have stopped all client 
connection while trying to run a nodetool repair. 

nodetool tpstats also shows that there isn't much activities going on.


{noformat}
Pool NameActive   Pending  Completed   Blocked  All 
time blocked
CounterMutationStage  0 0  0 0  
   0
ReadStage 0 0  35441 0  
   0
RequestResponseStage  0 0   26940702 0  
   0
MutationStage 1 0   42298652 0  
   0
ReadRepairStage   0 0101 0  
   0
GossipStage   0 0  77379 0  
   0
CacheCleanupExecutor  0 0  0 0  
   0
AntiEntropyStage  0 0333 0  
   0
MigrationStage0 0   1664 0  
   0
ValidationExecutor0 0 51 0  
   0
CommitLogArchiver 0 0  0 0  
   0
MiscStage 0 0  0 0  
   0
MemtableFlushWriter   1 1853 0  
   0
MemtableReclaimMemory 0 0853 0  
   0
PendingRangeCalculator0 0 12 0  
   0
MemtablePostFlush 0 0   1365 0  
   0
CompactionExecutor2 9244 0  
   0
InternalResponseStage 0 0   1679 0  
   0
HintedHandoff 0 0 13 0  
   0

Message type   Dropped
RANGE_SLICE  0
READ_REPAIR  0
PAGED_RANGE  0
BINARY   0
READ  6067
MUTATION 8
_TRACE   0
REQUEST_RESPONSE 0
COUNTER_MUTATION 0
{noformat}

tail -30 /var/log/cassandra/system.log where you can see the log suddenly 
stopped.

{noformat}
at 
org.apache.cassandra.repair.RepairSession.failedSnapshot(RepairSession.java:344)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.repair.RepairJob$2.onFailure(RepairJob.java:128) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at com.google.common.util.concurrent.Futures$4.run(Futures.java:1172) 
~[guava-16.0.jar:na]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
ERROR [AntiEntropySessions:38] 2015-01-28 19:47:39,746 CassandraDaemon.java:153 
- Exception in thread Thread[AntiEntropySessions:38,5,RMI Runtime]
java.lang.RuntimeException: java.io.IOException: Failed during snapshot 
creation.
at com.google.common.base.Throwables.propagate(Throwables.java:160) 
~[guava-16.0.jar:na]
at 
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:32) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
~[na:1.7.0_45]
at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
~[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
Caused by: java.io.IOException: Failed during snapshot creation.
at 
org.apache.cassandra.repair.RepairSession.failedSnapshot(RepairSession.java:344)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.repair.RepairJob$2.onFailure(RepairJob.java:128) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at com.google.common.util.concurrent.Futures$4.run(Futures.java:1172) 
~[guava-16.0.jar:na]
... 3 common frames omitted
WARN  [HintedHandoff:2] 2015-01-28 19:47:39,749 SliceQueryFilter.java:236 - 
Read 0 live and 4525 tombstoned cells in system.hints (see 
tombstone_warn_threshold). 128 columns was requested, slices=[-], 
delInfo={deletedAt=-9223372036854775808, localDeletion=2147483647}
INFO 

[jira] [Commented] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces return java.lang.RuntimeException: Could not create snapshot

2015-01-29 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297272#comment-14297272
 ] 

Jeff Liu commented on CASSANDRA-8696:
-

It's getting worse today after having the "failed to create snapshot" error. 4 
nodes of 6 are showing "UN" in nodetool status. I checked the system.log and 
syslog and couldn't find anything that tells why the node was down. The java 
cassandra process is also gone.

I tried to restart cassandra service on one node and watch system.log. Shortly 
after the service start running, cassandra starts to throw "failed to create 
snapshot" error for a while before it hung with no log output, nodetool status 
show error:"error: No nodes present in the cluster. Has this node finished 
starting up?". However the cassandra java process is still running in the 
system.

{noformat}
ps -ef | grep cass
110  18634 1 19 18:01 ?00:01:39 java -ea 
-javaagent:/usr/share/cassandra/lib/jamm-0.2.8.jar -XX:+UseThreadPriorities 
-XX:ThreadPriorityPolicy=42 -Xms1792M -Xmx1792M -Xmn400M 
-XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTableSize=103 
-XX:+UseParNewGC -XX
:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled -XX:SurvivorRatio=8 
-XX:MaxTenuringThreshold=1 -XX:CMSInitiatingOccupancyFraction=75 
-XX:+UseCMSInitiatingOccupancyOnly -XX:+UseTLAB -XX:+CMSClassUnloadingEnabled 
-XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintHeapAtGC -XX
:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime 
-XX:+PrintPromotionFailure -Xloggc:/var/log/cassandra/gc-1422554504.log 
-XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=48M 
-Djava.net.preferIPv4Stack=true -Dcom.sun.management.jmxremote.port=719
9 -Dcom.sun.management.jmxremote.ssl=false 
-Dcom.sun.management.jmxremote.authenticate=false 
-javaagent:/usr/share/java/graphite-reporter-agent-1.0-SNAPSHOT.jar=graphiteServer=metrics-a.hq.nest.com;graphitePort=2003;graphitePollInt=60
 -Dlogback.configurationFile=logback.xml -Dca
ssandra.logdir=/var/log/cassandra -Dcassandra.storagedir= 
-Dcassandra-pidfile=/var/run/cassandra/cassandra.pid -cp 
/etc/cassandra:/usr/share/cassandra/lib/airline-0.6.jar:/usr/share/cassandra/lib/antlr-runtime-3.5.2.jar:/usr/share/cassandra/lib/commons-cli-1.1.jar:/usr/share/cas
sandra/lib/commons-codec-1.2.jar:/usr/share/cassandra/lib/commons-lang3-3.1.jar:/usr/share/cassandra/lib/commons-math3-3.2.jar:/usr/share/cassandra/lib/compress-lzf-0.8.4.jar:/usr/share/cassandra/lib/concurrentlinkedhashmap-lru-1.4.jar:/usr/share/cassandra/lib/disruptor-3.0.1.ja
r:/usr/share/cassandra/lib/guava-16.0.jar:/usr/share/cassandra/lib/high-scale-lib-1.0.6.jar:/usr/share/cassandra/lib/jackson-core-asl-1.9.2.jar:/usr/share/cassandra/lib/jackson-mapper-asl-1.9.2.jar:/usr/share/cassandra/lib/jamm-0.2.8.jar:/usr/share/cassandra/lib/javax.inject.jar
:/usr/share/cassandra/lib/jbcrypt-0.3m.jar:/usr/share/cassandra/lib/jline-1.0.jar:/usr/share/cassandra/lib/jna-4.0.0.jar:/usr/share/cassandra/lib/json-simple-1.1.jar:/usr/share/cassandra/lib/libthrift-0.9.1.jar:/usr/share/cassandra/lib/logback-classic-1.1.2.jar:/usr/share/cassan
dra/lib/logback-core-1.1.2.jar:/usr/share/cassandra/lib/lz4-1.2.0.jar:/usr/share/cassandra/lib/metrics-core-2.2.0.jar:/usr/share/cassandra/lib/metrics-graphite-2.2.0.jar:/usr/share/cassandra/lib/mx4j-tools.jar:/usr/share/cassandra/lib/netty-all-4.0.23.Final.jar:/usr/share/cassan
dra/lib/reporter-config-2.1.0.jar:/usr/share/cassandra/lib/slf4j-api-1.7.2.jar:/usr/share/cassandra/lib/snakeyaml-1.11.jar:/usr/share/cassandra/lib/snappy-java-1.0.5.2.jar:/usr/share/cassandra/lib/stream-2.5.2.jar:/usr/share/cassandra/lib/stringtemplate-4.0.2.jar:/usr/share/cass
andra/lib/super-csv-2.1.0.jar:/usr/share/cassandra/lib/thrift-server-0.3.7.jar:/usr/share/cassandra/apache-cassandra-2.1.2.jar:/usr/share/cassandra/apache-cassandra-thrift-2.1.2.jar:/usr/share/cassandra/apache-cassandra.jar:/usr/share/cassandra/cassandra-driver-core-2.0.5.jar:/u
sr/share/cassandra/netty-3.9.0.Final.jar:/usr/share/cassandra/stress.jar: 
-XX:HeapDumpPath=/var/lib/cassandra/java_1422554504.hprof 
-XX:ErrorFile=/var/lib/cassandra/hs_err_1422554504.log 
org.apache.cassandra.service.CassandraDaemon
root 19317 19074  0 18:10 pts/100:00:00 grep --color=auto cass
#:/home/jliu# less /var/log/cassandra/system.log
{noformat}

{noformat}
#:/home/jliu# nodetool status
error: No nodes present in the cluster. Has this node finished starting up?
-- StackTrace --
java.lang.RuntimeException: No nodes present in the cluster. Has this node 
finished starting up?
at 
org.apache.cassandra.dht.RandomPartitioner.describeOwnership(RandomPartitioner.java:143)
at 
org.apache.cassandra.service.StorageService.getOwnership(StorageService.java:3702)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java

[jira] [Commented] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces return java.lang.RuntimeException: Could not create snapshot

2015-01-28 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14295881#comment-14295881
 ] 

Jeff Liu commented on CASSANDRA-8696:
-

The replica nodes are not busy at all. Actually I have stopped all client 
connection while trying to run a nodetool repair. 

nodetool tpstats also shows that there isn't much activities going on.


{noformat}
Pool NameActive   Pending  Completed   Blocked  All 
time blocked
CounterMutationStage  0 0  0 0  
   0
ReadStage 0 0  35441 0  
   0
RequestResponseStage  0 0   26940702 0  
   0
MutationStage 1 0   42298652 0  
   0
ReadRepairStage   0 0101 0  
   0
GossipStage   0 0  77379 0  
   0
CacheCleanupExecutor  0 0  0 0  
   0
AntiEntropyStage  0 0333 0  
   0
MigrationStage0 0   1664 0  
   0
ValidationExecutor0 0 51 0  
   0
CommitLogArchiver 0 0  0 0  
   0
MiscStage 0 0  0 0  
   0
MemtableFlushWriter   1 1853 0  
   0
MemtableReclaimMemory 0 0853 0  
   0
PendingRangeCalculator0 0 12 0  
   0
MemtablePostFlush 0 0   1365 0  
   0
CompactionExecutor2 9244 0  
   0
InternalResponseStage 0 0   1679 0  
   0
HintedHandoff 0 0 13 0  
   0

Message type   Dropped
RANGE_SLICE  0
READ_REPAIR  0
PAGED_RANGE  0
BINARY   0
READ  6067
MUTATION 8
_TRACE   0
REQUEST_RESPONSE 0
COUNTER_MUTATION 0
{noformat}

> nodetool repair on cassandra 2.1.2 keyspaces return 
> java.lang.RuntimeException: Could not create snapshot
> -
>
> Key: CASSANDRA-8696
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8696
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> When trying to run nodetool repair -pr on cassandra node ( 2.1.2), cassandra 
> throw java exceptions: cannot create snapshot. 
> the error log from system.log:
> {noformat}
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:28,815 
> StreamResultFuture.java:166 - [Stream #692c1450-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 2 files(221187 bytes), sending 5 
> files(632105 bytes)
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:180 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> Session with /10.97.9.110 is complete
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:212 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,047 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> INFO  [RepairJobTask:1] 2015-01-28 02:07:29,065 StreamResultFuture.java:86 - 
> [Stream #692c6270-a692-11e4-9973-070e938df227] Executing streaming plan for 
> Repair
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,065 
> StreamSession.java:213 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Starting streaming to /10.66.187.201
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,070 
> StreamCoordinator.java:209 - [Stream #692c6270-a692-11e4-9973-070e938df227, 
> ID#0] Beginning stream session with /10.66.187.201
> INFO  [STREAM-IN-/10.66.187.201] 2015-01-28 02:07:29,465 
> StreamResultFuture.java:166 - [Stream #692c6270-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 5 files(627994 bytes), sending 5 
> files(632105 bytes)
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,971 
> StreamResultFuture.java:180 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Session with /10.66.187.201 is complete
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamResultFuture.ja

[jira] [Updated] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces return java.lang.RuntimeException: Could not create snapshot

2015-01-28 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8696:

Summary: nodetool repair on cassandra 2.1.2 keyspaces return 
java.lang.RuntimeException: Could not create snapshot  (was: nodetool repair on 
cassandra 2.1.2 keyspaces returnjava.lang.RuntimeException: Could not create 
snapshot)

> nodetool repair on cassandra 2.1.2 keyspaces return 
> java.lang.RuntimeException: Could not create snapshot
> -
>
> Key: CASSANDRA-8696
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8696
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> When trying to run nodetool repair -pr on cassandra node ( 2.1.2), cassandra 
> throw java exceptions: cannot create snapshot. 
> the error log from system.log:
> {noformat}
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:28,815 
> StreamResultFuture.java:166 - [Stream #692c1450-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 2 files(221187 bytes), sending 5 
> files(632105 bytes)
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:180 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> Session with /10.97.9.110 is complete
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:212 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,047 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> INFO  [RepairJobTask:1] 2015-01-28 02:07:29,065 StreamResultFuture.java:86 - 
> [Stream #692c6270-a692-11e4-9973-070e938df227] Executing streaming plan for 
> Repair
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,065 
> StreamSession.java:213 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Starting streaming to /10.66.187.201
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,070 
> StreamCoordinator.java:209 - [Stream #692c6270-a692-11e4-9973-070e938df227, 
> ID#0] Beginning stream session with /10.66.187.201
> INFO  [STREAM-IN-/10.66.187.201] 2015-01-28 02:07:29,465 
> StreamResultFuture.java:166 - [Stream #692c6270-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 5 files(627994 bytes), sending 5 
> files(632105 bytes)
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,971 
> StreamResultFuture.java:180 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Session with /10.66.187.201 is complete
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamResultFuture.java:212 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> ERROR [RepairJobTask:1] 2015-01-28 02:07:39,444 RepairJob.java:127 - Error 
> occurred during snapshot phase
> java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
> at 
> org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
> ~[na:1.7.0_45]
> at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
> ~[na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  [na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  [na:1.7.0_45]
> at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
> INFO  [AntiEntropySessions:6] 2015-01-28 02:07:39,445 RepairSession.java:260 
> - [repair #6f85e740-a692-11e4-9973-070e938df227] new session: will sync 
> /10.98.194.68, /10.66.187.201, /10.226.218.135 on range 
> (12817179804668051873746972069086
> 2638799,12863540308359254031520865977436165] for events.[bigint0text, 
> bigint0boolean, bigint0int, dataset_catalog, column_categories, 
> bigint0double, bigint0bigint]
> ERROR [AntiEntropySessions:5] 2015-01-28 02:07:39,445 RepairSession.java:303 
> - [repair #685e3d00-a692-11e4-9973-070e938df227] session completed with the 
> following error
> java.io.IOException: Failed during snapshot creation.
> at 
> org.apache.cassandra.repair.RepairSession.failedSnapshot(RepairSession.java:344)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.repair.RepairJob$2.onFailure(RepairJob.java:128) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> a

[jira] [Commented] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces returnjava.lang.RuntimeException: Could not create snapshot

2015-01-28 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14295581#comment-14295581
 ] 

Jeff Liu commented on CASSANDRA-8696:
-

nodetool repair returns: "Lost notification.You should check server log for 
repair status of keyspace events"

# nodetool repair -pr
[2015-01-28 18:23:29,890] Starting repair command #1, repairing 256 ranges for 
keyspace events (seq=true, full=true)
[2015-01-28 18:26:29,913] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:29:29,918] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:30:29,920] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:32:29,923] Lost notification. You should check server log for 
repair status of keyspace events

> nodetool repair on cassandra 2.1.2 keyspaces 
> returnjava.lang.RuntimeException: Could not create snapshot
> 
>
> Key: CASSANDRA-8696
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8696
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> When trying to run nodetool repair -pr on cassandra node ( 2.1.2), cassandra 
> throw java exceptions: cannot create snapshot. 
> the error log from system.log:
> {noformat}
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:28,815 
> StreamResultFuture.java:166 - [Stream #692c1450-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 2 files(221187 bytes), sending 5 
> files(632105 bytes)
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:180 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> Session with /10.97.9.110 is complete
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:212 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,047 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> INFO  [RepairJobTask:1] 2015-01-28 02:07:29,065 StreamResultFuture.java:86 - 
> [Stream #692c6270-a692-11e4-9973-070e938df227] Executing streaming plan for 
> Repair
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,065 
> StreamSession.java:213 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Starting streaming to /10.66.187.201
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,070 
> StreamCoordinator.java:209 - [Stream #692c6270-a692-11e4-9973-070e938df227, 
> ID#0] Beginning stream session with /10.66.187.201
> INFO  [STREAM-IN-/10.66.187.201] 2015-01-28 02:07:29,465 
> StreamResultFuture.java:166 - [Stream #692c6270-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 5 files(627994 bytes), sending 5 
> files(632105 bytes)
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,971 
> StreamResultFuture.java:180 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Session with /10.66.187.201 is complete
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamResultFuture.java:212 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> ERROR [RepairJobTask:1] 2015-01-28 02:07:39,444 RepairJob.java:127 - Error 
> occurred during snapshot phase
> java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
> at 
> org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
> ~[na:1.7.0_45]
> at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
> ~[na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  [na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  [na:1.7.0_45]
> at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
> INFO  [AntiEntropySessions:6] 2015-01-28 02:07:39,445 RepairSession.java:260 
> - [repair #6f85e740-a692-11e4-9973-070e938df227] new session: will sync 
> /10.98.194.68, /10.66.187.201, /10.226.218.135 on range 
> (12817179804668051873746972069086
> 2638799,12863540308359254031520865977436165] for events.[bigint0text, 
> bigint0boolean, bigint0int, dataset_catalog, column_categories, 
> bigint0double, big

[jira] [Comment Edited] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces returnjava.lang.RuntimeException: Could not create snapshot

2015-01-28 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14295581#comment-14295581
 ] 

Jeff Liu edited comment on CASSANDRA-8696 at 1/28/15 6:37 PM:
--

nodetool repair returns: "Lost notification.You should check server log for 
repair status of keyspace events"

{noformat}
# nodetool repair -pr
[2015-01-28 18:23:29,890] Starting repair command #1, repairing 256 ranges for 
keyspace events (seq=true, full=true)
[2015-01-28 18:26:29,913] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:29:29,918] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:30:29,920] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:32:29,923] Lost notification. You should check server log for 
repair status of keyspace events
{noformat}


was (Author: jeffl):
nodetool repair returns: "Lost notification.You should check server log for 
repair status of keyspace events"

{format}
# nodetool repair -pr
[2015-01-28 18:23:29,890] Starting repair command #1, repairing 256 ranges for 
keyspace events (seq=true, full=true)
[2015-01-28 18:26:29,913] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:29:29,918] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:30:29,920] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:32:29,923] Lost notification. You should check server log for 
repair status of keyspace events
{format}

> nodetool repair on cassandra 2.1.2 keyspaces 
> returnjava.lang.RuntimeException: Could not create snapshot
> 
>
> Key: CASSANDRA-8696
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8696
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> When trying to run nodetool repair -pr on cassandra node ( 2.1.2), cassandra 
> throw java exceptions: cannot create snapshot. 
> the error log from system.log:
> {noformat}
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:28,815 
> StreamResultFuture.java:166 - [Stream #692c1450-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 2 files(221187 bytes), sending 5 
> files(632105 bytes)
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:180 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> Session with /10.97.9.110 is complete
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:212 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,047 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> INFO  [RepairJobTask:1] 2015-01-28 02:07:29,065 StreamResultFuture.java:86 - 
> [Stream #692c6270-a692-11e4-9973-070e938df227] Executing streaming plan for 
> Repair
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,065 
> StreamSession.java:213 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Starting streaming to /10.66.187.201
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,070 
> StreamCoordinator.java:209 - [Stream #692c6270-a692-11e4-9973-070e938df227, 
> ID#0] Beginning stream session with /10.66.187.201
> INFO  [STREAM-IN-/10.66.187.201] 2015-01-28 02:07:29,465 
> StreamResultFuture.java:166 - [Stream #692c6270-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 5 files(627994 bytes), sending 5 
> files(632105 bytes)
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,971 
> StreamResultFuture.java:180 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Session with /10.66.187.201 is complete
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamResultFuture.java:212 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> ERROR [RepairJobTask:1] 2015-01-28 02:07:39,444 RepairJob.java:127 - Error 
> occurred during snapshot phase
> java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
> at 
> org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
> 

[jira] [Comment Edited] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces returnjava.lang.RuntimeException: Could not create snapshot

2015-01-28 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14295581#comment-14295581
 ] 

Jeff Liu edited comment on CASSANDRA-8696 at 1/28/15 6:37 PM:
--

nodetool repair returns: "Lost notification.You should check server log for 
repair status of keyspace events"

{format}
# nodetool repair -pr
[2015-01-28 18:23:29,890] Starting repair command #1, repairing 256 ranges for 
keyspace events (seq=true, full=true)
[2015-01-28 18:26:29,913] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:29:29,918] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:30:29,920] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:32:29,923] Lost notification. You should check server log for 
repair status of keyspace events
{format}


was (Author: jeffl):
nodetool repair returns: "Lost notification.You should check server log for 
repair status of keyspace events"

# nodetool repair -pr
[2015-01-28 18:23:29,890] Starting repair command #1, repairing 256 ranges for 
keyspace events (seq=true, full=true)
[2015-01-28 18:26:29,913] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:29:29,918] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:30:29,920] Lost notification. You should check server log for 
repair status of keyspace events
[2015-01-28 18:32:29,923] Lost notification. You should check server log for 
repair status of keyspace events

> nodetool repair on cassandra 2.1.2 keyspaces 
> returnjava.lang.RuntimeException: Could not create snapshot
> 
>
> Key: CASSANDRA-8696
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8696
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> When trying to run nodetool repair -pr on cassandra node ( 2.1.2), cassandra 
> throw java exceptions: cannot create snapshot. 
> the error log from system.log:
> {noformat}
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:28,815 
> StreamResultFuture.java:166 - [Stream #692c1450-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 2 files(221187 bytes), sending 5 
> files(632105 bytes)
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:180 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> Session with /10.97.9.110 is complete
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:212 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,047 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> INFO  [RepairJobTask:1] 2015-01-28 02:07:29,065 StreamResultFuture.java:86 - 
> [Stream #692c6270-a692-11e4-9973-070e938df227] Executing streaming plan for 
> Repair
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,065 
> StreamSession.java:213 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Starting streaming to /10.66.187.201
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,070 
> StreamCoordinator.java:209 - [Stream #692c6270-a692-11e4-9973-070e938df227, 
> ID#0] Beginning stream session with /10.66.187.201
> INFO  [STREAM-IN-/10.66.187.201] 2015-01-28 02:07:29,465 
> StreamResultFuture.java:166 - [Stream #692c6270-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 5 files(627994 bytes), sending 5 
> files(632105 bytes)
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,971 
> StreamResultFuture.java:180 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Session with /10.66.187.201 is complete
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamResultFuture.java:212 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> ERROR [RepairJobTask:1] 2015-01-28 02:07:39,444 RepairJob.java:127 - Error 
> occurred during snapshot phase
> java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
> at 
> org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
> ~[na:1.7.0_45]
>  

[jira] [Commented] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces returnjava.lang.RuntimeException: Could not create snapshot

2015-01-28 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14295578#comment-14295578
 ] 

Jeff Liu commented on CASSANDRA-8696:
-

[~yukim]

I continued to hop on 10.66.187.201 and run nodetool repair -pr on this node 
and observed the similar failed to create snapshot error.

{/noformat}
INFO  [RepairJobTask:15] 2015-01-28 18:29:25,378 StreamResultFuture.java:86 - 
[Stream #93af91f0-a71b-11e4-820f-edcf08b136c7] Executing streaming plan for 
Repair
INFO  [StreamConnectionEstablisher:5] 2015-01-28 18:29:25,379 
StreamSession.java:213 - [Stream #93af91f0-a71b-11e4-820f-edcf08b136c7] 
Starting streaming to /10.98.194.68
INFO  [StreamConnectionEstablisher:5] 2015-01-28 18:29:25,384 
StreamCoordinator.java:209 - [Stream #93af91f0-a71b-11e4-820f-edcf08b136c7, 
ID#0] Beginning stream session with /10.98.194.68
INFO  [STREAM-IN-/10.98.194.68] 2015-01-28 18:29:25,590 
StreamResultFuture.java:166 - [Stream #93af91f0-a71b-11e4-820f-edcf08b136c7 
ID#0] Prepare completed. Receiving 47 files(40043719 bytes), sending 1100 
files(151144862 bytes)
ERROR [RepairJobTask:15] 2015-01-28 18:29:31,654 RepairJob.java:127 - Error 
occurred during snapshot phase
java.lang.RuntimeException: Could not create snapshot at /10.66.187.201
at 
org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
~[na:1.7.0_45]
at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
~[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
ERROR [AntiEntropySessions:1] 2015-01-28 18:29:31,656 RepairSession.java:303 - 
[repair #c35edd80-a71a-11e4-820f-edcf08b136c7] session completed with the 
following error
java.io.IOException: Failed during snapshot creation.
at 
org.apache.cassandra.repair.RepairSession.failedSnapshot(RepairSession.java:344)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.repair.RepairJob$2.onFailure(RepairJob.java:128) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at com.google.common.util.concurrent.Futures$4.run(Futures.java:1172) 
~[guava-16.0.jar:na]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
INFO  [AntiEntropySessions:5] 2015-01-28 18:29:31,656 RepairSession.java:260 - 
[repair #99efe970-a71b-11e4-820f-edcf08b136c7] new session: will sync 
/10.66.187.201, /10.66.121.76, /10.97.9.110 on range 
(1578063643453800039554350043004094,15825842670805971884119537801885597891] 
for events.[column_categories, bigint0double, bigint0text, bigint0bigint, 
bigint0int, bigint0boolean, dataset_catalog]
ERROR [AntiEntropySessions:1] 2015-01-28 18:29:31,658 CassandraDaemon.java:153 
- Exception in thread Thread[AntiEntropySessions:1,5,RMI Runtime]
java.lang.RuntimeException: java.io.IOException: Failed during snapshot 
creation.
at com.google.common.base.Throwables.propagate(Throwables.java:160) 
~[guava-16.0.jar:na]
at 
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:32) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
~[na:1.7.0_45]
at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
~[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
Caused by: java.io.IOException: Failed during snapshot creation.
at 
org.apache.cassandra.repair.RepairSession.failedSnapshot(RepairSession.java:344)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.repair.RepairJob$2.onFailure(RepairJob.java:128) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at com.google.common.util.concurrent.Futures$4.run(Futures.java:1172) 
~[guava-16.0.jar:na]
... 3 common frames omitted
ERROR [RepairJobTask:3] 2015-01-28 18:29:41,664 RepairJob.java:127 - Error 
occurred during snapshot phase
java.lang.RuntimeException: Could not create snapshot at /10.66.187.201
at 
org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
 ~[apache-cassandra-2.1

[jira] [Comment Edited] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces returnjava.lang.RuntimeException: Could not create snapshot

2015-01-28 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14295578#comment-14295578
 ] 

Jeff Liu edited comment on CASSANDRA-8696 at 1/28/15 6:35 PM:
--

[~yukim]

I continued to hop on 10.66.187.201 and run nodetool repair -pr on this node 
and observed the similar failed to create snapshot error.

{noformat}
INFO  [RepairJobTask:15] 2015-01-28 18:29:25,378 StreamResultFuture.java:86 - 
[Stream #93af91f0-a71b-11e4-820f-edcf08b136c7] Executing streaming plan for 
Repair
INFO  [StreamConnectionEstablisher:5] 2015-01-28 18:29:25,379 
StreamSession.java:213 - [Stream #93af91f0-a71b-11e4-820f-edcf08b136c7] 
Starting streaming to /10.98.194.68
INFO  [StreamConnectionEstablisher:5] 2015-01-28 18:29:25,384 
StreamCoordinator.java:209 - [Stream #93af91f0-a71b-11e4-820f-edcf08b136c7, 
ID#0] Beginning stream session with /10.98.194.68
INFO  [STREAM-IN-/10.98.194.68] 2015-01-28 18:29:25,590 
StreamResultFuture.java:166 - [Stream #93af91f0-a71b-11e4-820f-edcf08b136c7 
ID#0] Prepare completed. Receiving 47 files(40043719 bytes), sending 1100 
files(151144862 bytes)
ERROR [RepairJobTask:15] 2015-01-28 18:29:31,654 RepairJob.java:127 - Error 
occurred during snapshot phase
java.lang.RuntimeException: Could not create snapshot at /10.66.187.201
at 
org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
~[na:1.7.0_45]
at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
~[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
ERROR [AntiEntropySessions:1] 2015-01-28 18:29:31,656 RepairSession.java:303 - 
[repair #c35edd80-a71a-11e4-820f-edcf08b136c7] session completed with the 
following error
java.io.IOException: Failed during snapshot creation.
at 
org.apache.cassandra.repair.RepairSession.failedSnapshot(RepairSession.java:344)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.repair.RepairJob$2.onFailure(RepairJob.java:128) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at com.google.common.util.concurrent.Futures$4.run(Futures.java:1172) 
~[guava-16.0.jar:na]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
INFO  [AntiEntropySessions:5] 2015-01-28 18:29:31,656 RepairSession.java:260 - 
[repair #99efe970-a71b-11e4-820f-edcf08b136c7] new session: will sync 
/10.66.187.201, /10.66.121.76, /10.97.9.110 on range 
(1578063643453800039554350043004094,15825842670805971884119537801885597891] 
for events.[column_categories, bigint0double, bigint0text, bigint0bigint, 
bigint0int, bigint0boolean, dataset_catalog]
ERROR [AntiEntropySessions:1] 2015-01-28 18:29:31,658 CassandraDaemon.java:153 
- Exception in thread Thread[AntiEntropySessions:1,5,RMI Runtime]
java.lang.RuntimeException: java.io.IOException: Failed during snapshot 
creation.
at com.google.common.base.Throwables.propagate(Throwables.java:160) 
~[guava-16.0.jar:na]
at 
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:32) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
~[na:1.7.0_45]
at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
~[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
~[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
Caused by: java.io.IOException: Failed during snapshot creation.
at 
org.apache.cassandra.repair.RepairSession.failedSnapshot(RepairSession.java:344)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.repair.RepairJob$2.onFailure(RepairJob.java:128) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at com.google.common.util.concurrent.Futures$4.run(Futures.java:1172) 
~[guava-16.0.jar:na]
... 3 common frames omitted
ERROR [RepairJobTask:3] 2015-01-28 18:29:41,664 RepairJob.java:127 - Error 
occurred during snapshot phase
java.lang.RuntimeException: Could not create snapshot at /10.66.187.201
at 
org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailu

[jira] [Commented] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces returnjava.lang.RuntimeException: Could not create snapshot

2015-01-28 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14295547#comment-14295547
 ] 

Jeff Liu commented on CASSANDRA-8696:
-

Interesting thing is that I haven't seen any repair error in node 10.97.9.110 
system.log. From another post on serverfault website, Some other people 
suggested that additional logs will show if manually running nodetool snapshot 
on the problematic node. but I was able to successfully run nodetool snapshot 
on node 10.97.9.110 without seeing any error log related to snapshot creation.

> nodetool repair on cassandra 2.1.2 keyspaces 
> returnjava.lang.RuntimeException: Could not create snapshot
> 
>
> Key: CASSANDRA-8696
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8696
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> When trying to run nodetool repair -pr on cassandra node ( 2.1.2), cassandra 
> throw java exceptions: cannot create snapshot. 
> the error log from system.log:
> {noformat}
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:28,815 
> StreamResultFuture.java:166 - [Stream #692c1450-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 2 files(221187 bytes), sending 5 
> files(632105 bytes)
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:180 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> Session with /10.97.9.110 is complete
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:212 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,047 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> INFO  [RepairJobTask:1] 2015-01-28 02:07:29,065 StreamResultFuture.java:86 - 
> [Stream #692c6270-a692-11e4-9973-070e938df227] Executing streaming plan for 
> Repair
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,065 
> StreamSession.java:213 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Starting streaming to /10.66.187.201
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,070 
> StreamCoordinator.java:209 - [Stream #692c6270-a692-11e4-9973-070e938df227, 
> ID#0] Beginning stream session with /10.66.187.201
> INFO  [STREAM-IN-/10.66.187.201] 2015-01-28 02:07:29,465 
> StreamResultFuture.java:166 - [Stream #692c6270-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 5 files(627994 bytes), sending 5 
> files(632105 bytes)
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,971 
> StreamResultFuture.java:180 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Session with /10.66.187.201 is complete
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamResultFuture.java:212 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> ERROR [RepairJobTask:1] 2015-01-28 02:07:39,444 RepairJob.java:127 - Error 
> occurred during snapshot phase
> java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
> at 
> org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
> ~[na:1.7.0_45]
> at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
> ~[na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  [na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  [na:1.7.0_45]
> at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
> INFO  [AntiEntropySessions:6] 2015-01-28 02:07:39,445 RepairSession.java:260 
> - [repair #6f85e740-a692-11e4-9973-070e938df227] new session: will sync 
> /10.98.194.68, /10.66.187.201, /10.226.218.135 on range 
> (12817179804668051873746972069086
> 2638799,12863540308359254031520865977436165] for events.[bigint0text, 
> bigint0boolean, bigint0int, dataset_catalog, column_categories, 
> bigint0double, bigint0bigint]
> ERROR [AntiEntropySessions:5] 2015-01-28 02:07:39,445 RepairSession.java:303 
> - [repair #685e3d00-a692-11e4-9973-070e938df227] session completed with the 
> following error
> java.io.IOException: Failed during snapshot creation.
> at 
> org.apache.cassandra.repair.RepairSession.fai

[jira] [Commented] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces returnjava.lang.RuntimeException: Could not create snapshot

2015-01-28 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14295552#comment-14295552
 ] 

Jeff Liu commented on CASSANDRA-8696:
-

I take my previous reply back.
I do see similar error log of failing to create snapshot now.

{noformat}
INFO  [RepairJobTask:14] 2015-01-28 18:15:07,253 Differencer.java:67 - [repair 
#c4fa7700-a718-11e4-a4fa-21b31e869e75] Endpoints /10.115.61.85 and /10.97.9.110 
are consistent for bigint0bigint
INFO  [RepairJobTask:15] 2015-01-28 18:15:07,254 Differencer.java:74 - [repair 
#c4fa7700-a718-11e4-a4fa-21b31e869e75] Endpoints /10.66.187.201 and 
/10.97.9.110 have 1 range(s) out of sync for bigint0bigint
INFO  [RepairJobTask:13] 2015-01-28 18:15:07,254 Differencer.java:74 - [repair 
#c4fa7700-a718-11e4-a4fa-21b31e869e75] Endpoints /10.115.61.85 and 
/10.66.187.201 have 1 range(s) out of sync for bigint0bigint
INFO  [RepairJobTask:13] 2015-01-28 18:15:07,254 StreamingRepairTask.java:81 - 
[repair #c4fa7700-a718-11e4-a4fa-21b31e869e75] Forwarding streaming repair of 1 
ranges to /10.115.61.85 (to be streamed with /10.66.187.201)
INFO  [AntiEntropyStage:1] 2015-01-28 18:15:07,256 RepairSession.java:171 - 
[repair #634aaed0-a718-11e4-a4fa-21b31e869e75] Received merkle tree for 
bigint0bigint from /10.97.9.110
INFO  [RepairJobTask:14] 2015-01-28 18:15:07,257 Differencer.java:67 - [repair 
#634aaed0-a718-11e4-a4fa-21b31e869e75] Endpoints /10.115.61.85 and /10.97.9.110 
are consistent for bigint0bigint
INFO  [RepairJobTask:13] 2015-01-28 18:15:07,257 Differencer.java:74 - [repair 
#634aaed0-a718-11e4-a4fa-21b31e869e75] Endpoints /10.115.61.85 and 
/10.66.187.201 have 1 range(s) out of sync for bigint0bigint
INFO  [RepairJobTask:15] 2015-01-28 18:15:07,257 Differencer.java:74 - [repair 
#634aaed0-a718-11e4-a4fa-21b31e869e75] Endpoints /10.66.187.201 and 
/10.97.9.110 have 1 range(s) out of sync for bigint0bigint
INFO  [RepairJobTask:13] 2015-01-28 18:15:07,257 StreamingRepairTask.java:81 - 
[repair #634aaed0-a718-11e4-a4fa-21b31e869e75] Forwarding streaming repair of 1 
ranges to /10.115.61.85 (to be streamed with /10.66.187.201)
INFO  [RepairJobTask:15] 2015-01-28 18:15:07,330 StreamingRepairTask.java:68 - 
[streaming task #c4fa7700-a718-11e4-a4fa-21b31e869e75] Performing streaming 
repair of 1 ranges with /10.66.187.201
INFO  [RepairJobTask:15] 2015-01-28 18:15:07,330 StreamingRepairTask.java:68 - 
[streaming task #634aaed0-a718-11e4-a4fa-21b31e869e75] Performing streaming 
repair of 1 ranges with /10.66.187.201
INFO  [RepairJobTask:15] 2015-01-28 18:15:09,823 StreamResultFuture.java:86 - 
[Stream #96c28930-a719-11e4-a4fa-21b31e869e75] Executing streaming plan for 
Repair
INFO  [RepairJobTask:15] 2015-01-28 18:15:09,823 StreamResultFuture.java:86 - 
[Stream #96c28931-a719-11e4-a4fa-21b31e869e75] Executing streaming plan for 
Repair
INFO  [StreamConnectionEstablisher:2] 2015-01-28 18:15:09,823 
StreamSession.java:213 - [Stream #96c28930-a719-11e4-a4fa-21b31e869e75] 
Starting streaming to /10.66.187.201
INFO  [StreamConnectionEstablisher:3] 2015-01-28 18:15:09,824 
StreamSession.java:213 - [Stream #96c28931-a719-11e4-a4fa-21b31e869e75] 
Starting streaming to /10.66.187.201
INFO  [StreamConnectionEstablisher:3] 2015-01-28 18:15:09,826 
StreamCoordinator.java:209 - [Stream #96c28931-a719-11e4-a4fa-21b31e869e75, 
ID#0] Beginning stream session with /10.66.187.201
INFO  [StreamConnectionEstablisher:2] 2015-01-28 18:15:09,826 
StreamCoordinator.java:209 - [Stream #96c28930-a719-11e4-a4fa-21b31e869e75, 
ID#0] Beginning stream session with /10.66.187.201
ERROR [RepairJobTask:15] 2015-01-28 18:15:17,905 RepairJob.java:127 - Error 
occurred during snapshot phase
java.lang.RuntimeException: Could not create snapshot at /10.66.187.201
at 
org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
~[na:1.7.0_45]
at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
~[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
ERROR [RepairJobTask:15] 2015-01-28 18:15:17,905 RepairJob.java:127 - Error 
occurred during snapshot phase
java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
at 
org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
~[apache-cassandra-2.1.2.jar:2.

[jira] [Comment Edited] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces returnjava.lang.RuntimeException: Could not create snapshot

2015-01-28 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14295552#comment-14295552
 ] 

Jeff Liu edited comment on CASSANDRA-8696 at 1/28/15 6:17 PM:
--

I take my words saying no error log in my previous reply back.
I do see similar error log of failing to create snapshot now.

{noformat}
INFO  [RepairJobTask:14] 2015-01-28 18:15:07,253 Differencer.java:67 - [repair 
#c4fa7700-a718-11e4-a4fa-21b31e869e75] Endpoints /10.115.61.85 and /10.97.9.110 
are consistent for bigint0bigint
INFO  [RepairJobTask:15] 2015-01-28 18:15:07,254 Differencer.java:74 - [repair 
#c4fa7700-a718-11e4-a4fa-21b31e869e75] Endpoints /10.66.187.201 and 
/10.97.9.110 have 1 range(s) out of sync for bigint0bigint
INFO  [RepairJobTask:13] 2015-01-28 18:15:07,254 Differencer.java:74 - [repair 
#c4fa7700-a718-11e4-a4fa-21b31e869e75] Endpoints /10.115.61.85 and 
/10.66.187.201 have 1 range(s) out of sync for bigint0bigint
INFO  [RepairJobTask:13] 2015-01-28 18:15:07,254 StreamingRepairTask.java:81 - 
[repair #c4fa7700-a718-11e4-a4fa-21b31e869e75] Forwarding streaming repair of 1 
ranges to /10.115.61.85 (to be streamed with /10.66.187.201)
INFO  [AntiEntropyStage:1] 2015-01-28 18:15:07,256 RepairSession.java:171 - 
[repair #634aaed0-a718-11e4-a4fa-21b31e869e75] Received merkle tree for 
bigint0bigint from /10.97.9.110
INFO  [RepairJobTask:14] 2015-01-28 18:15:07,257 Differencer.java:67 - [repair 
#634aaed0-a718-11e4-a4fa-21b31e869e75] Endpoints /10.115.61.85 and /10.97.9.110 
are consistent for bigint0bigint
INFO  [RepairJobTask:13] 2015-01-28 18:15:07,257 Differencer.java:74 - [repair 
#634aaed0-a718-11e4-a4fa-21b31e869e75] Endpoints /10.115.61.85 and 
/10.66.187.201 have 1 range(s) out of sync for bigint0bigint
INFO  [RepairJobTask:15] 2015-01-28 18:15:07,257 Differencer.java:74 - [repair 
#634aaed0-a718-11e4-a4fa-21b31e869e75] Endpoints /10.66.187.201 and 
/10.97.9.110 have 1 range(s) out of sync for bigint0bigint
INFO  [RepairJobTask:13] 2015-01-28 18:15:07,257 StreamingRepairTask.java:81 - 
[repair #634aaed0-a718-11e4-a4fa-21b31e869e75] Forwarding streaming repair of 1 
ranges to /10.115.61.85 (to be streamed with /10.66.187.201)
INFO  [RepairJobTask:15] 2015-01-28 18:15:07,330 StreamingRepairTask.java:68 - 
[streaming task #c4fa7700-a718-11e4-a4fa-21b31e869e75] Performing streaming 
repair of 1 ranges with /10.66.187.201
INFO  [RepairJobTask:15] 2015-01-28 18:15:07,330 StreamingRepairTask.java:68 - 
[streaming task #634aaed0-a718-11e4-a4fa-21b31e869e75] Performing streaming 
repair of 1 ranges with /10.66.187.201
INFO  [RepairJobTask:15] 2015-01-28 18:15:09,823 StreamResultFuture.java:86 - 
[Stream #96c28930-a719-11e4-a4fa-21b31e869e75] Executing streaming plan for 
Repair
INFO  [RepairJobTask:15] 2015-01-28 18:15:09,823 StreamResultFuture.java:86 - 
[Stream #96c28931-a719-11e4-a4fa-21b31e869e75] Executing streaming plan for 
Repair
INFO  [StreamConnectionEstablisher:2] 2015-01-28 18:15:09,823 
StreamSession.java:213 - [Stream #96c28930-a719-11e4-a4fa-21b31e869e75] 
Starting streaming to /10.66.187.201
INFO  [StreamConnectionEstablisher:3] 2015-01-28 18:15:09,824 
StreamSession.java:213 - [Stream #96c28931-a719-11e4-a4fa-21b31e869e75] 
Starting streaming to /10.66.187.201
INFO  [StreamConnectionEstablisher:3] 2015-01-28 18:15:09,826 
StreamCoordinator.java:209 - [Stream #96c28931-a719-11e4-a4fa-21b31e869e75, 
ID#0] Beginning stream session with /10.66.187.201
INFO  [StreamConnectionEstablisher:2] 2015-01-28 18:15:09,826 
StreamCoordinator.java:209 - [Stream #96c28930-a719-11e4-a4fa-21b31e869e75, 
ID#0] Beginning stream session with /10.66.187.201
ERROR [RepairJobTask:15] 2015-01-28 18:15:17,905 RepairJob.java:127 - Error 
occurred during snapshot phase
java.lang.RuntimeException: Could not create snapshot at /10.66.187.201
at 
org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
~[na:1.7.0_45]
at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
~[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
ERROR [RepairJobTask:15] 2015-01-28 18:15:17,905 RepairJob.java:127 - Error 
occurred during snapshot phase
java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
at 
org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.net.Mes

[jira] [Comment Edited] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces returnjava.lang.RuntimeException: Could not create snapshot

2015-01-28 Thread Jeff Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14295547#comment-14295547
 ] 

Jeff Liu edited comment on CASSANDRA-8696 at 1/28/15 6:15 PM:
--

Interesting thing is that I haven't seen any repair error in node 10.97.9.110 
system.log. From another post on serverfault website, another people suggested 
that additional logs will show if manually running nodetool snapshot on the 
problematic node. but I was able to successfully run nodetool snapshot on node 
10.97.9.110 without seeing any error log related to snapshot creation.


was (Author: jeffl):
Interesting thing is that I haven't seen any repair error in node 10.97.9.110 
system.log. From another post on serverfault website, Some other people 
suggested that additional logs will show if manually running nodetool snapshot 
on the problematic node. but I was able to successfully run nodetool snapshot 
on node 10.97.9.110 without seeing any error log related to snapshot creation.

> nodetool repair on cassandra 2.1.2 keyspaces 
> returnjava.lang.RuntimeException: Could not create snapshot
> 
>
> Key: CASSANDRA-8696
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8696
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Jeff Liu
>
> When trying to run nodetool repair -pr on cassandra node ( 2.1.2), cassandra 
> throw java exceptions: cannot create snapshot. 
> the error log from system.log:
> {noformat}
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:28,815 
> StreamResultFuture.java:166 - [Stream #692c1450-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 2 files(221187 bytes), sending 5 
> files(632105 bytes)
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:180 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> Session with /10.97.9.110 is complete
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
> StreamResultFuture.java:212 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,047 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> INFO  [RepairJobTask:1] 2015-01-28 02:07:29,065 StreamResultFuture.java:86 - 
> [Stream #692c6270-a692-11e4-9973-070e938df227] Executing streaming plan for 
> Repair
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,065 
> StreamSession.java:213 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Starting streaming to /10.66.187.201
> INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,070 
> StreamCoordinator.java:209 - [Stream #692c6270-a692-11e4-9973-070e938df227, 
> ID#0] Beginning stream session with /10.66.187.201
> INFO  [STREAM-IN-/10.66.187.201] 2015-01-28 02:07:29,465 
> StreamResultFuture.java:166 - [Stream #692c6270-a692-11e4-9973-070e938df227 
> ID#0] Prepare completed. Receiving 5 files(627994 bytes), sending 5 
> files(632105 bytes)
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,971 
> StreamResultFuture.java:180 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> Session with /10.66.187.201 is complete
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamResultFuture.java:212 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
> All sessions completed
> INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
> StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
> streaming task succeed, returning response to /10.98.194.68
> ERROR [RepairJobTask:1] 2015-01-28 02:07:39,444 RepairJob.java:127 - Error 
> occurred during snapshot phase
> java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
> at 
> org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
>  ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
> ~[apache-cassandra-2.1.2.jar:2.1.2]
> at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
> ~[na:1.7.0_45]
> at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
> ~[na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  [na:1.7.0_45]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  [na:1.7.0_45]
> at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
> INFO  [AntiEntropySessions:6] 2015-01-28 02:07:39,445 RepairSession.java:260 
> - [repair #6f85e740-a692-11e4-9973-070e938df227] new session: will sync 
> /10.98.194.68, /10.66.187.201, /10.226.218.135 on range 
> (12817179804668051873746972069086
> 2638799,1

[jira] [Created] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces returnjava.lang.RuntimeException: Could not create snapshot

2015-01-27 Thread Jeff Liu (JIRA)
Jeff Liu created CASSANDRA-8696:
---

 Summary: nodetool repair on cassandra 2.1.2 keyspaces 
returnjava.lang.RuntimeException: Could not create snapshot
 Key: CASSANDRA-8696
 URL: https://issues.apache.org/jira/browse/CASSANDRA-8696
 Project: Cassandra
  Issue Type: Bug
Reporter: Jeff Liu


When trying to run nodetool repair -pr on cassandra node ( 2.1.2), cassandra 
throw java exceptions: cannot create snapshot. 

the error log from system.log:

{noformat}
INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:28,815 
StreamResultFuture.java:166 - [Stream #692c1450-a692-11e4-9973-070e938df227 
ID#0] Prepare completed. Receiving 2 files(221187 bytes), sending 5 
files(632105 bytes)
INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
StreamResultFuture.java:180 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
Session with /10.97.9.110 is complete
INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
StreamResultFuture.java:212 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
All sessions completed
INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,047 
StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
streaming task succeed, returning response to /10.98.194.68
INFO  [RepairJobTask:1] 2015-01-28 02:07:29,065 StreamResultFuture.java:86 - 
[Stream #692c6270-a692-11e4-9973-070e938df227] Executing streaming plan for 
Repair
INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,065 
StreamSession.java:213 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
Starting streaming to /10.66.187.201
INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,070 
StreamCoordinator.java:209 - [Stream #692c6270-a692-11e4-9973-070e938df227, 
ID#0] Beginning stream session with /10.66.187.201
INFO  [STREAM-IN-/10.66.187.201] 2015-01-28 02:07:29,465 
StreamResultFuture.java:166 - [Stream #692c6270-a692-11e4-9973-070e938df227 
ID#0] Prepare completed. Receiving 5 files(627994 bytes), sending 5 
files(632105 bytes)
INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,971 
StreamResultFuture.java:180 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
Session with /10.66.187.201 is complete
INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
StreamResultFuture.java:212 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
All sessions completed
INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
streaming task succeed, returning response to /10.98.194.68
ERROR [RepairJobTask:1] 2015-01-28 02:07:39,444 RepairJob.java:127 - Error 
occurred during snapshot phase
java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
at 
org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
~[na:1.7.0_45]
at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
~[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
INFO  [AntiEntropySessions:6] 2015-01-28 02:07:39,445 RepairSession.java:260 - 
[repair #6f85e740-a692-11e4-9973-070e938df227] new session: will sync 
/10.98.194.68, /10.66.187.201, /10.226.218.135 on range 
(12817179804668051873746972069086
2638799,12863540308359254031520865977436165] for events.[bigint0text, 
bigint0boolean, bigint0int, dataset_catalog, column_categories, bigint0double, 
bigint0bigint]
ERROR [AntiEntropySessions:5] 2015-01-28 02:07:39,445 RepairSession.java:303 - 
[repair #685e3d00-a692-11e4-9973-070e938df227] session completed with the 
following error
java.io.IOException: Failed during snapshot creation.
at 
org.apache.cassandra.repair.RepairSession.failedSnapshot(RepairSession.java:344)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.repair.RepairJob$2.onFailure(RepairJob.java:128) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at com.google.common.util.concurrent.Futures$4.run(Futures.java:1172) 
~[guava-16.0.jar:na]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
ERROR [AntiEntropySessions:5] 2015-01-28 02:07:39,446 CassandraDaemon.java:153 
- Exception in thread Thread[AntiEntropySessions:5,5,RMI Runtime]
java.lang.RuntimeException: java.io.IOException: Failed during snapshot 
creation.
at com.goog

[jira] [Updated] (CASSANDRA-8689) Assertion error in 2.1.2: ERROR [IndexSummaryManager:1]

2015-01-26 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8689:

Description: 
After upgrading a 6 nodes cassandra from 2.1.0 to 2.1.2, start getting the 
following assertion error.

{noformat}
ERROR [IndexSummaryManager:1] 2015-01-26 20:55:40,451 CassandraDaemon.java:153 
- Exception in thread Thread[IndexSummaryManager:1,1,main]
java.lang.AssertionError: null
at org.apache.cassandra.io.util.Memory.size(Memory.java:307) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummary.getOffHeapSize(IndexSummary.java:192)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.SSTableReader.getIndexSummaryOffHeapSize(SSTableReader.java:1070)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager.redistributeSummaries(IndexSummaryManager.java:292)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager.redistributeSummaries(IndexSummaryManager.java:238)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager$1.runMayThrow(IndexSummaryManager.java:139)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.concurrent.DebuggableScheduledThreadPoolExecutor$UncomplainingRunnable.run(DebuggableScheduledThreadPoolExecutor.java:77)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
[na:1.7.0_45]
at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:304) 
[na:1.7.0_45]
at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:178)
 [na:1.7.0_45]
at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
 [na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
{noformat}

cassandra service is still running despite the issue. Node has total 8G memory 
with 2G allocated to heap. We are basically running read queries to retrieve 
data out of cassandra.




  was:
After upgrading a 6 nodes cassandra from 2.1.0 to 2.1.2, start getting the 
following assertion error.

{noformat}
ERROR [IndexSummaryManager:1] 2015-01-26 20:55:40,451 CassandraDaemon.java:153 
- Exception in thread Thread[IndexSummaryManager:1,1,main]
java.lang.AssertionError: null
at org.apache.cassandra.io.util.Memory.size(Memory.java:307) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummary.getOffHeapSize(IndexSummary.java:192)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.SSTableReader.getIndexSummaryOffHeapSize(SSTableReader.java:1070)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager.redistributeSummaries(IndexSummaryManager.java:292)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager.redistributeSummaries(IndexSummaryManager.java:238)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager$1.runMayThrow(IndexSummaryManager.java:139)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.concurrent.DebuggableScheduledThreadPoolExecutor$UncomplainingRunnable.run(DebuggableScheduledThreadPoolExecutor.java:77)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
[na:1.7.0_45]
at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:304) 
[na:1.7.0_45]
at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:178)
 [na:1.7.0_45]
at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
 [na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
{noformat}

cassandra service is still running despite the issue. Node has total 8G memory 
with 2G allocated to heap.



> Assertion error in 2.1.2: ERROR [IndexSummaryManager:1]
> -

[jira] [Updated] (CASSANDRA-8689) Assertion error in 2.1.2: ERROR [IndexSummaryManager:1]

2015-01-26 Thread Jeff Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Liu updated CASSANDRA-8689:

Description: 
After upgrading a 6 nodes cassandra from 2.1.0 to 2.1.2, start getting the 
following assertion error.

{noformat}
ERROR [IndexSummaryManager:1] 2015-01-26 20:55:40,451 CassandraDaemon.java:153 
- Exception in thread Thread[IndexSummaryManager:1,1,main]
java.lang.AssertionError: null
at org.apache.cassandra.io.util.Memory.size(Memory.java:307) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummary.getOffHeapSize(IndexSummary.java:192)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.SSTableReader.getIndexSummaryOffHeapSize(SSTableReader.java:1070)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager.redistributeSummaries(IndexSummaryManager.java:292)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager.redistributeSummaries(IndexSummaryManager.java:238)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager$1.runMayThrow(IndexSummaryManager.java:139)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.concurrent.DebuggableScheduledThreadPoolExecutor$UncomplainingRunnable.run(DebuggableScheduledThreadPoolExecutor.java:77)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
[na:1.7.0_45]
at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:304) 
[na:1.7.0_45]
at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:178)
 [na:1.7.0_45]
at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
 [na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
{noformat}

cassandra service is still running despite the issue. Node has total 8G memory 
with 2G allocated to heap.


  was:
After upgrading a 6 nodes cassandra from 2.1.0 to 2.1.2, start getting the 
following assertion error.


ERROR [IndexSummaryManager:1] 2015-01-26 20:55:40,451 CassandraDaemon.java:153 
- Exception in thread Thread[IndexSummaryManager:1,1,main]
java.lang.AssertionError: null
at org.apache.cassandra.io.util.Memory.size(Memory.java:307) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummary.getOffHeapSize(IndexSummary.java:192)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.SSTableReader.getIndexSummaryOffHeapSize(SSTableReader.java:1070)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager.redistributeSummaries(IndexSummaryManager.java:292)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager.redistributeSummaries(IndexSummaryManager.java:238)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager$1.runMayThrow(IndexSummaryManager.java:139)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.concurrent.DebuggableScheduledThreadPoolExecutor$UncomplainingRunnable.run(DebuggableScheduledThreadPoolExecutor.java:77)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
[na:1.7.0_45]
at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:304) 
[na:1.7.0_45]
at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:178)
 [na:1.7.0_45]
at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
 [na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]



> Assertion error in 2.1.2: ERROR [IndexSummaryManager:1]
> ---
>
> Key: CASSANDRA-8689
> URL: https://issues.apache.org/jira/browse/CASSANDRA-8689
> Project: Cassandra
>  Issue Type: Bug
> 

[jira] [Created] (CASSANDRA-8689) Assertion error in 2.1.2: ERROR [IndexSummaryManager:1]

2015-01-26 Thread Jeff Liu (JIRA)
Jeff Liu created CASSANDRA-8689:
---

 Summary: Assertion error in 2.1.2: ERROR [IndexSummaryManager:1]
 Key: CASSANDRA-8689
 URL: https://issues.apache.org/jira/browse/CASSANDRA-8689
 Project: Cassandra
  Issue Type: Bug
Reporter: Jeff Liu


After upgrading a 6 nodes cassandra from 2.1.0 to 2.1.2, start getting the 
following assertion error.


ERROR [IndexSummaryManager:1] 2015-01-26 20:55:40,451 CassandraDaemon.java:153 
- Exception in thread Thread[IndexSummaryManager:1,1,main]
java.lang.AssertionError: null
at org.apache.cassandra.io.util.Memory.size(Memory.java:307) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummary.getOffHeapSize(IndexSummary.java:192)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.SSTableReader.getIndexSummaryOffHeapSize(SSTableReader.java:1070)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager.redistributeSummaries(IndexSummaryManager.java:292)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager.redistributeSummaries(IndexSummaryManager.java:238)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.io.sstable.IndexSummaryManager$1.runMayThrow(IndexSummaryManager.java:139)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28) 
~[apache-cassandra-2.1.2.jar:2.1.2]
at 
org.apache.cassandra.concurrent.DebuggableScheduledThreadPoolExecutor$UncomplainingRunnable.run(DebuggableScheduledThreadPoolExecutor.java:77)
 ~[apache-cassandra-2.1.2.jar:2.1.2]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
[na:1.7.0_45]
at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:304) 
[na:1.7.0_45]
at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:178)
 [na:1.7.0_45]
at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
 [na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_45]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)