[jira] [Created] (CASSANDRA-15541) Add dch release version to debian packages

2020-02-03 Thread Michael Semb Wever (Jira)
Michael Semb Wever created CASSANDRA-15541:
--

 Summary: Add dch release version to debian packages
 Key: CASSANDRA-15541
 URL: https://issues.apache.org/jira/browse/CASSANDRA-15541
 Project: Cassandra
  Issue Type: Task
  Components: Packaging
Reporter: Michael Semb Wever


Add to the prepare_release.sh script the `dch -r ` step that adds the version.

Currently the script will generate an UNRELEASED version.

Reference: 
https://lists.apache.org/thread.html/r7f8addef88553955b1057a61b29526cb974cc9037f20ce46fb26fb61%40%3Cdev.cassandra.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Created] (CASSANDRA-15540) Use the ASF process for packaging and uploading deb files

2020-02-03 Thread Michael Semb Wever (Jira)
Michael Semb Wever created CASSANDRA-15540:
--

 Summary: Use the ASF process for packaging and uploading deb files
 Key: CASSANDRA-15540
 URL: https://issues.apache.org/jira/browse/CASSANDRA-15540
 Project: Cassandra
  Issue Type: Task
  Components: Packaging
Reporter: Michael Semb Wever


Use the ASF process for packaging and uploading deb files.

https://cwiki.apache.org/confluence/display/INFRA/Packaging+a+deb+and+uploading+to+Bintray

The current process is found in 
https://github.com/apache/cassandra-builds/tree/master/cassandra-release



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Comment Edited] (CASSANDRA-15034) cassandra-stress fails to retry user profile insert and query operations

2020-02-03 Thread Dmitry Kropachev (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029596#comment-17029596
 ] 

Dmitry Kropachev edited comment on CASSANDRA-15034 at 2/4/20 6:21 AM:
--

[~djoshi] , could you please look into it, this issue is really annoying.


was (Author: dkropachev):
Dinesh Joshi, could you please look into it, this issue is really annoying.

> cassandra-stress fails to retry user profile insert and query operations 
> -
>
> Key: CASSANDRA-15034
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15034
> Project: Cassandra
>  Issue Type: Bug
>  Components: Tool/stress
>Reporter: Shlomi Livne
>Assignee: Shlomi Livne
>Priority: Normal
> Attachments: 0001-Fix-retry-of-userdefined-operations.patch, 
> stress.yaml
>
>
> cassandra-stress that is run with a user profile against a cluster will fail 
> to trey operations when a node is killed.
> To reproduce:
> # Create a 3 ndoe cluster with ccm
> {code:java}
> ccm create cas-3 --vnodes -n 1 --version=3.11.3``{code}
> # start the cluster
> {code:java}
>  ccm start{code}
> # run
> {code:java}
> tools/bin/cassandra-stress user profile=stress.yaml n=10 ops(insert=1) 
> no-warmup cl=QUORUM -node 127.0.0.2 -rate threads=10{code}
> or run
> {code:java}
> tools/bin/cassandra-stress user profile=stress.yaml n=10 ops(simple=1) 
> no-warmup cl=QUORUM -node 127.0.0.2 -rate threads=10{code}
> # while the stress is progressing kill a node
> {code:java}
> ccm node1 stop --not-gently{code}
> # wait for cassandra-stress to end
> for the insert case check
> {code:java}
> ccm node2 cqlsh -e " select count(*) FROM stresscql.tb ;"{code}
> we are missing rows
> for the query case (simple) the following errors will be reported
> {code:java}
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.io.IOException: Operation x10 on key(s) [49a26f35297303469236]: Error 
> executing: (NoSuchElementException)
> at 
> org.apache.cassandra.stress.Operation.error(Operation.java:127)java.util.NoSuchElementException
> at 
> org.apache.cassandra.stress.Operation.timeWithRetry(Operation.java:105)com.datastax.driver.core.exceptions.TransportException:
>  [/127.0.0.1:9042] Connection has been closed
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException{code}
> profile: stress.yaml[^stress.yaml]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Commented] (CASSANDRA-15034) cassandra-stress fails to retry user profile insert and query operations

2020-02-03 Thread Dmitry Kropachev (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029596#comment-17029596
 ] 

Dmitry Kropachev commented on CASSANDRA-15034:
--

Dinesh Joshi, could you please look into it, this issue is really annoying.

> cassandra-stress fails to retry user profile insert and query operations 
> -
>
> Key: CASSANDRA-15034
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15034
> Project: Cassandra
>  Issue Type: Bug
>  Components: Tool/stress
>Reporter: Shlomi Livne
>Assignee: Shlomi Livne
>Priority: Normal
> Attachments: 0001-Fix-retry-of-userdefined-operations.patch, 
> stress.yaml
>
>
> cassandra-stress that is run with a user profile against a cluster will fail 
> to trey operations when a node is killed.
> To reproduce:
> # Create a 3 ndoe cluster with ccm
> {code:java}
> ccm create cas-3 --vnodes -n 1 --version=3.11.3``{code}
> # start the cluster
> {code:java}
>  ccm start{code}
> # run
> {code:java}
> tools/bin/cassandra-stress user profile=stress.yaml n=10 ops(insert=1) 
> no-warmup cl=QUORUM -node 127.0.0.2 -rate threads=10{code}
> or run
> {code:java}
> tools/bin/cassandra-stress user profile=stress.yaml n=10 ops(simple=1) 
> no-warmup cl=QUORUM -node 127.0.0.2 -rate threads=10{code}
> # while the stress is progressing kill a node
> {code:java}
> ccm node1 stop --not-gently{code}
> # wait for cassandra-stress to end
> for the insert case check
> {code:java}
> ccm node2 cqlsh -e " select count(*) FROM stresscql.tb ;"{code}
> we are missing rows
> for the query case (simple) the following errors will be reported
> {code:java}
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.io.IOException: Operation x10 on key(s) [49a26f35297303469236]: Error 
> executing: (NoSuchElementException)
> at 
> org.apache.cassandra.stress.Operation.error(Operation.java:127)java.util.NoSuchElementException
> at 
> org.apache.cassandra.stress.Operation.timeWithRetry(Operation.java:105)com.datastax.driver.core.exceptions.TransportException:
>  [/127.0.0.1:9042] Connection has been closed
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException
> java.util.NoSuchElementException{code}
> profile: stress.yaml[^stress.yaml]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Comment Edited] (CASSANDRA-15529) AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 to 3.11.4

2020-02-03 Thread Pooja Nair (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029577#comment-17029577
 ] 

Pooja Nair edited comment on CASSANDRA-15529 at 2/4/20 4:47 AM:


Hi [~ifesdjeen] 
 I have attached the sstabledump of the table to this ticket. 
 We are getting an exception while trying to take a sstabledump for the above 
mentioned sal_purge table where range deletes are carried out.
 We tried taking dump on multiple sstables but was thrown with similar 
exception every time. 


was (Author: pooja_nair):
Hi [~ifesdjeen] 
 I have attached the sstabledump of the table to this ticket. 
 We are getting an exception while trying to take a sstabledump for the above 
mentioned sal_purge table where range deletes are carried out.
 We tried taking dump on multiple sstables but was thrown the similar exception 
every time. 

> AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 
> to 3.11.4
> -
>
> Key: CASSANDRA-15529
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15529
> Project: Cassandra
>  Issue Type: Bug
>  Components: Cluster/Schema
>Reporter: Pooja Nair
>Priority: Urgent
>  Labels: 2.1.16, 3.11.4
> Attachments: sstable_dump.txt
>
>
> Hello Team, 
> We have cluster running on cassandra 3.11.4
> Following is the table schema of the tables that is being used in our system.
> {code:java}
> cqlsh> desc KEYSPACE "SAL"
>   
>   CREATE KEYSPACE "SAL" WITH replication = {'class': 
> 'NetworkTopologyStrategy', 'DC_EAST': '3', 'DC_WEST': '3'}  AND 
> durable_writes = true;
>   
>   CREATE TABLE "SAL".sal_purge (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.1
>   AND caching = '{"keys":"NONE", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds items to be removed as 
> [shardid][salid][timestamp]. The table records SALIDs to be deleted along 
> with their deletion times (which may be modified)'
>   AND compaction = {'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.1
>   AND speculative_retry = '99.0PERCENTILE';
>   
>   CREATE TABLE "SAL".sal_ref (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.025
>   AND caching = '{"keys":"ALL", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds owner references to content as [salid][lcid/opid]'
>   AND compaction = {'sstable_size_in_mb': '180', 'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.0
>   AND speculative_retry = '99.0PERCENTILE';
> {code}
> Things to note:
>  # The column2 is always passed a null value during insertion 
>  # column2 is a part of primary key
>  #  Range select and Range delete is done through our app.    
> Iniatally the cluster was on casssandra version 2.1.16  and have been 
> recently upgraded to 3.11.4 post the upgrade, we see that the nodes are going 
> down, and log the below exceptions during startup and even after node is up. 
> This one node is causing the whole cluster to behave improperly.
> {code:java}
> WARN [Native-Transport-Requests-47] 2020-01-29 13:49:05,190 
> AbstractLocalAwareExecutorService.java:167 - Uncaught exception on thread 
> Thread[Native-Transport-Requests-47,5,main]: {} java.lang.RuntimeException: 
> java.lang.IllegalStateException: UnfilteredRowIterator for SAL.sal_purge has 
> an open RT bound as its last item at 
> org.apache.cassandra.service.StorageProxy$DroppableRunnable.run(StorageProxy.java:2588)
>  ~[apache-cassandra-3.11.4.jar:3.11.4] at 
> 

[jira] [Comment Edited] (CASSANDRA-15529) AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 to 3.11.4

2020-02-03 Thread Pooja Nair (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029577#comment-17029577
 ] 

Pooja Nair edited comment on CASSANDRA-15529 at 2/4/20 4:47 AM:


Hi [~ifesdjeen] 
 I have attached the sstabledump of the table to this ticket. 
 We are getting an exception while trying to take a sstabledump for the above 
mentioned sal_purge table where range deletes are carried out.
 We tried taking dump on multiple sstables but was thrown the similar exception 
every time. 


was (Author: pooja_nair):
Hi [~ifesdjeen] 
 I have attached the sstabledump of the table to this ticket. 
 We are getting an exception while trying to take a sstabledump for the above 
mentioned sal_purge table where range deletes are carried out.
We tried taking dump on multiple sstables but was thrown the same exception 
every time. 

> AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 
> to 3.11.4
> -
>
> Key: CASSANDRA-15529
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15529
> Project: Cassandra
>  Issue Type: Bug
>  Components: Cluster/Schema
>Reporter: Pooja Nair
>Priority: Urgent
>  Labels: 2.1.16, 3.11.4
> Attachments: sstable_dump.txt
>
>
> Hello Team, 
> We have cluster running on cassandra 3.11.4
> Following is the table schema of the tables that is being used in our system.
> {code:java}
> cqlsh> desc KEYSPACE "SAL"
>   
>   CREATE KEYSPACE "SAL" WITH replication = {'class': 
> 'NetworkTopologyStrategy', 'DC_EAST': '3', 'DC_WEST': '3'}  AND 
> durable_writes = true;
>   
>   CREATE TABLE "SAL".sal_purge (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.1
>   AND caching = '{"keys":"NONE", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds items to be removed as 
> [shardid][salid][timestamp]. The table records SALIDs to be deleted along 
> with their deletion times (which may be modified)'
>   AND compaction = {'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.1
>   AND speculative_retry = '99.0PERCENTILE';
>   
>   CREATE TABLE "SAL".sal_ref (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.025
>   AND caching = '{"keys":"ALL", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds owner references to content as [salid][lcid/opid]'
>   AND compaction = {'sstable_size_in_mb': '180', 'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.0
>   AND speculative_retry = '99.0PERCENTILE';
> {code}
> Things to note:
>  # The column2 is always passed a null value during insertion 
>  # column2 is a part of primary key
>  #  Range select and Range delete is done through our app.    
> Iniatally the cluster was on casssandra version 2.1.16  and have been 
> recently upgraded to 3.11.4 post the upgrade, we see that the nodes are going 
> down, and log the below exceptions during startup and even after node is up. 
> This one node is causing the whole cluster to behave improperly.
> {code:java}
> WARN [Native-Transport-Requests-47] 2020-01-29 13:49:05,190 
> AbstractLocalAwareExecutorService.java:167 - Uncaught exception on thread 
> Thread[Native-Transport-Requests-47,5,main]: {} java.lang.RuntimeException: 
> java.lang.IllegalStateException: UnfilteredRowIterator for SAL.sal_purge has 
> an open RT bound as its last item at 
> org.apache.cassandra.service.StorageProxy$DroppableRunnable.run(StorageProxy.java:2588)
>  ~[apache-cassandra-3.11.4.jar:3.11.4] at 
> 

[jira] [Comment Edited] (CASSANDRA-15529) AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 to 3.11.4

2020-02-03 Thread Pooja Nair (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029577#comment-17029577
 ] 

Pooja Nair edited comment on CASSANDRA-15529 at 2/4/20 4:46 AM:


Hi [~ifesdjeen] 
 I have attached the sstabledump of the table to this ticket. 
 We are getting an exception while trying to take a sstabledump for the above 
mentioned sal_purge table where range deletes are carried out.
We tried taking dump on multiple sstables but was thrown the same exception 
every time. 


was (Author: pooja_nair):
Hi [~ifesdjeen] 
I have attached the sstabledump of the table to this ticket. 
We are getting an exception while trying to take a sstabledump for the above 
mentioned sal_purge table where range deletes are carried out.

> AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 
> to 3.11.4
> -
>
> Key: CASSANDRA-15529
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15529
> Project: Cassandra
>  Issue Type: Bug
>  Components: Cluster/Schema
>Reporter: Pooja Nair
>Priority: Urgent
>  Labels: 2.1.16, 3.11.4
> Attachments: sstable_dump.txt
>
>
> Hello Team, 
> We have cluster running on cassandra 3.11.4
> Following is the table schema of the tables that is being used in our system.
> {code:java}
> cqlsh> desc KEYSPACE "SAL"
>   
>   CREATE KEYSPACE "SAL" WITH replication = {'class': 
> 'NetworkTopologyStrategy', 'DC_EAST': '3', 'DC_WEST': '3'}  AND 
> durable_writes = true;
>   
>   CREATE TABLE "SAL".sal_purge (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.1
>   AND caching = '{"keys":"NONE", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds items to be removed as 
> [shardid][salid][timestamp]. The table records SALIDs to be deleted along 
> with their deletion times (which may be modified)'
>   AND compaction = {'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.1
>   AND speculative_retry = '99.0PERCENTILE';
>   
>   CREATE TABLE "SAL".sal_ref (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.025
>   AND caching = '{"keys":"ALL", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds owner references to content as [salid][lcid/opid]'
>   AND compaction = {'sstable_size_in_mb': '180', 'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.0
>   AND speculative_retry = '99.0PERCENTILE';
> {code}
> Things to note:
>  # The column2 is always passed a null value during insertion 
>  # column2 is a part of primary key
>  #  Range select and Range delete is done through our app.    
> Iniatally the cluster was on casssandra version 2.1.16  and have been 
> recently upgraded to 3.11.4 post the upgrade, we see that the nodes are going 
> down, and log the below exceptions during startup and even after node is up. 
> This one node is causing the whole cluster to behave improperly.
> {code:java}
> WARN [Native-Transport-Requests-47] 2020-01-29 13:49:05,190 
> AbstractLocalAwareExecutorService.java:167 - Uncaught exception on thread 
> Thread[Native-Transport-Requests-47,5,main]: {} java.lang.RuntimeException: 
> java.lang.IllegalStateException: UnfilteredRowIterator for SAL.sal_purge has 
> an open RT bound as its last item at 
> org.apache.cassandra.service.StorageProxy$DroppableRunnable.run(StorageProxy.java:2588)
>  ~[apache-cassandra-3.11.4.jar:3.11.4] at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) 
> [na:1.8.0-internal] at 
> 

[jira] [Updated] (CASSANDRA-15529) AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 to 3.11.4

2020-02-03 Thread Pooja Nair (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-15529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pooja Nair updated CASSANDRA-15529:
---
Attachment: sstable_dump.txt

> AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 
> to 3.11.4
> -
>
> Key: CASSANDRA-15529
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15529
> Project: Cassandra
>  Issue Type: Bug
>  Components: Cluster/Schema
>Reporter: Pooja Nair
>Priority: Urgent
>  Labels: 2.1.16, 3.11.4
> Attachments: sstable_dump.txt
>
>
> Hello Team, 
> We have cluster running on cassandra 3.11.4
> Following is the table schema of the tables that is being used in our system.
> {code:java}
> cqlsh> desc KEYSPACE "SAL"
>   
>   CREATE KEYSPACE "SAL" WITH replication = {'class': 
> 'NetworkTopologyStrategy', 'DC_EAST': '3', 'DC_WEST': '3'}  AND 
> durable_writes = true;
>   
>   CREATE TABLE "SAL".sal_purge (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.1
>   AND caching = '{"keys":"NONE", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds items to be removed as 
> [shardid][salid][timestamp]. The table records SALIDs to be deleted along 
> with their deletion times (which may be modified)'
>   AND compaction = {'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.1
>   AND speculative_retry = '99.0PERCENTILE';
>   
>   CREATE TABLE "SAL".sal_ref (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.025
>   AND caching = '{"keys":"ALL", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds owner references to content as [salid][lcid/opid]'
>   AND compaction = {'sstable_size_in_mb': '180', 'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.0
>   AND speculative_retry = '99.0PERCENTILE';
> {code}
> Things to note:
>  # The column2 is always passed a null value during insertion 
>  # column2 is a part of primary key
>  #  Range select and Range delete is done through our app.    
> Iniatally the cluster was on casssandra version 2.1.16  and have been 
> recently upgraded to 3.11.4 post the upgrade, we see that the nodes are going 
> down, and log the below exceptions during startup and even after node is up. 
> This one node is causing the whole cluster to behave improperly.
> {code:java}
> WARN [Native-Transport-Requests-47] 2020-01-29 13:49:05,190 
> AbstractLocalAwareExecutorService.java:167 - Uncaught exception on thread 
> Thread[Native-Transport-Requests-47,5,main]: {} java.lang.RuntimeException: 
> java.lang.IllegalStateException: UnfilteredRowIterator for SAL.sal_purge has 
> an open RT bound as its last item at 
> org.apache.cassandra.service.StorageProxy$DroppableRunnable.run(StorageProxy.java:2588)
>  ~[apache-cassandra-3.11.4.jar:3.11.4] at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) 
> [na:1.8.0-internal] at 
> org.apache.cassandra.concurrent.AbstractLocalAwareExecutorService$FutureTask.run(AbstractLocalAwareExecutorService.java:162)
>  [apache-cassandra-3.11.4.jar:3.11.4] at 
> org.apache.cassandra.concurrent.AbstractLocalAwareExecutorService$LocalSessionFutureTask.run(AbstractLocalAwareExecutorService.java:134)
>  [apache-cassandra-3.11.4.jar:3.11.4] at 
> org.apache.cassandra.concurrent.SEPExecutor.maybeExecuteImmediately(SEPExecutor.java:194)
>  [apache-cassandra-3.11.4.jar:3.11.4] at 
> org.apache.cassandra.service.AbstractReadExecutor.makeRequests(AbstractReadExecutor.java:117)
>  [apache-cassandra-3.11.4.jar:3.11.4] at 
> 

[jira] [Commented] (CASSANDRA-15529) AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 to 3.11.4

2020-02-03 Thread Pooja Nair (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029577#comment-17029577
 ] 

Pooja Nair commented on CASSANDRA-15529:


Hi [~ifesdjeen] 
I have attached the sstabledump of the table to this ticket. 
We are getting an exception while trying to take a sstabledump for the above 
mentioned sal_purge table where range deletes are carried out.

> AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 
> to 3.11.4
> -
>
> Key: CASSANDRA-15529
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15529
> Project: Cassandra
>  Issue Type: Bug
>  Components: Cluster/Schema
>Reporter: Pooja Nair
>Priority: Urgent
>  Labels: 2.1.16, 3.11.4
> Attachments: sstable_dump.txt
>
>
> Hello Team, 
> We have cluster running on cassandra 3.11.4
> Following is the table schema of the tables that is being used in our system.
> {code:java}
> cqlsh> desc KEYSPACE "SAL"
>   
>   CREATE KEYSPACE "SAL" WITH replication = {'class': 
> 'NetworkTopologyStrategy', 'DC_EAST': '3', 'DC_WEST': '3'}  AND 
> durable_writes = true;
>   
>   CREATE TABLE "SAL".sal_purge (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.1
>   AND caching = '{"keys":"NONE", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds items to be removed as 
> [shardid][salid][timestamp]. The table records SALIDs to be deleted along 
> with their deletion times (which may be modified)'
>   AND compaction = {'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.1
>   AND speculative_retry = '99.0PERCENTILE';
>   
>   CREATE TABLE "SAL".sal_ref (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.025
>   AND caching = '{"keys":"ALL", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds owner references to content as [salid][lcid/opid]'
>   AND compaction = {'sstable_size_in_mb': '180', 'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.0
>   AND speculative_retry = '99.0PERCENTILE';
> {code}
> Things to note:
>  # The column2 is always passed a null value during insertion 
>  # column2 is a part of primary key
>  #  Range select and Range delete is done through our app.    
> Iniatally the cluster was on casssandra version 2.1.16  and have been 
> recently upgraded to 3.11.4 post the upgrade, we see that the nodes are going 
> down, and log the below exceptions during startup and even after node is up. 
> This one node is causing the whole cluster to behave improperly.
> {code:java}
> WARN [Native-Transport-Requests-47] 2020-01-29 13:49:05,190 
> AbstractLocalAwareExecutorService.java:167 - Uncaught exception on thread 
> Thread[Native-Transport-Requests-47,5,main]: {} java.lang.RuntimeException: 
> java.lang.IllegalStateException: UnfilteredRowIterator for SAL.sal_purge has 
> an open RT bound as its last item at 
> org.apache.cassandra.service.StorageProxy$DroppableRunnable.run(StorageProxy.java:2588)
>  ~[apache-cassandra-3.11.4.jar:3.11.4] at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) 
> [na:1.8.0-internal] at 
> org.apache.cassandra.concurrent.AbstractLocalAwareExecutorService$FutureTask.run(AbstractLocalAwareExecutorService.java:162)
>  [apache-cassandra-3.11.4.jar:3.11.4] at 
> org.apache.cassandra.concurrent.AbstractLocalAwareExecutorService$LocalSessionFutureTask.run(AbstractLocalAwareExecutorService.java:134)
>  [apache-cassandra-3.11.4.jar:3.11.4] at 
> 

[jira] [Comment Edited] (CASSANDRA-15529) AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 to 3.11.4

2020-02-03 Thread feroz shaik (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029481#comment-17029481
 ] 

feroz shaik edited comment on CASSANDRA-15529 at 2/4/20 2:35 AM:
-

Hi [~ifesdjeen] , I am a colleague of [~Pooja_nair] and would like to answer 
your questions. # We have all the nodes upgraded to 3.11.4 and also finished 
upgradesstables. There are no previous sstables present on any of the 
nodes/cluster.
 # We have no clue whether the affected node is cordinator or not as it is too 
random and the affected node behaves rogue potentially impacting whole cluster.
 # The query patterns are like below:

 static String INSERT_SCHEDULED_DELETION_QUERY = "INSERT INTO \"sal_purge\" 
   (key,column1,value) VALUES (?,?,?) USING TIMESTAMP 
 ?;"; static String SELECT_SCHEDULED_DELETION_QUERY = "SELECT column1, value 
FROM sal_purge where key=? AND column1>=? LIMIT
 ?;"; static String DELETE_SCHEDULED_DELETION_QUERY = "DELETE FROM 
\"sal_purge\" USING TIMESTAMP ? WHERE key=? AND column1=?;";
 I will try to upload the sstable dump soon to this, Appreciate your support 
[~ifesdjeen]


was (Author: ferozshaik...@gmail.com):
Hi [~ifesdjeen] , I am a colleague of [~Pooja_nair] and would like to answer 
your questions. # We have all the nodes upgraded to 3.11.4 and also finished 
upgradesstables. There are no previous sstables present on any of the 
nodes/cluster.
 # We have no clue whether the affected node is cordinator or not as it is too 
random and the affected node behaves rogue potentially bring impacting whole 
cluster.
 # The query patterns are like below:

 static String INSERT_SCHEDULED_DELETION_QUERY = "INSERT INTO \"sal_purge\" 
   (key,column1,value) VALUES (?,?,?) USING TIMESTAMP 
?;"; static String SELECT_SCHEDULED_DELETION_QUERY = "SELECT column1, value 
FROM sal_purge where key=? AND column1>=? LIMIT
?;"; static String DELETE_SCHEDULED_DELETION_QUERY = "DELETE FROM \"sal_purge\" 
USING TIMESTAMP ? WHERE key=? AND column1=?;";
I will try to upload the sstable dump soon to this, Appreciate your support 
[~ifesdjeen]

> AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 
> to 3.11.4
> -
>
> Key: CASSANDRA-15529
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15529
> Project: Cassandra
>  Issue Type: Bug
>  Components: Cluster/Schema
>Reporter: Pooja Nair
>Priority: Urgent
>  Labels: 2.1.16, 3.11.4
>
> Hello Team, 
> We have cluster running on cassandra 3.11.4
> Following is the table schema of the tables that is being used in our system.
> {code:java}
> cqlsh> desc KEYSPACE "SAL"
>   
>   CREATE KEYSPACE "SAL" WITH replication = {'class': 
> 'NetworkTopologyStrategy', 'DC_EAST': '3', 'DC_WEST': '3'}  AND 
> durable_writes = true;
>   
>   CREATE TABLE "SAL".sal_purge (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.1
>   AND caching = '{"keys":"NONE", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds items to be removed as 
> [shardid][salid][timestamp]. The table records SALIDs to be deleted along 
> with their deletion times (which may be modified)'
>   AND compaction = {'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.1
>   AND speculative_retry = '99.0PERCENTILE';
>   
>   CREATE TABLE "SAL".sal_ref (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.025
>   AND caching = '{"keys":"ALL", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds owner references to content as [salid][lcid/opid]'
>   AND compaction = {'sstable_size_in_mb': '180', 'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
> 

[jira] [Commented] (CASSANDRA-15529) AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 to 3.11.4

2020-02-03 Thread feroz shaik (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029481#comment-17029481
 ] 

feroz shaik commented on CASSANDRA-15529:
-

Hi [~ifesdjeen] , I am a colleague of [~Pooja_nair] and would like to answer 
your questions. # We have all the nodes upgraded to 3.11.4 and also finished 
upgradesstables. There are no previous sstables present on any of the 
nodes/cluster.
 # We have no clue whether the affected node is cordinator or not as it is too 
random and the affected node behaves rogue potentially bring impacting whole 
cluster.
 # The query patterns are like below:

 static String INSERT_SCHEDULED_DELETION_QUERY = "INSERT INTO \"sal_purge\" 
   (key,column1,value) VALUES (?,?,?) USING TIMESTAMP 
?;"; static String SELECT_SCHEDULED_DELETION_QUERY = "SELECT column1, value 
FROM sal_purge where key=? AND column1>=? LIMIT
?;"; static String DELETE_SCHEDULED_DELETION_QUERY = "DELETE FROM \"sal_purge\" 
USING TIMESTAMP ? WHERE key=? AND column1=?;";
I will try to upload the sstable dump soon to this, Appreciate your support 
[~ifesdjeen]

> AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 
> to 3.11.4
> -
>
> Key: CASSANDRA-15529
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15529
> Project: Cassandra
>  Issue Type: Bug
>  Components: Cluster/Schema
>Reporter: Pooja Nair
>Priority: Urgent
>  Labels: 2.1.16, 3.11.4
>
> Hello Team, 
> We have cluster running on cassandra 3.11.4
> Following is the table schema of the tables that is being used in our system.
> {code:java}
> cqlsh> desc KEYSPACE "SAL"
>   
>   CREATE KEYSPACE "SAL" WITH replication = {'class': 
> 'NetworkTopologyStrategy', 'DC_EAST': '3', 'DC_WEST': '3'}  AND 
> durable_writes = true;
>   
>   CREATE TABLE "SAL".sal_purge (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.1
>   AND caching = '{"keys":"NONE", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds items to be removed as 
> [shardid][salid][timestamp]. The table records SALIDs to be deleted along 
> with their deletion times (which may be modified)'
>   AND compaction = {'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.1
>   AND speculative_retry = '99.0PERCENTILE';
>   
>   CREATE TABLE "SAL".sal_ref (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.025
>   AND caching = '{"keys":"ALL", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds owner references to content as [salid][lcid/opid]'
>   AND compaction = {'sstable_size_in_mb': '180', 'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.0
>   AND speculative_retry = '99.0PERCENTILE';
> {code}
> Things to note:
>  # The column2 is always passed a null value during insertion 
>  # column2 is a part of primary key
>  #  Range select and Range delete is done through our app.    
> Iniatally the cluster was on casssandra version 2.1.16  and have been 
> recently upgraded to 3.11.4 post the upgrade, we see that the nodes are going 
> down, and log the below exceptions during startup and even after node is up. 
> This one node is causing the whole cluster to behave improperly.
> {code:java}
> WARN [Native-Transport-Requests-47] 2020-01-29 13:49:05,190 
> AbstractLocalAwareExecutorService.java:167 - Uncaught exception on thread 
> Thread[Native-Transport-Requests-47,5,main]: {} java.lang.RuntimeException: 
> java.lang.IllegalStateException: UnfilteredRowIterator for SAL.sal_purge has 
> an open RT bound as its last item at 

[jira] [Updated] (CASSANDRA-15446) Per-thread stack size is too small on aarch64 CentOS

2020-02-03 Thread Michael Shuler (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-15446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Shuler updated CASSANDRA-15446:
---
Fix Version/s: (was: 3.11.5)
   3.11.x

> Per-thread stack size is too small on aarch64 CentOS
> 
>
> Key: CASSANDRA-15446
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15446
> Project: Cassandra
>  Issue Type: Bug
>  Components: Local/Config, Local/Startup and Shutdown
>Reporter: Heming Fu
>Assignee: Heming Fu
>Priority: Normal
> Fix For: 2.1.x, 2.2.x, 3.0.x, 3.11.x
>
>
> Hi all,
> I found an issue when I tried to start cassandra on my aarch64 CentOS7.6, 
> however no errors on Ubuntu. Of course I could increase -Xss in jvm.options 
> to fix it, but this issue also caused Cassandra's docker images from docker 
> hub could not run containers on this OS.
> The information of my current environment and root cause of this issue were 
> shown below.
> *Error*
> The stack size specified is too small, Specify at least 328k
> Error: Could not create the Java Virtual Machine.
> Error: A fatal exception has occurred. Program will exit.
> *Version*
> Cassandra 2.1.21 2.2.15 3.0.19 3.11.5 
> *Environment*
> $ lscpu
> Architecture: aarch64
> Byte Order: Little Endian
> $ uname -m
> aarch64
> $ java -version
> openjdk version "1.8.0_181"
> OpenJDK Runtime Environment (build 1.8.0_181-b13)
> OpenJDK 64-Bit Server VM (build 25.181-b13, mixed mode)
> [root@localhost apache-cassandra-3.11.5]# cat /etc/os-release
> $ cat /etc/os-release
> NAME="CentOS Linux"
> VERSION="7 (AltArch)"
> ID="centos"
> ID_LIKE="rhel fedora"
> VERSION_ID="7"
> PRETTY_NAME="CentOS Linux 7 (AltArch)"
> ANSI_COLOR="0;31"
> CPE_NAME="cpe:/o:centos:centos:7"
> HOME_URL="https://www.centos.org/;
> BUG_REPORT_URL="https://bugs.centos.org/;
> *Root Cause*
> Checked openjdk-1.8.0 source code, the min stack size is calculated by 
> StackYellowPage, StackRedPage, StackShadowPage, OS page size. Among those 
> parameters, *default OS page size of aarch64 CentOS 7.6 is 64K, however 
> aarch64 Ubuntu 18.04 and X86 CentOS are both 4K.*
> This difference causes JVM on aarch64 Ubuntu 18.04 needs 164K per-thread 
> stack size, but 328K required on aarch64 CentOS 7.6.
> The formula is 
> os::Linux::min_stack_allowed = MAX2(os::Linux::min_stack_allowed,
>  (size_t)(StackYellowPages+StackRedPages+StackShadowPages) * 
> Linux::page_size() +
>  (2*BytesPerWord COMPILER2_PRESENT(+1)) * Linux::vm_default_page_size());
> *Parameters on aarch64 CentOS7.6*
> intx StackRedPages = 1 
>  intx StackShadowPages = 1 
>  intx StackYellowPages = 1 
> pageSize 64K
> BytesPerWord 8
> vm_default_page_size 8K
> As a result, we have min_stack_allowed = (1 + 1 + 1) * 64K + (2 * 8 + 1) * 8K 
> = 328K
>  
> I could see some similar issues asked for specified achitecture, but no root 
> cause analyzed. I hope this could help you decide proper stack size for all 
> common OS.
> If you have any suggestion, pls let me know.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Commented] (CASSANDRA-15137) Switch http to https URLs in build.xml

2020-02-03 Thread Michael Shuler (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029172#comment-17029172
 ] 

Michael Shuler commented on CASSANDRA-15137:


Committed to 2.1 and merged up.
https://github.com/apache/cassandra/commit/b26927d863d5fc5d3251c4ce2aa0cdef9046dd3e

> Switch http to https URLs in build.xml
> --
>
> Key: CASSANDRA-15137
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15137
> Project: Cassandra
>  Issue Type: Task
>  Components: Build, Dependencies
>Reporter: Michael Shuler
>Assignee: Michael Shuler
>Priority: Normal
> Fix For: 2.2.15, 3.0.19, 3.11.5, 4.0
>
> Attachments: 0001-Switch-http-to-https-URLs-in-build.xml.patch, 
> 2.1_backport_Switch-http-to-https-URLs-in-build.xml.patch
>
>
> Switch to using https URLs wherever possible in build.xml.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[cassandra] branch cassandra-2.2 updated (b2f2c70 -> bd628ce)

2020-02-03 Thread mshuler
This is an automated email from the ASF dual-hosted git repository.

mshuler pushed a change to branch cassandra-2.2
in repository https://gitbox.apache.org/repos/asf/cassandra.git.


from b2f2c70  Add message interceptors to in-jvm dtests
 new b26927d  Switch http to https URLs in build.xml
 new bd628ce  Merge branch 'cassandra-2.1' into cassandra-2.2

The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:


-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[cassandra] branch trunk updated (d3b3dcb -> 7c088b9)

2020-02-03 Thread mshuler
This is an automated email from the ASF dual-hosted git repository.

mshuler pushed a change to branch trunk
in repository https://gitbox.apache.org/repos/asf/cassandra.git.


from d3b3dcb  Separate exceptions for CAS write timeout exceptions caused 
by contention and unkown result
 new b26927d  Switch http to https URLs in build.xml
 new bd628ce  Merge branch 'cassandra-2.1' into cassandra-2.2
 new 1710fa0  Merge branch 'cassandra-2.2' into cassandra-3.0
 new e4ca96a  Merge branch 'cassandra-3.0' into cassandra-3.11
 new 7c088b9  Merge branch 'cassandra-3.11' into trunk

The 5 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:


-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[cassandra] 01/01: Merge branch 'cassandra-2.2' into cassandra-3.0

2020-02-03 Thread mshuler
This is an automated email from the ASF dual-hosted git repository.

mshuler pushed a commit to branch cassandra-3.0
in repository https://gitbox.apache.org/repos/asf/cassandra.git

commit 1710fa0f3d03dacc3056f93b7366c8611e3bccb8
Merge: a270929 bd628ce
Author: Michael Shuler 
AuthorDate: Mon Feb 3 12:31:14 2020 -0600

Merge branch 'cassandra-2.2' into cassandra-3.0



-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[cassandra] branch cassandra-3.11 updated (f09e1be -> e4ca96a)

2020-02-03 Thread mshuler
This is an automated email from the ASF dual-hosted git repository.

mshuler pushed a change to branch cassandra-3.11
in repository https://gitbox.apache.org/repos/asf/cassandra.git.


from f09e1be  Merge branch 'cassandra-3.0' into cassandra-3.11
 new b26927d  Switch http to https URLs in build.xml
 new bd628ce  Merge branch 'cassandra-2.1' into cassandra-2.2
 new 1710fa0  Merge branch 'cassandra-2.2' into cassandra-3.0
 new e4ca96a  Merge branch 'cassandra-3.0' into cassandra-3.11

The 4 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:


-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[cassandra] 01/01: Merge branch 'cassandra-2.1' into cassandra-2.2

2020-02-03 Thread mshuler
This is an automated email from the ASF dual-hosted git repository.

mshuler pushed a commit to branch cassandra-2.2
in repository https://gitbox.apache.org/repos/asf/cassandra.git

commit bd628ce7a25b5b63bf7c0bd92aa655ee6977af4b
Merge: b2f2c70 b26927d
Author: Michael Shuler 
AuthorDate: Mon Feb 3 12:30:48 2020 -0600

Merge branch 'cassandra-2.1' into cassandra-2.2



-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[cassandra] 01/01: Merge branch 'cassandra-3.0' into cassandra-3.11

2020-02-03 Thread mshuler
This is an automated email from the ASF dual-hosted git repository.

mshuler pushed a commit to branch cassandra-3.11
in repository https://gitbox.apache.org/repos/asf/cassandra.git

commit e4ca96a28ec1f833f309a3be0e4748837345cde9
Merge: f09e1be 1710fa0
Author: Michael Shuler 
AuthorDate: Mon Feb 3 12:31:44 2020 -0600

Merge branch 'cassandra-3.0' into cassandra-3.11



-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[cassandra] branch cassandra-3.0 updated (a270929 -> 1710fa0)

2020-02-03 Thread mshuler
This is an automated email from the ASF dual-hosted git repository.

mshuler pushed a change to branch cassandra-3.0
in repository https://gitbox.apache.org/repos/asf/cassandra.git.


from a270929  Merge branch 'cassandra-2.2' into cassandra-3.0
 new b26927d  Switch http to https URLs in build.xml
 new bd628ce  Merge branch 'cassandra-2.1' into cassandra-2.2
 new 1710fa0  Merge branch 'cassandra-2.2' into cassandra-3.0

The 3 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:


-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[cassandra] 01/01: Merge branch 'cassandra-3.11' into trunk

2020-02-03 Thread mshuler
This is an automated email from the ASF dual-hosted git repository.

mshuler pushed a commit to branch trunk
in repository https://gitbox.apache.org/repos/asf/cassandra.git

commit 7c088b97115eb8fe982bac5ce9bf7bfbe6c0dc6e
Merge: d3b3dcb e4ca96a
Author: Michael Shuler 
AuthorDate: Mon Feb 3 12:32:01 2020 -0600

Merge branch 'cassandra-3.11' into trunk



-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[cassandra] branch cassandra-2.1 updated: Switch http to https URLs in build.xml

2020-02-03 Thread mshuler
This is an automated email from the ASF dual-hosted git repository.

mshuler pushed a commit to branch cassandra-2.1
in repository https://gitbox.apache.org/repos/asf/cassandra.git


The following commit(s) were added to refs/heads/cassandra-2.1 by this push:
 new b26927d  Switch http to https URLs in build.xml
b26927d is described below

commit b26927d863d5fc5d3251c4ce2aa0cdef9046dd3e
Author: Michael Shuler 
AuthorDate: Mon Feb 3 12:29:47 2020 -0600

Switch http to https URLs in build.xml

Patch by Michael Shuler; Reviewed by Joshua McKenzie for CASSANDRA-15137
(Backport to 2.1 to fix build.)
---
 build.properties.default |  4 ++--
 build.xml| 24 
 2 files changed, 14 insertions(+), 14 deletions(-)

diff --git a/build.properties.default b/build.properties.default
index 5291659..11da534 100644
--- a/build.properties.default
+++ b/build.properties.default
@@ -1,4 +1,4 @@
 # Maven2 Repository Locations (you can override these in "build.properties" to 
point to a local proxy, e.g. Nexus)
-artifact.remoteRepository.central: http://repo1.maven.org/maven2
-artifact.remoteRepository.apache:  http://repo.maven.apache.org/maven2
+artifact.remoteRepository.central: https://repo1.maven.org/maven2
+artifact.remoteRepository.apache:  https://repo.maven.apache.org/maven2
 
diff --git a/build.xml b/build.xml
index f6fc1f6..c34ec28 100644
--- a/build.xml
+++ b/build.xml
@@ -8,7 +8,7 @@
  ~ "License"); you may not use this file except in compliance
  ~ with the License.  You may obtain a copy of the License at
  ~
- ~http://www.apache.org/licenses/LICENSE-2.0
+ ~https://www.apache.org/licenses/LICENSE-2.0
  ~
  ~ Unless required by applicable law or agreed to in writing,
  ~ software distributed under the License is distributed on an
@@ -26,9 +26,9 @@
 
 
 
-
-
-http://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=tree"/>
+https://gitbox.apache.org/repos/asf/cassandra.git"/>
+https://gitbox.apache.org/repos/asf/cassandra.git"/>
+https://gitbox.apache.org/repos/asf?p=cassandra.git;a=tree"/>
 
 
 
@@ -79,7 +79,7 @@
 
 
 http://repo2.maven.org/maven2/org/apache/maven/maven-ant-tasks; />
+  
value="https://repo.maven.apache.org/maven2/org/apache/maven/maven-ant-tasks; />
 
 
 https://repository.apache.org/service/local/staging/deploy/maven2;>
@@ -98,7 +98,7 @@
 
 
 
-
+
 
 
 
@@ -343,11 +343,11 @@
 artifactId="cassandra-parent"
 packaging="pom"
 version="${version}"
-url="http://cassandra.apache.org;
+url="https://cassandra.apache.org;
 name="Apache Cassandra"
 inceptionYear="2009"
 description="The Apache Cassandra Project develops a 
highly scalable second-generation distributed database, bringing together 
Dynamo's fully distributed design and Bigtable's ColumnFamily-based data 
model.">
-http://www.apache.org/licenses/LICENSE-2.0.txt"/>
+https://www.apache.org/licenses/LICENSE-2.0.txt"/>
 
 
   
@@ -480,7 +480,7 @@
 
   http://cassandra.apache.org;
+url="https://cassandra.apache.org;
 name="Apache Cassandra">
 
   http://cassandra.apache.org;
+url="https://cassandra.apache.org;
 name="Apache Cassandra">
 
   http://cassandra.apache.org;
+url="https://cassandra.apache.org;
 name="Apache Cassandra">
 http://cassandra.apache.org;
+url="https://cassandra.apache.org;
 name="Apache Cassandra">
 

[jira] [Commented] (CASSANDRA-14773) Overflow of 32-bit integer during compaction.

2020-02-03 Thread Vladimir Bukhtoyarov (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-14773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029159#comment-17029159
 ] 

Vladimir Bukhtoyarov commented on CASSANDRA-14773:
--

Hi [~e.dimitrova] 

No, I am not working on this.

> Overflow of 32-bit integer during compaction.
> -
>
> Key: CASSANDRA-14773
> URL: https://issues.apache.org/jira/browse/CASSANDRA-14773
> Project: Cassandra
>  Issue Type: Bug
>  Components: Local/Compaction
>Reporter: Vladimir Bukhtoyarov
>Assignee: Vladimir Bukhtoyarov
>Priority: Urgent
> Fix For: 4.0, 4.0-beta
>
>
> In scope of CASSANDRA-13444 the compaction was significantly improved from 
> CPU and memory perspective. Hovewer this improvement introduces the bug in 
> rounding. When rounding the expriration time which is close to  
> *Cell.MAX_DELETION_TIME*(it is just *Integer.MAX_VALUE*) the math overflow 
> happens(because in scope of -CASSANDRA-13444-) data type for point was 
> changed from Long to Integer in order to reduce memory footprint), as result 
> point became negative and acts as silent poison for internal structures of 
> StreamingTombstoneHistogramBuilder like *DistanceHolder* and *DataHolder*. 
> Then depending of point intervals:
>  * The TombstoneHistogram produces wrong values when interval of points is 
> less then binSize, it is not critical.
>  * Compaction crashes with ArrayIndexOutOfBoundsException if amount of point 
> intervals is great then  binSize, this case is very critical.
>  
> This is pull request [https://github.com/apache/cassandra/pull/273] that 
> reproduces the issue and provides the fix. 
>  
> The stacktrace when running(on codebase without fix) 
> *testMathOverflowDuringRoundingOfLargeTimestamp* without -ea JVM flag
> {noformat}
> java.lang.ArrayIndexOutOfBoundsException
> at java.lang.System.arraycopy(Native Method)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilder$DistanceHolder.add(StreamingTombstoneHistogramBuilder.java:208)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilder.flushValue(StreamingTombstoneHistogramBuilder.java:140)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilder$$Lambda$1/1967205423.consume(Unknown
>  Source)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilder$Spool.forEach(StreamingTombstoneHistogramBuilder.java:574)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilder.flushHistogram(StreamingTombstoneHistogramBuilder.java:124)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilder.build(StreamingTombstoneHistogramBuilder.java:184)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilderTest.testMathOverflowDuringRoundingOfLargeTimestamp(StreamingTombstoneHistogramBuilderTest.java:183)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:497)
> at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:44)
> at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:15)
> at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:41)
> at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:20)
> at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:28)
> at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:31)
> at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
> at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:44)
> at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:180)
> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:41)
> at org.junit.runners.ParentRunner$1.evaluate(ParentRunner.java:173)
> at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:28)
> at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:31)
> at org.junit.runners.ParentRunner.run(ParentRunner.java:220)
> at org.junit.runner.JUnitCore.run(JUnitCore.java:159)
> at 
> com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs(JUnit4IdeaTestRunner.java:68)
> at 
> com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.startRunnerWithArgs(IdeaTestRunner.java:47)
> at 
> com.intellij.rt.execution.junit.JUnitStarter.prepareStreamsAndStart(JUnitStarter.java:242)
> at com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStarter.java:70)
> {noformat}
>  
> The stacktrace when 

[jira] [Commented] (CASSANDRA-15137) Switch http to https URLs in build.xml

2020-02-03 Thread Stefan Podkowinski (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029150#comment-17029150
 ] 

Stefan Podkowinski commented on CASSANDRA-15137:


Can you get this committed to 2.1 [~mshuler]? maven.central turned off 
non-https downloads, so you won't be able to download any artifacts for that 
branch anymore, if they are not cached yet.

> Switch http to https URLs in build.xml
> --
>
> Key: CASSANDRA-15137
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15137
> Project: Cassandra
>  Issue Type: Task
>  Components: Build, Dependencies
>Reporter: Michael Shuler
>Assignee: Michael Shuler
>Priority: Normal
> Fix For: 2.2.15, 3.0.19, 3.11.5, 4.0
>
> Attachments: 0001-Switch-http-to-https-URLs-in-build.xml.patch, 
> 2.1_backport_Switch-http-to-https-URLs-in-build.xml.patch
>
>
> Switch to using https URLs wherever possible in build.xml.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Commented] (CASSANDRA-12995) update hppc dependency to 0.7

2020-02-03 Thread Ekaterina Dimitrova (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-12995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029143#comment-17029143
 ] 

Ekaterina Dimitrova commented on CASSANDRA-12995:
-

Hi,

Thanks. Dawid Weiss is already on top of it. He said he will be working on it 
next week. I will keep you posted. 

> update hppc dependency to 0.7
> -
>
> Key: CASSANDRA-12995
> URL: https://issues.apache.org/jira/browse/CASSANDRA-12995
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Dependencies, Packaging
>Reporter: Tomas Repik
>Assignee: Ekaterina Dimitrova
>Priority: Normal
>  Labels: easyfix
> Fix For: 4.0
>
> Attachments: Screen Shot 2020-01-29 at 11.14.30 AM.png, Screen Shot 
> 2020-01-29 at 11.20.39 AM.png, Screen Shot 2020-01-29 at 11.20.47 AM.png, 
> cassandra-3.11.0-hppc.patch
>
>
> Cassandra 3.11.0 is about to be included in Fedora. There are some tweaks to 
> the sources we need to do in order to successfully build it. Cassandra 
> depends on hppc 0.5.4, but In Fedora we have the newer version 0.7.1 Upstream 
> released even newer version 0.7.2. I attached a patch updating cassandra 
> sources to depend on the 0.7.1 hppc sources. It should be also compatible 
> with the newest upstream version. The only actual changes are the removal of 
> Open infix in class names. The issue was discussed in here: 
> https://bugzilla.redhat.com/show_bug.cgi?id=1340876 Please consider updating.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-15535) Documentation gives the wrong instruction to activate remote jmx

2020-02-03 Thread Alex Petrov (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-15535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Petrov updated CASSANDRA-15535:

Change Category: Quality Assurance
 Complexity: Low Hanging Fruit
 Status: Open  (was: Triage Needed)

> Documentation gives the wrong instruction to activate remote jmx
> 
>
> Key: CASSANDRA-15535
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15535
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Documentation/Website
>Reporter: jean carlo rivera ura
>Assignee: jean carlo rivera ura
>Priority: Normal
>
> In this section [jmx acces 
> |https://cassandra.apache.org/doc/latest/operating/security.html?highlight=local_jmx#jmx-access],
>  in order to activate the remove jmx acces, it says to change the value of 
> LOCAL_JMX to *yes*. However the right configuration is *no*



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-15531) Improve docs on disk_access_mode, specifically post CASSANDRA-8464

2020-02-03 Thread Alex Petrov (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-15531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Petrov updated CASSANDRA-15531:

Change Category: Quality Assurance
 Complexity: Low Hanging Fruit
 Status: Open  (was: Triage Needed)

> Improve docs on disk_access_mode, specifically post CASSANDRA-8464
> --
>
> Key: CASSANDRA-15531
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15531
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Documentation/Website
>Reporter: Erick Ramirez
>Assignee: Erick Ramirez
>Priority: Normal
>  Labels: documentation
>
> h2.  Background
> [~tsteinmaurer] brought up [on 
> Slack|https://the-asf.slack.com/archives/CJZLTM05A/p1580298424288700] that 
> they've been hit by the `oom-killer` as a result of SSTables being mmaped.
> h2. Action
> Based on feedback from Thomas Steinmaurer, Jean Carlo & Sam Kearon, I am 
> going to push the info I've written in this KB article to the official docs – 
> [Increased memory use on nodes after upgrading to DSE 5.0 or DSE 
> 5.1|https://support.datastax.com/hc/en-us/articles/360027838911].
> I will revise it specifically for affected OSS versions.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Comment Edited] (CASSANDRA-15529) AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 to 3.11.4

2020-02-03 Thread Alex Petrov (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029059#comment-17029059
 ] 

Alex Petrov edited comment on CASSANDRA-15529 at 2/3/20 4:04 PM:
-

[~Pooja_nair] is it possible to get a snapshot (possibly, anonymized if it's 
produdciton data) of what sstable dump would look like? This looks like it 
might be of a concern. 

Does this happen on the upgraded SSTables or on the old ones? Do you have a 
mixed cluster (as in, some nodes are still running old version)? If yes, does 
it happen depending on which node is a coordinator or regardless?

It would be also good to know what kind of insert/update/delete query patterns 
you roughly have for that schema, so I could try to reproduce locally.


was (Author: ifesdjeen):
[~Pooja_nair] is it possible to get a snapshot (possibly, anonymized if it's 
produdciton data) of what sstable dump would look like? This looks like it 
might be of a concern.

It would be also good to know what kind of insert/update/delete query patterns 
you roughly have for that schema, so I could try to reproduce locally.

> AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 
> to 3.11.4
> -
>
> Key: CASSANDRA-15529
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15529
> Project: Cassandra
>  Issue Type: Bug
>  Components: Cluster/Schema
>Reporter: Pooja Nair
>Priority: Urgent
>  Labels: 2.1.16, 3.11.4
>
> Hello Team, 
> We have cluster running on cassandra 3.11.4
> Following is the table schema of the tables that is being used in our system.
> {code:java}
> cqlsh> desc KEYSPACE "SAL"
>   
>   CREATE KEYSPACE "SAL" WITH replication = {'class': 
> 'NetworkTopologyStrategy', 'DC_EAST': '3', 'DC_WEST': '3'}  AND 
> durable_writes = true;
>   
>   CREATE TABLE "SAL".sal_purge (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.1
>   AND caching = '{"keys":"NONE", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds items to be removed as 
> [shardid][salid][timestamp]. The table records SALIDs to be deleted along 
> with their deletion times (which may be modified)'
>   AND compaction = {'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.1
>   AND speculative_retry = '99.0PERCENTILE';
>   
>   CREATE TABLE "SAL".sal_ref (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.025
>   AND caching = '{"keys":"ALL", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds owner references to content as [salid][lcid/opid]'
>   AND compaction = {'sstable_size_in_mb': '180', 'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.0
>   AND speculative_retry = '99.0PERCENTILE';
> {code}
> Things to note:
>  # The column2 is always passed a null value during insertion 
>  # column2 is a part of primary key
>  #  Range select and Range delete is done through our app.    
> Iniatally the cluster was on casssandra version 2.1.16  and have been 
> recently upgraded to 3.11.4 post the upgrade, we see that the nodes are going 
> down, and log the below exceptions during startup and even after node is up. 
> This one node is causing the whole cluster to behave improperly.
> {code:java}
> WARN [Native-Transport-Requests-47] 2020-01-29 13:49:05,190 
> AbstractLocalAwareExecutorService.java:167 - Uncaught exception on thread 
> Thread[Native-Transport-Requests-47,5,main]: {} java.lang.RuntimeException: 
> java.lang.IllegalStateException: UnfilteredRowIterator for SAL.sal_purge has 
> an open RT 

[jira] [Updated] (CASSANDRA-15529) AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 to 3.11.4

2020-02-03 Thread Alex Petrov (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-15529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Petrov updated CASSANDRA-15529:

Bug Category: Parent values: Correctness(12982)Level 1 values: Recoverable 
Corruption / Loss(12986)  (was: Parent values: Code(13163)Level 1 values: Bug - 
Unclear Impact(13164))
  Status: Open  (was: Triage Needed)

> AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 
> to 3.11.4
> -
>
> Key: CASSANDRA-15529
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15529
> Project: Cassandra
>  Issue Type: Bug
>  Components: Cluster/Schema
>Reporter: Pooja Nair
>Priority: Urgent
>  Labels: 2.1.16, 3.11.4
>
> Hello Team, 
> We have cluster running on cassandra 3.11.4
> Following is the table schema of the tables that is being used in our system.
> {code:java}
> cqlsh> desc KEYSPACE "SAL"
>   
>   CREATE KEYSPACE "SAL" WITH replication = {'class': 
> 'NetworkTopologyStrategy', 'DC_EAST': '3', 'DC_WEST': '3'}  AND 
> durable_writes = true;
>   
>   CREATE TABLE "SAL".sal_purge (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.1
>   AND caching = '{"keys":"NONE", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds items to be removed as 
> [shardid][salid][timestamp]. The table records SALIDs to be deleted along 
> with their deletion times (which may be modified)'
>   AND compaction = {'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.1
>   AND speculative_retry = '99.0PERCENTILE';
>   
>   CREATE TABLE "SAL".sal_ref (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.025
>   AND caching = '{"keys":"ALL", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds owner references to content as [salid][lcid/opid]'
>   AND compaction = {'sstable_size_in_mb': '180', 'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.0
>   AND speculative_retry = '99.0PERCENTILE';
> {code}
> Things to note:
>  # The column2 is always passed a null value during insertion 
>  # column2 is a part of primary key
>  #  Range select and Range delete is done through our app.    
> Iniatally the cluster was on casssandra version 2.1.16  and have been 
> recently upgraded to 3.11.4 post the upgrade, we see that the nodes are going 
> down, and log the below exceptions during startup and even after node is up. 
> This one node is causing the whole cluster to behave improperly.
> {code:java}
> WARN [Native-Transport-Requests-47] 2020-01-29 13:49:05,190 
> AbstractLocalAwareExecutorService.java:167 - Uncaught exception on thread 
> Thread[Native-Transport-Requests-47,5,main]: {} java.lang.RuntimeException: 
> java.lang.IllegalStateException: UnfilteredRowIterator for SAL.sal_purge has 
> an open RT bound as its last item at 
> org.apache.cassandra.service.StorageProxy$DroppableRunnable.run(StorageProxy.java:2588)
>  ~[apache-cassandra-3.11.4.jar:3.11.4] at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) 
> [na:1.8.0-internal] at 
> org.apache.cassandra.concurrent.AbstractLocalAwareExecutorService$FutureTask.run(AbstractLocalAwareExecutorService.java:162)
>  [apache-cassandra-3.11.4.jar:3.11.4] at 
> org.apache.cassandra.concurrent.AbstractLocalAwareExecutorService$LocalSessionFutureTask.run(AbstractLocalAwareExecutorService.java:134)
>  [apache-cassandra-3.11.4.jar:3.11.4] at 
> org.apache.cassandra.concurrent.SEPExecutor.maybeExecuteImmediately(SEPExecutor.java:194)
>  [apache-cassandra-3.11.4.jar:3.11.4] at 
> 

[jira] [Commented] (CASSANDRA-15529) AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 to 3.11.4

2020-02-03 Thread Alex Petrov (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029059#comment-17029059
 ] 

Alex Petrov commented on CASSANDRA-15529:
-

[~Pooja_nair] is it possible to get a snapshot (possibly, anonymized if it's 
produdciton data) of what sstable dump would look like? This looks like it 
might be of a concern.

It would be also good to know what kind of insert/update/delete query patterns 
you roughly have for that schema, so I could try to reproduce locally.

> AbstractLocalAwareExecutorService.java exceptions after upgrade from 2.1.16 
> to 3.11.4
> -
>
> Key: CASSANDRA-15529
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15529
> Project: Cassandra
>  Issue Type: Bug
>  Components: Cluster/Schema
>Reporter: Pooja Nair
>Priority: Urgent
>  Labels: 2.1.16, 3.11.4
>
> Hello Team, 
> We have cluster running on cassandra 3.11.4
> Following is the table schema of the tables that is being used in our system.
> {code:java}
> cqlsh> desc KEYSPACE "SAL"
>   
>   CREATE KEYSPACE "SAL" WITH replication = {'class': 
> 'NetworkTopologyStrategy', 'DC_EAST': '3', 'DC_WEST': '3'}  AND 
> durable_writes = true;
>   
>   CREATE TABLE "SAL".sal_purge (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.1
>   AND caching = '{"keys":"NONE", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds items to be removed as 
> [shardid][salid][timestamp]. The table records SALIDs to be deleted along 
> with their deletion times (which may be modified)'
>   AND compaction = {'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.1
>   AND speculative_retry = '99.0PERCENTILE';
>   
>   CREATE TABLE "SAL".sal_ref (
>   key text,
>   column1 text,
>   column2 text,
>   value text,
>   PRIMARY KEY (key, column1, column2)
>   ) WITH COMPACT STORAGE
>   AND CLUSTERING ORDER BY (column1 ASC, column2 ASC)
>   AND bloom_filter_fp_chance = 0.025
>   AND caching = '{"keys":"ALL", "rows_per_partition":"NONE"}'
>   AND comment = 'Holds owner references to content as [salid][lcid/opid]'
>   AND compaction = {'sstable_size_in_mb': '180', 'class': 
> 'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'}
>   AND compression = {'chunk_length_kb': '64', 'sstable_compression': 
> 'org.apache.cassandra.io.compress.SnappyCompressor'}
>   AND dclocal_read_repair_chance = 0.0
>   AND default_time_to_live = 0
>   AND gc_grace_seconds = 864000
>   AND max_index_interval = 2048
>   AND memtable_flush_period_in_ms = 0
>   AND min_index_interval = 128
>   AND read_repair_chance = 0.0
>   AND speculative_retry = '99.0PERCENTILE';
> {code}
> Things to note:
>  # The column2 is always passed a null value during insertion 
>  # column2 is a part of primary key
>  #  Range select and Range delete is done through our app.    
> Iniatally the cluster was on casssandra version 2.1.16  and have been 
> recently upgraded to 3.11.4 post the upgrade, we see that the nodes are going 
> down, and log the below exceptions during startup and even after node is up. 
> This one node is causing the whole cluster to behave improperly.
> {code:java}
> WARN [Native-Transport-Requests-47] 2020-01-29 13:49:05,190 
> AbstractLocalAwareExecutorService.java:167 - Uncaught exception on thread 
> Thread[Native-Transport-Requests-47,5,main]: {} java.lang.RuntimeException: 
> java.lang.IllegalStateException: UnfilteredRowIterator for SAL.sal_purge has 
> an open RT bound as its last item at 
> org.apache.cassandra.service.StorageProxy$DroppableRunnable.run(StorageProxy.java:2588)
>  ~[apache-cassandra-3.11.4.jar:3.11.4] at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) 
> [na:1.8.0-internal] at 
> org.apache.cassandra.concurrent.AbstractLocalAwareExecutorService$FutureTask.run(AbstractLocalAwareExecutorService.java:162)
>  [apache-cassandra-3.11.4.jar:3.11.4] at 
> org.apache.cassandra.concurrent.AbstractLocalAwareExecutorService$LocalSessionFutureTask.run(AbstractLocalAwareExecutorService.java:134)
>  [apache-cassandra-3.11.4.jar:3.11.4] at 
> 

[jira] [Commented] (CASSANDRA-14773) Overflow of 32-bit integer during compaction.

2020-02-03 Thread Ekaterina Dimitrova (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-14773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029050#comment-17029050
 ] 

Ekaterina Dimitrova commented on CASSANDRA-14773:
-

Hi [~vladimir.bukhtoyarov] and [~benedict],

Maybe I can contribute to this one? As far as I see from the latest 2019 
updates failing tests had to be fixed at first?

[~vladimir.bukhtoyarov] are you still working on this one?

 

 

> Overflow of 32-bit integer during compaction.
> -
>
> Key: CASSANDRA-14773
> URL: https://issues.apache.org/jira/browse/CASSANDRA-14773
> Project: Cassandra
>  Issue Type: Bug
>  Components: Local/Compaction
>Reporter: Vladimir Bukhtoyarov
>Assignee: Vladimir Bukhtoyarov
>Priority: Urgent
> Fix For: 4.0, 4.0-beta
>
>
> In scope of CASSANDRA-13444 the compaction was significantly improved from 
> CPU and memory perspective. Hovewer this improvement introduces the bug in 
> rounding. When rounding the expriration time which is close to  
> *Cell.MAX_DELETION_TIME*(it is just *Integer.MAX_VALUE*) the math overflow 
> happens(because in scope of -CASSANDRA-13444-) data type for point was 
> changed from Long to Integer in order to reduce memory footprint), as result 
> point became negative and acts as silent poison for internal structures of 
> StreamingTombstoneHistogramBuilder like *DistanceHolder* and *DataHolder*. 
> Then depending of point intervals:
>  * The TombstoneHistogram produces wrong values when interval of points is 
> less then binSize, it is not critical.
>  * Compaction crashes with ArrayIndexOutOfBoundsException if amount of point 
> intervals is great then  binSize, this case is very critical.
>  
> This is pull request [https://github.com/apache/cassandra/pull/273] that 
> reproduces the issue and provides the fix. 
>  
> The stacktrace when running(on codebase without fix) 
> *testMathOverflowDuringRoundingOfLargeTimestamp* without -ea JVM flag
> {noformat}
> java.lang.ArrayIndexOutOfBoundsException
> at java.lang.System.arraycopy(Native Method)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilder$DistanceHolder.add(StreamingTombstoneHistogramBuilder.java:208)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilder.flushValue(StreamingTombstoneHistogramBuilder.java:140)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilder$$Lambda$1/1967205423.consume(Unknown
>  Source)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilder$Spool.forEach(StreamingTombstoneHistogramBuilder.java:574)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilder.flushHistogram(StreamingTombstoneHistogramBuilder.java:124)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilder.build(StreamingTombstoneHistogramBuilder.java:184)
> at 
> org.apache.cassandra.utils.streamhist.StreamingTombstoneHistogramBuilderTest.testMathOverflowDuringRoundingOfLargeTimestamp(StreamingTombstoneHistogramBuilderTest.java:183)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:497)
> at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:44)
> at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:15)
> at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:41)
> at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:20)
> at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:28)
> at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:31)
> at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
> at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:44)
> at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:180)
> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:41)
> at org.junit.runners.ParentRunner$1.evaluate(ParentRunner.java:173)
> at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:28)
> at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:31)
> at org.junit.runners.ParentRunner.run(ParentRunner.java:220)
> at org.junit.runner.JUnitCore.run(JUnitCore.java:159)
> at 
> com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs(JUnit4IdeaTestRunner.java:68)
> at 
> com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.startRunnerWithArgs(IdeaTestRunner.java:47)
> at 
> 

[jira] [Updated] (CASSANDRA-15350) Separate exceptions for CAS write timeout exceptions caused by contention and unkown result

2020-02-03 Thread Alex Petrov (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-15350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Petrov updated CASSANDRA-15350:

  Fix Version/s: 4.0
Source Control Link: 
https://github.com/apache/cassandra/commit/d3b3dcbb353de97220a11f55391babf149410905
 Resolution: Fixed
 Status: Resolved  (was: Ready to Commit)

> Separate exceptions for CAS write timeout exceptions caused by contention and 
> unkown result
> ---
>
> Key: CASSANDRA-15350
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15350
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Feature/Lightweight Transactions
>Reporter: Alex Petrov
>Assignee: Yifan Cai
>Priority: Normal
>  Labels: protocolv5, pull-request-available
> Fix For: 4.0
>
> Attachments: Utf8StringEncodeBench.java
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Right now, CAS uncertainty introduced in 
> https://issues.apache.org/jira/browse/CASSANDRA-6013 is propagating as 
> WriteTimeout. One of this conditions it manifests is when there’s at least 
> one acceptor that has accepted the value, which means that this value _may_ 
> still get accepted during the later round, despite the proposer failure. 
> Similar problem happens with CAS contention, which is also indistinguishable 
> from the “regular” timeout, even though it is visible in metrics correctly.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Issue Comment Deleted] (CASSANDRA-15350) Separate exceptions for CAS write timeout exceptions caused by contention and unkown result

2020-02-03 Thread Alex Petrov (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-15350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Petrov updated CASSANDRA-15350:

Comment: was deleted

(was: 
https://github.com/apache/cassandra/commit/d3b3dcbb353de97220a11f55391babf149410905)

> Separate exceptions for CAS write timeout exceptions caused by contention and 
> unkown result
> ---
>
> Key: CASSANDRA-15350
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15350
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Feature/Lightweight Transactions
>Reporter: Alex Petrov
>Assignee: Yifan Cai
>Priority: Normal
>  Labels: protocolv5, pull-request-available
> Fix For: 4.0
>
> Attachments: Utf8StringEncodeBench.java
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Right now, CAS uncertainty introduced in 
> https://issues.apache.org/jira/browse/CASSANDRA-6013 is propagating as 
> WriteTimeout. One of this conditions it manifests is when there’s at least 
> one acceptor that has accepted the value, which means that this value _may_ 
> still get accepted during the later round, despite the proposer failure. 
> Similar problem happens with CAS contention, which is also indistinguishable 
> from the “regular” timeout, even though it is visible in metrics correctly.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-15350) Separate exceptions for CAS write timeout exceptions caused by contention and unkown result

2020-02-03 Thread Alex Petrov (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-15350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Petrov updated CASSANDRA-15350:

Status: Ready to Commit  (was: Review In Progress)

https://github.com/apache/cassandra/commit/d3b3dcbb353de97220a11f55391babf149410905

> Separate exceptions for CAS write timeout exceptions caused by contention and 
> unkown result
> ---
>
> Key: CASSANDRA-15350
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15350
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Feature/Lightweight Transactions
>Reporter: Alex Petrov
>Assignee: Yifan Cai
>Priority: Normal
>  Labels: protocolv5, pull-request-available
> Attachments: Utf8StringEncodeBench.java
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Right now, CAS uncertainty introduced in 
> https://issues.apache.org/jira/browse/CASSANDRA-6013 is propagating as 
> WriteTimeout. One of this conditions it manifests is when there’s at least 
> one acceptor that has accepted the value, which means that this value _may_ 
> still get accepted during the later round, despite the proposer failure. 
> Similar problem happens with CAS contention, which is also indistinguishable 
> from the “regular” timeout, even though it is visible in metrics correctly.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Commented] (CASSANDRA-15350) Separate exceptions for CAS write timeout exceptions caused by contention and unkown result

2020-02-03 Thread Alex Petrov (Jira)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-15350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17028964#comment-17028964
 ] 

Alex Petrov commented on CASSANDRA-15350:
-

One more test run before commit: 
[CI|https://circleci.com/gh/ifesdjeen/cassandra/tree/cas-exception-changes].

+1 on the patch, thank you for patience.

Committed now to trunk with 
[d3b3dcbb353de97220a11f55391babf149410905|https://github.com/apache/cassandra/commit/d3b3dcbb353de97220a11f55391babf149410905].

I was able to slightly simplify the write result unknown exception test as 
shown 
[here|https://github.com/apache/cassandra/commit/d3b3dcbb353de97220a11f55391babf149410905#diff-94b9f00830fe29802a8a05183d77d207R237-R262].
 

As a side note, a corresponding ticket probably should be created for tracking 
at least the two main drivers (python and Java one, branch 3.6.0, since 4.0 
branch breaks compatibility with most of the existing code).

> Separate exceptions for CAS write timeout exceptions caused by contention and 
> unkown result
> ---
>
> Key: CASSANDRA-15350
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15350
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Feature/Lightweight Transactions
>Reporter: Alex Petrov
>Assignee: Yifan Cai
>Priority: Normal
>  Labels: protocolv5, pull-request-available
> Attachments: Utf8StringEncodeBench.java
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Right now, CAS uncertainty introduced in 
> https://issues.apache.org/jira/browse/CASSANDRA-6013 is propagating as 
> WriteTimeout. One of this conditions it manifests is when there’s at least 
> one acceptor that has accepted the value, which means that this value _may_ 
> still get accepted during the later round, despite the proposer failure. 
> Similar problem happens with CAS contention, which is also indistinguishable 
> from the “regular” timeout, even though it is visible in metrics correctly.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[cassandra] branch trunk updated: Separate exceptions for CAS write timeout exceptions caused by contention and unkown result

2020-02-03 Thread ifesdjeen
This is an automated email from the ASF dual-hosted git repository.

ifesdjeen pushed a commit to branch trunk
in repository https://gitbox.apache.org/repos/asf/cassandra.git


The following commit(s) were added to refs/heads/trunk by this push:
 new d3b3dcb  Separate exceptions for CAS write timeout exceptions caused 
by contention and unkown result
d3b3dcb is described below

commit d3b3dcbb353de97220a11f55391babf149410905
Author: yifan-c 
AuthorDate: Sun Oct 27 21:01:31 2019 -0700

Separate exceptions for CAS write timeout exceptions caused by contention 
and unkown result

Patch by Yifan Cai; reviewed by Alex Petrov and Dinesh Joshi for 
CASSANDRA-15350
---
 doc/native_protocol_v5.spec|  40 ++-
 ...xception.java => CasWriteTimeoutException.java} |  10 +-
 ...on.java => CasWriteUnknownResultException.java} |   7 +-
 .../apache/cassandra/exceptions/ExceptionCode.java |   3 +-
 .../exceptions/RequestTimeoutException.java|   8 +
 .../exceptions/WriteTimeoutException.java  |   6 +
 .../cassandra/metrics/CASClientRequestMetrics.java |   7 +-
 .../org/apache/cassandra/service/StorageProxy.java |  32 ++-
 .../cassandra/transport/messages/ErrorMessage.java | 110 ++--
 .../cassandra/distributed/test/CasWriteTest.java   | 276 +
 .../distributed/test/DistributedTestBase.java  |   4 +-
 .../cassandra/transport/ErrorMessageTest.java  |  80 ++
 12 files changed, 525 insertions(+), 58 deletions(-)

diff --git a/doc/native_protocol_v5.spec b/doc/native_protocol_v5.spec
index d1b3915..d279453 100644
--- a/doc/native_protocol_v5.spec
+++ b/doc/native_protocol_v5.spec
@@ -1120,7 +1120,7 @@ Table of Contents
 0x1003Truncate_error: error during a truncation error.
 0x1100Write_timeout: Timeout exception during a write request. The rest
   of the ERROR message body will be
-
+
   where:
  is the [consistency] level of the query having triggered
  the exception.
@@ -1144,12 +1144,14 @@ Table of Contents
  - "BATCH_LOG": the timeout occurred during the
write to the batch log when a (logged) batch
write was requested.
-- "CAS": the timeout occured during the Compare 
And Set write/update.
-- "VIEW": the timeout occured when a write involves
-  VIEW update and failure to acqiure local view(MV)
-  lock for key within timeout
-- "CDC": the timeout occured when 
cdc_total_space_in_mb is
-  exceeded when doing a write to data tracked by 
cdc.
+ - "CAS": the timeout occured during the Compare 
And Set write/update.
+ - "VIEW": the timeout occured when a write 
involves
+   VIEW update and failure to acqiure local 
view(MV)
+   lock for key within timeout
+ - "CDC": the timeout occured when 
cdc_total_space_in_mb is
+   exceeded when doing a write to data tracked by 
cdc.
+ is a [short] that describes the number of 
contentions occured during the CAS operation.
+  The field only presents when the  is 
"CAS".
 0x1200Read_timeout: Timeout exception during a read request. The rest
   of the ERROR message body will be
 
@@ -1225,12 +1227,24 @@ Table of Contents
  - "BATCH_LOG": the failure occured during the
write to the batch log when a (logged) batch
write was requested.
-- "CAS": the failure occured during the Compare 
And Set write/update.
-- "VIEW": the failure occured when a write involves
-  VIEW update and failure to acqiure local view(MV)
-  lock for key within timeout
-- "CDC": the failure occured when 
cdc_total_space_in_mb is
-  exceeded when doing a write to data tracked by 
cdc.
+ - "CAS": the failure occured during the Compare 
And Set write/update.
+ - "VIEW": the failure occured when a write 
involves
+   VIEW update and failure to acqiure local 
view(MV)
+   lock for key within timeout
+ - "CDC": the failure occured when 
cdc_total_space_in_mb is
+   exceeded when doing a write to data tracked by 
cdc.
+0x1600CDC_WRITE_FAILURE: // todo
+0x1700CAS_WRITE_UNKNOWN: An exception occured due to 

[jira] [Updated] (CASSANDRA-15350) Separate exceptions for CAS write timeout exceptions caused by contention and unkown result

2020-02-03 Thread Alex Petrov (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-15350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Petrov updated CASSANDRA-15350:

Summary: Separate exceptions for CAS write timeout exceptions caused by 
contention and unkown result  (was: Separate exceptions for CAS write timeout 
exceptions due to contention and unkown result)

> Separate exceptions for CAS write timeout exceptions caused by contention and 
> unkown result
> ---
>
> Key: CASSANDRA-15350
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15350
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Feature/Lightweight Transactions
>Reporter: Alex Petrov
>Assignee: Yifan Cai
>Priority: Normal
>  Labels: protocolv5, pull-request-available
> Attachments: Utf8StringEncodeBench.java
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Right now, CAS uncertainty introduced in 
> https://issues.apache.org/jira/browse/CASSANDRA-6013 is propagating as 
> WriteTimeout. One of this conditions it manifests is when there’s at least 
> one acceptor that has accepted the value, which means that this value _may_ 
> still get accepted during the later round, despite the proposer failure. 
> Similar problem happens with CAS contention, which is also indistinguishable 
> from the “regular” timeout, even though it is visible in metrics correctly.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-15350) Separate exceptions for CAS write timeout due to contention and unkown result

2020-02-03 Thread Alex Petrov (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-15350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Petrov updated CASSANDRA-15350:

Summary: Separate exceptions for CAS write timeout due to contention and 
unkown result  (was: Add CAS “uncertainty” and “contention" messages that are 
currently propagated as a WriteTimeoutException.)

> Separate exceptions for CAS write timeout due to contention and unkown result
> -
>
> Key: CASSANDRA-15350
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15350
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Feature/Lightweight Transactions
>Reporter: Alex Petrov
>Assignee: Yifan Cai
>Priority: Normal
>  Labels: protocolv5, pull-request-available
> Attachments: Utf8StringEncodeBench.java
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Right now, CAS uncertainty introduced in 
> https://issues.apache.org/jira/browse/CASSANDRA-6013 is propagating as 
> WriteTimeout. One of this conditions it manifests is when there’s at least 
> one acceptor that has accepted the value, which means that this value _may_ 
> still get accepted during the later round, despite the proposer failure. 
> Similar problem happens with CAS contention, which is also indistinguishable 
> from the “regular” timeout, even though it is visible in metrics correctly.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-15350) Separate exceptions for CAS write timeout exceptions due to contention and unkown result

2020-02-03 Thread Alex Petrov (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-15350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Petrov updated CASSANDRA-15350:

Summary: Separate exceptions for CAS write timeout exceptions due to 
contention and unkown result  (was: Separate exceptions for CAS write timeout 
due to contention and unkown result)

> Separate exceptions for CAS write timeout exceptions due to contention and 
> unkown result
> 
>
> Key: CASSANDRA-15350
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15350
> Project: Cassandra
>  Issue Type: Improvement
>  Components: Feature/Lightweight Transactions
>Reporter: Alex Petrov
>Assignee: Yifan Cai
>Priority: Normal
>  Labels: protocolv5, pull-request-available
> Attachments: Utf8StringEncodeBench.java
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Right now, CAS uncertainty introduced in 
> https://issues.apache.org/jira/browse/CASSANDRA-6013 is propagating as 
> WriteTimeout. One of this conditions it manifests is when there’s at least 
> one acceptor that has accepted the value, which means that this value _may_ 
> still get accepted during the later round, despite the proposer failure. 
> Similar problem happens with CAS contention, which is also indistinguishable 
> from the “regular” timeout, even though it is visible in metrics correctly.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-15358) Cassandra alpha 4 testing - Nodes crashing due to bufferpool allocator issue

2020-02-03 Thread Alex Petrov (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-15358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Petrov updated CASSANDRA-15358:

Reviewers:   (was: Alex Petrov)

> Cassandra alpha 4 testing - Nodes crashing due to bufferpool allocator issue
> 
>
> Key: CASSANDRA-15358
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15358
> Project: Cassandra
>  Issue Type: Bug
>  Components: Test/benchmark
>Reporter: Santhosh Kumar Ramalingam
>Assignee: Benedict Elliott Smith
>Priority: Normal
>  Labels: 4.0, alpha
> Attachments: all_errors.txt, debug_logs_during_repair.txt, 
> repair_1_trace.txt, verbose_logs.diff, verbose_logs.txt
>
>
> Hitting a bug with cassandra 4 alpha version. The same bug is repeated with 
> difefrent version of Java(8,11 &12) [~benedict]
>  
> Stack trace:
> {code:java}
> INFO [main] 2019-10-11 16:07:12,024 Server.java:164 - Starting listening for 
> CQL clients on /1.3.0.6:9042 (unencrypted)...
> WARN [OptionalTasks:1] 2019-10-11 16:07:13,961 CassandraRoleManager.java:343 
> - CassandraRoleManager skipped default role setup: some nodes were not ready
> INFO [OptionalTasks:1] 2019-10-11 16:07:13,961 CassandraRoleManager.java:369 
> - Setup task failed with error, rescheduling
> WARN [Messaging-EventLoop-3-2] 2019-10-11 16:07:22,038 NoSpamLogger.java:94 - 
> 10.3x.4x.5x:7000->1.3.0.5:7000-LARGE_MESSAGES-[no-channel] dropping message 
> of type PING_REQ whose timeout expired before reaching the network
> WARN [OptionalTasks:1] 2019-10-11 16:07:23,963 CassandraRoleManager.java:343 
> - CassandraRoleManager skipped default role setup: some nodes were not ready
> INFO [OptionalTasks:1] 2019-10-11 16:07:23,963 CassandraRoleManager.java:369 
> - Setup task failed with error, rescheduling
> INFO [Messaging-EventLoop-3-6] 2019-10-11 16:07:32,759 NoSpamLogger.java:91 - 
> 10.3x.4x.5x:7000->1.3.0.2:7000-URGENT_MESSAGES-[no-channel] failed to connect
> io.netty.channel.AbstractChannel$AnnotatedConnectException: finishConnect(..) 
> failed: Connection refused: /1.3.0.2:7000
> Caused by: java.net.ConnectException: finishConnect(..) failed: Connection 
> refused
> at io.netty.channel.unix.Errors.throwConnectException(Errors.java:124)
> at io.netty.channel.unix.Socket.finishConnect(Socket.java:243)
> at 
> io.netty.channel.epoll.AbstractEpollChannel$AbstractEpollUnsafe.doFinishConnect(AbstractEpollChannel.java:667)
> at 
> io.netty.channel.epoll.AbstractEpollChannel$AbstractEpollUnsafe.finishConnect(AbstractEpollChannel.java:644)
> at 
> io.netty.channel.epoll.AbstractEpollChannel$AbstractEpollUnsafe.epollOutReady(AbstractEpollChannel.java:524)
> at io.netty.channel.epoll.EpollEventLoop.processReady(EpollEventLoop.java:414)
> at io.netty.channel.epoll.EpollEventLoop.run(EpollEventLoop.java:326)
> at 
> io.netty.util.concurrent.SingleThreadEventExecutor$5.run(SingleThreadEventExecutor.java:918)
> at io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74)
> at 
> io.netty.util.concurrent.FastThreadLocalRunnable.run(FastThreadLocalRunnable.java:30)
> at java.base/java.lang.Thread.run(Thread.java:834)
> WARN [Messaging-EventLoop-3-3] 2019-10-11 16:11:32,639 NoSpamLogger.java:94 - 
> 1.3.4.6:7000->1.3.4.5:7000-URGENT_MESSAGES-[no-channel] dropping message of 
> type GOSSIP_DIGEST_SYN whose timeout expired before reaching the network
> INFO [Messaging-EventLoop-3-18] 2019-10-11 16:11:33,077 NoSpamLogger.java:91 
> - 1.3.4.5:7000->1.3.4.4:7000-URGENT_MESSAGES-[no-channel] failed to connect
>  
> ERROR [Messaging-EventLoop-3-11] 2019-10-10 01:34:34,407 
> InboundMessageHandler.java:657 - 
> 1.3.4.5:7000->1.3.4.8:7000-LARGE_MESSAGES-0b7d09cd unexpected exception 
> caught while processing inbound messages; terminating connection
> java.lang.IllegalArgumentException: initialBuffer is not a direct buffer.
> at io.netty.buffer.UnpooledDirectByteBuf.(UnpooledDirectByteBuf.java:87)
> at 
> io.netty.buffer.UnpooledUnsafeDirectByteBuf.(UnpooledUnsafeDirectByteBuf.java:59)
> at 
> org.apache.cassandra.net.BufferPoolAllocator$Wrapped.(BufferPoolAllocator.java:95)
> at 
> org.apache.cassandra.net.BufferPoolAllocator.newDirectBuffer(BufferPoolAllocator.java:56)
> at 
> io.netty.buffer.AbstractByteBufAllocator.directBuffer(AbstractByteBufAllocator.java:187)
> at 
> io.netty.buffer.AbstractByteBufAllocator.directBuffer(AbstractByteBufAllocator.java:178)
> at 
> io.netty.channel.unix.PreferredDirectByteBufAllocator.ioBuffer(PreferredDirectByteBufAllocator.java:53)
> at 
> io.netty.channel.DefaultMaxMessagesRecvByteBufAllocator$MaxMessageHandle.allocate(DefaultMaxMessagesRecvByteBufAllocator.java:114)
> at 
> io.netty.channel.epoll.EpollRecvByteAllocatorHandle.allocate(EpollRecvByteAllocatorHandle.java:75)
> at 
>