[ 
https://issues.apache.org/jira/browse/CASSANDRA-17355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jai Bheemsen Rao Dhanwada updated CASSANDRA-17355:
--------------------------------------------------
    Description: 
Hello Everyone, 

I am noticing a huge perf drop (spike in latency and CPU utilization) for the 
counter type tables when the data size grows. To better understand/simulate I 
have done the following perf test with `cassandra-stress` instead of my 
use-case and I can reproduce the performance issue consistently. When using the 
counter type tables, when the datasize grows the read latency and cpu spikes to 
very high number.

 

*Test Setup:*
 # Setup a cluster with 3 nodes.
 # Run a test with cassandra-stress and I see the latency and CPU are okay 
without much spike.
 # Send a lot of counter traffic using `cassandra-stress` tool (Replication 
Factory = 3)
 # Now the data size on the cluster is ~300G. 
 # Now run another test with cassandra-stress with 3:1 read write mixed 
workload.
 # At this point I see the CPU spikes to double (32 on a 16 core CPU) and the 
latency reaches ~1 seconds (which earlier was < 5ms).
 # Another interesting observation is the disk reads goes to a higher number 
and it keeps going higher with the increase in the disk size. 
 # It pretty much looked like a disk bottleneck issue but the same result shows 
very low disk reads, cpu, latency with less amount of data.
 # Below is the configuration I have used for testing this.

 
{quote}C* Version: 3.11.9

CPU: 16

Memory: 64G

Heap: 16G

GC: G1GC

Disk: 500G GCP Persistent disk 

 
{quote}
I understand that, with growth in disk the number of lookup grows high, but 
this looked to be a big performance drop.

Please let me know if you need more details. Also let me know this is known 
limitation with the counter type and if there is a work around. 

  was:
Hello Everyone, 

I am noticing a huge perf drop (spike in latency and CPU utilization) for the 
counter type tables when the data size grows. To better understand/simulate I 
have done the following perf test with `cassandra-stress` instead of my 
use-case and I can reproduce the performance issue consistently. When using the 
counter type tables, when the datasize grows the read latency and cpu spikes to 
very high number.

 

*Test Setup:*
 # Setup a cluster with 3 nodes.
 # Run a test with cassandra-stress and I see the latency and CPU are okay 
without much spike.
 # Send a lot of counter traffic using `cassandra-stress` tool (Replication 
Factory = 3)
 # Now the data size on the cluster is ~300G. 
 # Now run another test with cassandra-stress with 3:1 read write mixed 
workload.
 # At this point I see the CPU spikes to double (32 on a 16 core CPU) and the 
latency reaches ~1 seconds (which earlier was < 5ms).
 # Another interesting observation is the disk reads goes to a higher number 
and it keeps going higher with the increase in the disk size. 
 # It pretty much looked like a disk bottleneck issue but the same result shows 
very low disk reads, cpu, latency with less amount of data.
 # Below is the configuration I have used for testing this.

```

C* Version: 3.11.9

CPU: 16

Memory: 64G

Heap: 16G

GC: G1GC

Disk: 500G GCP Persistent disk 

``` 

I understand that, with growth in disk the number of lookup grows high, but 
this looked to be a big performance drop.

 

Please let me know if you need more details. Also let me know this is known 
limitation with the counter type and if there is a work around. 


> Performance degradation with Counter tables when the data size grows
> --------------------------------------------------------------------
>
>                 Key: CASSANDRA-17355
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-17355
>             Project: Cassandra
>          Issue Type: Bug
>            Reporter: Jai Bheemsen Rao Dhanwada
>            Priority: Normal
>
> Hello Everyone, 
> I am noticing a huge perf drop (spike in latency and CPU utilization) for the 
> counter type tables when the data size grows. To better understand/simulate I 
> have done the following perf test with `cassandra-stress` instead of my 
> use-case and I can reproduce the performance issue consistently. When using 
> the counter type tables, when the datasize grows the read latency and cpu 
> spikes to very high number.
>  
> *Test Setup:*
>  # Setup a cluster with 3 nodes.
>  # Run a test with cassandra-stress and I see the latency and CPU are okay 
> without much spike.
>  # Send a lot of counter traffic using `cassandra-stress` tool (Replication 
> Factory = 3)
>  # Now the data size on the cluster is ~300G. 
>  # Now run another test with cassandra-stress with 3:1 read write mixed 
> workload.
>  # At this point I see the CPU spikes to double (32 on a 16 core CPU) and the 
> latency reaches ~1 seconds (which earlier was < 5ms).
>  # Another interesting observation is the disk reads goes to a higher number 
> and it keeps going higher with the increase in the disk size. 
>  # It pretty much looked like a disk bottleneck issue but the same result 
> shows very low disk reads, cpu, latency with less amount of data.
>  # Below is the configuration I have used for testing this.
>  
> {quote}C* Version: 3.11.9
> CPU: 16
> Memory: 64G
> Heap: 16G
> GC: G1GC
> Disk: 500G GCP Persistent disk 
>  
> {quote}
> I understand that, with growth in disk the number of lookup grows high, but 
> this looked to be a big performance drop.
> Please let me know if you need more details. Also let me know this is known 
> limitation with the counter type and if there is a work around. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org

Reply via email to