[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated

2021-02-25 Thread Michael Semb Wever (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Semb Wever updated CASSANDRA-16318:
---
Fix Version/s: (was: 4.0-beta)

> Memtable heap size is severely underestimated
> -
>
> Key: CASSANDRA-16318
> URL: https://issues.apache.org/jira/browse/CASSANDRA-16318
> Project: Cassandra
>  Issue Type: Bug
>  Components: Local/Memtable
>Reporter: Branimir Lambov
>Assignee: Ekaterina Dimitrova
>Priority: Normal
> Fix For: 4.0-beta5, 4.0
>
> Attachments: image-2020-12-09-10-57-21-994.png, 
> image-2020-12-09-11-01-31-273.png
>
>
> We seem to be estimating the size of the on-heap memtable metadata to be 
> around half of what it actually is. For example, during a [read benchmark 
> which writes 1 million single-long 
> rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java]
>  the memtable reports
> {code}
> 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB 
> (0%) off-heap
> {code}
> while a heap dump taken at this point:
>  !image-2020-12-09-10-57-21-994.png! 
> lists an usage of about 666MB altogether.
> Switching to {{offheap_objects}}, the reported numbers are
> {code}
> 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB 
> (3%) off-heap
> {code}
> while actual heap usage:
>  !image-2020-12-09-11-01-31-273.png! 
> is about 442MB.
> Looking at the code we definitely are not counting the 
> {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion 
> info objects associated with each partition, and most probably others.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated

2021-01-19 Thread Ekaterina Dimitrova (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ekaterina Dimitrova updated CASSANDRA-16318:

  Since Version: 4.0-beta4
Source Control Link: 
https://github.com/apache/cassandra/commit/661f1aab171dc3ef16075f69581e88ad4a133fae
 Resolution: Fixed
 Status: Resolved  (was: Ready to Commit)

> Memtable heap size is severely underestimated
> -
>
> Key: CASSANDRA-16318
> URL: https://issues.apache.org/jira/browse/CASSANDRA-16318
> Project: Cassandra
>  Issue Type: Bug
>  Components: Local/Memtable
>Reporter: Branimir Lambov
>Assignee: Ekaterina Dimitrova
>Priority: Normal
> Fix For: 4.0-beta
>
> Attachments: image-2020-12-09-10-57-21-994.png, 
> image-2020-12-09-11-01-31-273.png
>
>
> We seem to be estimating the size of the on-heap memtable metadata to be 
> around half of what it actually is. For example, during a [read benchmark 
> which writes 1 million single-long 
> rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java]
>  the memtable reports
> {code}
> 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB 
> (0%) off-heap
> {code}
> while a heap dump taken at this point:
>  !image-2020-12-09-10-57-21-994.png! 
> lists an usage of about 666MB altogether.
> Switching to {{offheap_objects}}, the reported numbers are
> {code}
> 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB 
> (3%) off-heap
> {code}
> while actual heap usage:
>  !image-2020-12-09-11-01-31-273.png! 
> is about 442MB.
> Looking at the code we definitely are not counting the 
> {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion 
> info objects associated with each partition, and most probably others.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated

2021-01-19 Thread Ekaterina Dimitrova (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ekaterina Dimitrova updated CASSANDRA-16318:

Status: Ready to Commit  (was: Review In Progress)

> Memtable heap size is severely underestimated
> -
>
> Key: CASSANDRA-16318
> URL: https://issues.apache.org/jira/browse/CASSANDRA-16318
> Project: Cassandra
>  Issue Type: Bug
>  Components: Local/Memtable
>Reporter: Branimir Lambov
>Assignee: Ekaterina Dimitrova
>Priority: Normal
> Fix For: 4.0-beta
>
> Attachments: image-2020-12-09-10-57-21-994.png, 
> image-2020-12-09-11-01-31-273.png
>
>
> We seem to be estimating the size of the on-heap memtable metadata to be 
> around half of what it actually is. For example, during a [read benchmark 
> which writes 1 million single-long 
> rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java]
>  the memtable reports
> {code}
> 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB 
> (0%) off-heap
> {code}
> while a heap dump taken at this point:
>  !image-2020-12-09-10-57-21-994.png! 
> lists an usage of about 666MB altogether.
> Switching to {{offheap_objects}}, the reported numbers are
> {code}
> 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB 
> (3%) off-heap
> {code}
> while actual heap usage:
>  !image-2020-12-09-11-01-31-273.png! 
> is about 442MB.
> Looking at the code we definitely are not counting the 
> {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion 
> info objects associated with each partition, and most probably others.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated

2021-01-15 Thread Ekaterina Dimitrova (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ekaterina Dimitrova updated CASSANDRA-16318:

Authors: Branimir Lambov, Ekaterina Dimitrova  (was: Ekaterina Dimitrova)

> Memtable heap size is severely underestimated
> -
>
> Key: CASSANDRA-16318
> URL: https://issues.apache.org/jira/browse/CASSANDRA-16318
> Project: Cassandra
>  Issue Type: Bug
>  Components: Local/Memtable
>Reporter: Branimir Lambov
>Assignee: Ekaterina Dimitrova
>Priority: Normal
> Fix For: 4.0-beta
>
> Attachments: image-2020-12-09-10-57-21-994.png, 
> image-2020-12-09-11-01-31-273.png
>
>
> We seem to be estimating the size of the on-heap memtable metadata to be 
> around half of what it actually is. For example, during a [read benchmark 
> which writes 1 million single-long 
> rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java]
>  the memtable reports
> {code}
> 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB 
> (0%) off-heap
> {code}
> while a heap dump taken at this point:
>  !image-2020-12-09-10-57-21-994.png! 
> lists an usage of about 666MB altogether.
> Switching to {{offheap_objects}}, the reported numbers are
> {code}
> 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB 
> (3%) off-heap
> {code}
> while actual heap usage:
>  !image-2020-12-09-11-01-31-273.png! 
> is about 442MB.
> Looking at the code we definitely are not counting the 
> {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion 
> info objects associated with each partition, and most probably others.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated

2021-01-11 Thread Ekaterina Dimitrova (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ekaterina Dimitrova updated CASSANDRA-16318:

Reviewers: Benjamin Lerer, Branimir Lambov  (was: Benjamin Lerer)

> Memtable heap size is severely underestimated
> -
>
> Key: CASSANDRA-16318
> URL: https://issues.apache.org/jira/browse/CASSANDRA-16318
> Project: Cassandra
>  Issue Type: Bug
>  Components: Local/Memtable
>Reporter: Branimir Lambov
>Assignee: Ekaterina Dimitrova
>Priority: Normal
> Fix For: 4.0-beta
>
> Attachments: image-2020-12-09-10-57-21-994.png, 
> image-2020-12-09-11-01-31-273.png
>
>
> We seem to be estimating the size of the on-heap memtable metadata to be 
> around half of what it actually is. For example, during a [read benchmark 
> which writes 1 million single-long 
> rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java]
>  the memtable reports
> {code}
> 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB 
> (0%) off-heap
> {code}
> while a heap dump taken at this point:
>  !image-2020-12-09-10-57-21-994.png! 
> lists an usage of about 666MB altogether.
> Switching to {{offheap_objects}}, the reported numbers are
> {code}
> 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB 
> (3%) off-heap
> {code}
> while actual heap usage:
>  !image-2020-12-09-11-01-31-273.png! 
> is about 442MB.
> Looking at the code we definitely are not counting the 
> {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion 
> info objects associated with each partition, and most probably others.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated

2021-01-04 Thread Benjamin Lerer (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Benjamin Lerer updated CASSANDRA-16318:
---
Reviewers: Benjamin Lerer, Benjamin Lerer  (was: Benjamin Lerer)
   Benjamin Lerer, Benjamin Lerer
   Status: Review In Progress  (was: Patch Available)

> Memtable heap size is severely underestimated
> -
>
> Key: CASSANDRA-16318
> URL: https://issues.apache.org/jira/browse/CASSANDRA-16318
> Project: Cassandra
>  Issue Type: Bug
>  Components: Local/Memtable
>Reporter: Branimir Lambov
>Assignee: Ekaterina Dimitrova
>Priority: Normal
> Fix For: 4.0-beta
>
> Attachments: image-2020-12-09-10-57-21-994.png, 
> image-2020-12-09-11-01-31-273.png
>
>
> We seem to be estimating the size of the on-heap memtable metadata to be 
> around half of what it actually is. For example, during a [read benchmark 
> which writes 1 million single-long 
> rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java]
>  the memtable reports
> {code}
> 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB 
> (0%) off-heap
> {code}
> while a heap dump taken at this point:
>  !image-2020-12-09-10-57-21-994.png! 
> lists an usage of about 666MB altogether.
> Switching to {{offheap_objects}}, the reported numbers are
> {code}
> 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB 
> (3%) off-heap
> {code}
> while actual heap usage:
>  !image-2020-12-09-11-01-31-273.png! 
> is about 442MB.
> Looking at the code we definitely are not counting the 
> {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion 
> info objects associated with each partition, and most probably others.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated

2020-12-21 Thread Ekaterina Dimitrova (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ekaterina Dimitrova updated CASSANDRA-16318:

Test and Documentation Plan: 
https://app.circleci.com/pipelines/github/ekaterinadimitrova2/cassandra/545/workflows/f797bf58-b572-4d8c-831e-a61936d23624
 Status: Patch Available  (was: In Progress)

> Memtable heap size is severely underestimated
> -
>
> Key: CASSANDRA-16318
> URL: https://issues.apache.org/jira/browse/CASSANDRA-16318
> Project: Cassandra
>  Issue Type: Bug
>  Components: Local/Memtable
>Reporter: Branimir Lambov
>Assignee: Ekaterina Dimitrova
>Priority: Normal
> Fix For: 4.0-beta
>
> Attachments: image-2020-12-09-10-57-21-994.png, 
> image-2020-12-09-11-01-31-273.png
>
>
> We seem to be estimating the size of the on-heap memtable metadata to be 
> around half of what it actually is. For example, during a [read benchmark 
> which writes 1 million single-long 
> rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java]
>  the memtable reports
> {code}
> 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB 
> (0%) off-heap
> {code}
> while a heap dump taken at this point:
>  !image-2020-12-09-10-57-21-994.png! 
> lists an usage of about 666MB altogether.
> Switching to {{offheap_objects}}, the reported numbers are
> {code}
> 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB 
> (3%) off-heap
> {code}
> while actual heap usage:
>  !image-2020-12-09-11-01-31-273.png! 
> is about 442MB.
> Looking at the code we definitely are not counting the 
> {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion 
> info objects associated with each partition, and most probably others.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated

2020-12-14 Thread Branimir Lambov (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Branimir Lambov updated CASSANDRA-16318:

Fix Version/s: (was: 3.11.x)
   (was: 3.0.x)

> Memtable heap size is severely underestimated
> -
>
> Key: CASSANDRA-16318
> URL: https://issues.apache.org/jira/browse/CASSANDRA-16318
> Project: Cassandra
>  Issue Type: Bug
>  Components: Local/Memtable
>Reporter: Branimir Lambov
>Assignee: Ekaterina Dimitrova
>Priority: Normal
> Fix For: 4.0-beta
>
> Attachments: image-2020-12-09-10-57-21-994.png, 
> image-2020-12-09-11-01-31-273.png
>
>
> We seem to be estimating the size of the on-heap memtable metadata to be 
> around half of what it actually is. For example, during a [read benchmark 
> which writes 1 million single-long 
> rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java]
>  the memtable reports
> {code}
> 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB 
> (0%) off-heap
> {code}
> while a heap dump taken at this point:
>  !image-2020-12-09-10-57-21-994.png! 
> lists an usage of about 666MB altogether.
> Switching to {{offheap_objects}}, the reported numbers are
> {code}
> 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB 
> (3%) off-heap
> {code}
> while actual heap usage:
>  !image-2020-12-09-11-01-31-273.png! 
> is about 442MB.
> Looking at the code we definitely are not counting the 
> {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion 
> info objects associated with each partition, and most probably others.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated

2020-12-09 Thread Benjamin Lerer (Jira)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Benjamin Lerer updated CASSANDRA-16318:
---
 Bug Category: Parent values: Correctness(12982)
   Complexity: Normal
Discovered By: User Report
Fix Version/s: 4.0-beta
   3.11.x
   3.0.x
 Severity: Normal
   Status: Open  (was: Triage Needed)

> Memtable heap size is severely underestimated
> -
>
> Key: CASSANDRA-16318
> URL: https://issues.apache.org/jira/browse/CASSANDRA-16318
> Project: Cassandra
>  Issue Type: Bug
>  Components: Local/Memtable
>Reporter: Branimir Lambov
>Priority: Normal
> Fix For: 3.0.x, 3.11.x, 4.0-beta
>
> Attachments: image-2020-12-09-10-57-21-994.png, 
> image-2020-12-09-11-01-31-273.png
>
>
> We seem to be estimating the size of the on-heap memtable metadata to be 
> around half of what it actually is. For example, during a [read benchmark 
> which writes 1 million single-long 
> rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java]
>  the memtable reports
> {code}
> 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB 
> (0%) off-heap
> {code}
> while a heap dump taken at this point:
>  !image-2020-12-09-10-57-21-994.png! 
> lists an usage of about 666MB altogether.
> Switching to {{offheap_objects}}, the reported numbers are
> {code}
> 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB 
> (3%) off-heap
> {code}
> while actual heap usage:
>  !image-2020-12-09-11-01-31-273.png! 
> is about 442MB.
> Looking at the code we definitely are not counting the 
> {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion 
> info objects associated with each partition, and most probably others.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org