[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated
[ https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Semb Wever updated CASSANDRA-16318: --- Fix Version/s: (was: 4.0-beta) > Memtable heap size is severely underestimated > - > > Key: CASSANDRA-16318 > URL: https://issues.apache.org/jira/browse/CASSANDRA-16318 > Project: Cassandra > Issue Type: Bug > Components: Local/Memtable >Reporter: Branimir Lambov >Assignee: Ekaterina Dimitrova >Priority: Normal > Fix For: 4.0-beta5, 4.0 > > Attachments: image-2020-12-09-10-57-21-994.png, > image-2020-12-09-11-01-31-273.png > > > We seem to be estimating the size of the on-heap memtable metadata to be > around half of what it actually is. For example, during a [read benchmark > which writes 1 million single-long > rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java] > the memtable reports > {code} > 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB > (0%) off-heap > {code} > while a heap dump taken at this point: > !image-2020-12-09-10-57-21-994.png! > lists an usage of about 666MB altogether. > Switching to {{offheap_objects}}, the reported numbers are > {code} > 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB > (3%) off-heap > {code} > while actual heap usage: > !image-2020-12-09-11-01-31-273.png! > is about 442MB. > Looking at the code we definitely are not counting the > {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion > info objects associated with each partition, and most probably others. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated
[ https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ekaterina Dimitrova updated CASSANDRA-16318: Since Version: 4.0-beta4 Source Control Link: https://github.com/apache/cassandra/commit/661f1aab171dc3ef16075f69581e88ad4a133fae Resolution: Fixed Status: Resolved (was: Ready to Commit) > Memtable heap size is severely underestimated > - > > Key: CASSANDRA-16318 > URL: https://issues.apache.org/jira/browse/CASSANDRA-16318 > Project: Cassandra > Issue Type: Bug > Components: Local/Memtable >Reporter: Branimir Lambov >Assignee: Ekaterina Dimitrova >Priority: Normal > Fix For: 4.0-beta > > Attachments: image-2020-12-09-10-57-21-994.png, > image-2020-12-09-11-01-31-273.png > > > We seem to be estimating the size of the on-heap memtable metadata to be > around half of what it actually is. For example, during a [read benchmark > which writes 1 million single-long > rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java] > the memtable reports > {code} > 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB > (0%) off-heap > {code} > while a heap dump taken at this point: > !image-2020-12-09-10-57-21-994.png! > lists an usage of about 666MB altogether. > Switching to {{offheap_objects}}, the reported numbers are > {code} > 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB > (3%) off-heap > {code} > while actual heap usage: > !image-2020-12-09-11-01-31-273.png! > is about 442MB. > Looking at the code we definitely are not counting the > {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion > info objects associated with each partition, and most probably others. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated
[ https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ekaterina Dimitrova updated CASSANDRA-16318: Status: Ready to Commit (was: Review In Progress) > Memtable heap size is severely underestimated > - > > Key: CASSANDRA-16318 > URL: https://issues.apache.org/jira/browse/CASSANDRA-16318 > Project: Cassandra > Issue Type: Bug > Components: Local/Memtable >Reporter: Branimir Lambov >Assignee: Ekaterina Dimitrova >Priority: Normal > Fix For: 4.0-beta > > Attachments: image-2020-12-09-10-57-21-994.png, > image-2020-12-09-11-01-31-273.png > > > We seem to be estimating the size of the on-heap memtable metadata to be > around half of what it actually is. For example, during a [read benchmark > which writes 1 million single-long > rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java] > the memtable reports > {code} > 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB > (0%) off-heap > {code} > while a heap dump taken at this point: > !image-2020-12-09-10-57-21-994.png! > lists an usage of about 666MB altogether. > Switching to {{offheap_objects}}, the reported numbers are > {code} > 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB > (3%) off-heap > {code} > while actual heap usage: > !image-2020-12-09-11-01-31-273.png! > is about 442MB. > Looking at the code we definitely are not counting the > {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion > info objects associated with each partition, and most probably others. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated
[ https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ekaterina Dimitrova updated CASSANDRA-16318: Authors: Branimir Lambov, Ekaterina Dimitrova (was: Ekaterina Dimitrova) > Memtable heap size is severely underestimated > - > > Key: CASSANDRA-16318 > URL: https://issues.apache.org/jira/browse/CASSANDRA-16318 > Project: Cassandra > Issue Type: Bug > Components: Local/Memtable >Reporter: Branimir Lambov >Assignee: Ekaterina Dimitrova >Priority: Normal > Fix For: 4.0-beta > > Attachments: image-2020-12-09-10-57-21-994.png, > image-2020-12-09-11-01-31-273.png > > > We seem to be estimating the size of the on-heap memtable metadata to be > around half of what it actually is. For example, during a [read benchmark > which writes 1 million single-long > rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java] > the memtable reports > {code} > 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB > (0%) off-heap > {code} > while a heap dump taken at this point: > !image-2020-12-09-10-57-21-994.png! > lists an usage of about 666MB altogether. > Switching to {{offheap_objects}}, the reported numbers are > {code} > 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB > (3%) off-heap > {code} > while actual heap usage: > !image-2020-12-09-11-01-31-273.png! > is about 442MB. > Looking at the code we definitely are not counting the > {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion > info objects associated with each partition, and most probably others. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated
[ https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ekaterina Dimitrova updated CASSANDRA-16318: Reviewers: Benjamin Lerer, Branimir Lambov (was: Benjamin Lerer) > Memtable heap size is severely underestimated > - > > Key: CASSANDRA-16318 > URL: https://issues.apache.org/jira/browse/CASSANDRA-16318 > Project: Cassandra > Issue Type: Bug > Components: Local/Memtable >Reporter: Branimir Lambov >Assignee: Ekaterina Dimitrova >Priority: Normal > Fix For: 4.0-beta > > Attachments: image-2020-12-09-10-57-21-994.png, > image-2020-12-09-11-01-31-273.png > > > We seem to be estimating the size of the on-heap memtable metadata to be > around half of what it actually is. For example, during a [read benchmark > which writes 1 million single-long > rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java] > the memtable reports > {code} > 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB > (0%) off-heap > {code} > while a heap dump taken at this point: > !image-2020-12-09-10-57-21-994.png! > lists an usage of about 666MB altogether. > Switching to {{offheap_objects}}, the reported numbers are > {code} > 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB > (3%) off-heap > {code} > while actual heap usage: > !image-2020-12-09-11-01-31-273.png! > is about 442MB. > Looking at the code we definitely are not counting the > {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion > info objects associated with each partition, and most probably others. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated
[ https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Lerer updated CASSANDRA-16318: --- Reviewers: Benjamin Lerer, Benjamin Lerer (was: Benjamin Lerer) Benjamin Lerer, Benjamin Lerer Status: Review In Progress (was: Patch Available) > Memtable heap size is severely underestimated > - > > Key: CASSANDRA-16318 > URL: https://issues.apache.org/jira/browse/CASSANDRA-16318 > Project: Cassandra > Issue Type: Bug > Components: Local/Memtable >Reporter: Branimir Lambov >Assignee: Ekaterina Dimitrova >Priority: Normal > Fix For: 4.0-beta > > Attachments: image-2020-12-09-10-57-21-994.png, > image-2020-12-09-11-01-31-273.png > > > We seem to be estimating the size of the on-heap memtable metadata to be > around half of what it actually is. For example, during a [read benchmark > which writes 1 million single-long > rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java] > the memtable reports > {code} > 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB > (0%) off-heap > {code} > while a heap dump taken at this point: > !image-2020-12-09-10-57-21-994.png! > lists an usage of about 666MB altogether. > Switching to {{offheap_objects}}, the reported numbers are > {code} > 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB > (3%) off-heap > {code} > while actual heap usage: > !image-2020-12-09-11-01-31-273.png! > is about 442MB. > Looking at the code we definitely are not counting the > {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion > info objects associated with each partition, and most probably others. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated
[ https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ekaterina Dimitrova updated CASSANDRA-16318: Test and Documentation Plan: https://app.circleci.com/pipelines/github/ekaterinadimitrova2/cassandra/545/workflows/f797bf58-b572-4d8c-831e-a61936d23624 Status: Patch Available (was: In Progress) > Memtable heap size is severely underestimated > - > > Key: CASSANDRA-16318 > URL: https://issues.apache.org/jira/browse/CASSANDRA-16318 > Project: Cassandra > Issue Type: Bug > Components: Local/Memtable >Reporter: Branimir Lambov >Assignee: Ekaterina Dimitrova >Priority: Normal > Fix For: 4.0-beta > > Attachments: image-2020-12-09-10-57-21-994.png, > image-2020-12-09-11-01-31-273.png > > > We seem to be estimating the size of the on-heap memtable metadata to be > around half of what it actually is. For example, during a [read benchmark > which writes 1 million single-long > rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java] > the memtable reports > {code} > 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB > (0%) off-heap > {code} > while a heap dump taken at this point: > !image-2020-12-09-10-57-21-994.png! > lists an usage of about 666MB altogether. > Switching to {{offheap_objects}}, the reported numbers are > {code} > 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB > (3%) off-heap > {code} > while actual heap usage: > !image-2020-12-09-11-01-31-273.png! > is about 442MB. > Looking at the code we definitely are not counting the > {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion > info objects associated with each partition, and most probably others. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated
[ https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Branimir Lambov updated CASSANDRA-16318: Fix Version/s: (was: 3.11.x) (was: 3.0.x) > Memtable heap size is severely underestimated > - > > Key: CASSANDRA-16318 > URL: https://issues.apache.org/jira/browse/CASSANDRA-16318 > Project: Cassandra > Issue Type: Bug > Components: Local/Memtable >Reporter: Branimir Lambov >Assignee: Ekaterina Dimitrova >Priority: Normal > Fix For: 4.0-beta > > Attachments: image-2020-12-09-10-57-21-994.png, > image-2020-12-09-11-01-31-273.png > > > We seem to be estimating the size of the on-heap memtable metadata to be > around half of what it actually is. For example, during a [read benchmark > which writes 1 million single-long > rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java] > the memtable reports > {code} > 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB > (0%) off-heap > {code} > while a heap dump taken at this point: > !image-2020-12-09-10-57-21-994.png! > lists an usage of about 666MB altogether. > Switching to {{offheap_objects}}, the reported numbers are > {code} > 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB > (3%) off-heap > {code} > while actual heap usage: > !image-2020-12-09-11-01-31-273.png! > is about 442MB. > Looking at the code we definitely are not counting the > {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion > info objects associated with each partition, and most probably others. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-16318) Memtable heap size is severely underestimated
[ https://issues.apache.org/jira/browse/CASSANDRA-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Lerer updated CASSANDRA-16318: --- Bug Category: Parent values: Correctness(12982) Complexity: Normal Discovered By: User Report Fix Version/s: 4.0-beta 3.11.x 3.0.x Severity: Normal Status: Open (was: Triage Needed) > Memtable heap size is severely underestimated > - > > Key: CASSANDRA-16318 > URL: https://issues.apache.org/jira/browse/CASSANDRA-16318 > Project: Cassandra > Issue Type: Bug > Components: Local/Memtable >Reporter: Branimir Lambov >Priority: Normal > Fix For: 3.0.x, 3.11.x, 4.0-beta > > Attachments: image-2020-12-09-10-57-21-994.png, > image-2020-12-09-11-01-31-273.png > > > We seem to be estimating the size of the on-heap memtable metadata to be > around half of what it actually is. For example, during a [read benchmark > which writes 1 million single-long > rows|https://github.com/blambov/cassandra/blob/memtable-heap/test/microbench/org/apache/cassandra/test/microbench/instance/ReadTestSmallPartitions.java] > the memtable reports > {code} > 100 ops, 58.174MiB serialized bytes, 385.284MiB (19%) on heap, 0.000KiB > (0%) off-heap > {code} > while a heap dump taken at this point: > !image-2020-12-09-10-57-21-994.png! > lists an usage of about 666MB altogether. > Switching to {{offheap_objects}}, the reported numbers are > {code} > 100 ops, 58.174MiB serialized bytes, 233.650MiB (12%) on heap, 53.406MiB > (3%) off-heap > {code} > while actual heap usage: > !image-2020-12-09-11-01-31-273.png! > is about 442MB. > Looking at the code we definitely are not counting the > {{AtomicBTreePartition.Holder}}, {{EncodingStats}}, liveness and deletion > info objects associated with each partition, and most probably others. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org