[jira] [Comment Edited] (CASSANDRA-12228) Write performance regression in 3.x vs 3.0

2016-08-03 Thread Ariel Weisberg (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-12228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403197#comment-15403197
 ] 

Ariel Weisberg edited comment on CASSANDRA-12228 at 8/3/16 2:35 PM:


|[trunk|https://github.com/apache/cassandra/compare/trunk...aweisberg:CASSANDRA-12228-trunk?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-testall/3/]|[dtest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-dtest/3/]|

I set the default back to the calculation that existed before and updated the 
documentation. I would like to remove memtable_cleanup_threshold scoped as part 
of this ticket. I can't see why you would set that to anything other than the 
default calculation and removing it would help offset the additional lines of 
doc for memtable_flush_writers.

Somewhat orthogonal regarding my earlier comments about too much memory 
utilization. I created CASSANDRA-12358 for that.


was (Author: aweisberg):
|[trunk|https://github.com/apache/cassandra/compare/trunk...aweisberg:CASSANDRA-12228-trunk?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-testall/1/]|[dtest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-dtest/1/]|

I set the default back to the calculation that existed before and updated the 
documentation. I would like to remove memtable_cleanup_threshold scoped as part 
of this ticket. I can't see why you would set that to anything other than the 
default calculation and removing it would help offset the additional lines of 
doc for memtable_flush_writers.

Somewhat orthogonal regarding my earlier comments about too much memory 
utilization. I created CASSANDRA-12358 for that.

> Write performance regression in 3.x vs 3.0
> --
>
> Key: CASSANDRA-12228
> URL: https://issues.apache.org/jira/browse/CASSANDRA-12228
> Project: Cassandra
>  Issue Type: Bug
>Reporter: T Jake Luciani
>Assignee: Ariel Weisberg
>Priority: Minor
> Fix For: 3.10
>
>
> I've been tracking down a performance issue in trunk vs cassandra-3.0 branch.
> I think I've found it.  CASSANDRA-6696 changed the default memtable flush 
> default to 1 vs the min of 2 in cassandra-3.0.
> I don't see any technical reason for this and we should add back the min of 2 
> sstable flushers per disk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (CASSANDRA-12228) Write performance regression in 3.x vs 3.0

2016-08-03 Thread Ariel Weisberg (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-12228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403197#comment-15403197
 ] 

Ariel Weisberg edited comment on CASSANDRA-12228 at 8/3/16 2:34 PM:


|[trunk|https://github.com/apache/cassandra/compare/trunk...aweisberg:CASSANDRA-12228-trunk?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-testall/1/]|[dtest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-dtest/1/]|

I set the default back to the calculation that existed before and updated the 
documentation. I would like to remove memtable_cleanup_threshold scoped as part 
of this ticket. I can't see why you would set that to anything other than the 
default calculation and removing it would help offset the additional lines of 
doc for memtable_flush_writers.

Somewhat orthogonal regarding my earlier comments about too much memory 
utilization. I created CASSANDRA-12358 for that.


was (Author: aweisberg):
|[3.9|https://github.com/apache/cassandra/compare/cassandra-3.9...aweisberg:CASSANDRA-12228-3.9?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-testall/1/]|[dtest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-dtest/1/]|
|[trunk|https://github.com/apache/cassandra/compare/trunk...aweisberg:CASSANDRA-12228-trunk?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-testall/1/]|[dtest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-dtest/1/]|

I set the default back to the calculation that existed before and updated the 
documentation. I would like to remove memtable_cleanup_threshold scoped as part 
of this ticket. I can't see why you would set that to anything other than the 
default calculation and removing it would help offset the additional lines of 
doc for memtable_flush_writers.

Somewhat orthogonal regarding my earlier comments about too much memory 
utilization. I created CASSANDRA-12358 for that.

> Write performance regression in 3.x vs 3.0
> --
>
> Key: CASSANDRA-12228
> URL: https://issues.apache.org/jira/browse/CASSANDRA-12228
> Project: Cassandra
>  Issue Type: Bug
>Reporter: T Jake Luciani
>Assignee: Ariel Weisberg
>Priority: Minor
> Fix For: 3.10
>
>
> I've been tracking down a performance issue in trunk vs cassandra-3.0 branch.
> I think I've found it.  CASSANDRA-6696 changed the default memtable flush 
> default to 1 vs the min of 2 in cassandra-3.0.
> I don't see any technical reason for this and we should add back the min of 2 
> sstable flushers per disk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (CASSANDRA-12228) Write performance regression in 3.x vs 3.0

2016-08-03 Thread Ariel Weisberg (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-12228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403197#comment-15403197
 ] 

Ariel Weisberg edited comment on CASSANDRA-12228 at 8/3/16 2:31 PM:


|[3.9|https://github.com/apache/cassandra/compare/cassandra-3.9...aweisberg:CASSANDRA-12228-3.9?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-testall/1/]|[dtest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-dtest/1/]|
|[trunk|https://github.com/apache/cassandra/compare/trunk...aweisberg:CASSANDRA-12228-trunk?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-testall/1/]|[dtest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-dtest/1/]|

I set the default back to the calculation that existed before and updated the 
documentation. I would like to remove memtable_cleanup_threshold scoped as part 
of this ticket. I can't see why you would set that to anything other than the 
default calculation and removing it would help offset the additional lines of 
doc for memtable_flush_writers.

Somewhat orthogonal regarding my earlier comments about too much memory 
utilization. I created CASSANDRA-12358 for that.


was (Author: aweisberg):
|[3.9|https://github.com/apache/cassandra/compare/cassandra-3.9...aweisberg:CASSANDRA-12228-3.9?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-testall/1/]|[dtest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-dtest/1/]|
|[trunk|https://github.com/apache/cassandra/compare/trunk...aweisberg:CASSANDRA-12228-3.9?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-testall/1/]|[dtest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-dtest/1/]|

I set the default back to the calculation that existed before and updated the 
documentation. I would like to remove memtable_cleanup_threshold scoped as part 
of this ticket. I can't see why you would set that to anything other than the 
default calculation and removing it would help offset the additional lines of 
doc for memtable_flush_writers.

Somewhat orthogonal regarding my earlier comments about too much memory 
utilization. I created CASSANDRA-12358 for that.

> Write performance regression in 3.x vs 3.0
> --
>
> Key: CASSANDRA-12228
> URL: https://issues.apache.org/jira/browse/CASSANDRA-12228
> Project: Cassandra
>  Issue Type: Bug
>Reporter: T Jake Luciani
>Assignee: Ariel Weisberg
>Priority: Minor
> Fix For: 3.10
>
>
> I've been tracking down a performance issue in trunk vs cassandra-3.0 branch.
> I think I've found it.  CASSANDRA-6696 changed the default memtable flush 
> default to 1 vs the min of 2 in cassandra-3.0.
> I don't see any technical reason for this and we should add back the min of 2 
> sstable flushers per disk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (CASSANDRA-12228) Write performance regression in 3.x vs 3.0

2016-08-02 Thread Ariel Weisberg (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-12228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403197#comment-15403197
 ] 

Ariel Weisberg edited comment on CASSANDRA-12228 at 8/2/16 5:30 PM:


|[3.9|https://github.com/apache/cassandra/compare/cassandra-3.9...aweisberg:CASSANDRA-12228-3.9?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-testall/1/]|[dtest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-dtest/1/]|
|[trunk|https://github.com/apache/cassandra/compare/trunk...aweisberg:CASSANDRA-12228-3.9?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-testall/1/]|[dtest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-dtest/1/]|

I set the default back to the calculation that existed before and updated the 
documentation. I would like to remove memtable_cleanup_threshold scoped as part 
of this ticket. I can't see why you would set that to anything other than the 
default calculation and removing it would help offset the additional lines of 
doc for memtable_flush_writers.

Somewhat orthogonal regarding my earlier comments about too much memory 
utilization. I created CASSANDRA-12358 for that.


was (Author: aweisberg):
|[3.9|https://github.com/apache/cassandra/compare/cassandra-3.9...aweisberg:CASSANDRA-12228-3.9?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-testall/1/]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-dtest/1/]|
|[trunk|https://github.com/apache/cassandra/compare/trunk...aweisberg:CASSANDRA-12228-3.9?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-testall/1/]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-dtest/1/]|

I set the default back to the calculation that existed before and updated the 
documentation. I would like to remove memtable_cleanup_threshold scoped as part 
of this ticket. I can't see why you would set that to anything other than the 
default calculation and removing it would help offset the additional lines of 
doc for memtable_flush_writers.

Somewhat orthogonal regarding my earlier comments about too much memory 
utilization. I created CASSANDRA-12358 for that.

> Write performance regression in 3.x vs 3.0
> --
>
> Key: CASSANDRA-12228
> URL: https://issues.apache.org/jira/browse/CASSANDRA-12228
> Project: Cassandra
>  Issue Type: Bug
>Reporter: T Jake Luciani
>Assignee: Ariel Weisberg
>Priority: Minor
> Fix For: 3.9
>
>
> I've been tracking down a performance issue in trunk vs cassandra-3.0 branch.
> I think I've found it.  CASSANDRA-6696 changed the default memtable flush 
> default to 1 vs the min of 2 in cassandra-3.0.
> I don't see any technical reason for this and we should add back the min of 2 
> sstable flushers per disk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (CASSANDRA-12228) Write performance regression in 3.x vs 3.0

2016-08-02 Thread Marcus Eriksson (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-12228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15404078#comment-15404078
 ] 

Marcus Eriksson edited comment on CASSANDRA-12228 at 8/2/16 2:43 PM:
-

Since 6696 the configuration sets how many flush writers there should be *per 
data directory* - so factoring in the number of data directories in the 
calculation is probably wrong (ie, with 10 data directories and 2 memtable 
flush writers you would get 20 threads actually doing writing to disk and 2 
threads waiting on the writing, meaning we can flush 2 memtables concurrently)

edit: re-read your yaml comment where you explain exactly what I wrote above :) 
- but why would we want more threads per data directory if we have more data 
directories?


was (Author: krummas):
Since 6696 the configuration sets how many flush writers there should be *per 
data directory* - so factoring in the number of data directories in the 
calculation is probably wrong (ie, with 10 data directories and 2 memtable 
flush writers you would get 20 threads actually doing writing to disk and 2 
threads waiting on the writing, meaning we can flush 2 memtables concurrently)



> Write performance regression in 3.x vs 3.0
> --
>
> Key: CASSANDRA-12228
> URL: https://issues.apache.org/jira/browse/CASSANDRA-12228
> Project: Cassandra
>  Issue Type: Bug
>Reporter: T Jake Luciani
>Assignee: Ariel Weisberg
>Priority: Minor
> Fix For: 3.9
>
>
> I've been tracking down a performance issue in trunk vs cassandra-3.0 branch.
> I think I've found it.  CASSANDRA-6696 changed the default memtable flush 
> default to 1 vs the min of 2 in cassandra-3.0.
> I don't see any technical reason for this and we should add back the min of 2 
> sstable flushers per disk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (CASSANDRA-12228) Write performance regression in 3.x vs 3.0

2016-08-01 Thread Ariel Weisberg (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-12228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403197#comment-15403197
 ] 

Ariel Weisberg edited comment on CASSANDRA-12228 at 8/2/16 1:45 AM:


|[3.9|https://github.com/apache/cassandra/compare/cassandra-3.9...aweisberg:CASSANDRA-12228-3.9?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-testall/1/]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-dtest/1/]|
|[trunk|https://github.com/apache/cassandra/compare/trunk...aweisberg:CASSANDRA-12228-3.9?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-testall/1/]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-dtest/1/]|

I set the default back to the calculation that existed before and updated the 
documentation. I would like to remove memtable_cleanup_threshold scoped as part 
of this ticket. I can't see why you would set that to anything other than the 
default calculation and removing it would help offset the additional lines of 
doc for memtable_flush_writers.

Somewhat orthogonal regarding my earlier comments about too much memory 
utilization. I created CASSANDRA-12358 for that.


was (Author: aweisberg):
|[3.9|https://github.com/apache/cassandra/compare/cassandra-3.9...aweisberg:CASSANDRA-12228-3.9?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-testall/1/]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-3.9-dtest/1/]|
|[trunk|https://github.com/apache/cassandra/compare/trunk...aweisberg:CASSANDRA-12228-3.9?expand=1]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-testall/1/]|[utest|https://cassci.datastax.com/view/Dev/view/aweisberg/job/aweisberg-CASSANDRA-12228-trunk-dtest/1/]|

> Write performance regression in 3.x vs 3.0
> --
>
> Key: CASSANDRA-12228
> URL: https://issues.apache.org/jira/browse/CASSANDRA-12228
> Project: Cassandra
>  Issue Type: Bug
>Reporter: T Jake Luciani
>Assignee: Ariel Weisberg
>Priority: Minor
> Fix For: 3.9
>
>
> I've been tracking down a performance issue in trunk vs cassandra-3.0 branch.
> I think I've found it.  CASSANDRA-6696 changed the default memtable flush 
> default to 1 vs the min of 2 in cassandra-3.0.
> I don't see any technical reason for this and we should add back the min of 2 
> sstable flushers per disk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (CASSANDRA-12228) Write performance regression in 3.x vs 3.0

2016-07-29 Thread Ariel Weisberg (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-12228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15400133#comment-15400133
 ] 

Ariel Weisberg edited comment on CASSANDRA-12228 at 7/29/16 10:28 PM:
--

There are some remaining issues with thread pool sizes. See 
[CASSANDRA-12071|https://issues.apache.org/jira/browse/CASSANDRA-12071?focusedCommentId=15400086=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15400086].

You still can't get multiple threads if you have a single disk due to TPE not 
spinning up additional threads if you are using an unbounded queue.  Seems like 
this would be a good place to address the related issue. I also don't think 
this is minor it's pretty crippling for performance and you can't work around 
it by changing configuration values.


was (Author: aweisberg):
There are some remaining issues with thread pool sizes. See 
[CASSANDRA-12071|https://issues.apache.org/jira/browse/CASSANDRA-12071?focusedCommentId=15400086=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15400086].

You still can't get multiple threads if you have a single disk.  Seems like 
this would be a good place to address the related issue. I also don't think 
this is minor it's pretty crippling for performance and you can't work around 
it by changing configuration values.

> Write performance regression in 3.x vs 3.0
> --
>
> Key: CASSANDRA-12228
> URL: https://issues.apache.org/jira/browse/CASSANDRA-12228
> Project: Cassandra
>  Issue Type: Bug
>Reporter: T Jake Luciani
>Assignee: Marcus Eriksson
>Priority: Minor
> Fix For: 3.9
>
>
> I've been tracking down a performance issue in trunk vs cassandra-3.0 branch.
> I think I've found it.  CASSANDRA-6696 changed the default memtable flush 
> default to 1 vs the min of 2 in cassandra-3.0.
> I don't see any technical reason for this and we should add back the min of 2 
> sstable flushers per disk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)