date:20150218

Re: Many pending compactions

2015-02-18 Thread Roni Balthazar

Try repair -pr on all nodes.

If after that you still have issues, you can try to rebuild the SSTables using
nodetool upgradesstables or scrub.

Regards,

Roni Balthazar

Em 18/02/2015, às 14:13, Ja Sam ptrstp...@gmail.com escreveu:

ad 3) I did this already yesterday (setcompactionthrouput also). But still
SSTables are increasing.

ad 1) What do you think I should use -pr or try to use incremental?

On Wed, Feb 18, 2015 at 4:54 PM, Roni Balthazar ronibaltha...@gmail.com
wrote:
You are right... Repair makes the data consistent between nodes.

I understand that you have 2 issues going on.

You need to run repair periodically without errors and need to decrease the
numbers of compactions pending.

So I suggest:

1) Run repair -pr on all nodes. If you upgrade to the new 2.1.3, you can use
incremental repairs. There were some bugs on 2.1.2.
2) Run cleanup on all nodes
3) Since you have too many cold SSTables, set cold_reads_to_omit to 0.0, and
increase setcompactionthroughput for some time and see if the number of
SSTables is going down.

Let us know what errors are you getting when running repairs.

Regards,

Roni Balthazar

On Wed, Feb 18, 2015 at 1:31 PM, Ja Sam ptrstp...@gmail.com wrote:
Can you explain me what is the correlation between growing SSTables and
repair?
I was sure, until your mail, that repair is only to make data consistent
between nodes.

Regards

On Wed, Feb 18, 2015 at 4:20 PM, Roni Balthazar ronibaltha...@gmail.com
wrote:
Which error are you getting when running repairs?
You need to run repair on your nodes within gc_grace_seconds (eg:
weekly). They have data that are not read frequently. You can run
repair -pr on all nodes. Since you do not have deletes, you will not
have trouble with that. If you have deletes, it's better to increase
gc_grace_seconds before the repair.
http://www.datastax.com/documentation/cassandra/2.0/cassandra/operations/ops_repair_nodes_c.html
After repair, try to run a nodetool cleanup.

Check if the number of SSTables goes down after that... Pending
compactions must decrease as well...

Cheers,

Roni Balthazar

On Wed, Feb 18, 2015 at 12:39 PM, Ja Sam ptrstp...@gmail.com wrote:
1) we tried to run repairs but they usually does not succeed. But we had
Leveled compaction before. Last week we ALTER tables to STCS, because
guys
from DataStax suggest us that we should not use Leveled and alter tables
in
STCS, because we don't have SSD. After this change we did not run any
repair. Anyway I don't think it will change anything in SSTable count -
if I
am wrong please give me an information

2) I did this. My tables are 99% write only. It is audit system

3) Yes I am using default values

4) In both operations I am using LOCAL_QUORUM.

I am almost sure that READ timeout happens because of too much SSTables.
Anyway firstly I would like to fix to many pending compactions. I still
don't know how to speed up them.

On Wed, Feb 18, 2015 at 2:49 PM, Roni Balthazar ronibaltha...@gmail.com
wrote:

Are you running repairs within gc_grace_seconds? (default is 10 days)

http://www.datastax.com/documentation/cassandra/2.0/cassandra/operations/ops_repair_nodes_c.html

Double check if you set cold_reads_to_omit to 0.0 on tables with STCS
that you do not read often.

Are you using default values for the properties
min_compaction_threshold(4) and max_compaction_threshold(32)?

Which Consistency Level are you using for reading operations? Check if
you are not reading from DC_B due to your Replication Factor and CL.

http://www.datastax.com/documentation/cassandra/2.0/cassandra/dml/dml_config_consistency_c.html

Cheers,

Roni Balthazar

On Wed, Feb 18, 2015 at 11:07 AM, Ja Sam ptrstp...@gmail.com wrote:
I don't have problems with DC_B (replica) only in DC_A(my system write
only
to it) I have read timeouts.

I checked in OpsCenter SSTable count and I have:
1) in DC_A same +-10% for last week, a small increase for last 24h
(it
is
more than 15000-2 SSTables depends on node)
2) in DC_B last 24h shows up to 50% decrease, which give nice
prognostics.
Now I have less then 1000 SSTables

What did you measure during system optimizations? Or do you have an
idea
what more should I check?
1) I look at CPU Idle (one node is 50% idle, rest 70% idle)
2) Disk queue - mostly is it near zero: avg 0.09. Sometimes there are
spikes
3) system RAM usage is almost full
4) In Total Bytes Compacted most most lines are below 3MB/s. For total
DC_A
it is less than 10MB/s, in DC_B it looks much better (avg is like
17MB/s)

something else?

On Wed, Feb 18, 2015 at 1:32 PM, Roni Balthazar
ronibaltha...@gmail.com
wrote:

Hi,

You can check if the number of SSTables is decreasing. Look for the
SSTable count information of your tables using nodetool

Re: Many pending compactions

2015-02-18 Thread Ja Sam

As Al Tobey suggest me I upgraded my 2.1.0 to snaphot version of 2.1.3. I
have now installed exactly this build:
https://cassci.datastax.com/job/cassandra-2.1/912/
I see many compaction which completes, but some of them are really slow.
Maybe I should send some stats form OpsCenter or servers? But it is
difficult to me to choose what is important

Regards

On Wed, Feb 18, 2015 at 6:11 PM, Jake Luciani jak...@gmail.com wrote:

Ja, Please upgrade to official 2.1.3 we've fixed many things related to
compaction. Are you seeing the compactions % complete progress at all?

On Wed, Feb 18, 2015 at 11:58 AM, Roni Balthazar ronibaltha...@gmail.com
wrote:

Try repair -pr on all nodes.

If after that you still have issues, you can try to rebuild the SSTables
using nodetool upgradesstables or scrub.

Regards,

Roni Balthazar

Em 18/02/2015, às 14:13, Ja Sam ptrstp...@gmail.com escreveu:

ad 3) I did this already yesterday (setcompactionthrouput also). But
still SSTables are increasing.

ad 1) What do you think I should use -pr or try to use incremental?

On Wed, Feb 18, 2015 at 4:54 PM, Roni Balthazar ronibaltha...@gmail.com
wrote:

You are right... Repair makes the data consistent between nodes.

I understand that you have 2 issues going on.

You need to run repair periodically without errors and need to decrease
the numbers of compactions pending.

So I suggest:

1) Run repair -pr on all nodes. If you upgrade to the new 2.1.3, you can
use incremental repairs. There were some bugs on 2.1.2.
2) Run cleanup on all nodes
3) Since you have too many cold SSTables, set cold_reads_to_omit to
0.0, and increase setcompactionthroughput for some time and see if the
number of SSTables is going down.

Let us know what errors are you getting when running repairs.

Regards,

Roni Balthazar

On Wed, Feb 18, 2015 at 1:31 PM, Ja Sam ptrstp...@gmail.com wrote:

Can you explain me what is the correlation between growing SSTables and
repair?
I was sure, until your mail, that repair is only to make data
consistent between nodes.

Regards

On Wed, Feb 18, 2015 at 4:20 PM, Roni Balthazar
ronibaltha...@gmail.com wrote:

Which error are you getting when running repairs?
You need to run repair on your nodes within gc_grace_seconds (eg:
weekly). They have data that are not read frequently. You can run
repair -pr on all nodes. Since you do not have deletes, you will not
have trouble with that. If you have deletes, it's better to increase
gc_grace_seconds before the repair.

http://www.datastax.com/documentation/cassandra/2.0/cassandra/operations/ops_repair_nodes_c.html
After repair, try to run a nodetool cleanup.

Check if the number of SSTables goes down after that... Pending
compactions must decrease as well...

Cheers,

Roni Balthazar

On Wed, Feb 18, 2015 at 12:39 PM, Ja Sam ptrstp...@gmail.com wrote:
1) we tried to run repairs but they usually does not succeed. But we
had
Leveled compaction before. Last week we ALTER tables to STCS,
because guys
from DataStax suggest us that we should not use Leveled and alter
tables in
STCS, because we don't have SSD. After this change we did not run any
repair. Anyway I don't think it will change anything in SSTable
count - if I
am wrong please give me an information

2) I did this. My tables are 99% write only. It is audit system

3) Yes I am using default values

4) In both operations I am using LOCAL_QUORUM.

I am almost sure that READ timeout happens because of too much
SSTables.
Anyway firstly I would like to fix to many pending compactions. I
still
don't know how to speed up them.

On Wed, Feb 18, 2015 at 2:49 PM, Roni Balthazar
ronibaltha...@gmail.com
wrote:

Are you running repairs within gc_grace_seconds? (default is 10
days)

http://www.datastax.com/documentation/cassandra/2.0/cassandra/operations/ops_repair_nodes_c.html

Double check if you set cold_reads_to_omit to 0.0 on tables with
STCS
that you do not read often.

Are you using default values for the properties
min_compaction_threshold(4) and max_compaction_threshold(32)?

Which Consistency Level are you using for reading operations? Check
if
you are not reading from DC_B due to your Replication Factor and CL.

http://www.datastax.com/documentation/cassandra/2.0/cassandra/dml/dml_config_consistency_c.html

Cheers,

Roni Balthazar

On Wed, Feb 18, 2015 at 11:07 AM, Ja Sam ptrstp...@gmail.com
wrote:
I don't have problems with DC_B (replica) only in DC_A(my system
write
only
to it) I have read timeouts.

I checked in OpsCenter SSTable count and I have:
1) in DC_A same +-10% for last week, a small increase for last
24h (it
is
more than 15000-2 SSTables depends on node)
2) in DC_B last 24h shows up to 50% decrease, which give nice
prognostics.
Now I have less then 1000 SSTables

What did you measure during system optimizations? Or do you have
an idea

Re: Many pending compactions

2015-02-18 Thread Jake Luciani

Ja, Please upgrade to official 2.1.3 we've fixed many things related to
compaction. Are you seeing the compactions % complete progress at all?

On Wed, Feb 18, 2015 at 11:58 AM, Roni Balthazar ronibaltha...@gmail.com
wrote:

Try repair -pr on all nodes.

If after that you still have issues, you can try to rebuild the SSTables
using nodetool upgradesstables or scrub.

Regards,

Roni Balthazar

Em 18/02/2015, às 14:13, Ja Sam ptrstp...@gmail.com escreveu:

ad 3) I did this already yesterday (setcompactionthrouput also). But
still SSTables are increasing.

ad 1) What do you think I should use -pr or try to use incremental?

On Wed, Feb 18, 2015 at 4:54 PM, Roni Balthazar ronibaltha...@gmail.com
wrote:

You are right... Repair makes the data consistent between nodes.

I understand that you have 2 issues going on.

You need to run repair periodically without errors and need to decrease
the numbers of compactions pending.

So I suggest:

1) Run repair -pr on all nodes. If you upgrade to the new 2.1.3, you can
use incremental repairs. There were some bugs on 2.1.2.
2) Run cleanup on all nodes
3) Since you have too many cold SSTables, set cold_reads_to_omit to 0.0,
and increase setcompactionthroughput for some time and see if the number
of SSTables is going down.

Let us know what errors are you getting when running repairs.

Regards,

Roni Balthazar

On Wed, Feb 18, 2015 at 1:31 PM, Ja Sam ptrstp...@gmail.com wrote:

Can you explain me what is the correlation between growing SSTables and
repair?
I was sure, until your mail, that repair is only to make data
consistent between nodes.

Regards

On Wed, Feb 18, 2015 at 4:20 PM, Roni Balthazar ronibaltha...@gmail.com
wrote:

http://www.datastax.com/documentation/cassandra/2.0/cassandra/operations/ops_repair_nodes_c.html
After repair, try to run a nodetool cleanup.

Check if the number of SSTables goes down after that... Pending
compactions must decrease as well...

Cheers,

Roni Balthazar

On Wed, Feb 18, 2015 at 12:39 PM, Ja Sam ptrstp...@gmail.com wrote:
1) we tried to run repairs but they usually does not succeed. But we
had
Leveled compaction before. Last week we ALTER tables to STCS, because
guys
from DataStax suggest us that we should not use Leveled and alter
tables in
STCS, because we don't have SSD. After this change we did not run any
repair. Anyway I don't think it will change anything in SSTable count
- if I
am wrong please give me an information