That's basically the approach I want to take in
https://issues.apache.org/jira/browse/CASSANDRA-2427.

On Thu, May 19, 2011 at 12:00 PM, Mike Malone <m...@simplegeo.com> wrote:
> Just noticed this thread and figured I'd chime in since we've had similar
> issues with the commit log growing too large on our clusters. Tuning down
> the flush timeout wasn't really an acceptable solution for us since we
> didn't want to be constantly flushing and generating extra SSTables for no
> reason. So we wrote a small tool that we start in a static block in
> CassandraServer that periodically checks the commit log size and flushes all
> memtables if they're above some threshold.
> I've attached that code. Any feedback / improvements are more than welcome.
>
> Mike
>
> On Thu, May 12, 2011 at 11:30 AM, Sanjeev Kulkarni <sanj...@locomatix.com>
> wrote:
>>
>> Hey guys,
>> I have a ec2 debian cluster consisting of several nodes running 0.7.5 on
>> ephimeral disks.
>> These are fresh installs and not upgrades.
>> The commitlog is set to the smaller of the disks which is around 10G in
>> size and the datadir is set to the bigger disk.
>> The config file is basically the same as the one supplied by the default
>> installation.
>> Our applications write to the cluster. After about a day of writing we
>> started noticing the commitlog disk filling up. Soon we went over the disk
>> limit and writes started failing. At this point we stopped the cluster.
>> Over the course of the day we inserted around 25G of data. Our columns
>> values are pretty small.
>> I understand that cassandra periodically cleans up the commitlog
>> directories by generating sstables in datadir. Is there any way to speed up
>> this movement from commitog to datadir?
>> Thanks!
>
>



-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of DataStax, the source for professional Cassandra support
http://www.datastax.com

Reply via email to