Scrub a single SSTable only?

2018-09-10 Thread Steinmaurer, Thomas
Hello, is there a way to Online scrub a particular SSTable file only and not the entire column family? According to the Cassandra logs we have a corrupted SSTable smallish compared to the entire data volume of the column family in question. To my understanding, both, nodetool scrub and sstable

Re: Default Single DataCenter -> Multi DataCenter

2018-09-10 Thread Eunsu Kim
Hello Thank you for your responses. I’ll share my adding datacenter plan. If you see problems, please respond. The sentence may be a little awkward because I am so poor at English that I am being helped by a translator. I've been most frequently referred to.(https://medium.com/p/465e9bf28d99

Re: Using CDC Feature to Stream C* to Kafka (Design Proposal)

2018-09-10 Thread Jeff Jirsa
On Sun, Sep 9, 2018 at 6:09 AM Jonathan Haddad wrote: > I'll be honest, I'm having a hard time wrapping my head around an > architecture where you use CDC to push data into Kafka. I've worked on > plenty of systems that use Kafka as a means of communication, and one of > the consumers is a proce

Re: Using CDC Feature to Stream C* to Kafka (Design Proposal)

2018-09-10 Thread DuyHai Doan
Also using Calvin means having to implement a distributed monotonic sequence as a primitive, not trivial at all ... On Mon, Sep 10, 2018 at 3:08 PM, Rahul Singh wrote: > In response to mimicking Advanced replication in DSE. I understand the > goal. Although DSE advanced replication does one way,

Re: Drop TTLd rows: upgradesstables -a or scrub?

2018-09-10 Thread Jeff Jirsa
How much free space do you have, and how big is the table? Switching to LCS is another option. -- Jeff Jirsa > On Sep 10, 2018, at 12:09 PM, Oleksandr Shulgin > wrote: > >> On Mon, 10 Sep 2018, 19:40 Jeff Jirsa, wrote: >> I think it's important to describe exactly what's going on for peop

Re: Drop TTLd rows: upgradesstables -a or scrub?

2018-09-10 Thread Oleksandr Shulgin
On Mon, 10 Sep 2018, 19:40 Jeff Jirsa, wrote: > I think it's important to describe exactly what's going on for people who > just read the list but who don't have context. This blog does a really good > job: > http://thelastpickle.com/blog/2016/07/27/about-deletes-and-tombstones.html > , but brief

Re: Drop TTLd rows: upgradesstables -a or scrub?

2018-09-10 Thread Oleksandr Shulgin
On Mon, 10 Sep 2018, 19:29 Charulata Sharma (charshar), wrote: > Scrub takes a very long time and does not remove the tombstones. > Charu, Why is that if the documentation clearly says it does? > should do garbage cleaning. It immediately removes the tombstones. > If you mean 'nodetool garbagec

Re: Drop TTLd rows: upgradesstables -a or scrub?

2018-09-10 Thread Jeff Jirsa
I think it's important to describe exactly what's going on for people who just read the list but who don't have context. This blog does a really good job: http://thelastpickle.com/blog/2016/07/27/about-deletes-and-tombstones.html , but briefly: - When a TTL expires, we treat it as a tombstone, bec

Re: Drop TTLd rows: upgradesstables -a or scrub?

2018-09-10 Thread Charulata Sharma (charshar)
Scrub takes a very long time and does not remove the tombstones. You should do garbage cleaning. It immediately removes the tombstones. Thaks, Charu From: Oleksandr Shulgin Reply-To: "user@cassandra.apache.org" Date: Monday, September 10, 2018 at 6:53 AM To: "user@cassandra.apache.org" Subjec

Drop TTLd rows: upgradesstables -a or scrub?

2018-09-10 Thread Oleksandr Shulgin
Hello, We have some tables with significant amount of TTLd rows that have expired by now (and more gc_grace_seconds have passed since the TTL). We have stopped writing more data to these tables quite a while ago, so background compaction isn't running. The compaction strategy is the default Size

Re: Default Single DataCenter -> Multi DataCenter

2018-09-10 Thread Pradeep Chhetri
Hello Eunsu, I am going through the same exercise at my job. I was making notes as i was testing the steps in my preproduction environment. Although I haven't tested end to end but hopefully this might help you: https://medium.com/p/465e9bf28d99 Regards, Pradeep On Mon, Sep 10, 2018 at 5:59 PM,

Re: Regarding migrating data from Oracle to Cassandra.migrate data from Oracle to Cassandra.

2018-09-10 Thread Rahul Singh
Look into Kafka Connect. It does tracking internally in a topic. Works better going from relational to Cassandra. Still won’t fix your potential data model issue related to skew and wide partitions. Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW, Sui

Re: Using CDC Feature to Stream C* to Kafka (Design Proposal)

2018-09-10 Thread Rahul Singh
In response to mimicking Advanced replication in DSE. I understand the goal. Although DSE advanced replication does one way, those are use cases with limited value to me because ultimately it’s still a master slave design. I’m working on a prototype for this for two way replication between clust

Re: Using CDC Feature to Stream C* to Kafka (Design Proposal)

2018-09-10 Thread Rahul Singh
Not everyone has it their way like Frank Sinatra. Due to various reasons, folks need to get the changes in Cassandra to be duplicated to a topic for further processing - especially if the new system owner doesn’t own the whole platform. There are various ways to do this but you have to deal with

Re: node replacement failed

2018-09-10 Thread onmstester onmstester
Thanks Alain, First here it is more detail about my cluster: 10 racks + 3 nodes on each rack nodetool status: shows 27 nodes UN and 3 nodes all related to single rack as DN version 3.11.2 Option 1: (Change schema and) use replace method (preferred method) * Did you try to have the replace going,

Re: Default Single DataCenter -> Multi DataCenter

2018-09-10 Thread Alain RODRIGUEZ
Adding a data center for the first time is a bit tricky when you haven't been considering it from the start. I operate 5 nodes cluster (3.11.0) in a single data center with > SimpleSnitch, SimpleStrategy and all client policy RoundRobin. > You will need: - To change clients, make them 'DCAware'.

Re: node replacement failed

2018-09-10 Thread Alain RODRIGUEZ
Hello, I am sorry it took us (the community) more than a day to answer to this rather critical situation. That being said, my recommendation at this point would be for you to make sure about the impacts of whatever you would try. Working on a broken cluster, as an emergency might lead you to a sec

Default Single DataCenter -> Multi DataCenter

2018-09-10 Thread Eunsu Kim
Hello everyone I operate 5 nodes cluster (3.11.0) in a single data center with SimpleSnitch, SimpleStrategy and all client policy RoundRobin. At this point, I am going to create clusters of the same size in different data centers. I think these two documents are appropriate, but there is confu

Re: node replacement failed

2018-09-10 Thread onmstester onmstester
Any idea? Sent using Zoho Mail On Sun, 09 Sep 2018 11:23:17 +0430  onmstester onmstester wrote Hi, Cluster Spec: 30 nodes RF = 2 NetworkTopologyStrategy GossipingPropertyFileSnitch + rack aware Suddenly i lost all disks of cassandar-data on one of my racks, after replacing the disks,

Re: Using CDC Feature to Stream C* to Kafka (Design Proposal)

2018-09-10 Thread Dinesh Joshi
> On Sep 9, 2018, at 6:08 AM, Jonathan Haddad > wrote: > > There may be some use cases for it.. but I'm not sure what they are. It > might help if you shared the use cases where the extra complexity is > required? When does writing to Cassandra which then dedupes an