Re: Adding Options to Create Statements...

2016-04-01 Thread Robert Coli
On Fri, Apr 1, 2016 at 10:43 AM, James Carman wrote: > A, my bad. One might wonder why the heck the Java driver is "owned" > by an outside entity, eh? > FWIW, the status quo prior to the Datastax drivers was a wide assortment of non-compatible drivers in different languages, not one set of

Re: Consistency Level (QUORUM vs LOCAL_QUORUM)

2016-03-31 Thread Robert Coli
On Thu, Mar 31, 2016 at 4:35 AM, Alain RODRIGUEZ wrote: > My understanding is using RF 3 and LOCAL_QUORUM for both reads and writes > will provide a strong consistency and a high availability. One node can go > down and also without lowering the consistency. Or RF = 5, Quorum = 3, > allowing 2 no

Re: Speeding up "nodetool rebuild"

2016-03-30 Thread Robert Coli
On Wed, Mar 30, 2016 at 1:44 PM, Anubhav Kale wrote: > Will changing compactionthroughput and streamingthroughput help with > reducing the “rebuild” time on a brand new node ? We will do it both on the > new node, and the nodes in source DC from where data is streamed. > streamingthroughput yes

Re: How is the coordinator node in LOCAL_QUORUM chosen?

2016-03-25 Thread Robert Coli
On Fri, Mar 25, 2016 at 1:04 PM, X. F. Li wrote: > Suppose I have replication factor 3. If one of the node fails, will > queries with ALL consistency fail if the queried partition is on the failed > node? Or would they continue to work with 2 replicas during the time while > cassandra is replicat

Re: Counter values become under-counted when running repair.

2016-03-24 Thread Robert Coli
On Thu, Mar 24, 2016 at 7:17 AM, Jack Krupansky wrote: > Can you advise us on your thinking when you selected RF=2? > I figure he was probably thinking "I want to operate in a bunch of different regions and don't need to use QUORUM for my use cases, and want to save money by not storing 3 copies

Re: Rack aware question.

2016-03-23 Thread Robert Coli
4209. > > > > So, few questions because things don’t add up: > > > >1. How come 127.0.0.1 is shown as an endpoint holding the ID when its >token range doesn’t contain it ? Does “nodetool ring” shows all >token-ranges for a node or just the primary range ? I am

Re: Rack aware question.

2016-03-23 Thread Robert Coli
On Wed, Mar 23, 2016 at 8:07 AM, Anubhav Kale wrote: > Suppose we change the racks on VMs on a running cluster. (We need to do > this while running on Azure, because sometimes when the VM gets moved its > rack changes). > > In this situation, new writes will be laid out based on new rack info on

Re: Large number of tombstones without delete or update

2016-03-23 Thread Robert Coli
On Wed, Mar 23, 2016 at 9:50 AM, Ralf Steppacher wrote: > How come I end up with that large a number of tombstones? > Are you inserting NULLs? =Rob

Re: Compaction Filter in Cassandra

2016-03-20 Thread Robert Coli
On Fri, Mar 11, 2016 at 10:05 PM, Dikang Gu wrote: > RocksDB has the feature called "Compaction Filter" to allow application to > modify/delete a key-value during the background compaction. > https://github.com/facebook/rocksdb/blob/v4.1/include/rocksdb/options.h#L201-L226 > > I'm wondering is th

Re: Read consistency

2016-03-19 Thread Robert Coli
On Tue, Mar 15, 2016 at 6:43 PM, Arko Provo Mukherjee < arkoprovomukher...@gmail.com> wrote: > I am designing a system where for a situation, I need to have SERIAL > consistency during writes. > Be sure to understand the implications of : https://issues.apache.org/jira/browse/CASSANDRA-9328 =Ro

Re: Deploy latest cassandra on top of datastax-ddc ?

2016-03-19 Thread Robert Coli
On Thu, Mar 17, 2016 at 10:38 PM, Mohamed Lrhazi < mohamed.lrh...@georgetown.edu> wrote: > Would simply overriding this one jar file do it? else could you please > share a procedure? > This seems like an odd thing to want to do. Why do you believe it is likely to work? =Rob

Re: Deploy latest cassandra on top of datastax-ddc ?

2016-03-18 Thread Robert Coli
On Fri, Mar 18, 2016 at 5:00 PM, Mohamed Lrhazi < mohamed.lrh...@georgetown.edu> wrote: > FYI.. for the curious.. what I did resulted in a cluster where i tested > these two things: > ... > I have no inclination to keep using such a setup.. just reporting the > experiment :) > FWIW, given the gen

Re: Deploy latest cassandra on top of datastax-ddc ?

2016-03-18 Thread Robert Coli
On Fri, Mar 18, 2016 at 2:18 PM, Mohamed Lrhazi < mohamed.lrh...@georgetown.edu> wrote: > So, after doing an ant build from the latest source... how would one > "install" or deploy cassandra? Could not find a document on the install > from source part... any pointers? All I find makes use of yum

Re: Cassandra Upgrade 3.0.x vs 3.x (Tick-Tock Release)

2016-03-14 Thread Robert Coli
On Mon, Mar 14, 2016 at 12:40 PM, Kathiresan S wrote: > We are planning for Cassandra upgrade in our production environment. > Which version of Cassandra is stable and is advised to upgrade to, at the > moment? > https://www.eventbrite.com/engineering/what-version-of-cassandra-should-i-run/ (IO

Re: Regarding cassandra-stress results

2016-03-14 Thread Robert Coli
On Mon, Mar 14, 2016 at 11:38 AM, Rajath Subramanyam wrote: > When cassandra-stress tool dumps the output at the end of the benchmarking > run, what is the unit of latency statistics ? > This is becoming a FAQ. Perhaps the docs for the tool (and/or the tool itself) should be modified to specify

Re: JMX liveSSTableCount

2016-03-11 Thread Robert Coli
On Fri, Mar 11, 2016 at 10:04 AM, Hazel Bobins wrote: > Does anyone know if the removal of the liveSSTableCount JMX attribute > from the 'org.apache.cassandra.db:type=Tables,keyspaces=' mbean was > intentional in 3.x? I can not see reference to its removal in any Jira etc > Yukim@IRC sends this

Re: ntpd clock sync

2016-03-10 Thread Robert Coli
On Wed, Mar 9, 2016 at 9:03 AM, K F wrote: > the clock is about 30 to 40 seconds behind. > If you don't want to get ntp working there, why not just... manually... set the clocks? =Rob

Re: How can I make Cassandra stable in a 2GB RAM node environment ?

2016-03-10 Thread Robert Coli
On Thu, Mar 10, 2016 at 3:27 AM, Alain RODRIGUEZ wrote: > So, like Jack, I globally really not recommend it unless you know what you > are doing and don't care about facing those issues. > Certainly a spectrum of views here, but everyone (including OP) seems to agree with the above. :D =Rob

Re: Unexplainably large reported partition sizes

2016-03-09 Thread Robert Coli
On Mon, Mar 7, 2016 at 1:25 PM, Nate McCall wrote: > >> Rob, can you remember which bug/jira this was? I have not been able to >> find it. >> I'm using 2.1.9. >> > > https://issues.apache.org/jira/browse/CASSANDRA-7953 > > Rob may have a different one, but I've something similar from this issue.

Re: How can I make Cassandra stable in a 2GB RAM node environment ?

2016-03-07 Thread Robert Coli
On Fri, Mar 4, 2016 at 8:27 PM, Jack Krupansky wrote: > Please review the minimum hardware requirements as clearly documented: > > http://docs.datastax.com/en/cassandra/3.x/cassandra/planning/planPlanningHardware.html > That is a document for Datastax Cassandra, not Apache Cassandra. It's wonder

Re: Unexplainably large reported partition sizes

2016-03-07 Thread Robert Coli
On Sat, Mar 5, 2016 at 9:16 AM, Tom van den Berge wrote: > I don't think compression can be the cause of the difference, because of > two reasons: > Your two reasons seem legitimate. Though you say you do not frequently do DELETE and so it shouldn't be due to tombstones, there are semi-recent v

Re: moving keyspaces to another disk while Cassandra is running

2016-03-07 Thread Robert Coli
On Mon, Mar 7, 2016 at 2:57 AM, Krzysztof Księżyk wrote: > I see on lsof output that even if keyspace > is not queried, Cassandra keeps files opened, so I guess it's not safe to > hotswap, but I'd like to make sure. > It is not safe for exactly this reason. Just restart your nodes. Were I doing

Re: Unexplainably large reported partition sizes

2016-03-04 Thread Robert Coli
On Fri, Mar 4, 2016 at 5:56 AM, Tom van den Berge wrote: > Compacting large partition > drillster/subscriberstats:rqtPewK-1chi0JSO595u-Q (1,470,058,292 bytes) > > This means that this single partition is about 1.4GB large. This is much > larger that it can possibly be, because of two reasons: >

Re: Broken links in Apache Cassandra home page

2016-03-02 Thread Robert Coli
On Wed, Mar 2, 2016 at 7:00 AM, Eric Evans wrote: > On Tue, Mar 1, 2016 at 8:30 PM, ANG ANG wrote: > > "#cassandra channel": http://freenode.net/ > > The latter, while not presently useful, links to a "coming soon..." > for Freenode. It might be pedantic to insist it's not broken, but I > don't

Re: Snitch for AWS EC2 nondefaultVPC

2016-03-02 Thread Robert Coli
On Wed, Mar 2, 2016 at 7:21 AM, Arun Sandu wrote: > > All the nodes in both datacenters are in DSE Search Mode(Solr). We may > have analytics datacenter as well in future. Will this have any impact in > using Ec2MultiRegionSnitch? > This list does not support DSE, but as I understand it, they cre

Re: Removing Node causes bunch of HostUnavailableException

2016-03-02 Thread Robert Coli
On Wed, Mar 2, 2016 at 8:10 AM, Peddi, Praveen wrote: > We have few dead nodes in the cluster (Amazon ASG removed those thinking > there is an issue with health). Now we are trying to remove those dead > nodes from the cluster so that other nodes can take over. As soon as I > execute nodetool rem

Re: Snitch for AWS EC2 nondefaultVPC

2016-03-01 Thread Robert Coli
On Tue, Mar 1, 2016 at 12:12 PM, Arun Sandu wrote: > All our nodes are launched in AWS EC2 VPC (private). We have 2 > datacenters(1 us-east , 1- asiapacific) and all communication is through > private IP's and don't have any public IPs. What is the recommended snitch > to be used? We currently ha

Re: List of List

2016-03-01 Thread Robert Coli
On Tue, Mar 1, 2016 at 3:23 PM, Jonathan Haddad wrote: > Thrift is deprecated, and will be removed in Cassandra 4.0 Don't do any > new development with it. > +infinity this. =Rob

Re: Handling uncommitted paxos state

2016-02-25 Thread Robert Coli
On Thu, Feb 25, 2016 at 1:23 AM, Nicholas Wilson < nicholas.wil...@realvnc.com> wrote: > If a WriteTimeoutException with WriteType.SIMPLE is thrown for a CAS > write, that means that the paxos phase was successful, but the data > couldn't be committed during the final 'commit/reset' phase. On the

Re: copy and rename sstable files as keyspace migration approach

2016-02-23 Thread Robert Coli
On Tue, Feb 23, 2016 at 6:44 AM, Jarod Guertin wrote: > Being fairly new to Cassandra, I'd like to run the following with the > experts to make sure it's an ok thing to do. > > We have a particular case where we have multiple keyspaces with multiple > tables each and we want to migrate to a new u

Re: Cassandra nodes reduce disks per node

2016-02-17 Thread Robert Coli
On Tue, Feb 16, 2016 at 11:29 PM, Anishek Agarwal wrote: > > To accomplish this can I just copy the data from disk1 to disk2 with in > the relevant cassandra home location folders, change the cassanda.yaml > configuration and restart the node. before starting i will shutdown the > cluster. > Yes.

Re: Sudden disk usage

2016-02-16 Thread Robert Coli
On Sat, Feb 13, 2016 at 4:30 PM, Branton Davis wrote: > We use SizeTieredCompaction. The nodes were about 67% full and we were > planning on adding new nodes (doubling the cluster to 6) soon. > Be sure to add those new nodes one at a time. Have you checked for, and cleared, old snapshots? Snap

Re: Restart Cassandra automatically

2016-02-09 Thread Robert Coli
On Tue, Feb 9, 2016 at 6:20 AM, wrote: > Call me naïve, but we do use an in-house built program for keeping nodes > started (based on a flag-check). The program is something that was written > for all kinds of daemon processes here, not Cassandra specifically. The > basic idea is that is runs a s

Re: Cassandra Collections performance issue

2016-02-08 Thread Robert Coli
On Mon, Feb 8, 2016 at 2:10 PM, Agrawal, Pratik wrote: > Recently we added one of the table fields from as Map in > *Cassandra > 2.1.11*. Currently we read every field from Map and overwrite map values. > Map is of size 3. We saw that writes are 30-40% slower while reads are > 70-80% slower. Ple

Re: Restart Cassandra automatically

2016-02-05 Thread Robert Coli
On Thu, Feb 4, 2016 at 8:26 PM, Debraj Manna wrote: > What is the best way to keep cassandra running? My requirement is if for > some reason cassandra stops then it should get started automatically. > I recommend against this mode of operation. When automatically restarting, you have no idea how

Re: "Not enough replicas available for query" after reboot

2016-02-04 Thread Robert Coli
On Thu, Feb 4, 2016 at 12:53 PM, Flavien Charlon wrote: > My cluster was running fine. I rebooted all three nodes (one by one), and > now all nodes are back up and running. "nodetool status" shows UP for all > three nodes on all three nodes: > > -- AddressLoad Tokens OwnsHost

Re: Want inputs about super column family vs map/list

2016-02-04 Thread Robert Coli
On Thu, Feb 4, 2016 at 2:37 AM, Bhuvan Rawal wrote: > 1. Using super column family: > Super columns have been not-recommended for use for about five years now. =Rob

Re: Re : Possibility of using 2 different snitches in the Multi_DC cluster

2016-02-02 Thread Robert Coli
On Tue, Feb 2, 2016 at 1:23 PM, sai krishnam raju potturi < pskraj...@gmail.com> wrote: > What is the possibility of using GossipingPropertFileSnitch on datacenters > in our private cloud, and Ec2MultiRegionSnitch in AWS? > You should just use GPFS everywhere. This is also the reason why you sho

Re: Session timeout

2016-01-29 Thread Robert Coli
On Fri, Jan 29, 2016 at 3:12 PM, Jack Krupansky wrote: > One last time, I'll simply renew my objection to the way you are abusing > this list. > FWIW, while I appreciate that OP (Oleg) is attempting to do a service for the community, I agree that the flood of single topic, context-lacking posts

Re: Rename Keyspace offline

2016-01-27 Thread Robert Coli
On Wed, Jan 27, 2016 at 6:49 AM, Jean Tremblay < jean.tremb...@zen-innovations.com> wrote: > Since it takes me 2 days to load my data, I was planning to load the new > set on a new keyspace (KS-Y), and when loaded drop KS-X and rename KS-Y to > KS-X. > Why bother with the rename? Just have two ke

Re: Strategy / order for upgradesstables during rolling upgrade.

2016-01-21 Thread Robert Coli
On Thu, Jan 21, 2016 at 11:37 AM, Kevin Burton wrote: > I think there are two strategies to upgradesstables after a release. > > We're doing a 2.0 to 2.1 upgrade (been procrastinating here). > > I think we can go with B below... Would you agree? > > Strategy A: > > - foreach server >

Re: Using cassandra a BLOB store / web cache.

2016-01-19 Thread Robert Coli
On Tue, Jan 19, 2016 at 2:07 PM, Richard L. Burton III wrote: > I would ask why do this over say HDFS, S3, etc. seems like this problem > has been solved with other solutions that are specifically designed for > blob storage? > HDFS's default block size is 64mb. If you are storing objects smalle

Re: Using cassandra a BLOB store / web cache.

2016-01-19 Thread Robert Coli
On Mon, Jan 18, 2016 at 6:52 PM, Kevin Burton wrote: > Internally we have the need for a blob store for web content. It's MOSTLY > key, ,value based but we'd like to have lookups by coarse grained tags. > I know you know how to operate and scale MySQL, so I suggest MogileFS for the actual blob

Re: max connection per user

2016-01-13 Thread Robert Coli
On Wed, Jan 13, 2016 at 1:41 PM, oleg yusim wrote: > Quick question, here: does Cassandra have a configuration switch to limit > number of connections per user (protection of DoS attack, security)? > Quick answer : no. =Rob

Re: Help debugging a very slow query

2016-01-13 Thread Robert Coli
On Wed, Jan 13, 2016 at 12:40 PM, Bryan Cheng wrote: > 1) What's up with the megapartition? What's the best way to debug this? > Our data model is largely write once, we don't do any updates. We do > DELETE, but the partitions that are giving us issues haven't been removed. > We had some suspicio

Re: Repair with "-pr" and vnodes

2016-01-12 Thread Robert Coli
On Tue, Jan 12, 2016 at 3:46 PM, Roman Tkachenko wrote: > The documentation for the "-pr" repair option says it repairs only the > first range returned by the partitioner. However, with vnodes a node owns a > lot of small ranges. > > Does that mean that if I run rolling "nodetool repair -pr" on t

Re: Cassandra 1.2.19 and Java 8

2016-01-12 Thread Robert Coli
On Tue, Jan 12, 2016 at 2:31 PM, Tim Heckman wrote: > We still have an installation of Cassandra on the 1.2.19 release, > running on Java 7. We do plan on upgrading to a newer version, but in > the mean time there has been some questions internally about running > 1.2 on Java 8 until the upgrade

Re: Sorting & pagination in apache cassandra 2.1

2016-01-12 Thread Robert Coli
On Mon, Jan 11, 2016 at 11:30 PM, anuja jain wrote: > 1 more question, what does it mean by "cassandra inherently sorts data"? > SSTable = Sorted Strings Table. It doesn't contain "Strings" anymore, really, but that's a hint.. :) =Rob

Re: Too many compactions, maybe keyspace system?

2016-01-12 Thread Robert Coli
On Mon, Jan 11, 2016 at 9:12 PM, Shuo Chen wrote: > I have a assumption that, lots of pending compaction tasks jam the memory > and raise full gc. The full chokes the process and slows down compaction. > And this causes more pending compaction tasks and more pressure on memory. > The question is

Re: Too many compactions, maybe keyspace system?

2016-01-11 Thread Robert Coli
The lines you are looking for look like this : INFO [CompactionExecutor:48] 2016-01-12 09:07:59,995 CompactionTask.java (line 120) Compacting [SSTableReader(path='/usr/local/cassandra/data/system/sstable_activity/system-sstable_activity-jb-4959-Data.db'), SSTableReader(path='/usr/local/cassandra/d

Re: Too many compactions, maybe keyspace system?

2016-01-11 Thread Robert Coli
On Sat, Jan 9, 2016 at 8:23 AM, Shuo Chen wrote: > I don't know what exactly compaction logs is like in system.log. But I see > logs like this in system.log, I think maybe this is the compaction log > grep -i compact /path/to/system.log =Rob

Re: Too many compactions, maybe keyspace system?

2016-01-08 Thread Robert Coli
On Thu, Jan 7, 2016 at 6:30 PM, Shuo Chen wrote:, > > I am using Cassandra 2.0.16 with 4 nodes and found too many compactions > for this cluster. This caused too much full gc and choked the system. I > have discussed the high gc in previous mails but didnot get the satisfied > answers. > > To clar

Re: Data rebalancing algorithm

2016-01-08 Thread Robert Coli
On Thu, Jan 7, 2016 at 11:42 PM, Jonathan Haddad wrote: > num_tokens is the number of tokens per node, not per cluster. > MySQL docs are enumerated in the manual along with their scope. I have seen the enumeration in the Cassandra docs, but I don't remember seeing scope there. Perhaps a useful s

Re: Revisit Cassandra EOL Policy

2016-01-08 Thread Robert Coli
On Fri, Jan 8, 2016 at 9:45 AM, Anuj Wadehra wrote: > *"Unfortunately how to get firm agreement on what criteria should be used > to judge "Production Ready" is unclear."* > Only you know how comfortable you are with the various types of risk that are involved in a particular version choice giv

Re: Revisit Cassandra EOL Policy

2016-01-07 Thread Robert Coli
On Wed, Jan 6, 2016 at 5:26 PM, Anuj Wadehra wrote: > I would like to understand how various users are dealing with the > situation. Are you upgrading Cassandra every 3-6 mths? How do you cut short > your planning,test and release cycles for Cassandra upgrades in your > application/products? > I

Re: CQL Composite Key Seen After Table Creation

2016-01-06 Thread Robert Coli
On Wed, Jan 6, 2016 at 12:54 PM, Chris Burroughs wrote: > I work with Amir and further experimentation I can shed a little more > light on what exactly is going on under the hood. For background our goal > is to take data that is currently being read and written to via thrift, > switch reads to

Re: Slow performance after upgrading from 2.0.9 to 2.1.11

2016-01-06 Thread Robert Coli
On Wed, Jan 6, 2016 at 11:49 AM, Peddi, Praveen wrote: > 2nd column is replication factor (RF). I have 2 rows for reads and 2 for > writes. First row is RF=1 and 2nd row is RF=3. So when I said increasing RF > , I meant from 1 to 3. Sorry the table is probably not clear. > Ah, I see now, I was m

Re: Slow performance after upgrading from 2.0.9 to 2.1.11

2016-01-06 Thread Robert Coli
On Wed, Jan 6, 2016 at 11:41 AM, Peddi, Praveen wrote: > We have upgraded Cassandra from 2.0.9 to 2.1.11 in our loadtest > environment with pretty much same yaml settings in both (removed unused > yaml settings and renamed few others) and we have noticed performance on > 2.1.11 is worse compared

Re: Node stuck when joining a Cassandra 2.2.0 cluster

2016-01-05 Thread Robert Coli
On Tue, Jan 5, 2016 at 3:01 AM, Herbert Fischer < herbert.fisc...@crossengage.io> wrote: > We run a small Cassandra 2.2.0 cluster, with 5 nodes, on bare-metal > servers and we are going to replace those nodes with other nodes. I planned > to add all the new nodes first, one-by-one, and later remov

Re: compaction_throughput_mb_per_sec

2016-01-05 Thread Robert Coli
On Tue, Jan 5, 2016 at 6:50 AM, Ken Hancock wrote: > As to why I think it's cluster-wide, here's what the documentation says: > Do you see "system" used in place of "cluster" anywhere else in the docs? I think you are correct that the docs should standardize on "system" instead of "node", becau

Re: cassandra full gc too often

2016-01-04 Thread Robert Coli
On Sun, Jan 3, 2016 at 5:54 PM, Shuo Chen wrote: > There are client operation in these days. Besides most columnfamily in the > cluster are supercolumnfamily created by cassandra-cli. Most rows have > average 30 sub-rows and each sub-row has 20 columns. > Supercolumns, especially pre-CQL impleme

Re: Upgrade from 2.0.9 to 2.2.4

2015-12-29 Thread Robert Coli
On Tue, Dec 29, 2015 at 2:31 PM, Mir Tanvir Hossain < mir.tanvir.hoss...@gmail.com> wrote: > Hello, is it possible to do a rolling upgrade from 2.0.9 to 2.2.4? > It is never supported to upgrade more than one major version of Cassandra. =Rob

Re: cassandra full gc too long

2015-12-28 Thread Robert Coli
On Mon, Dec 28, 2015 at 5:57 PM, xutom wrote: > I have 5 nodes in my C* cluster, and each node has the same > configuration file(Cassandra-env.sh: MAX_HEAP_SIZE="32G" and > HEAP_NEWSIZE="8G"), and My Cassandra version is 2.1.1. Now I want to > export all data of one table, i am using select

Re: CQL Composite Key Seen After Table Creation

2015-12-28 Thread Robert Coli
On Tue, Dec 22, 2015 at 6:55 AM, Amir Hassani wrote: > Defining the “column_metadata” after table creation vs during table > creation seems to affect the cql schema. We are running a large > high-throughput Cassandra 2.1.10 cluster. During table creation, the client > Cassandra-cli was used with

Re: Would data be lost by nodetool removenode force

2015-12-17 Thread Robert Coli
On Thu, Dec 17, 2015 at 4:44 PM, Shuo Chen wrote: >I have a 4 node cluster with status 3 UN and 1 DN. I am trying to add a > new node into the cluster but it is also dead. So the cluster is now 3 UN > and 2 DN. However I didnot run nodetool cleanup on any nodes. And just > several KBs of data

Re: can't make any permissions change in 2.2.4

2015-12-17 Thread Robert Coli
On Thu, Dec 17, 2015 at 9:10 AM, Kai Wang wrote: > I used to able to add/drop users and modify permissions in 2.1.1. After > upgrading to 2.2.4, I can't modify any of those. "List all permissions" > returns me all the permissions I setup before the upgrade. But I can't add > new permission or add

Re: Unable to start one Cassandra node: OutOfMemoryError

2015-12-15 Thread Robert Coli
On Tue, Dec 15, 2015 at 4:41 PM, Jack Krupansky wrote: > Can a core Cassandra committer verify if removing the compactions_in_progress > folder is indeed to desired and recommended solution to this problem, or > whether it might in fact be a bug that this workaround is needed at all? > Thanks! >

Re: Better setup to start using in production on one server

2015-12-15 Thread Robert Coli
On Tue, Dec 15, 2015 at 11:15 AM, Jonathan Haddad wrote: > If I had to choose between running 3x docker instances and 1x instance on > a single server, I'd choose the single one. Instead of dealing with RF > changing nonsense I'd just set up a 2nd data center w/ 3 nodes and move to > that when y

Re: Better setup to start using in production on one server

2015-12-15 Thread Robert Coli
On Mon, Dec 14, 2015 at 10:53 PM, Vladimir Prudnikov wrote: > Is it hard to start with 3 nodes on one server running in docker and then > just move 2 nodes to the separate servers? > FWIW, if you *absolutely knew* that you were going to need the scale and for some reason could not convince the m

Re: Better setup to start using in production on one server

2015-12-15 Thread Robert Coli
On Mon, Dec 14, 2015 at 10:53 PM, Vladimir Prudnikov wrote: > Save money. I don’t have huge enterprise behind me nor investor’s money on > my bank account. I just created an app and want to launch it and see if it > is what users will use and pay for. Once I get users using it I can scale > my ha

Re: Usage volume of older versions of Cassandra

2015-12-15 Thread Robert Coli
On Tue, Dec 15, 2015 at 6:28 AM, Andy Kruth wrote: > We are trying to decide how to proceed with development and support of > YCSB bindings for older versions of Cassandra, namely Cassandra 7, 8, and > 10. > > We would like to continue dev and support on these if the use of those > versions of Ca

Re: Fatal Issues after upgrading to Cassandra 3.0.0/3.0.1 and using latest java driver 3.0.0-beta1

2015-12-14 Thread Robert Coli
On Mon, Dec 14, 2015 at 1:17 PM, Carlos A wrote: > I had logged a bug on JIRA as I think it might be a better way of > reporting this: https://issues.apache.org/jira/browse/CASSANDRA-10861 > Thanks for keeping the list looped in on the JIRA URL. =Rob

Re: Fatal Issues after upgrading to Cassandra 3.0.0/3.0.1 and using latest java driver 3.0.0-beta1

2015-12-14 Thread Robert Coli
On Mon, Dec 14, 2015 at 1:02 PM, Carlos A wrote: > Indeed you are correct, thanks for that. But It is not production it is > DEV server. We need some features of Cassandra 3 hence the need to upgrade > it. > Sure, consider my response "response to a nick I don't recognize who may be a relatively

Re: Fatal Issues after upgrading to Cassandra 3.0.0/3.0.1 and using latest java driver 3.0.0-beta1

2015-12-14 Thread Robert Coli
On Mon, Dec 14, 2015 at 12:52 PM, Carlos A wrote: > We had upgrade to Cassandra 3.0.0 and later to 3.0.1. That went well. > System seems stable. > https://www.eventbrite.com/engineering/what-version-of-cassandra-should-i-run/ Running cutting edge versions of server and client in production *wil

Re: Better setup to start using in production on one server

2015-12-14 Thread Robert Coli
On Mon, Dec 14, 2015 at 3:46 AM, Vladimir Prudnikov wrote: > [I want to run Cassandra on a single server] I struggle to imagine the purpose of doing this. You are going to sign yourself up for repeated painful changes of RF, as well as downtimes. As Michael says, you will get most of the cost

Re: read time coprocessor?

2015-12-11 Thread Robert Coli
On Fri, Dec 11, 2015 at 8:34 AM, DuyHai Doan wrote: > The new UDF (User Defined Function) and UDA (User Defined Aggregate) > introduced since Cassandra 2.2 is the feature to closest HBase co-processor. > Aren't "Prototype Triggers" (which probably no one should use) closer? http://www.datastax.

Re: Want to run repair on a node without it taking traffic

2015-12-04 Thread Robert Coli
On Fri, Dec 4, 2015 at 11:44 AM, Anuj Wadehra wrote: > Did u say "longer than gc_grace_seconds" ? > Wont deletes pop back during repair? > Unfortunately, you are correct. Since CASSANDRA-4905 [1], such tombstones will not be propagated. The actual way to fully repair a node that has been down g

Re: Rebuilding a new Cassandra node at 100Mb/s

2015-12-03 Thread Robert Coli
On Thu, Dec 3, 2015 at 7:51 AM, Jonathan Ballet wrote: > I noticed it's not really fast and my monitoring system shows that the > traffic incoming on this node is exactly at 100Mb/s (12.6MB/s). I know it > can be much more than that (I just tested sending a file through SSH > between the two mach

Re: Restoring a snapshot into a new cluster - thoughts on replica placement

2015-12-02 Thread Robert Coli
On Wed, Dec 2, 2015 at 5:06 AM, Peer, Oded wrote: > It seems it is not enough to restore the token ranges on an equal-size > cluster since you also need to restore the rack information. > Yep, if you're using a rack-aware snitch, that is correct. Because in that case, rack determines replica pla

Re: Want to run repair on a node without it taking traffic

2015-12-02 Thread Robert Coli
On Wed, Dec 2, 2015 at 8:54 AM, K F wrote: > How can I run repair on a node without it taking any coordinator/client > traffic. So, I can complete the repair on the node without it taking any > traffic, except the streams from other nodes. Is that possible? > In general you should probably just

Re: Running sstableloader from every node when migrating?

2015-11-30 Thread Robert Coli
On Thu, Nov 19, 2015 at 7:01 AM, George Sigletos wrote: > We would like to migrate one keyspace from a 6-node cluster to a 3-node > one. > http://www.pythian.com/blog/bulk-loading-options-for-cassandra/ =Rob

Re: Does the rebuild tools rebuild all each time it start Or rebuild the rest?

2015-11-30 Thread Robert Coli
On Fri, Nov 20, 2015 at 8:54 AM, wateray wrote: > *is it rebuild all rang of tokens which belong to the node or just rebuild > the rest rang of tokens from last rebuild.(since last rebuild we get some > data).* > There is no resume in versions before 2.2.x, it will duplicate-rebuild anything tha

Re: Cassandra 3.0.0 connection problem

2015-11-30 Thread Robert Coli
On Wed, Nov 18, 2015 at 11:13 PM, Enrico Sola wrote: > Hi, I'm new to Cassandra and I've recently upgraded to 3.0.0 on Ubuntu > Linux 14.04 LTS > https://www.eventbrite.com/engineering/what-version-of-cassandra-should-i-run/ =Rob

Re: Strategy tools for taking snapshots to load in another cluster instance

2015-11-30 Thread Robert Coli
On Wed, Nov 18, 2015 at 2:23 AM, Anishek Agarwal wrote: > We have 5 node prod cluster and 3 node test cluster. Is there a way i can > take snapshot of a table in prod and load it test cluster. The cassandra > versions are same. > http://www.pythian.com/blog/bulk-loading-options-for-cassandra/ =

Re: Cassandra Cleanup and disk space

2015-11-30 Thread Robert Coli
On Thu, Nov 26, 2015 at 12:55 AM, Luigi Tagliamonte wrote: > I'd like to understand what cleanup does on a running cluster when there > is no cluster topology change, i did a test and i saw the cluster disk > space shrink of 200GB. > "writes out files 1:1 with their input files" IIRC it does no

Re: Huge ReadStage Pending tasks during startup

2015-11-30 Thread Robert Coli
On Fri, Nov 27, 2015 at 2:52 AM, Vasiliy I Ozerov wrote: > We have some strange troubles with cassandra startup. Cluster consists of > 4 nodes. 32 Gb RAM per node, each node has about 30Gb of data, 8 CPU. > > So, just after start it has 2753202 pending readstage tasks. And it takes > about 11 hou

Re: Moving SSTables from one disk to another

2015-11-30 Thread Robert Coli
On Mon, Nov 30, 2015 at 11:29 AM, S C wrote: > It is inevitable that the repairs are needed to keep consistency > guarantees. Is it worthwhile to consider RAID-0 as we get more storage? One > can treat loss of disk as loss of node and rebuild the node and repair. Any > other suggestions are most

Re: handling down node cassandra 2.0.15

2015-11-30 Thread Robert Coli
On Wed, Nov 18, 2015 at 6:16 AM, Anuj Wadehra wrote: > Suppose, gc grace seconds=10days, max hinted handoff period=3 hrs, 3 nodes > are there A,B & C,RF =3 and my client is reading at CL ONE. C remains down > for 5 hours and misses many updates including those which happened after > max hinted ha

Re: Nodetool rebuild on vnodes enabled

2015-11-17 Thread Robert Coli
On Tue, Nov 17, 2015 at 3:24 PM, cass savy wrote: > I am exploring vnodes on DSE spark enabled DC. I added new nodes with 64 > vnodes, stream thruput 100mb instead of default 200mb, sokcet_timeout set > to 1hr. > 1) what version of Cassandra (please the version of Apache Cassandra, not DSE)? 2

Re: handling down node cassandra 2.0.15

2015-11-17 Thread Robert Coli
On Tue, Nov 17, 2015 at 4:33 AM, Anuj Wadehra wrote: > Only if gc_grace_seconds havent passed since the failure. If your machine > is down for more than gc_grace_seconds you need to delete the data > directory and go with auto bootstrap = true . > Since CASSANDRA-6961 you can : 1) bring up the

Re: Ingesting Large Number of files

2015-11-17 Thread Robert Coli
On Tue, Nov 17, 2015 at 6:32 AM, Tushar Agrawal wrote: > We get periodic bulk load (twice a month) in form of delimited data files. > We get about 10K files with average size of 50 MB. Each record is a row in > Cassandra table. > http://www.pythian.com/blog/bulk-loading-options-for-cassandra/ =

Re: Help diagnosing performance issue

2015-11-17 Thread Robert Coli
On Tue, Nov 17, 2015 at 11:08 AM, Sebastian Estevez < sebastian.este...@datastax.com> wrote: > You're sstables are probably falling out of page cache on the smaller > nodes and your slow disks are killing your latencies. > +1 most likely. Are the heaps the same size on both machines? =Rob

Re: Deletes Reappeared even when nodes are not down

2015-11-16 Thread Robert Coli
On Sat, Nov 14, 2015 at 9:58 AM, Peddi, Praveen wrote: > I checked tpstats and there are no dropped mutations (though I checked it > after restating the affected nodes). If the problem occurs again, I will > check tpstats again. Is there any stat that shows failed hints? The only > abnormality I

Re: Deletes Reappeared even when nodes are not down

2015-11-13 Thread Robert Coli
On Fri, Nov 13, 2015 at 1:47 PM, Peddi, Praveen wrote: > We do not currently run repairs because we know our deployment time for > each cassandra node is very short. I do understand we have to run repairs > but would repair be in the picture here when no nodes in the cluster were > down for last

Re: too many full gc in one node of the cluster

2015-11-13 Thread Robert Coli
On Thu, Nov 12, 2015 at 10:35 PM, Shuo Chen wrote: > We have a small cassandra cluster with 4 nodes for production. All the > nodes have similar hardware configuration and similar data load. The C* > version is 1.0.7 (prretty old) > > One of the node has much higher cpu usage than others and high

Re: Deletes Reappeared even when nodes are not down

2015-11-13 Thread Robert Coli
On Fri, Nov 13, 2015 at 1:09 PM, Peddi, Praveen wrote: > We are seeing a scenario where some of the rows in the table reappears > even after they are deleted. We have seen this in Prod 3 times in last 1 > week and *coincidentally all 3 times on the same partition*. We have > confirmed that nodes

Re: UnknownColumnFamily exception / schema inconsistencies

2015-11-13 Thread Robert Coli
On Fri, Nov 13, 2015 at 12:31 PM, Maciek Sakrejda wrote: > On Fri, Nov 13, 2015 at 9:56 AM, Sebastian Estevez < > sebastian.este...@datastax.com> wrote: > >> I think you're just missing the steps in *Bold*: >> >> Thanks, but I wasn't clear on what to do if the "new" directory does not > exist at

Re: Data.db too large and after sstableloader still large

2015-11-12 Thread Robert Coli
On Thu, Nov 12, 2015 at 6:44 AM, qihuang.zheng wrote: > question is : why sstableloader can’t balance data file size? > Because it streams ranges from the source SStable to a distributed set of ranges, especially if you are using vnodes. It is a general property of Cassandra's streaming that i

Re: Unable to bootstrap another DC in my cluster

2015-11-09 Thread Robert Coli
On Mon, Nov 9, 2015 at 12:08 PM, K F wrote: > As I am trying to bring up a new DC in my cluster, my first seed node that > I bring-up in the new DC that I am adding to the existing cluster. It's not > able to receive reply back for the GossipDigestSyn request sent to other > seeds in the cluster.

Re: Cassandra compaction stuck? Should I disable?

2015-11-09 Thread Robert Coli
On Mon, Nov 9, 2015 at 1:29 PM, PenguinWhispererThe . < th3penguinwhispe...@gmail.com> wrote: > > In Opscenter I see one of the nodes is orange. It seems like it's working > on compaction. I used nodetool compactionstats and whenever I did this the > Completed nad percentage stays the same (even wi

  1   2   3   4   5   6   7   8   9   10   >