Re: Gossiper question

2011-05-18 Thread Peter Schuller
  I have 9 node cluster with RF-3 and using Cassandra0.70/Hector26. Recently we are seeing lot of UnavailableException at the client side. Whenever this happens, I found following pattern in Cassandra node's log file at that given time, UnavailableException is the expected error if an

Re: Knowing when there is a *real* need to add nodes

2011-05-18 Thread Tomer B
As for static disk usage i would add this: test: df -kh description: run test after compaction (check GCGraceSeconds in storage-conf.xml) as only then data is expunged permanently, run on data disk, assuming here commitlog disk is separated from data dir. green gauge: used_space 30% of disk

[RELEASE] Apache Cassandra 0.7.6 released

2011-05-18 Thread Sylvain Lebresne
The Cassandra team is pleased to announce the release of Apache Cassandra version 0.7.6. Cassandra is a highly scalable second-generation distributed database, bringing together Dynamo's fully distributed design and Bigtable's ColumnFamily-based data model. You can read more here:

Berlin Buzzword Hackathon

2011-05-18 Thread Daniel Doubleday
Hi all was wondering if there's anybody here planning to go to the Berlin Buzzwords and attend the cassandra hackathon. I'm still indecisive but it might be good to have the chance to talk about experiences in more detail. Cheers, Daniel

RE: AssertionError

2011-05-18 Thread Desimpel, Ignace
Hi Sylvain, I did the upgrade from 0.7.4 to 0.7.5 and the exception does not occur anymore (on Windows ...). Thanks for pointing me to the bug fix. From the 0.7.5 version I upgraded to the 0.7.6 version, and this is also OK, without any code changes and by still keeping the same data files

Re: Berlin Buzzword Hackathon

2011-05-18 Thread Christoph Rueger
Oh damn, I would love to go, but I'll just be at Berlin Buzzwords for the 6th and 7th. Next time :) On Wed, May 18, 2011 at 2:31 PM, Daniel Doubleday daniel.double...@gmx.netwrote: Hi all was wondering if there's anybody here planning to go to the Berlin Buzzwords and attend the cassandra

Re: AssertionError

2011-05-18 Thread Sylvain Lebresne
The compose() and decompose() methods of AbstractType are used only by the PIG driver (in 0.7 at least, in 0.8 I think CQL uses them too). If you're not using PIG, you safe with making those function simple pass-through, i.e, to have something along the line of: class CustomComparator extends

RE: AssertionError

2011-05-18 Thread Desimpel, Ignace
Great! I'm not using PIG. Thanks. -Original Message- From: Sylvain Lebresne [mailto:sylv...@datastax.com] Sent: Wednesday, May 18, 2011 3:07 PM To: user@cassandra.apache.org Subject: Re: AssertionError The compose() and decompose() methods of AbstractType are used only by the PIG

Re: Questions about using MD5 encryption with SimpleAuthenticator

2011-05-18 Thread Ted Zlatanov
On Tue, 17 May 2011 15:52:22 -0700 Sameer Farooqui cassandral...@gmail.com wrote: SF Would still be nice though to use the bcrypt hash over MD5 for stronger SF security. I used MD5 when I proposed SimpleAuthenticator for two reasons: 1) SimpleAuthenticator is supposed to be a demo of the

Re: Berlin Buzzword Hackathon

2011-05-18 Thread Eric Evans
On Wed, 2011-05-18 at 14:31 +0200, Daniel Doubleday wrote: was wondering if there's anybody here planning to go to the Berlin Buzzwords and attend the cassandra hackathon. I'll be there. I'm still indecisive but it might be good to have the chance to talk about experiences in more detail.

Re: How to configure internode encryption in 0.8.0?

2011-05-18 Thread Jeremy Hanna
I'll CC Nirmal Ranganathan who implemented the internode encryption who might be able to give you some advice on this. On May 17, 2011, at 7:47 PM, Sameer Farooqui wrote: Thanks for the link, Jeremy. I generated the keystore and truststore for inter-node communication using the link in

Design for 'Most viewed Discussions' in a forum

2011-05-18 Thread Aditya Narayan
* For a discussions forum, I need to show a page of most viewed discussions. For implementing this, I maintain a count of views of a discussion when this views count of a discussion passes a certain threshold limit, the discussion Id is added to a row of most viewed discussions.

Re: Design for 'Most viewed Discussions' in a forum

2011-05-18 Thread Aditya Narayan
I would arrange for memtable flush period in such a manner that the time period for which these most viewed discussions are generated equals the memtable flush timeperiod, so that the entire row of most viewed discussion on a topic is in one or maximum two memtables/ SST tables. This would also

Recommandation on how to organize CF

2011-05-18 Thread openvictor Open
Hello all, I know organization is a broad topic and everybody may have an idea on how to do it, but I really want to have some advices and opinions and I think it could be interesting to discuss this matter. Here is my problem: I am designing a messaging system internal to a website. There are 3

Re: Design for 'Most viewed Discussions' in a forum

2011-05-18 Thread openvictor Open
Have you thought about user another kind of Database, which supports volative content for example ? I am currently thinking about doing something similar. The best and simplest option at the moment that I can think of is Redis. In redis you have the option of querying keys with wildcards. Your

Re: Design for 'Most viewed Discussions' in a forum

2011-05-18 Thread Aditya Narayan
Thanks victor! Aren't there any good ways by using Cassandra alone ? On Wed, May 18, 2011 at 11:41 PM, openvictor Open openvic...@gmail.comwrote: Have you thought about user another kind of Database, which supports volative content for example ? I am currently thinking about doing something

Re: Design for 'Most viewed Discussions' in a forum

2011-05-18 Thread openvictor Open
Sorry I made a mistake in topics-seen ! When you insert it should be : topics-seen[topic:TopicX:timestampN]={TimeUUID3:whatever} Sorry about that, Victor 2011/5/18 openvictor Open openvic...@gmail.com I guess you can use the same system, you need two CF for that and I think it's better to

Re: [RELEASE] Apache Cassandra 0.7.6 released

2011-05-18 Thread Sylvain Lebresne
A small error in the debian setup script made it's way into the debian package of 0.7.6 (more details here: https://issues.apache.org/jira/browse/CASSANDRA-2641). We are working on fixing the problem but we must follow the apache process and as a result this may take a little longer than we would

Re: Questions about using MD5 encryption with SimpleAuthenticator

2011-05-18 Thread Aaron Morton
Also if you were wearing an aluminium foil hat you may also be concerned about how the password is sent to the server. Again though, see previous I am not a security guy comment and helpful link from Jonathan confirming that statement :) Cheers - Aaron Morton Freelance

Re: Native heap leaks?

2011-05-18 Thread Hannes Schmidt
One last word on the effect of memory mapped IO on the VIRT, RES and SHR columns in the output of the top utility. With mmap enabled, VIRT can be big, as much as the sum of the size of all index and data files and the sizes of shared libraries. RES is the sum of the sizes of 1) the Java heap, 2)

Re: Questions about using MD5 encryption with SimpleAuthenticator

2011-05-18 Thread Sameer Farooqui
I am wearing said hat and am freaking out right now :-) Just kidding and good point. I guess it would be nice if clients like Hector had an option to use TLS/SSL to encapsulate the application protocol. But even SSL/TLS is subject to attacks from tools like SSLSNIFF:

Snapshotting to a different volume?

2011-05-18 Thread Sameer Farooqui
As of 0.8.0, is it possible to take a Cassandra snapshot to a different volume (like a EBS volume dedicated for backups)? About a year ago, Jonathan Ellis said that this won't be implemented b/c snapshots are basically hard links:

Re: Snapshotting to a different volume?

2011-05-18 Thread Watanabe Maki
SSTables are immutable. Those won't changed once written to disk. From iPhone On 2011/05/19, at 9:37, Sameer Farooqui cassandral...@gmail.com wrote: As of 0.8.0, is it possible to take a Cassandra snapshot to a different volume (like a EBS volume dedicated for backups)? About a year ago,

Re: Snapshotting to a different volume?

2011-05-18 Thread Sameer Farooqui
Ahh.. yeah. And during a compaction a new SSTable is created with the merged data. So, if I take a snapshot before compaction, the old SSTables won't be deleted (b/c the snapshot hard links still have a reference to the files). But if I hadn't taken a snapshot before compaction, does compaction

Re: Snapshotting to a different volume?

2011-05-18 Thread Watanabe Maki
Please note that all files on unix file system are basically hard links referring specific inode. If you make a hard link to a file, it means the inode has two referring names. When the SSTable is compacted and GCed, Cassandra delete the old SSTable but keep snapshot. Now the reference count to

Using Toad to access Cassandra

2011-05-18 Thread Sameer Farooqui
Has anybody heard of or used Toad to access Cassandra? http://www.quest.com/toad-for-cloud-databases/ They claim to: Toad® for Cloud Databases provides a SQL-based interface that makes it simple for you to generate queries, migrate, browse, and edit data, as well as create reports and tables in

Using counters in 0.8

2011-05-18 Thread Ertio Lew
I am using Hector for a project wanted to try out using counters with latest 0.8 v Cassandra. How do we work with counters in 0.8 version ? Any web-links to such examples are appreciated. Has Hector started to provide API for that ?