Re: Hive + Cassandra
There are actually several storage handlers floating out in the wild for Mongo, Cassandra, HyperTable, etc. Your best bet is referencing the Cassandra ticket and building/using the brisk/dse code. Eventually the code will make it's way into Cassandra. It is a shame the code has been in production in numerous sites for over a year now, and is the most voted for hive issue, but has not landed into a release, thus confusing everyone. Look here for the latest code. https://issues.apache.org/jira/browse/CASSANDRA-4131 On Tue, May 22, 2012 at 4:19 AM, Szymon Dąbrowski szymon.dabrow...@gmail.com wrote: I've been trying to see if that's possible to combine Hive with Cassandra. I've noticed some issues about integration but it seems to me there's some mix-up about who is to patch things up. There are two issues: in Hive project [1] and in Cassandra [2]. The first one is opened, but it ends with comment which says the patch has been send to the second issue. The second issue is marked as closed, because it's marked as duplicate of the first - Hive issue. So I guess no one is going to look at the closed issue in Cassandra project (and no one will apply the patch) and no one is going to do anything about it on Hive side (because the patch has been submitted to Cassandra). Anyway is there a chance this feature would be available soon? [1] - https://issues.apache.org/jira/browse/HIVE-1434 [2] - https://issues.apache.org/jira/browse/CASSANDRA-913 By the way, I am hoping to achieve lower Hive latency by using the Cassandra, so that I get online processing tool. Is there a chance it would be possible? -- Szymon
Re: Hive + Cassandra?
On Fri, Sep 2, 2011 at 12:58 AM, Ayon Sinha ayonsi...@yahoo.com wrote: Hi, I'm looking for the status of the Open source Apache project that is integrating Hive Cassandra. I was under the impression that Datastax' Brisk is a commercial version of that but I'm looking for the original. BTW, Brisk Beta 2 release was pain-free to install and run but it doesn't return results. It either will return the entire table or return an empty set. But thats not relevant in this forum. Just looking for the Apache project. -Ayon See My Photos on Flickr http://www.flickr.com/photos/ayonsinha/ Also check out my Blog for answers to commonly asked questions.http://dailyadvisor.blogspot.com Ayon, The original issue to follow was https://issues.apache.org/jira/browse/HIVE-1434 . Hive-Cassandra never made it into hive-trunk due to many constraints in the testing environment that made the process hard to evolve. The datastax crew was hungry to hack at it so that code lives with them now. In the future it would be great if we can bring all the cool things brisk has backinto hive mainline. You can get some help here or ask me on #hive irc, but your best bet is #datastax-brisk on IRC or with datastax support. For reference, it is possible to take the cassandra handler jars from brisk and drop them into a hive release . This allows you to use the cassandra handler without using the other parts of brisk. https://github.com/riptano/hive/wiki/Cassandra-Handler-usage-in-Hive-0.7-with-Cassandra-0.7 Edward
Re: Hive + Cassandra?
@Ayon Are you certain that Brisk is a commercial version? Their datasheet states that Datastax' Brisk is an enhanced open-source distribution. While I have no idea what enhanced means (perhaps they offer expertise at a fee), their public github for the project proudly displays an Apache License. SOURCES: https://github.com/riptano/brisk/blob/beta2/LICENSE.txt http://www.datastax.com/wp-content/uploads/2011/03/WP-Brisk.pdf On Fri, Sep 2, 2011 at 7:37 AM, Edward Capriolo edlinuxg...@gmail.comwrote: On Fri, Sep 2, 2011 at 12:58 AM, Ayon Sinha ayonsi...@yahoo.com wrote: Hi, I'm looking for the status of the Open source Apache project that is integrating Hive Cassandra. I was under the impression that Datastax' Brisk is a commercial version of that but I'm looking for the original. BTW, Brisk Beta 2 release was pain-free to install and run but it doesn't return results. It either will return the entire table or return an empty set. But thats not relevant in this forum. Just looking for the Apache project. -Ayon See My Photos on Flickr http://www.flickr.com/photos/ayonsinha/ Also check out my Blog for answers to commonly asked questions.http://dailyadvisor.blogspot.com Ayon, The original issue to follow was https://issues.apache.org/jira/browse/HIVE-1434 . Hive-Cassandra never made it into hive-trunk due to many constraints in the testing environment that made the process hard to evolve. The datastax crew was hungry to hack at it so that code lives with them now. In the future it would be great if we can bring all the cool things brisk has backinto hive mainline. You can get some help here or ask me on #hive irc, but your best bet is #datastax-brisk on IRC or with datastax support. For reference, it is possible to take the cassandra handler jars from brisk and drop them into a hive release . This allows you to use the cassandra handler without using the other parts of brisk. https://github.com/riptano/hive/wiki/Cassandra-Handler-usage-in-Hive-0.7-with-Cassandra-0.7 Edward -- Joe Andrew Key (Andy)
Hive + Cassandra?
Hi, I'm looking for the status of the Open source Apache project that is integrating Hive Cassandra. I was under the impression that Datastax' Brisk is a commercial version of that but I'm looking for the original. BTW, Brisk Beta 2 release was pain-free to install and run but it doesn't return results. It either will return the entire table or return an empty set. But thats not relevant in this forum. Just looking for the Apache project. -Ayon See My Photos on Flickr Also check out my Blog for answers to commonly asked questions.