Re: Hive + Cassandra?

2011-09-02 Thread Edward Capriolo
On Fri, Sep 2, 2011 at 12:58 AM, Ayon Sinha ayonsi...@yahoo.com wrote: Hi, I'm looking for the status of the Open source Apache project that is integrating Hive Cassandra. I was under the impression that Datastax' Brisk is a commercial version of that but I'm looking for the original. BTW,

Re: Google Protocol Buffers and Hive

2011-09-02 Thread valentina kroshilina
I use MR to generate tables using Elephant-Bird's OutputFormat. Hive can read from EXTERNAL tables using ProtobufHiveSerde and ProtobufBlockInputFormat generated by Elephant-Bird. Create table statement looks like the following: CREATE EXTERNAL TABLE IF NOT EXISTS TABLE_NAME ( ... ) ROW FORMAT

Re: Google Protocol Buffers and Hive

2011-09-02 Thread valentina kroshilina
You can still partition the data. You'll have to run queries to add partitions to the table, otherwise your table won't see a new partition, but you'll have to do it regardless on what type of table you use. We have a big cluster so I don't really see any change in performance, Hive for this type

Re: Hive + Cassandra?

2011-09-02 Thread Joe Key
@Ayon Are you certain that Brisk is a commercial version? Their datasheet states that Datastax' Brisk is an enhanced open-source distribution. While I have no idea what enhanced means (perhaps they offer expertise at a fee), their public github for the project proudly displays an Apache License.

Hive Reduce error

2011-09-02 Thread Ayon Sinha
Hi, I'm pretty sure I've seen this error before on a regular hadoop job but I don't know how to fix this. Can anyone hint to what might be causing this? I'm runnign Brisk hive, but I think this is a more generic Hadoop error caused by some setting I have wrong. java.lang.RuntimeException: Hive