On Fri, Sep 2, 2011 at 12:58 AM, Ayon Sinha ayonsi...@yahoo.com wrote:
Hi,
I'm looking for the status of the Open source Apache project that is
integrating Hive Cassandra. I was under the impression that Datastax'
Brisk is a commercial version of that but I'm looking for the original. BTW,
I use MR to generate tables using Elephant-Bird's OutputFormat. Hive
can read from EXTERNAL tables using ProtobufHiveSerde and
ProtobufBlockInputFormat generated by Elephant-Bird. Create table
statement looks like the following:
CREATE EXTERNAL TABLE IF NOT EXISTS TABLE_NAME
(
...
)
ROW FORMAT
You can still partition the data. You'll have to run queries to add
partitions to the table, otherwise your table won't see a new partition, but
you'll have to do it regardless on what type of table you use.
We have a big cluster so I don't really see any change in performance, Hive
for this type
@Ayon
Are you certain that Brisk is a commercial version? Their datasheet states
that Datastax' Brisk is an enhanced open-source distribution. While I
have no idea what enhanced means (perhaps they offer expertise at a fee),
their public github for the project proudly displays an Apache License.
Hi,
I'm pretty sure I've seen this error before on a regular hadoop job but I don't
know how to fix this. Can anyone hint to what might be causing this? I'm
runnign Brisk hive, but I think this is a more generic Hadoop error caused by
some setting I have wrong.
java.lang.RuntimeException: Hive