[ 
https://issues.apache.org/jira/browse/CASSANDRA-1368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12902681#action_12902681
 ] 

Stu Hood commented on CASSANDRA-1368:
-------------------------------------

> but we're okay if we change the avro client api in backwards-compatible ways, 
> right?
Sortof... because Avro requires the reader's and writer's schema: otherwise, it 
assumes they are the same.

The attached example code doesn't actually create a package: it just always 
works with whatever the current {{cassandra.avpr}} is, so it will continue to 
work as long as things are changed backwards compatibly. If somebody packaged 
the {{avpr}} up in their application somehow, or generated code, they'd see a 
runtime failure, unless none of the objects they were writing had schema 
changes.

> i'd say adding OUTPUT_SCHEMA_KEY is worth putting in the if/when it's 
> actually a problem category
Fine by me. It's already (accidentally) labeled as a FIXME.

> Add output support for Hadoop Streaming
> ---------------------------------------
>
>                 Key: CASSANDRA-1368
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1368
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Hadoop
>            Reporter: Stu Hood
>             Fix For: 0.7 beta 2
>
>         Attachments: 0001-Switch-to-Cloudera-s-Distribution-of-Hadoop.patch, 
> 0002-Add-an-Avro-OutputReader-and-Resolver-for-Hadoop-Str.patch, 
> 0003-Apply-the-deprecated-OutputFormat-interface-to-allow.patch, 
> 0004-Add-Streaming-example-shell-scripts.patch
>
>
> Hadoop Streaming is a framework that allows mapreduce jobs to be written in 
> languages other than Java, by performing simple IPC on stdin/stdout.
> Adding output support for Hadoop Streaming to Cassandra would mean that users 
> could write very simple scripts in dynamic languages to load data into 
> Cassandra. Once our Hadoop OutputFormat has stabilized a bit, we might also 
> be able to this code to provide scalable bulk loading.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to