[ https://issues.apache.org/jira/browse/CASSANDRA-3045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13133717#comment-13133717 ]
T Jake Luciani commented on CASSANDRA-3045: ------------------------------------------- We could write an alternate CFOF like BulkColumnFamilyOutputFormat that can be used when the TT is running on the same node as Cassandra. The reducer would write files to hadoop.local.dir then when the reducer is closed it will contact the local cassandra instance via JMX with the output dir to be loaded into via streaming. > Update ColumnFamilyOutputFormat to use new bulkload API > ------------------------------------------------------- > > Key: CASSANDRA-3045 > URL: https://issues.apache.org/jira/browse/CASSANDRA-3045 > Project: Cassandra > Issue Type: Improvement > Components: Hadoop > Reporter: Jonathan Ellis > Assignee: Brandon Williams > Priority: Minor > Fix For: 1.1 > > > The bulk loading interface added in CASSANDRA-1278 is a great fit for Hadoop > jobs. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira