Thank you for your reference. I have looked at Brisk. In our situation both are disconnected clusters for various reasons and using different distributions (i.e cloudera). Is there any other/similar way to inject data to HDFS
R On Fri, Dec 23, 2011 at 7:34 AM, Sanjeev Verma <sanjeev.x.ve...@gmail.com>wrote: > Hey Ravi: > > Hadoop newbie here, so pardon me if I am pointing out the obvious - have > you taken a look at this link - > http://wiki.apache.org/cassandra/HadoopSupport > > Looks like Cassandra 0.6 onwards supports output to mapreduce. > > Regards > Sanjeev > > On Fri, 2011-12-23 at 07:13 -0800, ravikumar visweswara wrote: > > Hello All, > > > > I have a situation to dump cassandra data to hadoop cluster for further > > analytics. Lot of other relevant data which is not present in cassandra > is > > already available in hdfs for analysis. Both are independent clusters > right > > now. > > Is there a suggested way to get the data periodically or continuously to > > HDFS from cassandra? Any ideas or references will be very helpful for me. > > > > Thanks and Regards > > R > > >