There is a Spark<->HBase library that does this. I used it once in a prototype (never tried in production through): http://blog.cloudera.com/blog/2015/08/apache-spark-comes-to-apache-hbase-with-hbase-spark-module/ <http://blog.cloudera.com/blog/2015/08/apache-spark-comes-to-apache-hbase-with-hbase-spark-module/>
> On Jul 19, 2016, at 9:34 AM, Yu Wei <yu20...@hotmail.com> wrote: > > Hi guys, > > I write spark application and want to store results generated by spark > application to hbase. > Do I need to access hbase via java api directly? > Or is it better choice to use DAO similar as traditional RDBMS? I suspect > that there is major performance downgrade and other negative impacts using > DAO. However, I have little knowledge in this field. > > Any advice? > > Thanks, > Jared