Re: Writing/Importing large number of records into HBase

2017-01-28 Thread jeff saremi
No iI had not.I will take a look. Thanks Ted From: Ted Yu Sent: Friday, January 27, 2017 7:41 PM To: user@hbase.apache.org Subject: Re: Writing/Importing large number of records into HBase Have you looked at hbase-spark module (currently in master branch

Re: Writing/Importing large number of records into HBase

2017-01-28 Thread jeff saremi
Thank you Chetan From: Chetan Khatri Sent: Friday, January 27, 2017 8:15 PM To: user@hbase.apache.org Subject: Re: Writing/Importing large number of records into HBase Oh. Sorry. https://github.com/apache/hbase/blob/master/hbase-spark/src/main/java/org/apache

Re: Writing/Importing large number of records into HBase

2017-01-27 Thread Chetan Khatri
Oh. Sorry. https://github.com/apache/hbase/blob/master/hbase-spark/src/main/java/org/apache/hadoop/hbase/spark/example/hbasecontext/JavaHBaseBulkPutExample.java On Sat, Jan 28, 2017 at 9:27 AM, Ted Yu wrote: > Chetan: > The link you posted was from personal repo. > > There hasn't been commit for

Re: Writing/Importing large number of records into HBase

2017-01-27 Thread Ted Yu
Chetan: The link you posted was from personal repo. There hasn't been commit for at least a year. Meanwhile, the hbase-spark module in hbase repo is being actively maintained. FYI > On Jan 27, 2017, at 7:47 PM, Chetan Khatri > wrote: > > Adding to @Ted Check Bulk Put Example - > https://

Re: Writing/Importing large number of records into HBase

2017-01-27 Thread Chetan Khatri
Adding to @Ted Check Bulk Put Example - https://github.com/tmalaska/SparkOnHBase/blob/master/src/main/scala/org/apache/hadoop/hbase/spark/example/hbasecontext/HBaseBulkPutExampleFromFile.scala On Sat, Jan 28, 2017 at 9:11 AM, Ted Yu wrote: > Have you looked at hbase-spark module (currently in ma

Re: Writing/Importing large number of records into HBase

2017-01-27 Thread Chetan Khatri
https://github.com/tmalaska/SparkOnHBase/blob/master/src/main/scala/org/apache/hadoop/hbase/spark/example/hbasecontext/HBaseBulkPutExample.scala On Sat, Jan 28, 2017 at 9:17 AM, Chetan Khatri wrote: > Adding to @Ted Check Bulk Put Example - https://github.com/tmalaska/ > SparkOnHBase/blob/master

Re: Writing/Importing large number of records into HBase

2017-01-27 Thread Ted Yu
Have you looked at hbase-spark module (currently in master branch) ? See hbase-spark/src/main/scala/org/apache/hadoop/hbase/spark/example/datasources/AvroSource.scala and hbase-spark/src/test/scala/org/apache/hadoop/hbase/spark/DefaultSourceSuite.scala for examples. There may be other options.

Writing/Importing large number of records into HBase

2017-01-27 Thread jeff saremi
Hi I'm seeking some pointers/guidance on what we could do to insert billions of records that we already have in avro files in hadoop into HBase. I read some articles online and one of them recommended using HFile format. I took a cursory look at the documentation for that. Given the complexity o