Hbase scan - How to group by and aggregate specific column in hbase table

2018-08-23 Thread neo0731
Hi everyone, 
When i am trying  for group by and aggregate using spark/scala.  I am not
able to figure it out how to do it using hbase rdd api for scala. 

Here is my input and output dataset looks like in hbase 

I attached the file for input/output data should look like 


  Customer_TABLE_DATA_HBASE.docx

  




--
Sent from: http://apache-hbase.679495.n3.nabble.com/HBase-Developer-f679493.html


Does Hbase backup process support encryption while transporting the data from one cluster to other cluster

2018-10-09 Thread neo0731


Question arises when migrating the data from one hbase table to another.

Input

To sync the production cluster data with dev cluster. Additionaly, while
copying we need to re-hash the following fields: hashed_email, lexer_id,
foo_imsi, foo_msn, signal_uid, bar_imsi.

Question is : Does copyTable support hashing of data while copying? Same for
distcp utility ? Is it possible to supply some example code in scala as well

Any help on it would be much appreciated?



--
Sent from: http://apache-hbase.679495.n3.nabble.com/HBase-Developer-f679493.html