Hi

I have a requirement to transfer data from RDBMS mysql to partitioned hive
table
Partitioned on Year and month.
Each record in mysql data contains timestamp of user activity.

What is the best tool for that.

1.Shall I go with sqoop?

2.How to compute dynamic partition from RDBMS data .

Shall I bucketised my fetched data on User Key.
Shall I use day also in partition?
My requirement is to analyse user activity per day basis.

Thanks
Shushant

Reply via email to