For Scala API on map/reduce (hadoop engine) there's a library called
Scalding. It's built on top of Cascading. If you have a huge dataset or
if you consider using map/reduce engine for your job, for any reason, you
can try Scalding.
However, Spark vs Impala doesn't make sense to me. It should've
unsubscribe
these exercises...
https://drive.google.com/a/mobipulse.in/uc?id=0B0Q4Le4DZj5iNUdSZXpFTUJEU0Eexport=download
You will love it...
Regards,
Arpit Tak
On Tue, Apr 15, 2014 at 4:28 AM, Nabeel Memon nm3...@gmail.com wrote:
Hi. I found AmpCamp exercises as a nice way to get started with spark.
However
Hi. I found AmpCamp exercises as a nice way to get started with spark.
However they require amazon ec2 access. Has anyone put together any VM or
docker scripts to have the same environment locally to work out those labs?
It'll be really helpful. Thanks.