Re: Jr. to Mid Level Big Data jobs in Bay Area

2015-05-17 Thread Juan Suero
Hes a human asking for human advice.. its ok methinks. we should live in a more tolerant world. Thanks. On Sun, May 17, 2015 at 8:10 PM, Stephen Boesch java...@gmail.com wrote: Hi, This is not a job board. Thanks. 2015-05-17 16:00 GMT-07:00 Adam Pritchard apritchard...@gmail.com: Hi

Re: Best practice for EC2 deployment

2013-10-26 Thread Juan Suero
Everything fails On Saturday, October 26, 2013, Trev Smith wrote: Hi all, Could anyone direct me to a resource (or perhaps give me their own thoughts) on best practice for deploying a robust, resilient Hadoop (specifically CDH in this case) cluster to AWS? The data is important to us and we

Re: Project ideas

2013-05-21 Thread Juan Suero
im a newbie but maybe this will also add some value... it is my understanding that mapreduce is like a distributed group by statement when you run a statement like this against your petabyes of dataset it can take a long time.. first and foremost because the first thing you have to do before you