Thank you All. I have updated it to a little better version.
Regards, Sandeep Giri, +1 347 781 4573 (US) +91-953-899-8962 (IN) www.KnowBigData.com. <http://KnowBigData.com.> Phone: +1-253-397-1945 (Office) [image: linkedin icon] <https://linkedin.com/company/knowbigdata> [image: other site icon] <http://knowbigdata.com> [image: facebook icon] <https://facebook.com/knowbigdata> [image: twitter icon] <https://twitter.com/IKnowBigData> <https://twitter.com/IKnowBigData> On Mon, Aug 17, 2015 at 7:10 PM, Sandeep Giri <sand...@knowbigdata.com> wrote: > This statement is from the Spark's website itself. > > > Regards, > Sandeep Giri, > +1 347 781 4573 (US) > +91-953-899-8962 (IN) > > www.KnowBigData.com. <http://KnowBigData.com.> > Phone: +1-253-397-1945 (Office) > > [image: linkedin icon] <https://linkedin.com/company/knowbigdata> [image: > other site icon] <http://knowbigdata.com> [image: facebook icon] > <https://facebook.com/knowbigdata> [image: twitter icon] > <https://twitter.com/IKnowBigData> <https://twitter.com/IKnowBigData> > > > On Wed, Aug 12, 2015 at 10:42 PM, Peyman Mohajerian <mohaj...@gmail.com> > wrote: > >> I think this statement is inaccurate: >> Q7: What are Actions? A: An action brings back the data from the RDD to >> the local machine - >> >> Also I wouldn't say Spark is 100x faster than Hadoop and it is memory >> based. This is the kind of statement that will not get you the job. When it >> comes to shuffle it has to write to disk, it is a faster in many cases but >> 100x is just some marketing statement in a very narrow use cases. >> >> >> >> >> >> >> On Thu, Jul 30, 2015 at 4:55 AM, Sandeep Giri <sand...@knowbigdata.com> >> wrote: >> >>> i have prepared some interview questions: >>> http://www.knowbigdata.com/blog/interview-questions-apache-spark-part-1 >>> http://www.knowbigdata.com/blog/interview-questions-apache-spark-part-2 >>> >>> please provide your feedback. >>> >>> On Wed, Jul 29, 2015, 23:43 Pedro Rodriguez <ski.rodrig...@gmail.com> >>> wrote: >>> >>>> You might look at the edx course on Apache Spark or ML with Spark. >>>> There are probably some homework problems or quiz questions that might be >>>> relevant. I haven't looked at the course myself, but thats where I would go >>>> first. >>>> >>>> >>>> https://www.edx.org/course/introduction-big-data-apache-spark-uc-berkeleyx-cs100-1x >>>> >>>> https://www.edx.org/course/scalable-machine-learning-uc-berkeleyx-cs190-1x >>>> >>>> -- >>>> Pedro Rodriguez >>>> PhD Student in Distributed Machine Learning | CU Boulder >>>> UC Berkeley AMPLab Alumni >>>> >>>> ski.rodrig...@gmail.com | pedrorodriguez.io | 208-340-1703 >>>> Github: github.com/EntilZha | LinkedIn: >>>> https://www.linkedin.com/in/pedrorodriguezscience >>>> >>>> >> >