Re: Spark Interview Questions

Sandeep Giri Tue, 18 Aug 2015 23:34:09 -0700

Thank you All. I have updated it to a little better version.


Regards,
Sandeep Giri,
+1 347 781 4573 (US)
+91-953-899-8962 (IN)

www.KnowBigData.com. <http://KnowBigData.com.>
Phone: +1-253-397-1945 (Office)

[image: linkedin icon] <https://linkedin.com/company/knowbigdata> [image:
other site icon] <http://knowbigdata.com>  [image: facebook icon]
<https://facebook.com/knowbigdata> [image: twitter icon]
<https://twitter.com/IKnowBigData> <https://twitter.com/IKnowBigData>


On Mon, Aug 17, 2015 at 7:10 PM, Sandeep Giri <sand...@knowbigdata.com>
wrote:

> This statement is from the Spark's website itself.
>
>
> Regards,
> Sandeep Giri,
> +1 347 781 4573 (US)
> +91-953-899-8962 (IN)
>
> www.KnowBigData.com. <http://KnowBigData.com.>
> Phone: +1-253-397-1945 (Office)
>
> [image: linkedin icon] <https://linkedin.com/company/knowbigdata> [image:
> other site icon] <http://knowbigdata.com>  [image: facebook icon]
> <https://facebook.com/knowbigdata> [image: twitter icon]
> <https://twitter.com/IKnowBigData> <https://twitter.com/IKnowBigData>
>
>
> On Wed, Aug 12, 2015 at 10:42 PM, Peyman Mohajerian <mohaj...@gmail.com>
> wrote:
>
>> I think this statement is inaccurate:
>> Q7: What are Actions? A: An action brings back the data from the RDD to
>> the local machine -
>>
>> Also I wouldn't say Spark is 100x faster than Hadoop and it is memory
>> based. This is the kind of statement that will not get you the job. When it
>> comes to shuffle it has to write to disk, it is a faster in many cases but
>> 100x is just some marketing statement in a very narrow use cases.
>>
>>
>>
>>
>>
>>
>> On Thu, Jul 30, 2015 at 4:55 AM, Sandeep Giri <sand...@knowbigdata.com>
>> wrote:
>>
>>> i have prepared some interview questions:
>>> http://www.knowbigdata.com/blog/interview-questions-apache-spark-part-1
>>> http://www.knowbigdata.com/blog/interview-questions-apache-spark-part-2
>>>
>>> please provide your feedback.
>>>
>>> On Wed, Jul 29, 2015, 23:43 Pedro Rodriguez <ski.rodrig...@gmail.com>
>>> wrote:
>>>
>>>> You might look at the edx course on Apache Spark or ML with Spark.
>>>> There are probably some homework problems or quiz questions that might be
>>>> relevant. I haven't looked at the course myself, but thats where I would go
>>>> first.
>>>>
>>>>
>>>> https://www.edx.org/course/introduction-big-data-apache-spark-uc-berkeleyx-cs100-1x
>>>>
>>>> https://www.edx.org/course/scalable-machine-learning-uc-berkeleyx-cs190-1x
>>>>
>>>> --
>>>> Pedro Rodriguez
>>>> PhD Student in Distributed Machine Learning | CU Boulder
>>>> UC Berkeley AMPLab Alumni
>>>>
>>>> ski.rodrig...@gmail.com | pedrorodriguez.io | 208-340-1703
>>>> Github: github.com/EntilZha | LinkedIn:
>>>> https://www.linkedin.com/in/pedrorodriguezscience
>>>>
>>>>
>>
>

Re: Spark Interview Questions

Reply via email to