Re: Learning Spark

2019-07-05 Thread Alex A. Reda
Hello, I also second Gourav's point regarding "Spark the definitive guide" book. This is great for learning both Scala and python based SPARK. But as others mentioned, you will need to continuously read the documentation as SPARK is still undergoing a lot of improvements. I list additional

Re: Learning Spark

2019-07-05 Thread Gourav Sengupta
okay this is all something which I would disagree with. Dr. Matei Zaharia created SPARK Then he and Bill Chambers wrote a book on SPARK recently He is still the main thinking power behind SPARK (look at his research in Stanford) The name of the book is "SPARK the definitive guide", its the best

unsubscribe

2019-07-05 Thread Paras Bansal

Re: Learning Spark

2019-07-05 Thread Vikas Garg
Thanks!!! On Fri, 5 Jul 2019 at 15:38, Chris Teoh wrote: > Scala is better suited to data engineering work. It also has better > integration with other components like HBase, Kafka, etc. > > Python is great for data scientists as there are more data science > libraries available in Python. > >

Re: Learning Spark

2019-07-05 Thread Chris Teoh
Scala is better suited to data engineering work. It also has better integration with other components like HBase, Kafka, etc. Python is great for data scientists as there are more data science libraries available in Python. On Fri., 5 Jul. 2019, 7:40 pm Vikas Garg, wrote: > Is there any

Re: Learning Spark

2019-07-05 Thread Vikas Garg
Is there any disadvantage of using Python? I have gone through multiple articles which says that Python has advantages over Scala. Scala is super fast in comparison but Python has more pre-built libraries and options for analytics. Still should I go with Scala? On Fri, 5 Jul 2019 at 13:07, Kurt

Re: Learning Spark

2019-07-05 Thread Kurt Fehlhauer
Since you are a data engineer I would start by learning Scala. The parts of Scala you would need to learn are pretty basic. Start with the examples on the Spark website, which gives examples in multiple languages. Think of Scala as a typed version of Python. You will find that the error messages