Re: real world spark code

2017-07-25 Thread Matei Zaharia
; Twitter: @BobLovesData >>> >>> >>> From: Jörn Franke [mailto:jornfra...@gmail.com] >>> Sent: Tuesday, July 25, 2017 8:31 AM >>> To: Adaryl Wakefield <adaryl.wakefi...@hotmail.com> >>> Cc: user@spark.apache.org >>> Subject

Re: real world spark code

2017-07-25 Thread Frank Austin Nothaft
//twitter.com/BobLovesData> >> >> >> From: Jörn Franke [mailto:jornfra...@gmail.com >> <mailto:jornfra...@gmail.com>] >> Sent: Tuesday, July 25, 2017 8:31 AM >> To: Adaryl Wakefield <adaryl.wakefi...@hotmail.com >> <mailto:adaryl.wakefi...@

Re: real world spark code

2017-07-25 Thread Jörn Franke
day, July 25, 2017 8:31 AM > To: Adaryl Wakefield <adaryl.wakefi...@hotmail.com> > Cc: user@spark.apache.org > Subject: Re: real world spark code > > Look for the ones that have unit and integration tests as well as a > ci+reporting on code quality. > > All the ot

RE: real world spark code

2017-07-25 Thread Adaryl Wakefield
Twitter: @BobLovesData<http://twitter.com/BobLovesData> From: Jörn Franke [mailto:jornfra...@gmail.com] Sent: Tuesday, July 25, 2017 8:31 AM To: Adaryl Wakefield <adaryl.wakefi...@hotmail.com> Cc: user@spark.apache.org Subject: Re: real world spark code Look for the ones that have

Re: real world spark code

2017-07-25 Thread Jörn Franke
Look for the ones that have unit and integration tests as well as a ci+reporting on code quality. All the others are just toy examples. Well should be :) > On 25. Jul 2017, at 01:08, Adaryl Wakefield > wrote: > > Anybody know of publicly available GitHub repos

Re: real world spark code

2017-07-25 Thread Xiayun Sun
usually I look in github repos of those big name companies that I know are actively doing machine learning. For example, here are two spark-related repos from soundcloud: - https://github.com/soundcloud/spark-pagerank - https://github.com/soundcloud/cosine-lsh-join-spark On 25 July 2017 at

real world spark code

2017-07-24 Thread Adaryl Wakefield
Anybody know of publicly available GitHub repos of real world Spark applications written in scala? Adaryl "Bob" Wakefield, MBA Principal Mass Street Analytics, LLC 913.938.6685 www.massstreet.net