Re: Spark vs Tez

2014-10-24 Thread Adaryl "Bob" Wakefield, MBA
: Re: Spark vs Tez scala is not an interpreted language, from my non authoritative view it seems to have 2-3 (thousand) more compile phases than java and as a result some of the things you are doing that look like they are "interpreted" are actually macro's that get converte

Re: Spark vs Tez

2014-10-21 Thread Brian O'Neill
use of, or taking any action in reliance upon, this information by persons or entities other than the intended recipient is strictly prohibited. From: Edward Capriolo Reply-To: Date: Tuesday, October 21, 2014 at 12:06 PM To: "user@hadoop.apache.org" Subject: Re: Spark vs

Re: Spark vs Tez

2014-10-21 Thread Edward Capriolo
e with something that is billing >> itself as being fast doesn’t sound like the best idea... >> B. >> *From:* Russell Jurney <mailto:russell.jur...@gmail.com> >> *Sent:* Saturday, October 18, 2014 7:38 AM >> *To:* user@hadoop.apache.org <mailto:user@hadoop.apac

Re: Spark vs Tez

2014-10-21 Thread Tim Randles
o:russell.jur...@gmail.com> *Sent:* Saturday, October 18, 2014 7:38 AM *To:* user@hadoop.apache.org <mailto:user@hadoop.apache.org> *Subject:* Re: Spark vs Tez Check out PySpark. No Scala required. On Friday, October 17, 2014, Adaryl "Bob" Wakefield, MBA mailto:adaryl.wakefi...@hot

Re: Spark vs Tez

2014-10-20 Thread Adaryl "Bob" Wakefield, MBA
Using an interpreted scripting language with something that is billing itself as being fast doesn’t sound like the best idea... B. From: Russell Jurney Sent: Saturday, October 18, 2014 7:38 AM To: user@hadoop.apache.org Subject: Re: Spark vs Tez Check out PySpark. No Scala required. On

Re: Spark vs Tez

2014-10-19 Thread Mohan Radhakrishnan
>>>> >>>> >>>> On Fri, Oct 17, 2014 at 11:23 AM, Adaryl "Bob" Wakefield, MBA < >>>> adaryl.wakefi...@hotmail.com> wrote: >>>> >>>>> It was my understanding that Spark is faster batch processing. Tez >&

Re: Spark vs Tez

2014-10-19 Thread Niels Basjes
>> It was my understanding that Spark is faster batch processing. Tez >>>> is the new execution engine that replaces MapReduce and is also supposed to >>>> speed up batch processing. Is that not correct? >>>> B. >>>> >>>> >>

Re: Spark vs Tez

2014-10-18 Thread Jeff Zhang
ing that Spark is faster batch processing. Tez is >>> the new execution engine that replaces MapReduce and is also supposed to >>> speed up batch processing. Is that not correct? >>> B. >>> >>> >>> >>> *From:* Shahab Yunus >&g

Re: Spark vs Tez

2014-10-18 Thread Russell Jurney
oiled > down to if you need to master Java or Scala go with Java. Three months into > Java I don’t want to stop that and start learning Scala. > > B. > *From:* kartik saxena > > *Sent:* Friday, October 17, 2014 1:12 PM > *To:* user@hadoop.apache.org > > *Subject:* R

Re: Spark vs Tez

2014-10-18 Thread Mohan Radhakrishnan
that replaces MapReduce and is also supposed to >>> speed up batch processing. Is that not correct? >>> B. >>> >>> >>> >>> *From:* Shahab Yunus >>> *Sent:* Friday, October 17, 2014 1:12 PM >>> *To:* user@hadoop.apache.or

Re: Spark vs Tez

2014-10-18 Thread Niels Basjes
places MapReduce and is also supposed to >> speed up batch processing. Is that not correct? >> B. >> >> >> >> *From:* Shahab Yunus >> *Sent:* Friday, October 17, 2014 1:12 PM >> *To:* user@hadoop.apache.org >> *Subject:* Re: Spark vs Tez >>

Re: Spark vs Tez

2014-10-17 Thread Gavin Yue
; *To:* user@hadoop.apache.org > *Subject:* Re: Spark vs Tez > > What aspects of Tez and Spark are you comparing? They have different > purposes and thus not directly comparable, as far as I understand. > > Regards, > Shahab > > On Fri, Oct 17, 2014 at 2:06 PM, Adaryl "Bo

Re: Spark vs Tez

2014-10-17 Thread Adaryl "Bob" Wakefield, MBA
, October 17, 2014 1:12 PM To: user@hadoop.apache.org Subject: Re: Spark vs Tez I did a performance benchmark during my summer internship . I am currently a grad student. Can't reveal much about the specific project but Spark is still faster than around 4-5th iteration of Tez of the same query/da

Re: Spark vs Tez

2014-10-17 Thread Alexander Pivovarov
is faster batch processing. Tez is > the new execution engine that replaces MapReduce and is also supposed to > speed up batch processing. Is that not correct? > B. > > > > *From:* Shahab Yunus > *Sent:* Friday, October 17, 2014 1:12 PM > *To:* user@hadoop.apache.org > *S

Re: Spark vs Tez

2014-10-17 Thread Adaryl "Bob" Wakefield, MBA
: Spark vs Tez What aspects of Tez and Spark are you comparing? They have different purposes and thus not directly comparable, as far as I understand. Regards, Shahab On Fri, Oct 17, 2014 at 2:06 PM, Adaryl "Bob" Wakefield, MBA wrote: Does anybody have any performance figures on

Re: Spark vs Tez

2014-10-17 Thread kartik saxena
I did a performance benchmark during my summer internship . I am currently a grad student. Can't reveal much about the specific project but Spark is still faster than around 4-5th iteration of Tez of the same query/dataset. By Iteration I mean utilizing the "hot-container" property of Apache Tez .

Re: Spark vs Tez

2014-10-17 Thread Alexander Pivovarov
Spark creator Amplab did some benchmarks. https://amplab.cs.berkeley.edu/benchmark/ On Fri, Oct 17, 2014 at 11:06 AM, Adaryl "Bob" Wakefield, MBA < adaryl.wakefi...@hotmail.com> wrote: > Does anybody have any performance figures on how Spark stacks up > against Tez? If you don’t have figures, d

Re: Spark vs Tez

2014-10-17 Thread Shahab Yunus
What aspects of Tez and Spark are you comparing? They have different purposes and thus not directly comparable, as far as I understand. Regards, Shahab On Fri, Oct 17, 2014 at 2:06 PM, Adaryl "Bob" Wakefield, MBA < adaryl.wakefi...@hotmail.com> wrote: > Does anybody have any performance figure

Spark vs Tez

2014-10-17 Thread Adaryl "Bob" Wakefield, MBA
Does anybody have any performance figures on how Spark stacks up against Tez? If you don’t have figures, does anybody have an opinion? Spark seems so popular but I’m not really seeing why. B.