: Re: Spark vs Tez
scala is not an interpreted language, from my non authoritative view it seems
to have 2-3 (thousand) more compile phases than java and as a result some of
the things you are doing that look like they are "interpreted" are actually
macro's that get converte
use of, or taking any action in reliance upon, this
information by persons or entities other than the intended recipient is
strictly prohibited.
From: Edward Capriolo
Reply-To:
Date: Tuesday, October 21, 2014 at 12:06 PM
To: "user@hadoop.apache.org"
Subject: Re: Spark vs
e with something that is billing
>> itself as being fast doesn’t sound like the best idea...
>> B.
>> *From:* Russell Jurney <mailto:russell.jur...@gmail.com>
>> *Sent:* Saturday, October 18, 2014 7:38 AM
>> *To:* user@hadoop.apache.org <mailto:user@hadoop.apac
o:russell.jur...@gmail.com>
*Sent:* Saturday, October 18, 2014 7:38 AM
*To:* user@hadoop.apache.org <mailto:user@hadoop.apache.org>
*Subject:* Re: Spark vs Tez
Check out PySpark. No Scala required.
On Friday, October 17, 2014, Adaryl "Bob" Wakefield, MBA
mailto:adaryl.wakefi...@hot
Using an interpreted scripting language with something that is billing itself
as being fast doesn’t sound like the best idea...
B.
From: Russell Jurney
Sent: Saturday, October 18, 2014 7:38 AM
To: user@hadoop.apache.org
Subject: Re: Spark vs Tez
Check out PySpark. No Scala required.
On
>>>>
>>>>
>>>> On Fri, Oct 17, 2014 at 11:23 AM, Adaryl "Bob" Wakefield, MBA <
>>>> adaryl.wakefi...@hotmail.com> wrote:
>>>>
>>>>> It was my understanding that Spark is faster batch processing. Tez
>&
>> It was my understanding that Spark is faster batch processing. Tez
>>>> is the new execution engine that replaces MapReduce and is also supposed to
>>>> speed up batch processing. Is that not correct?
>>>> B.
>>>>
>>>>
>>
ing that Spark is faster batch processing. Tez is
>>> the new execution engine that replaces MapReduce and is also supposed to
>>> speed up batch processing. Is that not correct?
>>> B.
>>>
>>>
>>>
>>> *From:* Shahab Yunus
>&g
oiled
> down to if you need to master Java or Scala go with Java. Three months into
> Java I don’t want to stop that and start learning Scala.
>
> B.
> *From:* kartik saxena
>
> *Sent:* Friday, October 17, 2014 1:12 PM
> *To:* user@hadoop.apache.org
>
> *Subject:* R
that replaces MapReduce and is also supposed to
>>> speed up batch processing. Is that not correct?
>>> B.
>>>
>>>
>>>
>>> *From:* Shahab Yunus
>>> *Sent:* Friday, October 17, 2014 1:12 PM
>>> *To:* user@hadoop.apache.or
places MapReduce and is also supposed to
>> speed up batch processing. Is that not correct?
>> B.
>>
>>
>>
>> *From:* Shahab Yunus
>> *Sent:* Friday, October 17, 2014 1:12 PM
>> *To:* user@hadoop.apache.org
>> *Subject:* Re: Spark vs Tez
>>
; *To:* user@hadoop.apache.org
> *Subject:* Re: Spark vs Tez
>
> What aspects of Tez and Spark are you comparing? They have different
> purposes and thus not directly comparable, as far as I understand.
>
> Regards,
> Shahab
>
> On Fri, Oct 17, 2014 at 2:06 PM, Adaryl "Bo
, October 17, 2014 1:12 PM
To: user@hadoop.apache.org
Subject: Re: Spark vs Tez
I did a performance benchmark during my summer internship . I am currently a
grad student. Can't reveal much about the specific project but Spark is still
faster than around 4-5th iteration of Tez of the same query/da
is faster batch processing. Tez is
> the new execution engine that replaces MapReduce and is also supposed to
> speed up batch processing. Is that not correct?
> B.
>
>
>
> *From:* Shahab Yunus
> *Sent:* Friday, October 17, 2014 1:12 PM
> *To:* user@hadoop.apache.org
> *S
: Spark vs Tez
What aspects of Tez and Spark are you comparing? They have different purposes
and thus not directly comparable, as far as I understand.
Regards,
Shahab
On Fri, Oct 17, 2014 at 2:06 PM, Adaryl "Bob" Wakefield, MBA
wrote:
Does anybody have any performance figures on
I did a performance benchmark during my summer internship . I am currently
a grad student. Can't reveal much about the specific project but Spark is
still faster than around 4-5th iteration of Tez of the same query/dataset.
By Iteration I mean utilizing the "hot-container" property of Apache Tez .
Spark creator Amplab did some benchmarks.
https://amplab.cs.berkeley.edu/benchmark/
On Fri, Oct 17, 2014 at 11:06 AM, Adaryl "Bob" Wakefield, MBA <
adaryl.wakefi...@hotmail.com> wrote:
> Does anybody have any performance figures on how Spark stacks up
> against Tez? If you don’t have figures, d
What aspects of Tez and Spark are you comparing? They have different
purposes and thus not directly comparable, as far as I understand.
Regards,
Shahab
On Fri, Oct 17, 2014 at 2:06 PM, Adaryl "Bob" Wakefield, MBA <
adaryl.wakefi...@hotmail.com> wrote:
> Does anybody have any performance figure
Does anybody have any performance figures on how Spark stacks up against Tez?
If you don’t have figures, does anybody have an opinion? Spark seems so popular
but I’m not really seeing why.
B.
19 matches
Mail list logo