RE: [EXT] handling skewness issues

email Tue, 30 Apr 2019 19:26:01 -0700

Please share the links if they are publicly available. Otherwise please share 
the name of the talks.  Thank you

From: Jules Damji <dmat...@comcast.net> 
Sent: Monday, April 29, 2019 8:04 PM
To: Michael Mansour <michael_mans...@symantec.com>
Cc: rajat kumar <kumar.rajat20...@gmail.com>; user@spark.apache.org
Subject: Re: [EXT] handling skewness issues

Yes, indeed! A few talks in the developer and deep dives address the data skews 
issue and how to address them. 

I shall let the group know when the talk sessions are available.

Cheers 

Jules

Sent from my iPhone

Pardon the dumb thumb typos :)

On Apr 29, 2019, at 2:13 PM, Michael Mansour <michael_mans...@symantec.com 
<mailto:michael_mans...@symantec.com> > wrote:

There were recently some fantastic talks about this at the SparkSummit 
conference in San Francisco.  I suggest you check out the SparkSummit YouTube 
channel after May 9th for a deep dive into this topic. 

From: rajat kumar <kumar.rajat20...@gmail.com 
<mailto:kumar.rajat20...@gmail.com> >
Date: Monday, April 29, 2019 at 9:34 AM
To: "user@spark.apache.org <mailto:user@spark.apache.org> " 
<user@spark.apache.org <mailto:user@spark.apache.org> >
Subject: [EXT] handling skewness issues

Hi All, 

How to overcome skewness issues in spark ?

I read that we can add some randomness to key column before join and remove 
that random part after join.

is there any better way ? Above method seems to be a workaround.

thanks 

rajat

RE: [EXT] handling skewness issues

Reply via email to