/docs/latest/submitting-applications.html#bundling-your-applications-dependencies
>
> I hope that helps.
>
> On Tue, 28 Jan 2020, 9:46 am Tharindu Mathew,
> wrote:
>
>> Hi,
>>
>> Newbie to pyspark/spark here.
>>
>> I'm trying to submit a job to pyspark
.
--
Regards,
Tharindu Mathew
http://tharindumathew.com
hi,
Just wanted to get your input how to avoid RDD shuffling in a join after
Distributed Matrix operation
spark
Following is what my app would look like
1. created a dense matrix as a input to calculate cosine distance between
columns
val rowMarixIn = sc.textFile("input.csv").map{ line
d be much appreciated
Thanks,
Tharindu