How many jobs are left to calculate estimated time

2017-10-26 Thread Abdullah Bashir
So i have a understanding that .toLocalIterator jobs will be equal to Number of cores in my system ? 3. Also why is it so slow ? Best Regards, *Abdullah Bashir* *Senior Software Engineer,* *Foretheta, LLC.*

Fwd: [MLlib] RowMatrix computeSVD Native ARPACK support not detecting.

2017-10-09 Thread Abdullah Bashir
Hi, I am getting the following Warning when i run the pyspark job: My Code is mat = RowMatrix(tf_rdd_vec.cache())  # RDD is cached svd = mat.computeSVD(num_topics, computeU=False) I am using Ubuntu 16.04 EC2 instance. And I have installed all following libraries into my system. sudo apt insta