spark git commit: [SPARK-4477] [PySpark] remove numpy from RDDSampler

2014-11-20 Thread meng
Repository: spark Updated Branches: refs/heads/master ad5f1f3ca - d39f2e9c6 [SPARK-4477] [PySpark] remove numpy from RDDSampler In RDDSampler, it try use numpy to gain better performance for possion(), but the number of call of random() is only (1+faction) * N in the pure python

spark git commit: [SPARK-4477] [PySpark] remove numpy from RDDSampler

2014-11-20 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.2 69e28046b - 5153aa041 [SPARK-4477] [PySpark] remove numpy from RDDSampler In RDDSampler, it try use numpy to gain better performance for possion(), but the number of call of random() is only (1+faction) * N in the pure python