Repository: spark
Updated Branches:
refs/heads/master ad5f1f3ca - d39f2e9c6
[SPARK-4477] [PySpark] remove numpy from RDDSampler
In RDDSampler, it try use numpy to gain better performance for possion(), but
the number of call of random() is only (1+faction) * N in the pure python
Repository: spark
Updated Branches:
refs/heads/branch-1.2 69e28046b - 5153aa041
[SPARK-4477] [PySpark] remove numpy from RDDSampler
In RDDSampler, it try use numpy to gain better performance for possion(), but
the number of call of random() is only (1+faction) * N in the pure python