I'm looking to put together some representative tests for Spark. Where can I find such data and code? There must be some already existing. Some tests (logistic regression, k-means, PageRank) are mentioned in the RDD paper, for example.
- sample data & code for performance tests Mike
- Re: sample data & code for performance tests Matei Zaharia
