Hi, Apart from the amplab benchmark, you might also find [1] and [2] interesting. The first is a survey on existing benchmarks, while the second proposes one. However, they are also limited to SQL-like queries.
Regarding graph processing benchmarks, I recently came across Graphalytics [3]. The benchmark currently supports Giraph, GraphLab, Graph-X, MapReduce and Neo4j. I hope we can add Gelly to this list soon! Unfortunately, I'm not aware of any large-scale ML or streaming benchmarks. Cheers, Vasia. [1]: http://arxiv.org/pdf/1402.5194.pdf [2]: http://msrg.utoronto.ca/publications/pdf_files/2013/Ghazal13-BigBench:_Towards_an_Industry_Standa.pdf [3]: http://event.cwi.nl/grades2015/07-capota.pdf On 6 July 2015 at 19:03, Slim Baltagi <sbalt...@gmail.com> wrote: > Hi Hawin > > What you shared is not 'the Spark benchmark'. > This benchmark measures response time on a handful of relational queries of > different tools including Shark. > Shark development was ended a year ago on July 1, 2014 in favor of Spark > SQL > which graduated from an alpha project on March 13, 2015. > I am not aware of any published benchmark between Spark and Flink by a > third > party except the one that I shared from a conference paper: > http://goo.gl/WocQci > I hope this helps. > > Slim Baltagi > Apache Flink Knowledge Base ( Now with over 300 categorized web resources!) > http://sparkbigdata.com/component/tags/tag/27-flink > > > > > > > -- > View this message in context: > http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Benchmark-results-between-Flink-and-Spark-tp1940p1961.html > Sent from the Apache Flink User Mailing List archive. mailing list archive > at Nabble.com. >