Benchmark results between Flink and Spark
Hi Apache Flink outperforms Apache Spark in processing machine learning graph algorithms and relational queries but not in batch processing! The results were published in the proceedings of the 18th International Conference, Business Information Systems 2015, Poznań, Poland, June 24-26, 2015. Thanks to our friend Google, Chapter 3: 'Evaluating New Approaches of Big Data Analytics Frameworks' by Norman Spangenberg, Martin Roth and Bogdan Franczyk is available for preview at http://goo.gl/WocQci on pages 28-37. Enjoy! Slim Baltagi http://www.SparkBigData.com -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Benchmark-results-between-Flink-and-Spark-tp23626.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org
Re: Spark-ImageAnalysis
Hi Prakhar, How about you check the following related web resources to image processing using Spark. They are all listed in my Big Data Knowledge Base: http://www.SparkBigData.com: 1. Scaling Up Fast: Real-time Image Processing and Analytics using Spark - Kevin Mader (ETH Zurich) [VIDEO] https://www.youtube.com/watch?v=I6qmEcGNgDo 1. Scaling Up: Image Processing and Analytics using Spark https://rawgit.com/4Quant/spark-summit-2014-presentation/master/ssPresentation.html [SLIDES] 1. Scaling Up: Image Processing and Analytics using Spark http://4quant.com/xrm-big-data/ [MORE RESOURCES] Good luck! Slim Baltagi -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-ImageAnalysis-tp23618p23631.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org
Big Data Day LA: FREE Big Data Conference in Los Angeles on June 27, 2015
Please register for the 3rd annual full day ‘Big Data Day LA’ here: - http://bigdatadayla.org • Location: Los Angeles • Date: June 27, 2015 • Completely FREE: Attendance, Food (Breakfast, Lunch Coffee Breaks) and Networking Reception • Vendor neutral • Great lineup of presentations, workshops, panels, and keynotes. • Over 45 talks organized in 5 tracks: Hadoop/Spark, Big Data, Business Use Cases, NoSQL, Data Science • Led by over 40+ volunteers • 800+ expected attendees Register before it sells out! http://bigdatadayla.org Thanks Slim Baltagi -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Big-Data-Day-LA-FREE-Big-Data-Conference-in-Los-Angeles-on-June-27-2015-tp22921.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org
Re: How to learn Spark ?
Hi I maintain an Apache Spark Knowledge Base at http://www.SparkBigData.com with over 4,000 related web resources. You can check the ‘Quick Start’ section at http://sparkbigdata.com/tutorials There is a plenty of tutorials and examples to start with after you decide what you would like to use: Programming Language: Scala Java Python Development mode: Spark Shell Notebook: ISpark, Spark Notebook, Zeppelin IDE: IntelliJ, Eclipse Deployment mode: Single machine: Windows, Ubuntu, CentOS, Mac OS, Hadoop Sandbox [Cloudera, MapR, Hortonworks], ... Cluster: Standalone, YARN, Mesos, ... Cloud: Amazon EC2, Amazon EMR, Google Cloud Platform, Rackspace, Microsoft Azure,... Thanks Slim Baltagi http://www.SparkBigData.com On Apr 2, 2015, at 3:19 AM, Star Guo st...@ceph.me wrote: Hi, all I am new to here. Could you give me some suggestion to learn Spark ? Thanks. Best Regards, Star Guo
Benchmarks of 'Hive on Tez' vs 'Hive on Spark' vs Spark SQL
Hi I would like to share with you my comments on Hortonworks' benchmarks of 'Hive on Tez' vs 'Hive on Spark' vs 'Spark SQL'. Please check them in my related blog entry at http://goo.gl/K5mk0U Thanks Slim Baltagi Chicago, IL http://www.SparkBigData.com -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Benchmarks-of-Hive-on-Tez-vs-Hive-on-Spark-vs-Spark-SQL-tp22060.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org
Re: Slides of my talk in LA: 'Spark or Hadoop: is it an either-or proposition?'
Hi The video recording of this talk titled Spark or Hadoop: is it an either-or proposition? at the Los Angeles Spark Users Group on March 12, 2015 is now available on youtube at this link: http://goo.gl/0iJZ4n Thanks Slim Baltagi http://www.SparkBigData.com -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Slides-of-my-talk-in-LA-Spark-or-Hadoop-is-it-an-either-or-proposition-tp22061p22069.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org
Re: Help vote for Spark talks at the Hadoop Summit
Hi all Here is another Spark talk (a vendor-independent one!) that you might have missed: 'The Future of Apache Hadoop' track: How Spark and Flink are shaping the future of Hadoop? https://hadoopsummit.uservoice.com/forums/283266-the-future-of-apache-hadoop/suggestions/7074410 Regards, Slim Baltagi http://www.SparkBigData.com On Feb 25, 2015, at 12:33 PM, Xiangrui Meng men...@gmail.com wrote: Made 3 votes to each of the talks. Looking forward to see them in Hadoop Summit:) -Xiangrui On Tue, Feb 24, 2015 at 9:54 PM, Reynold Xin r...@databricks.com wrote: Hi all, The Hadoop Summit uses community choice voting to decide which talks to feature. It would be great if the community could help vote for Spark talks so that Spark has a good showing at this event. You can make three votes on each track. Below I've listed 3 talks that are important to Spark's roadmap. Please give 3 votes to each of the following talks. Committer Track: Lessons from Running Ultra Large Scale Spark Workloads on Hadoop https://hadoopsummit.uservoice.com/forums/283260-committer-track/suggestions/7074016 Data Science track: DataFrames: large-scale data science on Hadoop data with Spark https://hadoopsummit.uservoice.com/forums/283261-data-science-and-hadoop/suggestions/7074147 Future of Hadoop track: Online Approximate OLAP in SparkSQL https://hadoopsummit.uservoice.com/forums/283266-the-future-of-apache-hadoop/suggestions/7074424 Thanks! - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org
How Spark and Flink are shaping the future of Hadoop?
Hi 1.*To get a taste* of my talk at the 2015 Hadoop Summit, please find below a few links to a similar talk that I gave at the Chicago Hadoop Users Group on ‘ *Transitioning Compute Models: Apache MapReduce to Spark*’ on February 12, 2015 in front of 185 attendees: - Video Recording: http://goo.gl/f30eEn - Slides: http://goo.gl/Ikx4Ud - Blog Entry: http://goo.gl/Pc6qiz 2. To *vote* for my proposal at the 2015 Hadoop Summit on '*How Spark and Flink are shaping the future of Hadoop*?' Simply visit http://goo.gl/qfqSR9 , click on 'Vote', pick 3 votes and enter your name and email. Done in less than 30 seconds! Thanks in advance for your help Slim Baltagi Sr. Big Data Architect http://ww.SparkBigData.com -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/How-Spark-and-Flink-are-shaping-the-future-of-Hadoop-tp21743.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org
SparkBigData.com: The Apache Spark Knowledge Base
Hello all I'm very pleased to announce the launch of http://www.SparkBigData.com: The Apache Spark Knowledge Base. As your one-stop information resource dedicated to Apache Spark. SparkBigData.com, provides free, easy and fast access to hundreds of Apache Spark resources organized in several categories and sub-categories. Based on your feedback and the feedback of visitors from around the world, SparkBigData.com capabilities will evolve to better serve the fast growing Spark community. Warm regards, Slim Baltagi Chicago, USA http://www.SparkBigData.com -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/SparkBigData-com-The-Apache-Spark-Knowledge-Base-tp19571.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org