Benchmark results between Flink and Spark

2015-07-05 Thread Slim Baltagi
Hi

Apache Flink outperforms Apache Spark in processing machine learning  graph
algorithms and relational queries but not in batch processing!

The results were published in the proceedings of the 18th International
Conference, Business Information Systems 2015, Poznań, Poland, June 24-26,
2015. 

Thanks to our friend Google, Chapter 3: 'Evaluating New Approaches of Big
Data Analytics Frameworks' by Norman Spangenberg, Martin Roth and Bogdan
Franczyk is available for preview at http://goo.gl/WocQci on pages 28-37. 

Enjoy!

Slim Baltagi
http://www.SparkBigData.com 




--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Benchmark-results-between-Flink-and-Spark-tp23626.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



Re: Spark-ImageAnalysis

2015-07-05 Thread Slim Baltagi
Hi Prakhar,

How about you check the following related web resources to image processing
using Spark. They are all listed in my Big Data Knowledge Base:
http://www.SparkBigData.com: 
1. Scaling Up Fast: Real-time Image Processing and Analytics using Spark -
Kevin Mader (ETH Zurich) [VIDEO]

https://www.youtube.com/watch?v=I6qmEcGNgDo 
1. Scaling Up: Image Processing and Analytics using Spark

https://rawgit.com/4Quant/spark-summit-2014-presentation/master/ssPresentation.html
[SLIDES] 
1. Scaling Up: Image Processing and Analytics using Spark

http://4quant.com/xrm-big-data/ [MORE RESOURCES]

Good luck!

Slim Baltagi





--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Spark-ImageAnalysis-tp23618p23631.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



Big Data Day LA: FREE Big Data Conference in Los Angeles on June 27, 2015

2015-05-17 Thread Slim Baltagi
Please register for the 3rd annual full day ‘Big Data Day LA’ here: -
http://bigdatadayla.org
•   Location: Los Angeles
•   Date: June 27, 2015 
•   Completely FREE: Attendance, Food (Breakfast, Lunch  Coffee Breaks) and
Networking Reception
•   Vendor neutral
•   Great lineup of presentations, workshops, panels, and keynotes.
•   Over 45 talks organized in 5 tracks: Hadoop/Spark, Big Data, Business 
Use
Cases, NoSQL, Data Science
•   Led by over 40+ volunteers
•   800+ expected attendees
Register before it sells out! http://bigdatadayla.org

Thanks

Slim Baltagi



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Big-Data-Day-LA-FREE-Big-Data-Conference-in-Los-Angeles-on-June-27-2015-tp22921.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



Re: How to learn Spark ?

2015-04-02 Thread Slim Baltagi
Hi 

I maintain an Apache Spark Knowledge Base at http://www.SparkBigData.com with 
over 4,000 related web resources.
You can check the ‘Quick Start’ section at http://sparkbigdata.com/tutorials
There is a plenty of tutorials and examples to start with after you decide what 
you would like to use:
 Programming Language: 
Scala 
Java
Python
 Development mode: 
Spark Shell 
Notebook: ISpark, Spark Notebook, Zeppelin
IDE: IntelliJ, Eclipse
 Deployment mode: 
Single machine: Windows, Ubuntu, CentOS, Mac OS, Hadoop Sandbox [Cloudera, 
MapR, Hortonworks], ...
Cluster: Standalone, YARN, Mesos, ...
Cloud: Amazon EC2, Amazon EMR, Google Cloud Platform, Rackspace, Microsoft 
Azure,...
Thanks

Slim Baltagi
http://www.SparkBigData.com 

On Apr 2, 2015, at 3:19 AM, Star Guo st...@ceph.me wrote:

 Hi, all
  
 I am new to here. Could you give me some suggestion to learn Spark ? Thanks.
  
 Best Regards,
 Star Guo



Benchmarks of 'Hive on Tez' vs 'Hive on Spark' vs Spark SQL

2015-03-15 Thread Slim Baltagi
Hi

I would like to share with you my comments on Hortonworks' benchmarks of
'Hive on Tez' vs 'Hive on Spark' vs 'Spark SQL'.
Please check them in my related blog entry at http://goo.gl/K5mk0U

Thanks

Slim Baltagi
Chicago, IL
http://www.SparkBigData.com 



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Benchmarks-of-Hive-on-Tez-vs-Hive-on-Spark-vs-Spark-SQL-tp22060.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



Re: Slides of my talk in LA: 'Spark or Hadoop: is it an either-or proposition?'

2015-03-15 Thread Slim Baltagi
Hi

The video recording of this talk titled Spark or Hadoop: is it an either-or
proposition? at the Los Angeles Spark Users Group on March 12, 2015 is now
available on youtube at this link: http://goo.gl/0iJZ4n

Thanks

Slim Baltagi
http://www.SparkBigData.com 



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Slides-of-my-talk-in-LA-Spark-or-Hadoop-is-it-an-either-or-proposition-tp22061p22069.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



Re: Help vote for Spark talks at the Hadoop Summit

2015-02-25 Thread Slim Baltagi
Hi all

Here is another Spark talk (a vendor-independent one!) that you might have 
missed:

'The Future of Apache Hadoop' track: How Spark and Flink are shaping the future 
of Hadoop?
https://hadoopsummit.uservoice.com/forums/283266-the-future-of-apache-hadoop/suggestions/7074410

Regards,

Slim Baltagi
http://www.SparkBigData.com 

On Feb 25, 2015, at 12:33 PM, Xiangrui Meng men...@gmail.com wrote:

 Made 3 votes to each of the talks. Looking forward to see them in
 Hadoop Summit:) -Xiangrui
 
 On Tue, Feb 24, 2015 at 9:54 PM, Reynold Xin r...@databricks.com wrote:
 Hi all,
 
 The Hadoop Summit uses community choice voting to decide which talks to
 feature. It would be great if the community could help vote for Spark talks
 so that Spark has a good showing at this event. You can make three votes on
 each track. Below I've listed 3 talks that are important to Spark's
 roadmap. Please give 3 votes to each of the following talks.
 
 Committer Track: Lessons from Running Ultra Large Scale Spark Workloads on
 Hadoop
 https://hadoopsummit.uservoice.com/forums/283260-committer-track/suggestions/7074016
 
 Data Science track: DataFrames: large-scale data science on Hadoop data
 with Spark
 https://hadoopsummit.uservoice.com/forums/283261-data-science-and-hadoop/suggestions/7074147
 
 Future of Hadoop track: Online Approximate OLAP in SparkSQL
 https://hadoopsummit.uservoice.com/forums/283266-the-future-of-apache-hadoop/suggestions/7074424
 
 
 Thanks!
 
 -
 To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
 For additional commands, e-mail: user-h...@spark.apache.org
 



How Spark and Flink are shaping the future of Hadoop?

2015-02-20 Thread Slim Baltagi
Hi

1.*To get a taste* of my talk at the 2015 Hadoop Summit, please find below a
few links to a similar talk that I gave at the Chicago Hadoop Users Group on
‘ *Transitioning Compute Models: Apache MapReduce to Spark*’ on February 12,
2015 in front of 185 attendees:

- Video Recording: http://goo.gl/f30eEn
- Slides: http://goo.gl/Ikx4Ud
- Blog Entry: http://goo.gl/Pc6qiz 

2. To *vote* for my proposal at the 2015 Hadoop Summit on '*How Spark and
Flink are shaping the future of Hadoop*?' Simply visit http://goo.gl/qfqSR9
, click on 'Vote', pick 3 votes and enter your name and email. Done in less
than 30 seconds! 

Thanks in advance for your help

Slim Baltagi
Sr. Big Data Architect
http://ww.SparkBigData.com



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/How-Spark-and-Flink-are-shaping-the-future-of-Hadoop-tp21743.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



SparkBigData.com: The Apache Spark Knowledge Base

2014-11-22 Thread Slim Baltagi
Hello all

I'm very pleased to announce the launch of http://www.SparkBigData.com: The
Apache Spark Knowledge Base.
As your one-stop information resource dedicated to Apache Spark.
SparkBigData.com, provides free, easy and fast access to hundreds of Apache
Spark resources organized in several categories and sub-categories.
Based on your feedback and the feedback of visitors from around the world,
SparkBigData.com capabilities will evolve to better serve the fast growing
Spark community.

Warm regards,

Slim Baltagi
Chicago, USA
http://www.SparkBigData.com



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/SparkBigData-com-The-Apache-Spark-Knowledge-Base-tp19571.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org