[GitHub] spark pull request #15659: [WIP][SPARK-1267][SPARK-18129] Allow PySpark to b...

nchammas Thu, 27 Oct 2016 08:08:51 -0700

Github user nchammas commented on a diff in the pull request:

    https://github.com/apache/spark/pull/15659#discussion_r85354868
  
    --- Diff: python/README.md ---
    @@ -0,0 +1,32 @@
    +# Apache Spark
    +
    +Spark is a fast and general cluster computing system for Big Data. It 
provides
    +high-level APIs in Scala, Java, Python, and R, and an optimized engine that
    +supports general computation graphs for data analysis. It also supports a
    +rich set of higher-level tools including Spark SQL for SQL and DataFrames,
    +MLlib for machine learning, GraphX for graph processing,
    +and Spark Streaming for stream processing.
    +
    +<http://spark.apache.org/>
    +
    +## Online Documentation
    +
    +You can find the latest Spark documentation, including a programming
    +guide, on the [project web 
page](http://spark.apache.org/documentation.html)
    +
    +
    +## Python Packaging
    +
    +This README file only contains basic information related to pip installed 
PySpark.
    +This packaging is currently experimental and may change in future versions 
(although we will do our best to keep compatibility).
    +Using PySpark requires the Spark JARs, and if you are building this from 
source please see the builder instructions at
    +["Building 
Spark"](http://spark.apache.org/docs/latest/building-spark.html).
    +
    +The Python packaging for Spark is not intended to replace all of the other 
use cases. This Python packaged version of Spark is suitable for interacting 
with an existing cluster (be it Spark standalone, YARN, or Mesos) - but does 
not contain the tools required to setup your own standalone Spark cluster. You 
can download the full version of Spark from the [Apache Spark downloads 
page](http://spark.apache.org/downloads.html).
    --- End diff --
    
    If I am doing local development on my Mac, for example, what does pip 
installing Spark get me?
    
    It sounds like from this line that even if I pip install Spark, I will 
still need to separately `brew install apache-spark` or something to be able to 
run Spark programs. Is that correct?
    
    How does my workflow change or improve if I can pip install Spark?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #15659: [WIP][SPARK-1267][SPARK-18129] Allow PySpark to b...

Reply via email to