[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67774982 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,129 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67775004 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,129 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67774733 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,129 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/13592 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67773057 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67763740 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,129 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67763685 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,129 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67750092 --- Diff: docs/sql-programming-guide.md --- @@ -170,34 +175,37 @@ df.show() -{% highlight r %} -sqlContext <- SQLContext(sc)

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67749954 --- Diff: docs/sql-programming-guide.md --- @@ -419,35 +423,39 @@ In addition to simple column references and expressions, DataFrames also have a

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67749843 --- Diff: docs/sql-programming-guide.md --- @@ -1740,17 +1759,14 @@ results = spark.sql("FROM src SELECT key, value").collect()

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67746855 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,129 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67747099 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,129 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-20 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67746893 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,129 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-17 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67543099 --- Diff: docs/sql-programming-guide.md --- @@ -517,24 +517,26 @@ types such as Sequences or Arrays. This RDD can be implicitly converted to a Dat

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-15 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67204480 --- Diff: docs/sql-programming-guide.md --- @@ -1326,7 +1325,7 @@ write.df(df1, "data/test_table/key=1", "parquet", "overwrite") write.df(df2,

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-15 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67204146 --- Diff: docs/sql-programming-guide.md --- @@ -1142,11 +1141,11 @@ write.parquet(schemaPeople, "people.parquet") # Read in the Parquet file

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-15 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67203777 --- Diff: docs/sql-programming-guide.md --- @@ -1142,11 +1141,11 @@ write.parquet(schemaPeople, "people.parquet") # Read in the Parquet file

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-15 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67203431 --- Diff: docs/sql-programming-guide.md --- @@ -1142,11 +1141,11 @@ write.parquet(schemaPeople, "people.parquet") # Read in the Parquet file

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-15 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67203021 --- Diff: docs/sql-programming-guide.md --- @@ -956,30 +954,30 @@ file directly with SQL. {% highlight scala %} -val df =

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-15 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67202934 --- Diff: docs/sql-programming-guide.md --- @@ -939,7 +937,7 @@ df.select("name", "age").write.save("namesAndAges.parquet", format="parquet")

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-15 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67202830 --- Diff: docs/sql-programming-guide.md --- @@ -889,7 +887,7 @@ df.select("name", "favorite_color").write.save("namesAndFavColors.parquet")

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-15 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67202611 --- Diff: docs/sql-programming-guide.md --- @@ -419,35 +419,35 @@ In addition to simple column references and expressions, DataFrames also have a

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-15 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67202506 --- Diff: docs/sql-programming-guide.md --- @@ -363,10 +363,10 @@ In addition to simple column references and expressions, DataFrames also have a

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-15 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r67202307 --- Diff: docs/sql-programming-guide.md --- @@ -171,9 +171,9 @@ df.show() {% highlight r %} -sqlContext <- SQLContext(sc)

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread clockfly
Github user clockfly commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66894492 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread clockfly
Github user clockfly commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66894292 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread clockfly
Github user clockfly commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66894132 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread clockfly
Github user clockfly commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66893943 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread clockfly
Github user clockfly commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66893918 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread clockfly
Github user clockfly commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66893224 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread clockfly
Github user clockfly commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66892238 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread clockfly
Github user clockfly commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66891847 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread clockfly
Github user clockfly commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66891502 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66886012 --- Diff: docs/sql-programming-guide.md --- @@ -587,7 +590,7 @@ for the JavaBean. {% highlight java %} // sc is an existing

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66884213 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66884198 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66884076 --- Diff: docs/sql-programming-guide.md --- @@ -517,24 +517,26 @@ types such as Sequences or Arrays. This RDD can be implicitly converted to a Dat

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66878387 --- Diff: docs/sql-programming-guide.md --- @@ -1650,14 +1646,15 @@ SELECT * FROM jsonTable ## Hive Tables Spark SQL also supports reading

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66877689 --- Diff: docs/sql-programming-guide.md --- @@ -604,49 +607,47 @@ JavaRDD people = sc.textFile("examples/src/main/resources/people.txt").m });

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66877316 --- Diff: docs/sql-programming-guide.md --- @@ -587,7 +590,7 @@ for the JavaBean. {% highlight java %} // sc is an existing

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66877088 --- Diff: docs/sql-programming-guide.md --- @@ -517,24 +517,26 @@ types such as Sequences or Arrays. This RDD can be implicitly converted to a Dat

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66875336 --- Diff: docs/sql-programming-guide.md --- @@ -171,9 +171,9 @@ df.show() {% highlight r %} -sqlContext <- SQLContext(sc) +spark

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66875208 --- Diff: docs/sql-programming-guide.md --- @@ -145,10 +145,10 @@ df.show() {% highlight java %} -JavaSparkContext sc = ...; // An

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66874954 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66873712 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-13 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66872913 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,130 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-10 Thread WeichenXu123
Github user WeichenXu123 commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66700281 --- Diff: docs/sql-programming-guide.md --- @@ -517,24 +517,26 @@ types such as Sequences or Arrays. This RDD can be implicitly converted to a Dat

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-10 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66700277 --- Diff: docs/sql-programming-guide.md --- @@ -1607,13 +1600,13 @@ a regular multi-line JSON file will most often fail. {% highlight r %}

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-10 Thread WeichenXu123
Github user WeichenXu123 commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66699400 --- Diff: docs/sql-programming-guide.md --- @@ -1607,13 +1600,13 @@ a regular multi-line JSON file will most often fail. {% highlight r %}

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-10 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r3884 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,121 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-10 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r3638 --- Diff: docs/sql-programming-guide.md --- @@ -184,20 +175,20 @@ showDF(df) -## DataFrame Operations +## Untyped Dataset

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-10 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66566175 --- Diff: docs/sql-programming-guide.md --- @@ -184,20 +175,20 @@ showDF(df) -## DataFrame Operations +## Untyped Dataset Operations

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-10 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66566143 --- Diff: docs/sql-programming-guide.md --- @@ -184,20 +175,20 @@ showDF(df) -## DataFrame Operations +## Untyped Dataset Operations

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-10 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66566102 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,121 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-10 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66566055 --- Diff: docs/sql-programming-guide.md --- @@ -12,130 +12,121 @@ title: Spark SQL and DataFrames Spark SQL is a Spark module for structured data

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-10 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/13592#discussion_r66566019 --- Diff: docs/sql-programming-guide.md --- @@ -1,7 +1,7 @@ --- layout: global -displayTitle: Spark SQL, DataFrames and Datasets Guide -title:

[GitHub] spark pull request #13592: [SPARK-15863][SQL][DOC] Initial SQL programming g...

2016-06-10 Thread liancheng
GitHub user liancheng opened a pull request: https://github.com/apache/spark/pull/13592 [SPARK-15863][SQL][DOC] Initial SQL programming guide update for Spark 2.0 ## What changes were proposed in this pull request? Initial SQL programming guide update for Spark 2.0.