spark git commit: [SPARK-15932][SQL][DOC] document the contract of encoder serializer expressions

2016-06-13 Thread rxin
Repository: spark Updated Branches: refs/heads/master 1842cdd4e -> 688b6ef9d [SPARK-15932][SQL][DOC] document the contract of encoder serializer expressions ## What changes were proposed in this pull request? In our encoder framework, we imply that serializer expressions should use

spark git commit: [SPARK-15932][SQL][DOC] document the contract of encoder serializer expressions

2016-06-13 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 2841bbac4 -> 974be6241 [SPARK-15932][SQL][DOC] document the contract of encoder serializer expressions ## What changes were proposed in this pull request? In our encoder framework, we imply that serializer expressions should use

spark git commit: [SPARK-15663][SQL] SparkSession.catalog.listFunctions shouldn't include the list of built-in functions

2016-06-13 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 1a57bf0f4 -> 2841bbac4 [SPARK-15663][SQL] SparkSession.catalog.listFunctions shouldn't include the list of built-in functions ## What changes were proposed in this pull request? SparkSession.catalog.listFunctions currently returns all

spark git commit: [SPARK-15364][ML][PYSPARK] Implement PySpark picklers for ml.Vector and ml.Matrix under spark.ml.python

2016-06-13 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 55c1fac21 -> 1a57bf0f4 [SPARK-15364][ML][PYSPARK] Implement PySpark picklers for ml.Vector and ml.Matrix under spark.ml.python ## What changes were proposed in this pull request? Now we have PySpark picklers for new and old

spark git commit: [SPARK-15364][ML][PYSPARK] Implement PySpark picklers for ml.Vector and ml.Matrix under spark.ml.python

2016-06-13 Thread meng
Repository: spark Updated Branches: refs/heads/master 5827b65e2 -> baa3e633e [SPARK-15364][ML][PYSPARK] Implement PySpark picklers for ml.Vector and ml.Matrix under spark.ml.python ## What changes were proposed in this pull request? Now we have PySpark picklers for new and old

spark git commit: [SPARK-15808][SQL] File Format Checking When Appending Data

2016-06-13 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 774014250 -> 55c1fac21 [SPARK-15808][SQL] File Format Checking When Appending Data What changes were proposed in this pull request? **Issue:** Got wrong results or strange errors when append data to a table with mismatched file

spark git commit: [SPARK-15808][SQL] File Format Checking When Appending Data

2016-06-13 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 7b9071eea -> 5827b65e2 [SPARK-15808][SQL] File Format Checking When Appending Data What changes were proposed in this pull request? **Issue:** Got wrong results or strange errors when append data to a table with mismatched file

spark git commit: [SPARK-15910][SQL] Check schema consistency when using Kryo encoder to convert DataFrame to Dataset

2016-06-13 Thread wenchen
Repository: spark Updated Branches: refs/heads/master a6babca1b -> 7b9071eea [SPARK-15910][SQL] Check schema consistency when using Kryo encoder to convert DataFrame to Dataset ## What changes were proposed in this pull request? This PR enforces schema check when converting DataFrame to

spark git commit: [SPARK-15910][SQL] Check schema consistency when using Kryo encoder to convert DataFrame to Dataset

2016-06-13 Thread wenchen
Repository: spark Updated Branches: refs/heads/branch-2.0 ab9a304a7 -> 774014250 [SPARK-15910][SQL] Check schema consistency when using Kryo encoder to convert DataFrame to Dataset ## What changes were proposed in this pull request? This PR enforces schema check when converting DataFrame to

spark git commit: [SPARK-15929] Fix portability of DataFrameSuite path globbing tests

2016-06-13 Thread lian
Repository: spark Updated Branches: refs/heads/master ced8d669b -> a6babca1b [SPARK-15929] Fix portability of DataFrameSuite path globbing tests The DataFrameSuite regression tests for SPARK-13774 fail in my environment because they attempt to glob over all of `/mnt` and some of the

spark git commit: [SPARK-15929] Fix portability of DataFrameSuite path globbing tests

2016-06-13 Thread lian
Repository: spark Updated Branches: refs/heads/branch-2.0 1f3b5a5ac -> ab9a304a7 [SPARK-15929] Fix portability of DataFrameSuite path globbing tests The DataFrameSuite regression tests for SPARK-13774 fail in my environment because they attempt to glob over all of `/mnt` and some of the

spark git commit: [SPARK-15925][SQL][SPARKR] Replaces registerTempTable with createOrReplaceTempView

2016-06-13 Thread shivaram
Repository: spark Updated Branches: refs/heads/branch-2.0 b148b0364 -> 1f3b5a5ac [SPARK-15925][SQL][SPARKR] Replaces registerTempTable with createOrReplaceTempView ## What changes were proposed in this pull request? This PR replaces `registerTempTable` with `createOrReplaceTempView` as a

spark git commit: [SPARK-15925][SQL][SPARKR] Replaces registerTempTable with createOrReplaceTempView

2016-06-13 Thread shivaram
Repository: spark Updated Branches: refs/heads/master c4b1ad020 -> ced8d669b [SPARK-15925][SQL][SPARKR] Replaces registerTempTable with createOrReplaceTempView ## What changes were proposed in this pull request? This PR replaces `registerTempTable` with `createOrReplaceTempView` as a

spark git commit: [SPARK-15887][SQL] Bring back the hive-site.xml support for Spark 2.0

2016-06-13 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 97fe1d8ee -> b148b0364 [SPARK-15887][SQL] Bring back the hive-site.xml support for Spark 2.0 ## What changes were proposed in this pull request? Right now, Spark 2.0 does not load hive-site.xml. Based on users' feedback, it seems

spark git commit: [SPARK-15887][SQL] Bring back the hive-site.xml support for Spark 2.0

2016-06-13 Thread yhuai
Repository: spark Updated Branches: refs/heads/master c654ae214 -> c4b1ad020 [SPARK-15887][SQL] Bring back the hive-site.xml support for Spark 2.0 ## What changes were proposed in this pull request? Right now, Spark 2.0 does not load hive-site.xml. Based on users' feedback, it seems make

spark git commit: [SPARK-15889][SQL][STREAMING] Add a unique id to ContinuousQuery

2016-06-13 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 5ad4e32d4 -> c654ae214 [SPARK-15889][SQL][STREAMING] Add a unique id to ContinuousQuery ## What changes were proposed in this pull request? ContinuousQueries have names that are unique across all the active ones. However, when queries

spark git commit: [SPARK-15889][SQL][STREAMING] Add a unique id to ContinuousQuery

2016-06-13 Thread zsxwing
Repository: spark Updated Branches: refs/heads/branch-2.0 d9db8a9c8 -> 97fe1d8ee [SPARK-15889][SQL][STREAMING] Add a unique id to ContinuousQuery ## What changes were proposed in this pull request? ContinuousQueries have names that are unique across all the active ones. However, when

spark git commit: [SPARK-15530][SQL] Set #parallelism for file listing in listLeafFilesInParallel

2016-06-13 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 8c4050a5a -> d9db8a9c8 [SPARK-15530][SQL] Set #parallelism for file listing in listLeafFilesInParallel ## What changes were proposed in this pull request? This pr is to set the number of parallelism to prevent file listing in

spark git commit: [SPARK-15530][SQL] Set #parallelism for file listing in listLeafFilesInParallel

2016-06-13 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 3b7fb84cf -> 5ad4e32d4 [SPARK-15530][SQL] Set #parallelism for file listing in listLeafFilesInParallel ## What changes were proposed in this pull request? This pr is to set the number of parallelism to prevent file listing in

spark git commit: [SPARK-15676][SQL] Disallow Column Names as Partition Columns For Hive Tables

2016-06-13 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 2a0da84dc -> 8c4050a5a [SPARK-15676][SQL] Disallow Column Names as Partition Columns For Hive Tables What changes were proposed in this pull request? When creating a Hive Table (not data source tables), a common error users might

spark git commit: [SPARK-15676][SQL] Disallow Column Names as Partition Columns For Hive Tables

2016-06-13 Thread yhuai
Repository: spark Updated Branches: refs/heads/master a6a18a457 -> 3b7fb84cf [SPARK-15676][SQL] Disallow Column Names as Partition Columns For Hive Tables What changes were proposed in this pull request? When creating a Hive Table (not data source tables), a common error users might

spark git commit: [HOTFIX][MINOR][SQL] Revert " Standardize 'continuous queries' to 'streaming D…

2016-06-13 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-2.0 bd27dc489 -> 2a0da84dc [HOTFIX][MINOR][SQL] Revert " Standardize 'continuous queries' to 'streaming D… This reverts commit d32e227787338a08741d8064f5dd2db1d60ddc63. Broke build -

spark git commit: [MINOR][SQL] Standardize 'continuous queries' to 'streaming Datasets/DataFrames'

2016-06-13 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-2.0 413826d40 -> bd27dc489 [MINOR][SQL] Standardize 'continuous queries' to 'streaming Datasets/DataFrames' ## What changes were proposed in this pull request? This patch does some replacing (as `streaming Datasets/DataFrames` is the term

spark git commit: [MINOR][SQL] Standardize 'continuous queries' to 'streaming Datasets/DataFrames'

2016-06-13 Thread tdas
Repository: spark Updated Branches: refs/heads/master 4134653e5 -> d32e22778 [MINOR][SQL] Standardize 'continuous queries' to 'streaming Datasets/DataFrames' ## What changes were proposed in this pull request? This patch does some replacing (as `streaming Datasets/DataFrames` is the term

spark git commit: [SPARK-15697][REPL] Unblock some of the useful repl commands.

2016-06-13 Thread zsxwing
Repository: spark Updated Branches: refs/heads/branch-2.0 c01dc815d -> 413826d40 [SPARK-15697][REPL] Unblock some of the useful repl commands. ## What changes were proposed in this pull request? Unblock some of the useful repl commands. like, "implicits", "javap", "power", "type", "kind".

spark git commit: [SPARK-15697][REPL] Unblock some of the useful repl commands.

2016-06-13 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 938434dc7 -> 4134653e5 [SPARK-15697][REPL] Unblock some of the useful repl commands. ## What changes were proposed in this pull request? Unblock some of the useful repl commands. like, "implicits", "javap", "power", "type", "kind". As

spark git commit: [SPARK-15913][CORE] Dispatcher.stopped should be enclosed by synchronized block.

2016-06-13 Thread vanzin
Repository: spark Updated Branches: refs/heads/branch-2.0 d64cb1994 -> c01dc815d [SPARK-15913][CORE] Dispatcher.stopped should be enclosed by synchronized block. ## What changes were proposed in this pull request? `Dispatcher.stopped` is guarded by `this`, but it is used without

spark git commit: [SPARK-15913][CORE] Dispatcher.stopped should be enclosed by synchronized block.

2016-06-13 Thread vanzin
Repository: spark Updated Branches: refs/heads/master cd47e2337 -> 938434dc7 [SPARK-15913][CORE] Dispatcher.stopped should be enclosed by synchronized block. ## What changes were proposed in this pull request? `Dispatcher.stopped` is guarded by `this`, but it is used without synchronization

spark git commit: [SPARK-15814][SQL] Aggregator can return null result

2016-06-13 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/branch-2.0 41f309bfb -> d64cb1994 [SPARK-15814][SQL] Aggregator can return null result ## What changes were proposed in this pull request? It's similar to the bug fixed in https://github.com/apache/spark/pull/13425, we should consider null

spark git commit: [SPARK-15814][SQL] Aggregator can return null result

2016-06-13 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/master d681742b2 -> cd47e2337 [SPARK-15814][SQL] Aggregator can return null result ## What changes were proposed in this pull request? It's similar to the bug fixed in https://github.com/apache/spark/pull/13425, we should consider null object

spark git commit: [SPARK-15813] Improve Canceling log message to make it less ambiguous

2016-06-13 Thread srowen
Repository: spark Updated Branches: refs/heads/master e2ab79d5e -> d681742b2 [SPARK-15813] Improve Canceling log message to make it less ambiguous ## What changes were proposed in this pull request? Add new desired executor number to make the log message less ambiguous. ## How was this patch

spark git commit: [SPARK-15813] Improve Canceling log message to make it less ambiguous

2016-06-13 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 b96e7f6aa -> 41f309bfb [SPARK-15813] Improve Canceling log message to make it less ambiguous ## What changes were proposed in this pull request? Add new desired executor number to make the log message less ambiguous. ## How was this