spark git commit: [SPARK-16162] Remove dead code OrcTableScan.

2016-06-22 Thread rxin
Repository: spark Updated Branches: refs/heads/master f34b5c62b -> 4374a46bf [SPARK-16162] Remove dead code OrcTableScan. ## What changes were proposed in this pull request? SPARK-14535 removed all calls to class OrcTableScan. This removes the dead code. ## How was this patch tested?

spark git commit: [SQL][MINOR] Fix minor formatting issues in SHOW CREATE TABLE output

2016-06-22 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 5b4a9a4c3 -> 4ad731ed6 [SQL][MINOR] Fix minor formatting issues in SHOW CREATE TABLE output ## What changes were proposed in this pull request? This PR fixes two minor formatting issues appearing in `SHOW CREATE TABLE` output.

spark git commit: [SQL][MINOR] Fix minor formatting issues in SHOW CREATE TABLE output

2016-06-22 Thread rxin
Repository: spark Updated Branches: refs/heads/master 925884a61 -> f34b5c62b [SQL][MINOR] Fix minor formatting issues in SHOW CREATE TABLE output ## What changes were proposed in this pull request? This PR fixes two minor formatting issues appearing in `SHOW CREATE TABLE` output. Before:

spark git commit: [SPARK-15230][SQL] distinct() does not handle column name with dot properly

2016-06-22 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 37f3be5d2 -> 925884a61 [SPARK-15230][SQL] distinct() does not handle column name with dot properly ## What changes were proposed in this pull request? When table is created with column name containing dot, distinct() will fail to run.

spark git commit: [SPARK-15230][SQL] distinct() does not handle column name with dot properly

2016-06-22 Thread wenchen
Repository: spark Updated Branches: refs/heads/branch-2.0 e0a43235d -> 5b4a9a4c3 [SPARK-15230][SQL] distinct() does not handle column name with dot properly ## What changes were proposed in this pull request? When table is created with column name containing dot, distinct() will fail to

spark git commit: [SPARK-16159][SQL] Move RDD creation logic from FileSourceStrategy.apply

2016-06-22 Thread rxin
Repository: spark Updated Branches: refs/heads/master 9f990fa3f -> 37f3be5d2 [SPARK-16159][SQL] Move RDD creation logic from FileSourceStrategy.apply ## What changes were proposed in this pull request? We embed partitioning logic in FileSourceStrategy.apply, making the function very long.

spark git commit: [SPARK-16024][SQL][TEST] Verify Column Comment for Data Source Tables

2016-06-22 Thread wenchen
Repository: spark Updated Branches: refs/heads/branch-2.0 e2eb8e002 -> e0a43235d [SPARK-16024][SQL][TEST] Verify Column Comment for Data Source Tables What changes were proposed in this pull request? This PR is to improve test coverage. It verifies whether `Comment` of `Column` can be

spark git commit: [SPARK-16024][SQL][TEST] Verify Column Comment for Data Source Tables

2016-06-22 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 4f869f88e -> 9f990fa3f [SPARK-16024][SQL][TEST] Verify Column Comment for Data Source Tables What changes were proposed in this pull request? This PR is to improve test coverage. It verifies whether `Comment` of `Column` can be

spark git commit: [SPARK-15956][SQL] When unwrapping ORC avoid pattern matching at runtime

2016-06-22 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/master 044971eca -> 4f869f88e [SPARK-15956][SQL] When unwrapping ORC avoid pattern matching at runtime ## What changes were proposed in this pull request? Extend the returning of unwrapper functions from primitive types to all types. This PR is

spark git commit: [SPARK-16131] initialize internal logger lazily in Scala preferred way

2016-06-22 Thread zsxwing
Repository: spark Updated Branches: refs/heads/branch-2.0 1d3c56e77 -> e2eb8e002 [SPARK-16131] initialize internal logger lazily in Scala preferred way ## What changes were proposed in this pull request? Initialize logger instance lazily in Scala preferred way ## How was this patch tested?

spark git commit: [SPARK-16131] initialize internal logger lazily in Scala preferred way

2016-06-22 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 857ecff1d -> 044971eca [SPARK-16131] initialize internal logger lazily in Scala preferred way ## What changes were proposed in this pull request? Initialize logger instance lazily in Scala preferred way ## How was this patch tested? By

spark git commit: [SPARK-16155][DOC] remove package grouping in Java docs

2016-06-22 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 02435acf3 -> 1d3c56e77 [SPARK-16155][DOC] remove package grouping in Java docs ## What changes were proposed in this pull request? In 1.4 and earlier releases, we have package grouping in the generated Java API docs. See

spark git commit: [SPARK-16155][DOC] remove package grouping in Java docs

2016-06-22 Thread meng
Repository: spark Updated Branches: refs/heads/master 00cc5cca4 -> 857ecff1d [SPARK-16155][DOC] remove package grouping in Java docs ## What changes were proposed in this pull request? In 1.4 and earlier releases, we have package grouping in the generated Java API docs. See

spark git commit: [SPARK-16153][MLLIB] switch to multi-line doc to avoid a genjavadoc bug

2016-06-22 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 282a3cd02 -> 02435acf3 [SPARK-16153][MLLIB] switch to multi-line doc to avoid a genjavadoc bug ## What changes were proposed in this pull request? We recently deprecated setLabelCol in ChiSqSelectorModel (#13823): ~~~scala /**

spark git commit: [SPARK-16153][MLLIB] switch to multi-line doc to avoid a genjavadoc bug

2016-06-22 Thread meng
Repository: spark Updated Branches: refs/heads/master 20d411bc5 -> 00cc5cca4 [SPARK-16153][MLLIB] switch to multi-line doc to avoid a genjavadoc bug ## What changes were proposed in this pull request? We recently deprecated setLabelCol in ChiSqSelectorModel (#13823): ~~~scala /** group

spark git commit: [SPARK-16078][SQL] from_utc_timestamp/to_utc_timestamp should not depends on local timezone

2016-06-22 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/branch-2.0 299f427b7 -> 282a3cd02 [SPARK-16078][SQL] from_utc_timestamp/to_utc_timestamp should not depends on local timezone ## What changes were proposed in this pull request? Currently, we use local timezone to parse or format a timestamp

spark git commit: [SPARK-16078][SQL] from_utc_timestamp/to_utc_timestamp should not depends on local timezone

2016-06-22 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/master 43b04b7ec -> 20d411bc5 [SPARK-16078][SQL] from_utc_timestamp/to_utc_timestamp should not depends on local timezone ## What changes were proposed in this pull request? Currently, we use local timezone to parse or format a timestamp

spark git commit: [SPARK-15672][R][DOC] R programming guide update

2016-06-22 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-2.0 e043c02d0 -> 299f427b7 [SPARK-15672][R][DOC] R programming guide update ## What changes were proposed in this pull request? Guide for - UDFs with dapply, dapplyCollect - spark.lapply for running parallel R functions ## How was this

spark git commit: [SPARK-15672][R][DOC] R programming guide update

2016-06-22 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 6f915c9ec -> 43b04b7ec [SPARK-15672][R][DOC] R programming guide update ## What changes were proposed in this pull request? Guide for - UDFs with dapply, dapplyCollect - spark.lapply for running parallel R functions ## How was this patch

spark git commit: [SPARK-16003] SerializationDebugger runs into infinite loop

2016-06-22 Thread davies
Repository: spark Updated Branches: refs/heads/master 472d611a7 -> 6f915c9ec [SPARK-16003] SerializationDebugger runs into infinite loop ## What changes were proposed in this pull request? This fixes SerializationDebugger to not recurse forever when `writeReplace` returns an object of the

spark git commit: [SPARK-16003] SerializationDebugger runs into infinite loop

2016-06-22 Thread davies
Repository: spark Updated Branches: refs/heads/branch-2.0 520828c90 -> e043c02d0 [SPARK-16003] SerializationDebugger runs into infinite loop ## What changes were proposed in this pull request? This fixes SerializationDebugger to not recurse forever when `writeReplace` returns an object of

spark git commit: [SPARK-15956][SQL] Revert "[] When unwrapping ORC avoid pattern matching…

2016-06-22 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/master c2cebdb7d -> 472d611a7 [SPARK-15956][SQL] Revert "[] When unwrapping ORC avoid pattern matching… This reverts commit 0a9c02759515c41de37db6381750bc3a316c860c. It breaks the 2.10 build, I'll fix this in a different PR. Author: Herman

spark git commit: [SPARK-16120][STREAMING] getCurrentLogFiles in ReceiverSuite WAL generating and cleaning case uses external variable instead of the passed parameter

2016-06-22 Thread zsxwing
Repository: spark Updated Branches: refs/heads/branch-2.0 76d0ef34e -> 520828c90 [SPARK-16120][STREAMING] getCurrentLogFiles in ReceiverSuite WAL generating and cleaning case uses external variable instead of the passed parameter ## What changes were proposed in this pull request? In

spark git commit: [SPARK-16120][STREAMING] getCurrentLogFiles in ReceiverSuite WAL generating and cleaning case uses external variable instead of the passed parameter

2016-06-22 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 0a9c02759 -> c2cebdb7d [SPARK-16120][STREAMING] getCurrentLogFiles in ReceiverSuite WAL generating and cleaning case uses external variable instead of the passed parameter ## What changes were proposed in this pull request? In

spark git commit: [MINOR][MLLIB] DefaultParamsReadable/Writable should be DeveloperApi

2016-06-22 Thread meng
Repository: spark Updated Branches: refs/heads/master 18faa588c -> 6a6010f00 [MINOR][MLLIB] DefaultParamsReadable/Writable should be DeveloperApi ## What changes were proposed in this pull request? `DefaultParamsReadable/Writable` are not user-facing. Only developers who implement

spark git commit: [MINOR][MLLIB] DefaultParamsReadable/Writable should be DeveloperApi

2016-06-22 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 0cde3ad6d -> 76d0ef34e [MINOR][MLLIB] DefaultParamsReadable/Writable should be DeveloperApi ## What changes were proposed in this pull request? `DefaultParamsReadable/Writable` are not user-facing. Only developers who implement

spark git commit: [SPARK-16127][ML][PYPSARK] Audit @Since annotations related to ml.linalg

2016-06-22 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 1cfdd25fd -> 0cde3ad6d [SPARK-16127][ML][PYPSARK] Audit @Since annotations related to ml.linalg [SPARK-14615](https://issues.apache.org/jira/browse/SPARK-14615) and #12627 changed `spark.ml` pipelines to use the new `ml.linalg`

spark git commit: [SPARK-16127][ML][PYPSARK] Audit @Since annotations related to ml.linalg

2016-06-22 Thread meng
Repository: spark Updated Branches: refs/heads/master ea3a12b01 -> 18faa588c [SPARK-16127][ML][PYPSARK] Audit @Since annotations related to ml.linalg [SPARK-14615](https://issues.apache.org/jira/browse/SPARK-14615) and #12627 changed `spark.ml` pipelines to use the new `ml.linalg` classes

spark git commit: [SPARK-16107][R] group glm methods in documentation

2016-06-22 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 503eb882c -> 1cfdd25fd [SPARK-16107][R] group glm methods in documentation ## What changes were proposed in this pull request? This groups GLM methods (spark.glm, summary, print, predict and write.ml) in the documentation. The

spark git commit: [SPARK-15783][CORE] Fix Flakiness in BlacklistIntegrationSuite

2016-06-22 Thread irashid
Repository: spark Updated Branches: refs/heads/master 01277d4b2 -> cf1995a97 [SPARK-15783][CORE] Fix Flakiness in BlacklistIntegrationSuite ## What changes were proposed in this pull request? Three changes here -- first two were causing failures w/ BlacklistIntegrationSuite 1. The testing

spark git commit: [SPARK-6005][TESTS] Fix flaky test: o.a.s.streaming.kafka.DirectKafkaStreamSuite.offset recovery

2016-06-22 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 d98fb19c1 -> 4fdac3c27 [SPARK-6005][TESTS] Fix flaky test: o.a.s.streaming.kafka.DirectKafkaStreamSuite.offset recovery ## What changes were proposed in this pull request? Because this test extracts data from `DStream.generatedRDDs`

spark git commit: [SPARK-16097][SQL] Encoders.tuple should handle null object correctly

2016-06-22 Thread lian
Repository: spark Updated Branches: refs/heads/branch-2.0 60bd704b5 -> 503eb882c [SPARK-16097][SQL] Encoders.tuple should handle null object correctly ## What changes were proposed in this pull request? Although the top level input object can not be null, but when we use `Encoders.tuple` to

spark git commit: [SPARK-16097][SQL] Encoders.tuple should handle null object correctly

2016-06-22 Thread lian
Repository: spark Updated Branches: refs/heads/master 39ad53f7f -> 01277d4b2 [SPARK-16097][SQL] Encoders.tuple should handle null object correctly ## What changes were proposed in this pull request? Although the top level input object can not be null, but when we use `Encoders.tuple` to

spark git commit: [SPARK-16121] ListingFileCatalog does not list in parallel anymore

2016-06-22 Thread lian
Repository: spark Updated Branches: refs/heads/branch-2.0 838143a2a -> 60bd704b5 [SPARK-16121] ListingFileCatalog does not list in parallel anymore ## What changes were proposed in this pull request? Seems the fix of SPARK-14959 breaks the parallel partitioning discovery. This PR fixes the

spark git commit: [SPARK-16121] ListingFileCatalog does not list in parallel anymore

2016-06-22 Thread lian
Repository: spark Updated Branches: refs/heads/master d281b0baf -> 39ad53f7f [SPARK-16121] ListingFileCatalog does not list in parallel anymore ## What changes were proposed in this pull request? Seems the fix of SPARK-14959 breaks the parallel partitioning discovery. This PR fixes the

spark git commit: [SPARK-15162][SPARK-15164][PYSPARK][DOCS][ML] update some pydocs

2016-06-22 Thread mlnick
Repository: spark Updated Branches: refs/heads/branch-2.0 e7a489c7f -> 838143a2a [SPARK-15162][SPARK-15164][PYSPARK][DOCS][ML] update some pydocs ## What changes were proposed in this pull request? Mark ml.classification algorithms as experimental to match Scala algorithms, update PyDoc for

spark git commit: [SPARK-15162][SPARK-15164][PYSPARK][DOCS][ML] update some pydocs

2016-06-22 Thread mlnick
Repository: spark Updated Branches: refs/heads/master 0e3ce7533 -> d281b0baf [SPARK-15162][SPARK-15164][PYSPARK][DOCS][ML] update some pydocs ## What changes were proposed in this pull request? Mark ml.classification algorithms as experimental to match Scala algorithms, update PyDoc for for

spark git commit: [SPARK-15644][MLLIB][SQL] Replace SQLContext with SparkSession in MLlib

2016-06-22 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 7580f3041 -> 0e3ce7533 [SPARK-15644][MLLIB][SQL] Replace SQLContext with SparkSession in MLlib What changes were proposed in this pull request? This PR is to use the latest `SparkSession` to replace the existing `SQLContext` in

spark git commit: [SPARK-15644][MLLIB][SQL] Replace SQLContext with SparkSession in MLlib

2016-06-22 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-2.0 f3a2ebe0b -> e7a489c7f [SPARK-15644][MLLIB][SQL] Replace SQLContext with SparkSession in MLlib What changes were proposed in this pull request? This PR is to use the latest `SparkSession` to replace the existing `SQLContext` in