spark git commit: [SPARK-12349][ML] Make spark.ml PCAModel load backwards compatible

2015-12-21 Thread srowen
en <so...@cloudera.com> Closes #10327 from srowen/SPARK-12349. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/d0f69508 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/d0f69508 Diff: http://git-wip-us.apache.o

spark git commit: [SPARK-12311][CORE] Restore previous value of "os.arch" property in test suites after forcing to set specific value to "os.arch" property

2015-12-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9e85bb71a -> 392046611 [SPARK-12311][CORE] Restore previous value of "os.arch" property in test suites after forcing to set specific value to "os.arch" property Restore the original value of os.arch property after each test Since some of

spark git commit: [SPARK-12010][SQL] Spark JDBC requires support for column-name-free INSERT syntax

2015-12-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master 392046611 -> 502476e45 [SPARK-12010][SQL] Spark JDBC requires support for column-name-free INSERT syntax In the past Spark JDBC write only worked with technologies which support the following INSERT statement syntax (JdbcUtils.scala:

spark git commit: [SPARK-12010][SQL] Spark JDBC requires support for column-name-free INSERT syntax

2015-12-24 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 4dd8712c1 -> 865dd8bcc [SPARK-12010][SQL] Spark JDBC requires support for column-name-free INSERT syntax In the past Spark JDBC write only worked with technologies which support the following INSERT statement syntax (JdbcUtils.scala:

spark git commit: [SPARK-12440][CORE] Avoid setCheckpoint warning when directory is not local

2015-12-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master 502476e45 -> ea4aab7e8 [SPARK-12440][CORE] Avoid setCheckpoint warning when directory is not local In SparkContext method `setCheckpointDir`, a warning is issued when spark master is not local and the passed directory for the checkpoint

spark git commit: [SPARK-12263][DOCS] IllegalStateException: Memory can't be 0 for SPARK_WORKER_MEMORY without unit

2015-12-30 Thread srowen
Repository: spark Updated Branches: refs/heads/master 27af6157f -> 932cf4424 [SPARK-12263][DOCS] IllegalStateException: Memory can't be 0 for SPARK_WORKER_MEMORY without unit Updated the Worker Unit IllegalStateException message to indicate no values less than 1MB instead of 0 to help solve

spark git commit: [SPARK-11860][PYSAPRK][DOCUMENTATION] Invalid argument specification …

2015-11-25 Thread srowen
Repository: spark Updated Branches: refs/heads/master 638500265 -> b9b6fbe89 [SPARK-11860][PYSAPRK][DOCUMENTATION] Invalid argument specification … …for registerFunction [Python] Straightforward change on the python doc Author: Jeff Zhang Closes #9901 from

spark git commit: [SPARK-11860][PYSAPRK][DOCUMENTATION] Invalid argument specification …

2015-11-25 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 a986a3bde -> 4971eaaa5 [SPARK-11860][PYSAPRK][DOCUMENTATION] Invalid argument specification … …for registerFunction [Python] Straightforward change on the python doc Author: Jeff Zhang Closes #9901 from

spark git commit: [SPARK-11686][CORE] Issue WARN when dynamic allocation is disabled due to spark.dynamicAllocation.enabled and spark.executor.instances both set

2015-11-25 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 997896643 -> a986a3bde [SPARK-11686][CORE] Issue WARN when dynamic allocation is disabled due to spark.dynamicAllocation.enabled and spark.executor.instances both set Changed the log type to a 'warning' instead of 'info' as required.

spark git commit: [SPARK-11686][CORE] Issue WARN when dynamic allocation is disabled due to spark.dynamicAllocation.enabled and spark.executor.instances both set

2015-11-25 Thread srowen
Repository: spark Updated Branches: refs/heads/master a0f1a1183 -> 638500265 [SPARK-11686][CORE] Issue WARN when dynamic allocation is disabled due to spark.dynamicAllocation.enabled and spark.executor.instances both set Changed the log type to a 'warning' instead of 'info' as required.

spark git commit: [SPARK-11859][MESOS] SparkContext accepts invalid Master URLs in the form zk://host:port for a multi-master Mesos cluster using ZooKeeper

2015-11-30 Thread srowen
Repository: spark Updated Branches: refs/heads/master 0ddfe7868 -> e07494420 [SPARK-11859][MESOS] SparkContext accepts invalid Master URLs in the form zk://host:port for a multi-master Mesos cluster using ZooKeeper * According to below doc and validation logic in

spark git commit: [SPARK-11859][MESOS] SparkContext accepts invalid Master URLs in the form zk://host:port for a multi-master Mesos cluster using ZooKeeper

2015-11-30 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 a4a2a7deb -> 12d97b0c5 [SPARK-11859][MESOS] SparkContext accepts invalid Master URLs in the form zk://host:port for a multi-master Mesos cluster using ZooKeeper * According to below doc and validation logic in

spark git commit: [SPARK-12023][BUILD] Fix warnings while packaging spark with maven.

2015-11-30 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 aaf835f1d -> 33cd171b2 [SPARK-12023][BUILD] Fix warnings while packaging spark with maven. this is a trivial fix, discussed

spark git commit: [SPARK-12023][BUILD] Fix warnings while packaging spark with maven.

2015-11-30 Thread srowen
Repository: spark Updated Branches: refs/heads/master 26c3581f1 -> bf0e85a70 [SPARK-12023][BUILD] Fix warnings while packaging spark with maven. this is a trivial fix, discussed

spark git commit: [SPARK-11898][MLLIB] Use broadcast for the global tables in Word2Vec

2015-12-01 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9693b0d5a -> a0af0e351 [SPARK-11898][MLLIB] Use broadcast for the global tables in Word2Vec jira: https://issues.apache.org/jira/browse/SPARK-11898 syn0Global and sync1Global in word2vec are quite large objects with size (vocab *

spark git commit: [DOC] Explicitly state that top maintains the order of elements

2015-11-30 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 e9653921e -> aaf835f1d [DOC] Explicitly state that top maintains the order of elements Top is implemented in terms of takeOrdered, which already maintains the order, so top should, too. Author: Wieland Hoffmann

spark git commit: [DOC] Explicitly state that top maintains the order of elements

2015-11-30 Thread srowen
Repository: spark Updated Branches: refs/heads/master 953e8e6dc -> 26c3581f1 [DOC] Explicitly state that top maintains the order of elements Top is implemented in terms of takeOrdered, which already maintains the order, so top should, too. Author: Wieland Hoffmann

spark git commit: [MINOR][BUILD] Changed the comment to reflect the plugin project is there to support SBT pom reader only.

2015-11-30 Thread srowen
Repository: spark Updated Branches: refs/heads/master e07494420 -> 953e8e6dc [MINOR][BUILD] Changed the comment to reflect the plugin project is there to support SBT pom reader only. Author: Prashant Sharma Closes #10012 from ScrapCodes/minor-build-comment. Project:

spark git commit: [MINOR][BUILD] Changed the comment to reflect the plugin project is there to support SBT pom reader only.

2015-11-30 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 12d97b0c5 -> e9653921e [MINOR][BUILD] Changed the comment to reflect the plugin project is there to support SBT pom reader only. Author: Prashant Sharma Closes #10012 from ScrapCodes/minor-build-comment.

spark git commit: rmse was wrongly calculated

2015-11-18 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 34ded83ed -> 48bfe3f89 rmse was wrongly calculated It was multiplying with U instaed of dividing by U Author: Viveka Kulharia Closes #9771 from vivkul/patch-1. (cherry picked from commit

spark git commit: rmse was wrongly calculated

2015-11-18 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9631ca352 -> 1429e0a2b rmse was wrongly calculated It was multiplying with U instaed of dividing by U Author: Viveka Kulharia Closes #9771 from vivkul/patch-1. Project:

spark git commit: rmse was wrongly calculated

2015-11-18 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 0ed6d9cf3 -> 4b8dc2556 rmse was wrongly calculated It was multiplying with U instaed of dividing by U Author: Viveka Kulharia Closes #9771 from vivkul/patch-1. (cherry picked from commit

spark git commit: rmse was wrongly calculated

2015-11-18 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.4 073c89f06 -> e12fbd80c rmse was wrongly calculated It was multiplying with U instaed of dividing by U Author: Viveka Kulharia Closes #9771 from vivkul/patch-1. (cherry picked from commit

spark git commit: [SPARK-10946][SQL] JDBC - Use Statement.executeUpdate instead of PreparedStatement.executeUpdate for DDLs

2015-11-18 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 0eb82133f -> 5da7d4130 [SPARK-10946][SQL] JDBC - Use Statement.executeUpdate instead of PreparedStatement.executeUpdate for DDLs New changes with JDBCRDD Author: somideshmukh Closes #9733 from

spark git commit: [SPARK-10946][SQL] JDBC - Use Statement.executeUpdate instead of PreparedStatement.executeUpdate for DDLs

2015-11-18 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1714350bd -> b8f4379ba [SPARK-10946][SQL] JDBC - Use Statement.executeUpdate instead of PreparedStatement.executeUpdate for DDLs New changes with JDBCRDD Author: somideshmukh Closes #9733 from

spark git commit: [SPARK-6541] Sort executors by ID (numeric)

2015-11-18 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 5da7d4130 -> 04938d929 [SPARK-6541] Sort executors by ID (numeric) "Force" the executor ID sort with Int. Author: Jean-Baptiste Onofré Closes #9165 from jbonofre/SPARK-6541. (cherry picked from commit

spark git commit: [SPARK-11652][CORE] Remote code execution with InvokerTransformer

2015-11-18 Thread srowen
; Closes #9731 from srowen/SPARK-11652. (cherry picked from commit 9631ca35275b0ce8a5219f975907ac36ed11f528) Signed-off-by: Sean Owen <so...@cloudera.com> Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/34ded83e Tree:

spark git commit: [SPARK-11652][CORE] Remote code execution with InvokerTransformer

2015-11-18 Thread srowen
ses #9731 from srowen/SPARK-11652. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/9631ca35 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/9631ca35 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/9631ca35 Bra

spark git commit: [SPARK-11652][CORE] Remote code execution with InvokerTransformer

2015-11-18 Thread srowen
; Closes #9731 from srowen/SPARK-11652. (cherry picked from commit 9631ca35275b0ce8a5219f975907ac36ed11f528) Signed-off-by: Sean Owen <so...@cloudera.com> Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/0ed6d9cf Tree:

spark git commit: [SPARK-11906][WEB UI] Speculation Tasks Cause ProgressBar UI Overflow

2015-11-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master 12eea834d -> 800bd799a [SPARK-11906][WEB UI] Speculation Tasks Cause ProgressBar UI Overflow When there are speculative tasks in the stage, running progress bar could overflow and goes hidden on a new line:

spark git commit: [SPARK-11906][WEB UI] Speculation Tasks Cause ProgressBar UI Overflow

2015-11-24 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 e1b0a2376 -> 17ea95133 [SPARK-11906][WEB UI] Speculation Tasks Cause ProgressBar UI Overflow When there are speculative tasks in the stage, running progress bar could overflow and goes hidden on a new line:

spark git commit: [SPARK-12269][STREAMING][KINESIS] Update aws-java-sdk version

2016-01-11 Thread srowen
Repository: spark Updated Branches: refs/heads/master bd723bd53 -> 8fe928b4f [SPARK-12269][STREAMING][KINESIS] Update aws-java-sdk version The current Spark Streaming kinesis connector references a quite old version 1.9.40 of the AWS Java SDK (1.10.40 is current). Numerous AWS features

spark git commit: removed lambda from sortByKey()

2016-01-11 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 43b72d83e -> d4cfd2acd removed lambda from sortByKey() According to the documentation the sortByKey method does not take a lambda as an argument, thus the example is flawed. Removed the argument completely as this will default to

spark git commit: removed lambda from sortByKey()

2016-01-11 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 665aa47f8 -> 0e2aa4198 removed lambda from sortByKey() According to the documentation the sortByKey method does not take a lambda as an argument, thus the example is flawed. Removed the argument completely as this will default to

spark git commit: removed lambda from sortByKey()

2016-01-11 Thread srowen
Repository: spark Updated Branches: refs/heads/master f253feff6 -> bd723bd53 removed lambda from sortByKey() According to the documentation the sortByKey method does not take a lambda as an argument, thus the example is flawed. Removed the argument completely as this will default to

spark git commit: [SPARK-7615][MLLIB] MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero

2016-01-12 Thread srowen
<so...@cloudera.com> Closes #10696 from srowen/SPARK-7615. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/c48f2a3a Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/c48f2a3a Diff: http://git-wip-us.apache.org/repos/

spark git commit: [SPARK-5273][MLLIB][DOCS] Improve documentation examples for LinearRegression

2016-01-12 Thread srowen
hit this exact same issue when concocting an example and had the same solution. Author: Sean Owen <so...@cloudera.com> Closes #10675 from srowen/SPARK-5273. (cherry picked from commit 9c7f34af37ef328149c1d66b4689d80a1589e1cc) Signed-off-by: Sean Owen <so...@cloudera.com> Project:

spark git commit: [SPARK-12638][API DOC] Parameter explanation not very accurate for rdd function "aggregate"

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 a6c9c68d8 -> 46fc7a12a [SPARK-12638][API DOC] Parameter explanation not very accurate for rdd function "aggregate" Currently, RDD function aggregate's parameter doesn't explain well, especially parameter "zeroValue". It's helpful to

spark git commit: [SPARK-12638][API DOC] Parameter explanation not very accurate for rdd function "aggregate"

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9c7f34af3 -> 9f0995bb0 [SPARK-12638][API DOC] Parameter explanation not very accurate for rdd function "aggregate" Currently, RDD function aggregate's parameter doesn't explain well, especially parameter "zeroValue". It's helpful to let

spark git commit: [SPARK-12582][TEST] IndexShuffleBlockResolverSuite fails in windows

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 46fc7a12a -> 3221a7d91 [SPARK-12582][TEST] IndexShuffleBlockResolverSuite fails in windows [SPARK-12582][Test] IndexShuffleBlockResolverSuite fails in windows * IndexShuffleBlockResolverSuite fails in windows due to file is not

spark git commit: [SPARK-12582][TEST] IndexShuffleBlockResolverSuite fails in windows

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9f0995bb0 -> 7e15044d9 [SPARK-12582][TEST] IndexShuffleBlockResolverSuite fails in windows [SPARK-12582][Test] IndexShuffleBlockResolverSuite fails in windows * IndexShuffleBlockResolverSuite fails in windows due to file is not closed. *

spark git commit: [SPARK-7615][MLLIB] MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero

2016-01-12 Thread srowen
<so...@cloudera.com> Closes #10696 from srowen/SPARK-7615. (cherry picked from commit c48f2a3a5fd714ad2ff19b29337e55583988431e) Signed-off-by: Sean Owen <so...@cloudera.com> Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/

spark git commit: [SPARK-5273][MLLIB][DOCS] Improve documentation examples for LinearRegression

2016-01-12 Thread srowen
act same issue when concocting an example and had the same solution. Author: Sean Owen <so...@cloudera.com> Closes #10675 from srowen/SPARK-5273. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/9c7f34af Tree: http:

spark git commit: [SPARK-15707][SQL] Make Code Neat - Use map instead of if check.

2016-06-04 Thread srowen
Repository: spark Updated Branches: refs/heads/master 091f81e1f -> 0f307db5e [SPARK-15707][SQL] Make Code Neat - Use map instead of if check. ## What changes were proposed in this pull request? In forType function of object RandomDataGenerator, the code following: if

spark git commit: [SPARK-15707][SQL] Make Code Neat - Use map instead of if check.

2016-06-04 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 7e4c9dd55 -> 32a64d8fc [SPARK-15707][SQL] Make Code Neat - Use map instead of if check. ## What changes were proposed in this pull request? In forType function of object RandomDataGenerator, the code following: if

spark git commit: [MINOR][BUILD] Add modernizr MIT license; specify "2014 and onwards" in license copyright

2016-06-04 Thread srowen
ense. Specify "2014 and onwards" in copyright statement. ## How was this patch tested? (none required) Author: Sean Owen <so...@cloudera.com> Closes #13510 from srowen/ModernizrLicense. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apac

spark git commit: [MINOR][BUILD] Add modernizr MIT license; specify "2014 and onwards" in license copyright

2016-06-04 Thread srowen
ense. Specify "2014 and onwards" in copyright statement. ## How was this patch tested? (none required) Author: Sean Owen <so...@cloudera.com> Closes #13510 from srowen/ModernizrLicense. (cherry picked from commit 681387b2dc9a094cfba84188a1dd1ac9192bb99c) Signed-off-by:

spark git commit: [SPARK-15723] Fixed local-timezone-brittle test where short-timezone form "EST" is …

2016-06-05 Thread srowen
Repository: spark Updated Branches: refs/heads/master 0f307db5e -> 4e767d0f9 [SPARK-15723] Fixed local-timezone-brittle test where short-timezone form "EST" is … ## What changes were proposed in this pull request? Stop using the abbreviated and ambiguous timezone "EST" in a test, since it

spark git commit: [SPARK-15723] Fixed local-timezone-brittle test where short-timezone form "EST" is …

2016-06-05 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 32a64d8fc -> 8c0ec85e6 [SPARK-15723] Fixed local-timezone-brittle test where short-timezone form "EST" is … ## What changes were proposed in this pull request? Stop using the abbreviated and ambiguous timezone "EST" in a test,

spark git commit: [SPARK-15723] Fixed local-timezone-brittle test where short-timezone form "EST" is …

2016-06-05 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 a0cf7d0b2 -> 6a9f19dd5 [SPARK-15723] Fixed local-timezone-brittle test where short-timezone form "EST" is … ## What changes were proposed in this pull request? Stop using the abbreviated and ambiguous timezone "EST" in a test,

spark git commit: [DOCUMENTATION] Fixed target JAR path

2016-06-08 Thread srowen
Repository: spark Updated Branches: refs/heads/master f958c1c3e -> ca70ab27c [DOCUMENTATION] Fixed target JAR path ## What changes were proposed in this pull request? Mentioned Scala version in the sbt configuration file is 2.11, so the path of the target JAR should be

spark git commit: [DOCUMENTATION] Fixed target JAR path

2016-06-08 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 5e9a8e715 -> b2778c8bb [DOCUMENTATION] Fixed target JAR path ## What changes were proposed in this pull request? Mentioned Scala version in the sbt configuration file is 2.11, so the path of the target JAR should be

spark git commit: [SPARK-15818][BUILD] Upgrade to Hadoop 2.7.2

2016-06-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master 921fa40b1 -> 147c02082 [SPARK-15818][BUILD] Upgrade to Hadoop 2.7.2 ## What changes were proposed in this pull request? Updating the Hadoop version from 2.7.0 to 2.7.2 if we use the Hadoop-2.7 build profile ## How was this patch tested?

spark git commit: [SPARK-15818][BUILD] Upgrade to Hadoop 2.7.2

2016-06-09 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 8ee93eed9 -> 77c08d224 [SPARK-15818][BUILD] Upgrade to Hadoop 2.7.2 ## What changes were proposed in this pull request? Updating the Hadoop version from 2.7.0 to 2.7.2 if we use the Hadoop-2.7 build profile ## How was this patch

spark git commit: [SPARK-12655][GRAPHX] GraphX does not unpersist RDDs

2016-06-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 6a9f19dd5 -> 5830828ef [SPARK-12655][GRAPHX] GraphX does not unpersist RDDs Some VertexRDD and EdgeRDD are created during the intermediate step of g.connectedComponents() but unnecessarily left cached after the method is done. The

spark git commit: [MINOR] Fix Typos 'an -> a'

2016-06-06 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 7d10e4bdd -> 90e94b826 [MINOR] Fix Typos 'an -> a' ## What changes were proposed in this pull request? `an -> a` Use cmds like `find . -name '*.R' | xargs -i sh -c "grep -in ' an [^aeiou]' {} && echo {}"` to generate candidates, and

spark git commit: [MINOR] Fix Typos 'an -> a'

2016-06-06 Thread srowen
Repository: spark Updated Branches: refs/heads/master 32f2f95db -> fd8af3971 [MINOR] Fix Typos 'an -> a' ## What changes were proposed in this pull request? `an -> a` Use cmds like `find . -name '*.R' | xargs -i sh -c "grep -in ' an [^aeiou]' {} && echo {}"` to generate candidates, and

spark git commit: [SPARK-15771][ML][EXAMPLES] Use 'accuracy' rather than 'precision' in many ML examples

2016-06-06 Thread srowen
Repository: spark Updated Branches: refs/heads/master fd8af3971 -> a95252823 [SPARK-15771][ML][EXAMPLES] Use 'accuracy' rather than 'precision' in many ML examples ## What changes were proposed in this pull request? Since [SPARK-15617](https://issues.apache.org/jira/browse/SPARK-15617)

spark git commit: [SPARK-15771][ML][EXAMPLES] Use 'accuracy' rather than 'precision' in many ML examples

2016-06-06 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 90e94b826 -> 86a35a229 [SPARK-15771][ML][EXAMPLES] Use 'accuracy' rather than 'precision' in many ML examples ## What changes were proposed in this pull request? Since [SPARK-15617](https://issues.apache.org/jira/browse/SPARK-15617)

spark git commit: [SPARK-15793][ML] Add maxSentenceLength for ml.Word2Vec

2016-06-08 Thread srowen
Repository: spark Updated Branches: refs/heads/master 91fbc880b -> 87706eb66 [SPARK-15793][ML] Add maxSentenceLength for ml.Word2Vec ## What changes were proposed in this pull request? https://issues.apache.org/jira/browse/SPARK-15793 Word2vec in ML package should have maxSentenceLength

svn commit: r1747385 - in /spark: ./ site/ site/docs/ site/docs/2.0.0-preview/ site/docs/2.0.0-preview/api/ site/docs/2.0.0-preview/api/R/ site/docs/2.0.0-preview/api/java/ site/docs/2.0.0-preview/api

2016-06-08 Thread srowen
Author: srowen Date: Wed Jun 8 12:04:28 2016 New Revision: 1747385 URL: http://svn.apache.org/viewvc?rev=1747385=rev Log: Uploaded Spark 2.0.0 preview docs and added preview docs section on site [This commit notification would consist of 1214 parts, which exceeds the limit of 50 ones, so

spark git commit: [MINOR] fix typo in documents

2016-06-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 57dd4efcd -> a7e9e60df [MINOR] fix typo in documents ## What changes were proposed in this pull request? I use spell check tools checks typo in spark documents and fix them. ## How was this patch tested? N/A Author: WeichenXu

spark git commit: [MINOR] fix typo in documents

2016-06-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master 5f731d685 -> 1e2c93118 [MINOR] fix typo in documents ## What changes were proposed in this pull request? I use spell check tools checks typo in spark documents and fix them. ## How was this patch tested? N/A Author: WeichenXu

spark git commit: [SPARK-14900][ML][PYSPARK] Add accuracy and deprecate precison, recall, f1

2016-06-06 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 86a35a229 -> e38ff70e6 [SPARK-14900][ML][PYSPARK] Add accuracy and deprecate precison,recall,f1 ## What changes were proposed in this pull request? 1, add accuracy for MulticlassMetrics 2, deprecate overall precision,recall,f1 and

spark git commit: [SPARK-14900][ML][PYSPARK] Add accuracy and deprecate precison, recall, f1

2016-06-06 Thread srowen
Repository: spark Updated Branches: refs/heads/master a95252823 -> 00ad4f054 [SPARK-14900][ML][PYSPARK] Add accuracy and deprecate precison,recall,f1 ## What changes were proposed in this pull request? 1, add accuracy for MulticlassMetrics 2, deprecate overall precision,recall,f1 and

svn commit: r1747061 - in /spark: downloads.md js/downloads.js site/downloads.html site/js/downloads.js

2016-06-06 Thread srowen
Author: srowen Date: Mon Jun 6 19:56:07 2016 New Revision: 1747061 URL: http://svn.apache.org/viewvc?rev=1747061=rev Log: SPARK-15778 add spark-2.0.0-preview release to options and other minor related updates Modified: spark/downloads.md spark/js/downloads.js spark/site

svn commit: r1747076 - in /spark: js/downloads.js site/js/downloads.js

2016-06-06 Thread srowen
Author: srowen Date: Mon Jun 6 20:59:54 2016 New Revision: 1747076 URL: http://svn.apache.org/viewvc?rev=1747076=rev Log: SPARK-15778 part 2: group preview/stable releases in download version dropdown Modified: spark/js/downloads.js spark/site/js/downloads.js Modified: spark/js

spark git commit: [SPARK-15878][CORE][TEST] fix cleanup in EventLoggingListenerSuite and ReplayListenerSuite

2016-06-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 d494a483a -> 879e8fd09 [SPARK-15878][CORE][TEST] fix cleanup in EventLoggingListenerSuite and ReplayListenerSuite ## What changes were proposed in this pull request? These tests weren't properly using `LocalSparkContext` so weren't

spark git commit: [SPARK-15806][DOCUMENTATION] update doc for SPARK_MASTER_IP

2016-06-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master 3fd3ee038 -> 50248dcff [SPARK-15806][DOCUMENTATION] update doc for SPARK_MASTER_IP ## What changes were proposed in this pull request? SPARK_MASTER_IP is a deprecated environment variable. It is replaced by SPARK_MASTER_HOST according to

spark git commit: [SPARK-15806][DOCUMENTATION] update doc for SPARK_MASTER_IP

2016-06-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 8c294f4ad -> b75d1c201 [SPARK-15806][DOCUMENTATION] update doc for SPARK_MASTER_IP ## What changes were proposed in this pull request? SPARK_MASTER_IP is a deprecated environment variable. It is replaced by SPARK_MASTER_HOST

spark git commit: [SPARK-15781][DOCUMENTATION] remove deprecated environment variable doc

2016-06-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master 8cc22b008 -> 3fd3ee038 [SPARK-15781][DOCUMENTATION] remove deprecated environment variable doc ## What changes were proposed in this pull request? Like `SPARK_JAVA_OPTS` and `SPARK_CLASSPATH`, we will remove the document for

spark git commit: [SPARK-15781][DOCUMENTATION] remove deprecated environment variable doc

2016-06-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 879e8fd09 -> 8c294f4ad [SPARK-15781][DOCUMENTATION] remove deprecated environment variable doc ## What changes were proposed in this pull request? Like `SPARK_JAVA_OPTS` and `SPARK_CLASSPATH`, we will remove the document for

spark git commit: [SPARK-15878][CORE][TEST] fix cleanup in EventLoggingListenerSuite and ReplayListenerSuite

2016-06-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9e204c62c -> 8cc22b008 [SPARK-15878][CORE][TEST] fix cleanup in EventLoggingListenerSuite and ReplayListenerSuite ## What changes were proposed in this pull request? These tests weren't properly using `LocalSparkContext` so weren't

spark git commit: [SPARK-15837][ML][PYSPARK] Word2vec python add maxsentence parameter

2016-06-10 Thread srowen
Repository: spark Updated Branches: refs/heads/master 16ca32eac -> cdd7f5a57 [SPARK-15837][ML][PYSPARK] Word2vec python add maxsentence parameter ## What changes were proposed in this pull request? Word2vec python add maxsentence parameter. ## How was this patch tested? Existing test.

spark git commit: [SPARK-15837][ML][PYSPARK] Word2vec python add maxsentence parameter

2016-06-10 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 6709ce1ae -> 54b4763d2 [SPARK-15837][ML][PYSPARK] Word2vec python add maxsentence parameter ## What changes were proposed in this pull request? Word2vec python add maxsentence parameter. ## How was this patch tested? Existing test.

spark git commit: [SPARK-15823][PYSPARK][ML] Add @property for 'accuracy' in MulticlassMetrics

2016-06-10 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 84a8421e5 -> 6709ce1ae [SPARK-15823][PYSPARK][ML] Add @property for 'accuracy' in MulticlassMetrics ## What changes were proposed in this pull request? `accuracy` should be decorated with `property` to keep step with other methods in

spark git commit: [SPARK-15823][PYSPARK][ML] Add @property for 'accuracy' in MulticlassMetrics

2016-06-10 Thread srowen
Repository: spark Updated Branches: refs/heads/master 675a73715 -> 16ca32eac [SPARK-15823][PYSPARK][ML] Add @property for 'accuracy' in MulticlassMetrics ## What changes were proposed in this pull request? `accuracy` should be decorated with `property` to keep step with other methods in

spark git commit: Log warnings for numIterations * miniBatchFraction < 1.0

2016-05-25 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9c297df3d -> 589cce93c Log warnings for numIterations * miniBatchFraction < 1.0 ## What changes were proposed in this pull request? Add a warning log for the case that `numIterations * miniBatchFraction <1.0` during gradient descent. If

spark git commit: Log warnings for numIterations * miniBatchFraction < 1.0

2016-05-25 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 f7158c482 -> 0064a4dcb Log warnings for numIterations * miniBatchFraction < 1.0 ## What changes were proposed in this pull request? Add a warning log for the case that `numIterations * miniBatchFraction <1.0` during gradient descent.

spark git commit: [SPARK-15610][ML] update error message for k in pca

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master 88c9c467a -> 9893dc975 [SPARK-15610][ML] update error message for k in pca ## What changes were proposed in this pull request? Fix the wrong bound of `k` in `PCA` `require(k <= sources.first().size, ...` -> `require(k <

spark git commit: [SPARK-15610][ML] update error message for k in pca

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 6d82e0c1b -> 3801fb4f3 [SPARK-15610][ML] update error message for k in pca ## What changes were proposed in this pull request? Fix the wrong bound of `k` in `PCA` `require(k <= sources.first().size, ...` -> `require(k <

spark git commit: [SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4a2fb8b87 -> 5d4dafe8f [SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) In the MLLib naivebayes example, scala and

spark git commit: [SPARK-15562][ML] Delete temp directory after program exit in DataFrameExample

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 8467e2102 -> 6d82e0c1b [SPARK-15562][ML] Delete temp directory after program exit in DataFrameExample ## What changes were proposed in this pull request? Temp directory used to save records is not deleted after program exit in

spark git commit: [SPARK-15562][ML] Delete temp directory after program exit in DataFrameExample

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master 5d4dafe8f -> 88c9c467a [SPARK-15562][ML] Delete temp directory after program exit in DataFrameExample ## What changes were proposed in this pull request? Temp directory used to save records is not deleted after program exit in

spark git commit: [SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 80a40e8e2 -> 8467e2102 [SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) In the MLLib naivebayes example, scala and

spark git commit: [SPARK-15813] Improve Canceling log message to make it less ambiguous

2016-06-13 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 b96e7f6aa -> 41f309bfb [SPARK-15813] Improve Canceling log message to make it less ambiguous ## What changes were proposed in this pull request? Add new desired executor number to make the log message less ambiguous. ## How was this

spark git commit: [SPARK-15813] Improve Canceling log message to make it less ambiguous

2016-06-13 Thread srowen
Repository: spark Updated Branches: refs/heads/master e2ab79d5e -> d681742b2 [SPARK-15813] Improve Canceling log message to make it less ambiguous ## What changes were proposed in this pull request? Add new desired executor number to make the log message less ambiguous. ## How was this patch

spark git commit: [SPARK-15883][MLLIB][DOCS] Fix broken links in mllib documents

2016-06-11 Thread srowen
Repository: spark Updated Branches: refs/heads/master 3761330dd -> ad102af16 [SPARK-15883][MLLIB][DOCS] Fix broken links in mllib documents ## What changes were proposed in this pull request? This issue fixes all broken links on Spark 2.0 preview MLLib documents. Also, this contains some

spark git commit: [SPARK-15879][DOCS][UI] Update logo in UI and docs to add "Apache"

2016-06-11 Thread srowen
emove old unreferenced logo files. ## How was this patch tested? Manual check of generated HTML site and Spark UI. I searched for references to the deleted files to make sure they were not used. Author: Sean Owen <so...@cloudera.com> Closes #13609 from srowen/SPARK-15879

spark git commit: [SPARK-15879][DOCS][UI] Update logo in UI and docs to add "Apache"

2016-06-11 Thread srowen
old unreferenced logo files. ## How was this patch tested? Manual check of generated HTML site and Spark UI. I searched for references to the deleted files to make sure they were not used. Author: Sean Owen <so...@cloudera.com> Closes #13609 from srowen/SPARK-15879. Project: http://gi

spark git commit: [SPARK-15883][MLLIB][DOCS] Fix broken links in mllib documents

2016-06-11 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 4c29c55f2 -> 8cf33fb8a [SPARK-15883][MLLIB][DOCS] Fix broken links in mllib documents ## What changes were proposed in this pull request? This issue fixes all broken links on Spark 2.0 preview MLLib documents. Also, this contains

spark git commit: [DOCUMENTATION] fixed typos in python programming guide

2016-06-14 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 974be6241 -> cf52375b9 [DOCUMENTATION] fixed typos in python programming guide ## What changes were proposed in this pull request? minor typo ## How was this patch tested? minor typo in the doc, should be self explanatory Author:

spark git commit: [DOCUMENTATION] fixed typos in python programming guide

2016-06-14 Thread srowen
Repository: spark Updated Branches: refs/heads/master 688b6ef9d -> a87a56f5c [DOCUMENTATION] fixed typos in python programming guide ## What changes were proposed in this pull request? minor typo ## How was this patch tested? minor typo in the doc, should be self explanatory Author:

spark git commit: [SPARK-15821][DOCS] Include parallel build info

2016-06-14 Thread srowen
Repository: spark Updated Branches: refs/heads/master 96c3500c6 -> a431e3f1f [SPARK-15821][DOCS] Include parallel build info ## What changes were proposed in this pull request? We should mention that users can build Spark using multiple threads to decrease build times; either here or in

spark git commit: doc fix of HiveThriftServer

2016-06-14 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 0d80bc291 -> e90ba2287 doc fix of HiveThriftServer ## What changes were proposed in this pull request? Just minor doc fix. \cc yhuai Author: Jeff Zhang Closes #13659 from zjffdu/doc_fix. (cherry picked from

spark git commit: doc fix of HiveThriftServer

2016-06-14 Thread srowen
Repository: spark Updated Branches: refs/heads/master a431e3f1f -> 53bb03084 doc fix of HiveThriftServer ## What changes were proposed in this pull request? Just minor doc fix. \cc yhuai Author: Jeff Zhang Closes #13659 from zjffdu/doc_fix. Project:

spark git commit: [MINOR] Clean up several build warnings, mostly due to internal use of old accumulators

2016-06-14 Thread srowen
API usage in tests that are straightforward to update. I think this qualifies as "minor". Jenkins Author: Sean Owen <so...@cloudera.com> Closes #13642 from srowen/BuildWarnings. (cherry picked from commit 6151d2641f91c8e3ec0c324e78afb46cdb2ef111) Signed-off-by: Sean Owen <so...@c

spark git commit: [SPARK-15617][ML][DOC] Clarify that fMeasure in MulticlassMetrics is "micro" f1_score

2016-06-04 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 cf8782116 -> 729730159 [SPARK-15617][ML][DOC] Clarify that fMeasure in MulticlassMetrics is "micro" f1_score ## What changes were proposed in this pull request? 1, del precision,recall in `ml.MulticlassClassificationEvaluator` 2,

spark git commit: [SPARK-15617][ML][DOC] Clarify that fMeasure in MulticlassMetrics is "micro" f1_score

2016-06-04 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2ca563cc4 -> 2099e05f9 [SPARK-15617][ML][DOC] Clarify that fMeasure in MulticlassMetrics is "micro" f1_score ## What changes were proposed in this pull request? 1, del precision,recall in `ml.MulticlassClassificationEvaluator` 2, update

spark git commit: [MINOR] Fix Java Lint errors introduced by #13286 and #13280

2016-06-08 Thread srowen
Repository: spark Updated Branches: refs/heads/master 87706eb66 -> f958c1c3e [MINOR] Fix Java Lint errors introduced by #13286 and #13280 ## What changes were proposed in this pull request? revived #13464 Fix Java Lint errors introduced by #13286 and #13280 Before: ``` Using `mvn` from

<    3   4   5   6   7   8   9   10   11   12   >