git commit: [SPARK-1022][Streaming][HOTFIX] Fixed zookeeper dependency of Kafka

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.1 aec217a58 -> 5b4bc84ed [SPARK-1022][Streaming][HOTFIX] Fixed zookeeper dependency of Kafka https://github.com/apache/spark/pull/1751 caused maven builds to fail. ``` ~/Apache/spark(branch-1.1|✔) ➤ mvn -U -DskipTests clean install .

git commit: [SPARK-1022][Streaming][HOTFIX] Fixed zookeeper dependency of Kafka

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master c7b52010d -> ee7f30856 [SPARK-1022][Streaming][HOTFIX] Fixed zookeeper dependency of Kafka https://github.com/apache/spark/pull/1751 caused maven builds to fail. ``` ~/Apache/spark(branch-1.1|✔) ➤ mvn -U -DskipTests clean install . . .

git commit: [MLlib] Use this.type as return type in k-means' builder pattern

2014-08-05 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.1 1da2fdfab -> aec217a58 [MLlib] Use this.type as return type in k-means' builder pattern to ensure that the return object is itself. Author: DB Tsai Closes #1796 from dbtsai/dbtsai-kmeans and squashes the following commits: 658989e [

git commit: [MLlib] Use this.type as return type in k-means' builder pattern

2014-08-05 Thread meng
Repository: spark Updated Branches: refs/heads/master 63bdb1f41 -> c7b52010d [MLlib] Use this.type as return type in k-means' builder pattern to ensure that the return object is itself. Author: DB Tsai Closes #1796 from dbtsai/dbtsai-kmeans and squashes the following commits: 658989e [DB T

git commit: SPARK-2294: fix locality inversion bug in TaskManager

2014-08-05 Thread matei
Repository: spark Updated Branches: refs/heads/branch-1.1 048205549 -> 1da2fdfab SPARK-2294: fix locality inversion bug in TaskManager copied from original JIRA (https://issues.apache.org/jira/browse/SPARK-2294): If an executor E is free, a task may be speculatively assigned to E when there

git commit: SPARK-2294: fix locality inversion bug in TaskManager

2014-08-05 Thread matei
Repository: spark Updated Branches: refs/heads/master 5a826c00c -> 63bdb1f41 SPARK-2294: fix locality inversion bug in TaskManager copied from original JIRA (https://issues.apache.org/jira/browse/SPARK-2294): If an executor E is free, a task may be speculatively assigned to E when there are

git commit: [SQL] Fix logging warn -> debug

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.1 4f0b4f4cc -> 048205549 [SQL] Fix logging warn -> debug Author: Michael Armbrust Closes #1800 from marmbrus/warning and squashes the following commits: 8ea9cf1 [Michael Armbrust] [SQL] Fix logging warn -> debug. (cherry picked from c

git commit: [SQL] Fix logging warn -> debug

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master b70bae40e -> 5a826c00c [SQL] Fix logging warn -> debug Author: Michael Armbrust Closes #1800 from marmbrus/warning and squashes the following commits: 8ea9cf1 [Michael Armbrust] [SQL] Fix logging warn -> debug. Project: http://git-wip-

git commit: [SQL] Tighten the visibility of various SQLConf methods and renamed setter/getters

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.1 6da8f41fc -> 4f0b4f4cc [SQL] Tighten the visibility of various SQLConf methods and renamed setter/getters Author: Reynold Xin Closes #1794 from rxin/sql-conf and squashes the following commits: 3ac11ef [Reynold Xin] getAllConfs retu

git commit: [SQL] Tighten the visibility of various SQLConf methods and renamed setter/getters

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 82624e2cf -> b70bae40e [SQL] Tighten the visibility of various SQLConf methods and renamed setter/getters Author: Reynold Xin Closes #1794 from rxin/sql-conf and squashes the following commits: 3ac11ef [Reynold Xin] getAllConfs return a

git commit: [SPARK-2806] core - upgrade to json4s-jackson 3.2.10

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.1 936f61ef0 -> 6da8f41fc [SPARK-2806] core - upgrade to json4s-jackson 3.2.10 Scala 2.11 packages not available for the current version (3.2.6) Signed-off-by: Anand Avati Author: Anand Avati Closes #1702 from avati/SPARK-1812-json4s-

git commit: [SPARK-2806] core - upgrade to json4s-jackson 3.2.10

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 1d70c4f66 -> 82624e2cf [SPARK-2806] core - upgrade to json4s-jackson 3.2.10 Scala 2.11 packages not available for the current version (3.2.6) Signed-off-by: Anand Avati Author: Anand Avati Closes #1702 from avati/SPARK-1812-json4s-jack

git commit: [SPARK-2866][SQL] Support attributes in ORDER BY that aren't in SELECT

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.1 a10e1b073 -> 936f61ef0 [SPARK-2866][SQL] Support attributes in ORDER BY that aren't in SELECT Minor refactoring to allow resolution either using a nodes input or output. Author: Michael Armbrust Closes #1795 from marmbrus/ordering an

git commit: [SPARK-2866][SQL] Support attributes in ORDER BY that aren't in SELECT

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 69ec678d3 -> 1d70c4f66 [SPARK-2866][SQL] Support attributes in ORDER BY that aren't in SELECT Minor refactoring to allow resolution either using a nodes input or output. Author: Michael Armbrust Closes #1795 from marmbrus/ordering and sq

git commit: [SPARK-2854][SQL] Finalize _acceptable_types in pyspark.sql

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.1 4233b0214 -> a10e1b073 [SPARK-2854][SQL] Finalize _acceptable_types in pyspark.sql This PR aims to finalize accepted data value types in Python RDDs provided to Python `applySchema`. JIRA: https://issues.apache.org/jira/browse/SPARK-2

git commit: [SPARK-2854][SQL] Finalize _acceptable_types in pyspark.sql

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d0ae3f391 -> 69ec678d3 [SPARK-2854][SQL] Finalize _acceptable_types in pyspark.sql This PR aims to finalize accepted data value types in Python RDDs provided to Python `applySchema`. JIRA: https://issues.apache.org/jira/browse/SPARK-2854

git commit: [SPARK-2650][SQL] Try to partially fix SPARK-2650 by adjusting initial buffer size and reducing memory allocation

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.1 152e36c8d -> 4233b0214 [SPARK-2650][SQL] Try to partially fix SPARK-2650 by adjusting initial buffer size and reducing memory allocation JIRA issue: [SPARK-2650](https://issues.apache.org/jira/browse/SPARK-2650) Please refer to [comm

git commit: [SPARK-2650][SQL] Try to partially fix SPARK-2650 by adjusting initial buffer size and reducing memory allocation

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d94f5990e -> d0ae3f391 [SPARK-2650][SQL] Try to partially fix SPARK-2650 by adjusting initial buffer size and reducing memory allocation JIRA issue: [SPARK-2650](https://issues.apache.org/jira/browse/SPARK-2650) Please refer to [comments

git commit: [sql] rename project name in pom.xml of hive-thriftserver module

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.1 58247a8a0 -> 152e36c8d [sql] rename project name in pom.xml of hive-thriftserver module module spark-hive-thriftserver_2.10 and spark-hive_2.10 both named "Spark Project Hive" in pom.xml, so rename spark-hive-thriftserver_2.10 project

git commit: [sql] rename project name in pom.xml of hive-thriftserver module

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 2643e6600 -> d94f5990e [sql] rename project name in pom.xml of hive-thriftserver module module spark-hive-thriftserver_2.10 and spark-hive_2.10 both named "Spark Project Hive" in pom.xml, so rename spark-hive-thriftserver_2.10 project name

git commit: SPARK-2869 - Fix tiny bug in JdbcRdd for closing jdbc connection

2014-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.0 d415f883e -> b18046791 SPARK-2869 - Fix tiny bug in JdbcRdd for closing jdbc connection I inquired on dev mailing list about the motivation for checking the jdbc statement instead of the connection in the close() logic of JdbcRDD. Ted

git commit: SPARK-2869 - Fix tiny bug in JdbcRdd for closing jdbc connection

2014-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.1 672904eed -> 58247a8a0 SPARK-2869 - Fix tiny bug in JdbcRdd for closing jdbc connection I inquired on dev mailing list about the motivation for checking the jdbc statement instead of the connection in the close() logic of JdbcRDD. Ted

git commit: SPARK-2869 - Fix tiny bug in JdbcRdd for closing jdbc connection

2014-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master 1aad9114c -> 2643e6600 SPARK-2869 - Fix tiny bug in JdbcRdd for closing jdbc connection I inquired on dev mailing list about the motivation for checking the jdbc statement instead of the connection in the close() logic of JdbcRDD. Ted Yu

svn commit: r1616050 - in /spark: news/_posts/2014-08-05-spark-1-0-2-released.md site/news/index.html site/news/spark-1-0-2-released.html

2014-08-05 Thread tdas
Author: tdas Date: Tue Aug 5 23:43:56 2014 New Revision: 1616050 URL: http://svn.apache.org/r1616050 Log: Small update to Spark 1.0.2. Modified: spark/news/_posts/2014-08-05-spark-1-0-2-released.md spark/site/news/index.html spark/site/news/spark-1-0-2-released.html Modified: spark/

svn commit: r1616049 - in /spark: ./ _layouts/ news/_posts/ releases/_posts/ site/ site/docs/ site/docs/1.0.2/ site/docs/1.0.2/api/ site/docs/1.0.2/api/java/ site/docs/1.0.2/api/java/org/ site/docs/1.

2014-08-05 Thread tdas
Author: tdas Date: Tue Aug 5 23:40:54 2014 New Revision: 1616049 URL: http://svn.apache.org/r1616049 Log: Adding Spark 1.0.2 [This commit notification would consist of 386 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.] -

git commit: [SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pyspark's linear methods

2014-08-05 Thread meng
Repository: spark Updated Branches: refs/heads/master acff9a7f1 -> 1aad9114c [SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pyspark's linear methods Related to Jira Issue: [SPARK-2550](https://issues.apache.org/jira/browse/SPARK-2550?jql=project%20%3D%20SPARK%20AND

git commit: [SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pyspark's linear methods

2014-08-05 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.1 0172277d2 -> 672904eed [SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pyspark's linear methods Related to Jira Issue: [SPARK-2550](https://issues.apache.org/jira/browse/SPARK-2550?jql=project%20%3D%20SPARK%2

git commit: [SPARK-2503] Lower shuffle output buffer (spark.shuffle.file.buffer.kb) to 32KB.

2014-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.1 5081b0da0 -> 0172277d2 [SPARK-2503] Lower shuffle output buffer (spark.shuffle.file.buffer.kb) to 32KB. This can substantially reduce memory usage during shuffle. Author: Reynold Xin Closes #1781 from rxin/SPARK-2503-spark.shuffle.fi

git commit: [SPARK-2503] Lower shuffle output buffer (spark.shuffle.file.buffer.kb) to 32KB.

2014-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master cc491f69c -> acff9a7f1 [SPARK-2503] Lower shuffle output buffer (spark.shuffle.file.buffer.kb) to 32KB. This can substantially reduce memory usage during shuffle. Author: Reynold Xin Closes #1781 from rxin/SPARK-2503-spark.shuffle.file.b

git commit: [SPARK-2856] Decrease initial buffer size for Kryo to 64KB.

2014-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.1 e77fa81a2 -> 5081b0da0 [SPARK-2856] Decrease initial buffer size for Kryo to 64KB. Author: Reynold Xin Closes #1780 from rxin/kryo-init-size and squashes the following commits: 551b935 [Reynold Xin] [SPARK-2856] Decrease initial buff

git commit: [SPARK-2864][MLLIB] fix random seed in word2vec; move model to local

2014-08-05 Thread meng
Repository: spark Updated Branches: refs/heads/master 41e0a21b2 -> cc491f69c [SPARK-2864][MLLIB] fix random seed in word2vec; move model to local It also moves the model to local in order to map `RDD[String]` to `RDD[Vector]`. Ishiihara Author: Xiangrui Meng Closes #1790 from mengxr/word2v

git commit: [SPARK-2864][MLLIB] fix random seed in word2vec; move model to local

2014-08-05 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.1 7b798e10e -> e77fa81a2 [SPARK-2864][MLLIB] fix random seed in word2vec; move model to local It also moves the model to local in order to map `RDD[String]` to `RDD[Vector]`. Ishiihara Author: Xiangrui Meng Closes #1790 from mengxr/wo

git commit: SPARK-1680: use configs for specifying environment variables on YARN

2014-08-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.1 46b698307 -> 7b798e10e SPARK-1680: use configs for specifying environment variables on YARN Note that this also documents spark.executorEnv.* which to me means its public. If we don't want that please speak up. Author: Thomas Graves

git commit: SPARK-1680: use configs for specifying environment variables on YARN

2014-08-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 74f82c71b -> 41e0a21b2 SPARK-1680: use configs for specifying environment variables on YARN Note that this also documents spark.executorEnv.* which to me means its public. If we don't want that please speak up. Author: Thomas Graves C

git commit: SPARK-2380: Support displaying accumulator values in the web UI

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.1 0f541abe7 -> 46b698307 SPARK-2380: Support displaying accumulator values in the web UI This patch adds support for giving accumulators user-visible names and displaying accumulator values in the web UI. This allows users to create cust

git commit: SPARK-2380: Support displaying accumulator values in the web UI

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master ac3440f4f -> 74f82c71b SPARK-2380: Support displaying accumulator values in the web UI This patch adds support for giving accumulators user-visible names and displaying accumulator values in the web UI. This allows users to create custom

git commit: [SPARK-2859] Update url of Kryo project in related docs

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 f536caa1b -> d415f883e [SPARK-2859] Update url of Kryo project in related docs JIRA Issue: https://issues.apache.org/jira/browse/SPARK-2859 Kryo project has been migrated from googlecode to github, hence we need to update its URL in r

git commit: [SPARK-2859] Update url of Kryo project in related docs

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.1 388ab534b -> 0f541abe7 [SPARK-2859] Update url of Kryo project in related docs JIRA Issue: https://issues.apache.org/jira/browse/SPARK-2859 Kryo project has been migrated from googlecode to github, hence we need to update its URL in r

git commit: [SPARK-2859] Update url of Kryo project in related docs

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 6e821e3d1 -> ac3440f4f [SPARK-2859] Update url of Kryo project in related docs JIRA Issue: https://issues.apache.org/jira/browse/SPARK-2859 Kryo project has been migrated from googlecode to github, hence we need to update its URL in relat

git commit: [SPARK-2860][SQL] Fix coercion of CASE WHEN.

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 1ca23 -> 6e821e3d1 [SPARK-2860][SQL] Fix coercion of CASE WHEN. Author: Michael Armbrust Closes #1785 from marmbrus/caseNull and squashes the following commits: 126006d [Michael Armbrust] better error message 2fe357f [Michael Armbrus

git commit: [SPARK-2860][SQL] Fix coercion of CASE WHEN.

2014-08-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.1 e3fe6571d -> 388ab534b [SPARK-2860][SQL] Fix coercion of CASE WHEN. Author: Michael Armbrust Closes #1785 from marmbrus/caseNull and squashes the following commits: 126006d [Michael Armbrust] better error message 2fe357f [Michael Arm

git commit: SPARK-1890 and SPARK-1891- add admin and modify acls

2014-08-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.1 6c0c65fc8 -> e3fe6571d SPARK-1890 and SPARK-1891- add admin and modify acls It was easier to combine these 2 jira since they touch many of the same places. This pr adds the following: - adds modify acls - adds admin acls (list of adm

git commit: SPARK-1890 and SPARK-1891- add admin and modify acls

2014-08-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 2c0f705e2 -> 1ca23 SPARK-1890 and SPARK-1891- add admin and modify acls It was easier to combine these 2 jira since they touch many of the same places. This pr adds the following: - adds modify acls - adds admin acls (list of admins/

git commit: SPARK-1528 - spark on yarn, add support for accessing remote HDFS

2014-08-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.1 b92a45058 -> 6c0c65fc8 SPARK-1528 - spark on yarn, add support for accessing remote HDFS Add a config (spark.yarn.access.namenodes) to allow applications running on yarn to access other secure HDFS cluster. User just specifies the nam

git commit: SPARK-1528 - spark on yarn, add support for accessing remote HDFS

2014-08-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/master e87075df9 -> 2c0f705e2 SPARK-1528 - spark on yarn, add support for accessing remote HDFS Add a config (spark.yarn.access.namenodes) to allow applications running on yarn to access other secure HDFS cluster. User just specifies the namenod

git commit: [SPARK-1022][Streaming] Add Kafka real unit test

2014-08-05 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-1.1 075ba6781 -> b92a45058 [SPARK-1022][Streaming] Add Kafka real unit test This PR is a updated version of (https://github.com/apache/spark/pull/557) to actually test sending and receiving data through Kafka, and fix previous flaky issue

git commit: [SPARK-1022][Streaming] Add Kafka real unit test

2014-08-05 Thread tdas
Repository: spark Updated Branches: refs/heads/master 184048f80 -> e87075df9 [SPARK-1022][Streaming] Add Kafka real unit test This PR is a updated version of (https://github.com/apache/spark/pull/557) to actually test sending and receiving data through Kafka, and fix previous flaky issues.

git commit: [SPARK-2856] Decrease initial buffer size for Kryo to 64KB.

2014-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master 9862c614c -> 184048f80 [SPARK-2856] Decrease initial buffer size for Kryo to 64KB. Author: Reynold Xin Closes #1780 from rxin/kryo-init-size and squashes the following commits: 551b935 [Reynold Xin] [SPARK-2856] Decrease initial buffer s

git commit: [SPARK-1779] Throw an exception if memory fractions are not between 0 and 1

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.1 12f99cf5f -> 075ba6781 [SPARK-1779] Throw an exception if memory fractions are not between 0 and 1 Author: wangfei Author: wangfei Closes #714 from scwf/memoryFraction and squashes the following commits: 6e385b9 [wangfei] Update Spa

git commit: [SPARK-1779] Throw an exception if memory fractions are not between 0 and 1

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master a646a365e -> 9862c614c [SPARK-1779] Throw an exception if memory fractions are not between 0 and 1 Author: wangfei Author: wangfei Closes #714 from scwf/memoryFraction and squashes the following commits: 6e385b9 [wangfei] Update SparkCo

git commit: [SPARK-2857] Correct properties to set Master / Worker ports

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 4fde28c20 -> a646a365e [SPARK-2857] Correct properties to set Master / Worker ports `master.ui.port` and `worker.ui.port` were never picked up by SparkConf, simply because they are not prefixed with "spark." Unfortunately, this is also cu

git commit: [SPARK-2857] Correct properties to set Master / Worker ports

2014-08-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.1 d13d253fe -> 12f99cf5f [SPARK-2857] Correct properties to set Master / Worker ports `master.ui.port` and `worker.ui.port` were never picked up by SparkConf, simply because they are not prefixed with "spark." Unfortunately, this is also