spark git commit: [SPARK-8687] [YARN] Fix bug: Executor can't fetch the new set configuration in yarn-client

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 3697232b7 - 1b0c8e610 [SPARK-8687] [YARN] Fix bug: Executor can't fetch the new set configuration in yarn-client Spark initi the properties CoarseGrainedSchedulerBackend.start ```scala // TODO (prashant) send conf instead of

spark git commit: [SPARK-3071] Increase default driver memory

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 377ff4c9e - 3697232b7 [SPARK-3071] Increase default driver memory I've updated default values in comments, documentation, and in the command line builder to be 1g based on comments in the JIRA. I've also updated most usages to point at a

spark git commit: [SPARK-8740] [PROJECT INFRA] Support GitHub OAuth tokens in dev/merge_spark_pr.py

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 15d41cc50 - 377ff4c9e [SPARK-8740] [PROJECT INFRA] Support GitHub OAuth tokens in dev/merge_spark_pr.py This commit allows `dev/merge_spark_pr.py` to use personal GitHub OAuth tokens in order to make authenticated requests. This is

spark git commit: [SPARK-8688] [YARN] Bug fix: disable the cache fs to gain the HDFS connection.

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 792fcd802 - 646366b5d [SPARK-8688] [YARN] Bug fix: disable the cache fs to gain the HDFS connection. If `fs.hdfs.impl.disable.cache` was `false`(default), `FileSystem` will use the cached `DFSClient` which use old token.

spark git commit: [SPARK-8687] [YARN] Fix bug: Executor can't fetch the new set configuration in yarn-client

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 e33c0f0a4 - 7cbfef23a [SPARK-8687] [YARN] Fix bug: Executor can't fetch the new set configuration in yarn-client Spark initi the properties CoarseGrainedSchedulerBackend.start ```scala // TODO (prashant) send conf instead of

spark git commit: [SPARK-8769] [TRIVIAL] [DOCS] toLocalIterator should mention it results in many jobs

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master d14338eaf - 15d41cc50 [SPARK-8769] [TRIVIAL] [DOCS] toLocalIterator should mention it results in many jobs Author: Holden Karau hol...@pigscanfly.ca Closes #7171 from holdenk/SPARK-8769-toLocalIterator-documentation-improvement and

spark git commit: [SPARK-8769] [TRIVIAL] [DOCS] toLocalIterator should mention it results in many jobs

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 5b468cf0c - e33c0f0a4 [SPARK-8769] [TRIVIAL] [DOCS] toLocalIterator should mention it results in many jobs Author: Holden Karau hol...@pigscanfly.ca Closes #7171 from holdenk/SPARK-8769-toLocalIterator-documentation-improvement and

spark git commit: [SPARK-8771] [TRIVIAL] Add a version to the deprecated annotation for the actorSystem

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 646366b5d - d14338eaf [SPARK-8771] [TRIVIAL] Add a version to the deprecated annotation for the actorSystem Author: Holden Karau hol...@pigscanfly.ca Closes #7172 from

spark git commit: [DOCS] Fix minor wrong lambda expression example.

2015-07-02 Thread sarutak
Repository: spark Updated Branches: refs/heads/branch-1.4 7cbfef23a - f5c9296a6 [DOCS] Fix minor wrong lambda expression example. It's a really minor issue but there is an example with wrong lambda-expression usage in `SQLContext.scala` like as follows. ``` sqlContext.udf().register(myUDF,

spark git commit: [DOCS] Fix minor wrong lambda expression example.

2015-07-02 Thread sarutak
Repository: spark Updated Branches: refs/heads/master 1b0c8e610 - 41588365a [DOCS] Fix minor wrong lambda expression example. It's a really minor issue but there is an example with wrong lambda-expression usage in `SQLContext.scala` like as follows. ``` sqlContext.udf().register(myUDF,

spark git commit: [SPARK-8787] [SQL] Changed parameter order of @deprecated in package object sql

2015-07-02 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.4 f5c9296a6 - eb0dd45de [SPARK-8787] [SQL] Changed parameter order of @deprecated in package object sql Parameter order of deprecated annotation in package object sql is wrong deprecated(1.3.0, use DataFrame) . This has to be changed to

spark git commit: [SPARK-8787] [SQL] Changed parameter order of @deprecated in package object sql

2015-07-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master 41588365a - c572e2561 [SPARK-8787] [SQL] Changed parameter order of @deprecated in package object sql Parameter order of deprecated annotation in package object sql is wrong deprecated(1.3.0, use DataFrame) . This has to be changed to

spark git commit: [SPARK-8746] [SQL] update download link for Hive 0.13.1

2015-07-02 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.4 eb0dd45de - 4720a8d83 [SPARK-8746] [SQL] update download link for Hive 0.13.1 updated the [Hive 0.13.1](https://archive.apache.org/dist/hive/hive-0.13.1) download link in `sql/README.md` Author: Christian Kadner ckad...@us.ibm.com

spark git commit: [SPARK-8746] [SQL] update download link for Hive 0.13.1

2015-07-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master c572e2561 - 1bbdf9ead [SPARK-8746] [SQL] update download link for Hive 0.13.1 updated the [Hive 0.13.1](https://archive.apache.org/dist/hive/hive-0.13.1) download link in `sql/README.md` Author: Christian Kadner ckad...@us.ibm.com

spark git commit: [SPARK-8781] Fix variables in published pom.xml are not resolved

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 34d448dbe - 82cf3315e [SPARK-8781] Fix variables in published pom.xml are not resolved The issue is summarized in the JIRA and is caused by this commit: 984ad60147c933f2d5a2040c87ae687c14eb1724. This patch reverts that commit and fixes

spark git commit: [SPARK-8781] Fix variables in published pom.xml are not resolved

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 4720a8d83 - bc51bcaea [SPARK-8781] Fix variables in published pom.xml are not resolved The issue is summarized in the JIRA and is caused by this commit: 984ad60147c933f2d5a2040c87ae687c14eb1724. This patch reverts that commit and

spark git commit: [SPARK-8781] Fix variables in published pom.xml are not resolved

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.3 3a71cf946 - 502e1fd68 [SPARK-8781] Fix variables in published pom.xml are not resolved The issue is summarized in the JIRA and is caused by this commit: 984ad60147c933f2d5a2040c87ae687c14eb1724. This patch reverts that commit and

spark git commit: [SPARK-1564] [DOCS] Added Javascript to Javadocs to create badges for tags like :: Experimental ::

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 82cf3315e - fcbcba66c [SPARK-1564] [DOCS] Added Javascript to Javadocs to create badges for tags like :: Experimental :: Modified copy_api_dirs.rb and created api-javadocs.js and api-javadocs.css files in order to add badges to javadoc

spark git commit: [SPARK-7835] Refactor HeartbeatReceiverSuite for coverage + cleanup

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master fcbcba66c - cd2035507 [SPARK-7835] Refactor HeartbeatReceiverSuite for coverage + cleanup The existing test suite has a lot of duplicate code and doesn't even cover the most fundamental feature of the HeartbeatReceiver, which is expiring

spark git commit: Revert [SPARK-8784] [SQL] Add Python API for hex and unhex

2015-07-02 Thread rxin
Repository: spark Updated Branches: refs/heads/master 488bad319 - e589e71a2 Revert [SPARK-8784] [SQL] Add Python API for hex and unhex This reverts commit fc7aebd94a3c09657fc4dbded0997ed068304e0a. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: [SPARK-8782] [SQL] Fix code generation for ORDER BY NULL

2015-07-02 Thread rxin
Repository: spark Updated Branches: refs/heads/master e589e71a2 - d9838196f [SPARK-8782] [SQL] Fix code generation for ORDER BY NULL This fixes code generation for queries containing `ORDER BY NULL`. Previously, the generated code would fail to compile. Author: Josh Rosen

spark git commit: fix string order for non-ascii character

2015-07-02 Thread davies
Repository: spark Updated Branches: refs/heads/branch-1.4 bc51bcaea - 3f1e4efbd fix string order for non-ascii character Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/3f1e4efb Tree:

spark git commit: [SPARK-8801][SQL] Support TypeCollection in ExpectsInputTypes

2015-07-02 Thread rxin
Repository: spark Updated Branches: refs/heads/master 20a4d7dbd - a59d14f62 [SPARK-8801][SQL] Support TypeCollection in ExpectsInputTypes This patch adds a new TypeCollection AbstractDataType that can be used by expressions to specify more than one expected input types. Author: Reynold Xin

spark git commit: [SPARK-8803] handle special characters in elements in crosstab

2015-07-02 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.4 f142867ec - ff76b33b6 [SPARK-8803] handle special characters in elements in crosstab cc rxin Having back ticks or null as elements causes problems. Since elements become column names, we have to drop them from the element as back

spark git commit: [SPARK-7104] [MLLIB] Support model save/load in Python's Word2Vec

2015-07-02 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master fc7aebd94 - 488bad319 [SPARK-7104] [MLLIB] Support model save/load in Python's Word2Vec Author: Yu ISHIKAWA yuu.ishik...@gmail.com Closes #6821 from yu-iskw/SPARK-7104 and squashes the following commits: 975136b [Yu ISHIKAWA] Organize

spark git commit: [SPARK-6980] [CORE] Akka timeout exceptions indicate which conf controls them (RPC Layer)

2015-07-02 Thread irashid
Repository: spark Updated Branches: refs/heads/master d9838196f - aa7bbc143 [SPARK-6980] [CORE] Akka timeout exceptions indicate which conf controls them (RPC Layer) Latest changes after refactoring to the RPC layer. I rebased against trunk to make sure to get any recent changes since it

spark git commit: [SPARK-8501] [SQL] Avoids reading schema from empty ORC files

2015-07-02 Thread lian
Repository: spark Updated Branches: refs/heads/master dfd8bac8f - 20a4d7dbd [SPARK-8501] [SQL] Avoids reading schema from empty ORC files ORC writes empty schema (`struct`) to ORC files containing zero rows. This is OK for Hive since the table schema is managed by the metastore. But it

spark git commit: [SPARK-8501] [SQL] Avoids reading schema from empty ORC files (backport to 1.4)

2015-07-02 Thread lian
Repository: spark Updated Branches: refs/heads/branch-1.4 3f1e4efbd - de0802499 [SPARK-8501] [SQL] Avoids reading schema from empty ORC files (backport to 1.4) This PR backports #7199 to branch-1.4 Author: Cheng Lian l...@databricks.com Closes #7200 from liancheng/spark-8501-for-1.4 and

spark git commit: [SPARK-8803] handle special characters in elements in crosstab

2015-07-02 Thread rxin
Repository: spark Updated Branches: refs/heads/master f743c79ab - 9b23e92c7 [SPARK-8803] handle special characters in elements in crosstab cc rxin Having back ticks or null as elements causes problems. Since elements become column names, we have to drop them from the element as back ticks

spark git commit: [SPARK-8776] Increase the default MaxPermSize

2015-07-02 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-1.4 de0802499 - f142867ec [SPARK-8776] Increase the default MaxPermSize I am increasing the perm gen size to 256m. https://issues.apache.org/jira/browse/SPARK-8776 Author: Yin Huai yh...@databricks.com Closes #7196 from yhuai/SPARK-8776

spark git commit: [SPARK-8776] Increase the default MaxPermSize

2015-07-02 Thread yhuai
Repository: spark Updated Branches: refs/heads/master a59d14f62 - f743c79ab [SPARK-8776] Increase the default MaxPermSize I am increasing the perm gen size to 256m. https://issues.apache.org/jira/browse/SPARK-8776 Author: Yin Huai yh...@databricks.com Closes #7196 from yhuai/SPARK-8776 and

spark git commit: [SPARK-8690] [SQL] Add a setting to disable SparkSQL parquet schema merge by using datasource API

2015-07-02 Thread lian
Repository: spark Updated Branches: refs/heads/master 1bbdf9ead - 246265f2b [SPARK-8690] [SQL] Add a setting to disable SparkSQL parquet schema merge by using datasource API The detail problem story is in https://issues.apache.org/jira/browse/SPARK-8690 General speaking, I add a config

spark git commit: [SPARK-8647] [MLLIB] Potential issue with constant hashCode

2015-07-02 Thread meng
Repository: spark Updated Branches: refs/heads/master 246265f2b - 99c40cd0d [SPARK-8647] [MLLIB] Potential issue with constant hashCode I added the code, // see [SPARK-8647], this achieves the needed constant hash code without constant no. override def hashCode(): Int =

spark git commit: [SPARK-8581] [SPARK-8584] Simplify checkpointing code + better error message

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 0e553a3e9 - 2e2f32603 [SPARK-8581] [SPARK-8584] Simplify checkpointing code + better error message This patch rewrites the old checkpointing code in a way that is easier to understand. It also adds a guard against an invalid specification

spark git commit: [SPARK-8758] [MLLIB] Add Python user guide for PowerIterationClustering

2015-07-02 Thread meng
Repository: spark Updated Branches: refs/heads/master 99c40cd0d - 0a468a46b [SPARK-8758] [MLLIB] Add Python user guide for PowerIterationClustering Add Python user guide for PowerIterationClustering Author: Yanbo Liang yblia...@gmail.com Closes #7155 from yanboliang/spark-8758 and squashes

spark git commit: [SPARK-8223] [SPARK-8224] [SQL] shift left and shift right

2015-07-02 Thread davies
Repository: spark Updated Branches: refs/heads/master 0a468a46b - 5b3338130 [SPARK-8223] [SPARK-8224] [SQL] shift left and shift right Jira: https://issues.apache.org/jira/browse/SPARK-8223 https://issues.apache.org/jira/browse/SPARK-8224 ~~I am aware of #7174 and will update this pr, if

spark git commit: [SPARK-8708] [MLLIB] Paritition ALS ratings based on both users and products

2015-07-02 Thread meng
Repository: spark Updated Branches: refs/heads/master 52302a803 - 0e553a3e9 [SPARK-8708] [MLLIB] Paritition ALS ratings based on both users and products JIRA: https://issues.apache.org/jira/browse/SPARK-8708 Previously the partitions of ratings are only based on the given products. So if

spark git commit: [SPARK-8407] [SQL] complex type constructors: struct and named_struct

2015-07-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master afa021e03 - 52302a803 [SPARK-8407] [SQL] complex type constructors: struct and named_struct This is a follow up of [SPARK-8283](https://issues.apache.org/jira/browse/SPARK-8283) ([PR-6828](https://github.com/apache/spark/pull/6828)), to

spark git commit: [SPARK-8747] [SQL] fix EqualNullSafe for binary type

2015-07-02 Thread davies
Repository: spark Updated Branches: refs/heads/master 5b3338130 - afa021e03 [SPARK-8747] [SQL] fix EqualNullSafe for binary type also improve tests for binary comparison. Author: Wenchen Fan cloud0...@outlook.com Closes #7143 from cloud-fan/binary and squashes the following commits:

spark git commit: [SPARK-8479] [MLLIB] Add numNonzeros and numActives to linalg.Matrices

2015-07-02 Thread meng
Repository: spark Updated Branches: refs/heads/master 2e2f32603 - 34d448dbe [SPARK-8479] [MLLIB] Add numNonzeros and numActives to linalg.Matrices Matrices allow zeros to be stored in values. Sometimes a method is handy to check if the numNonZeros are same as number of Active values.

spark git commit: [SPARK-8772][SQL] Implement implicit type cast for expressions that define input types.

2015-07-02 Thread rxin
Repository: spark Updated Branches: refs/heads/master cd2035507 - 52508beb6 [SPARK-8772][SQL] Implement implicit type cast for expressions that define input types. Author: Reynold Xin r...@databricks.com Closes #7175 from rxin/implicitCast and squashes the following commits: 88080a2

spark git commit: [SPARK-3382] [MLLIB] GradientDescent convergence tolerance

2015-07-02 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 52508beb6 - 7d9cc9673 [SPARK-3382] [MLLIB] GradientDescent convergence tolerance GrandientDescent can receive convergence tolerance value. Default value is 0.0. When loss value becomes less than the tolerance which is set by user,

spark git commit: [SPARK-8784] [SQL] Add Python API for hex and unhex

2015-07-02 Thread rxin
Repository: spark Updated Branches: refs/heads/master 7d9cc9673 - fc7aebd94 [SPARK-8784] [SQL] Add Python API for hex and unhex Also improve the performance of hex/unhex Author: Davies Liu dav...@databricks.com Closes #7181 from davies/hex and squashes the following commits: f032fbb