git commit: Updated programming-guide.md

2014-07-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master 50561f439 -> 0128905ee Updated programming-guide.md Made sure that readers know the random number generator seed argument, within the 'takeSample' method, is optional. Author: Rishi Verma Closes #1324 from riverma/patch-1 and squashes t

git commit: Resolve sbt warnings during build Ⅱ

2014-07-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master 0128905ee -> 3cd5029be Resolve sbt warnings during build Ⅱ Author: witgo Closes #1153 from witgo/expectResult and squashes the following commits: 97541d8 [witgo] merge master ead26e7 [witgo] Resolve sbt warnings during build Project:

git commit: Resolve sbt warnings during build Ⅱ

2014-07-08 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.0 9dce7beff -> 23b01a303 Resolve sbt warnings during build Ⅱ Author: witgo Closes #1153 from witgo/expectResult and squashes the following commits: 97541d8 [witgo] merge master ead26e7 [witgo] Resolve sbt warnings during build (cher

git commit: [SPARK-2391][SQL] Custom take() for LIMIT queries.

2014-07-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master 3cd5029be -> 5a4063645 [SPARK-2391][SQL] Custom take() for LIMIT queries. Using Spark's take can result in an entire in-memory partition to be shipped in order to retrieve a single row. Author: Michael Armbrust Closes #1318 from marmbru

git commit: [SPARK-2391][SQL] Custom take() for LIMIT queries.

2014-07-08 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.0 23b01a303 -> faa0e9f56 [SPARK-2391][SQL] Custom take() for LIMIT queries. Using Spark's take can result in an entire in-memory partition to be shipped in order to retrieve a single row. Author: Michael Armbrust Closes #1318 from mar

git commit: [EC2] Add default history server port to ec2 script

2014-07-08 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 5a4063645 -> 56e009d4f [EC2] Add default history server port to ec2 script Right now I have to open it manually Author: Andrew Or Closes #1296 from andrewor14/hist-serv-port and squashes the following commits: 8895a1f [Andrew Or] Add de

git commit: [EC2] Add default history server port to ec2 script

2014-07-08 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 faa0e9f56 -> 3e95225c7 [EC2] Add default history server port to ec2 script Right now I have to open it manually Author: Andrew Or Closes #1296 from andrewor14/hist-serv-port and squashes the following commits: 8895a1f [Andrew Or] Ad

git commit: [SPARK-2395][SQL] Optimize common LIKE patterns.

2014-07-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 56e009d4f -> cc3e0a14d [SPARK-2395][SQL] Optimize common LIKE patterns. Author: Michael Armbrust Closes #1325 from marmbrus/slowLike and squashes the following commits: 023c3eb [Michael Armbrust] add comment. 8b421c2 [Michael Armbrust] H

git commit: [SPARK-2395][SQL] Optimize common LIKE patterns.

2014-07-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 3e95225c7 -> 4bf8ddaee [SPARK-2395][SQL] Optimize common LIKE patterns. Author: Michael Armbrust Closes #1325 from marmbrus/slowLike and squashes the following commits: 023c3eb [Michael Armbrust] add comment. 8b421c2 [Michael Armbrus

git commit: [SPARK-2403] Catch all errors during serialization in DAGScheduler

2014-07-08 Thread adav
Repository: spark Updated Branches: refs/heads/branch-1.0 4bf8ddaee -> 3bd32f023 [SPARK-2403] Catch all errors during serialization in DAGScheduler https://issues.apache.org/jira/browse/SPARK-2403 Spark hangs for us whenever we forget to register a class with Kryo. This should be a simple fi

git commit: [SPARK-2403] Catch all errors during serialization in DAGScheduler

2014-07-08 Thread adav
Repository: spark Updated Branches: refs/heads/master cc3e0a14d -> c8a2313cd [SPARK-2403] Catch all errors during serialization in DAGScheduler https://issues.apache.org/jira/browse/SPARK-2403 Spark hangs for us whenever we forget to register a class with Kryo. This should be a simple fix fo

git commit: SPARK-2400 : fix spark.yarn.max.executor.failures explaination

2014-07-08 Thread tgraves
Repository: spark Updated Branches: refs/heads/master c8a2313cd -> b520b6453 SPARK-2400 : fix spark.yarn.max.executor.failures explaination According to ```scala private val maxNumExecutorFailures = sparkConf.getInt("spark.yarn.max.executor.failures", sparkConf.getInt("spark.yarn.max.wo

git commit: [SPARK-2409] Make SQLConf thread safe.

2014-07-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master b520b6453 -> 32516f866 [SPARK-2409] Make SQLConf thread safe. Author: Reynold Xin Closes #1334 from rxin/sqlConfThreadSafetuy and squashes the following commits: c1e0a5a [Reynold Xin] Fixed the duplicate comment. 7614372 [Reynold Xin] [S

git commit: [SPARK-2409] Make SQLConf thread safe.

2014-07-08 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.0 3bd32f023 -> 1c12b0b5c [SPARK-2409] Make SQLConf thread safe. Author: Reynold Xin Closes #1334 from rxin/sqlConfThreadSafetuy and squashes the following commits: c1e0a5a [Reynold Xin] Fixed the duplicate comment. 7614372 [Reynold Xin

git commit: [SPARK-2362] Fix for newFilesOnly logic in file DStream

2014-07-08 Thread tdas
Repository: spark Updated Branches: refs/heads/master 32516f866 -> e6f7bfcfb [SPARK-2362] Fix for newFilesOnly logic in file DStream The newFilesOnly logic should be inverted: the logic should be that if the flag newFilesOnly==true then only start reading files older than current time. As th

git commit: [SPARK-2362] Fix for newFilesOnly logic in file DStream

2014-07-08 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-1.0 1c12b0b5c -> 885489112 [SPARK-2362] Fix for newFilesOnly logic in file DStream The newFilesOnly logic should be inverted: the logic should be that if the flag newFilesOnly==true then only start reading files older than current time. As

git commit: [SPARK-2362] Fix for newFilesOnly logic in file DStream

2014-07-08 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-0.9 57873ef69 -> 8e5604b22 [SPARK-2362] Fix for newFilesOnly logic in file DStream The newFilesOnly logic should be inverted: the logic should be that if the flag newFilesOnly==true then only start reading files older than current time. As

git commit: [SPARK-2392] Executors should not start their own HTTP servers

2014-07-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master e6f7bfcfb -> bf04a390e [SPARK-2392] Executors should not start their own HTTP servers Executors currently start their own unused HTTP file servers. This is because we use the same SparkEnv class for both executors and drivers, and we do no

git commit: [SPARK-2413] Upgrade junit_xml_listener to 0.5.1

2014-07-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master bf04a390e -> ac9cdc116 [SPARK-2413] Upgrade junit_xml_listener to 0.5.1 which fixes the following issues 1) fix the class name to be fully qualified classpath 2) make sure the the reporting time is in second not in miliseond, which causin

git commit: [SPARK-2152][MLlib] fix bin offset in DecisionTree node aggregations (also resolves SPARK-2160)

2014-07-08 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.0 885489112 -> d569838bc [SPARK-2152][MLlib] fix bin offset in DecisionTree node aggregations (also resolves SPARK-2160) Hi, this pull fixes (what I believe to be) a bug in DecisionTree.scala. In the extractLeftRightNodeAggregates funct

git commit: [SPARK-2152][MLlib] fix bin offset in DecisionTree node aggregations (also resolves SPARK-2160)

2014-07-08 Thread meng
Repository: spark Updated Branches: refs/heads/master ac9cdc116 -> 1114207cc [SPARK-2152][MLlib] fix bin offset in DecisionTree node aggregations (also resolves SPARK-2160) Hi, this pull fixes (what I believe to be) a bug in DecisionTree.scala. In the extractLeftRightNodeAggregates function,

[3/3] git commit: [SPARK-2384] Add tooltips to UI.

2014-07-08 Thread rxin
[SPARK-2384] Add tooltips to UI. This patch adds tooltips to clarify some points of confusion in the UI. When users mouse over some of the table headers (shuffle read, write, and input size) as well as over the "scheduler delay" metric shown for each stage, a black tool tip (see image below) p

[1/3] [SPARK-2384] Add tooltips to UI.

2014-07-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master 1114207cc -> 339441f54 http://git-wip-us.apache.org/repos/asf/spark/blob/339441f5/core/src/main/resources/org/apache/spark/ui/static/webui.css -- diff --git a/core/src/main/