[GitHub] spark pull request: [SPARK-10142][STREAMING] Made python checkpoin...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8366#issuecomment-133629112
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10142][STREAMING] Made python checkpoin...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8366#issuecomment-133629097
  
  [Test build #41401 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41401/console)
 for   PR 8366 at commit 
[`3afa666`](https://github.com/apache/spark/commit/3afa666027a7a657166e77640a2448f5c8c90546).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9104][SPARK-9105][SPARK-9106][SPARK-910...

2015-08-21 Thread liyezhang556520
Github user liyezhang556520 commented on the pull request:

https://github.com/apache/spark/pull/7753#issuecomment-133629862
  
Jenkins, retest this please


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9864] [DOC] [MLlib] [SQL] Replace since...

2015-08-21 Thread MechCoder
Github user MechCoder commented on the pull request:

https://github.com/apache/spark/pull/8352#issuecomment-133635527
  
thanks for the merge. I will not be available till Monday itself :( . If 
this is in a hurry, please feel free to fix it yourself.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10142][STREAMING] Made python checkpoin...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8366#issuecomment-133629113
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41401/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9104][SPARK-9105][SPARK-9106][SPARK-910...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/7753#issuecomment-133630302
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9104][SPARK-9105][SPARK-9106][SPARK-910...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/7753#issuecomment-133630304
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9104][SPARK-9105][SPARK-9106][SPARK-910...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/7753#issuecomment-133630525
  
  [Test build #41402 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41402/consoleFull)
 for   PR 7753 at commit 
[`f21a804`](https://github.com/apache/spark/commit/f21a8047e615c7dc17e5a823ebaabbbc84fffdb3).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10151] [SQL] Support invocation of hive...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8354#issuecomment-133366793
  
  [Test build #41360 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41360/console)
 for   PR 8354 at commit 
[`e1ef530`](https://github.com/apache/spark/commit/e1ef5306006f28a6eed72a512b894334ef6fbd3e).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10151] [SQL] Support invocation of hive...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8354#issuecomment-133367307
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41360/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10151] [SQL] Support invocation of hive...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8354#issuecomment-133367304
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9401][SQL]Fully implement code generati...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8353#issuecomment-133370445
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9401][SQL]Fully implement code generati...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8353#issuecomment-133370447
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41361/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10154][SQL] remove the no-longer-necess...

2015-08-21 Thread liancheng
Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/8356#issuecomment-133370402
  
LGTM, please fix the MiMa failure by adding

```scala

ProblemFilters.exclude[MissingClassProblem](org.apache.spark.sql.sources.CatalystScan)
```

to `project/MimaExcludes.scala`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9401][SQL]Fully implement code generati...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8353#issuecomment-133370369
  
  [Test build #41361 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41361/console)
 for   PR 8353 at commit 
[`affd497`](https://github.com/apache/spark/commit/affd497299a6fab1c6454228a531f99078a2e7ba).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9864] [DOC] [MLlib] [SQL] Replace since...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8352#issuecomment-133371987
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41365/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10152] [SQL] Support Init script for hi...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8355#issuecomment-133371929
  
  [Test build #41359 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41359/console)
 for   PR 8355 at commit 
[`c5ac310`](https://github.com/apache/spark/commit/c5ac3106dadcc53b834c80c316325dbae0b37a2c).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9864] [DOC] [MLlib] [SQL] Replace since...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8352#issuecomment-133371984
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10152] [SQL] Support Init script for hi...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8355#issuecomment-133372040
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41359/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9864] [DOC] [MLlib] [SQL] Replace since...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8352#issuecomment-133371905
  
  [Test build #41365 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41365/console)
 for   PR 8352 at commit 
[`969bbbd`](https://github.com/apache/spark/commit/969bbbdd5cba4abf1997a031d3efa6f57665ac73).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10152] [SQL] Support Init script for hi...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8355#issuecomment-133372039
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8889] [Core]: Fix for OOM for graph cre...

2015-08-21 Thread rekhajoshm
Github user rekhajoshm commented on the pull request:

https://github.com/apache/spark/pull/7602#issuecomment-133373808
  
Thanks @srowen @andrewor14 for the merge.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10122][PySpark][Streaming] Fix getOffse...

2015-08-21 Thread zsxwing
Github user zsxwing commented on the pull request:

https://github.com/apache/spark/pull/8347#issuecomment-133449445
  
LGTM except one minor doc typo.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9439] [yarn] External shuffle service r...

2015-08-21 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/7943


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9439] [yarn] External shuffle service r...

2015-08-21 Thread tgravescs
Github user tgravescs commented on the pull request:

https://github.com/apache/spark/pull/7943#issuecomment-133427799
  
committed to master


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9708] [MESOS] Spark should create local...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8358#issuecomment-133446766
  
Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10155][SQL]Change SqlParser to object t...

2015-08-21 Thread zsxwing
GitHub user zsxwing opened a pull request:

https://github.com/apache/spark/pull/8357

[SPARK-10155][SQL]Change SqlParser to object to avoid memory leak

Since `scala.util.parsing.combinator.Parsers` is thread-safe since Scala 
2.10 (See [SI-4929](https://issues.scala-lang.org/browse/SI-4929)), we can 
change SqlParser to object to avoid memory leak.

I didn't change other subclasses of `scala.util.parsing.combinator.Parsers` 
because there is only one instance in one SQLContext, which should not be an 
issue.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/zsxwing/spark sql-memory-leak

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/8357.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #8357


commit 4bc33e0440e844980f8119e19bba70e3656c856c
Author: zsxwing zsxw...@gmail.com
Date:   2015-08-21T10:55:28Z

Change SqlParser to object to avoid memory leak




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10154][SQL] remove the no-longer-necess...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8356#issuecomment-133403523
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10155][SQL]Change SqlParser to object t...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8357#issuecomment-133441976
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10155][SQL]Change SqlParser to object t...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8357#issuecomment-133442004
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10122][PySpark][Streaming] Fix getOffse...

2015-08-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request:

https://github.com/apache/spark/pull/8347#discussion_r37640572
  
--- Diff: python/pyspark/streaming/dstream.py ---
@@ -610,7 +610,10 @@ def __init__(self, prev, func):
 self.is_checkpointed = False
 self._jdstream_val = None
 
-if (isinstance(prev, TransformedDStream) and
+# Using type() to avoid folding the functions and compacting the 
DStreams which is not
+# not strictly a object of TransformedDStream.
--- End diff --

nit: two `not`s


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10154][SQL] remove the no-longer-necess...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8356#issuecomment-133403710
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10155][SQL]Change SqlParser to object t...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8357#issuecomment-133445626
  
  [Test build #41369 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41369/consoleFull)
 for   PR 8357 at commit 
[`4bc33e0`](https://github.com/apache/spark/commit/4bc33e0440e844980f8119e19bba70e3656c856c).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9708] [MESOS] Spark should create local...

2015-08-21 Thread Zariel
GitHub user Zariel opened a pull request:

https://github.com/apache/spark/pull/8358

[SPARK-9708] [MESOS] Spark should create local temporary directories in 
Mesos sandbox when launched with Mesos

This is my own original work and I license this to the project under the 
project's open source license

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Zariel/spark mesos-local-dir

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/8358.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #8358


commit 30fe90bfcc71bd74ce7ab44d07900297cba43e42
Author: Chris Bannister chris.bannis...@swiftkey.net
Date:   2015-08-21T14:19:52Z

when running in mesos use the provided scratch space




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9613] [CORE] Ban use of JavaConversions...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8033#issuecomment-133448393
  
  [Test build #1679 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1679/console)
 for   PR 8033 at commit 
[`b4f58d5`](https://github.com/apache/spark/commit/b4f58d5ebb4afc673304250b9309724811a11b5f).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10137][Streaming]Avoid to restart recei...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8340#issuecomment-133460175
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9864] [DOC] [MLlib] [SQL] Replace since...

2015-08-21 Thread mengxr
Github user mengxr commented on the pull request:

https://github.com/apache/spark/pull/8352#issuecomment-133490920
  
@MechCoder Could you move the changes in SQL to another PR? I think SQL is 
fine with `@since` tag. Switching to `@Since` will cause regression in the 
generated Java API doc. Let's experiment with `@Since` annotation in MLlib 
first:)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10154][SQL] remove the no-longer-necess...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8356#issuecomment-133408157
  
  [Test build #41368 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41368/consoleFull)
 for   PR 8356 at commit 
[`6ab9de0`](https://github.com/apache/spark/commit/6ab9de09e37600d7dd2073ca6a078b9eab7cc3cf).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9439] [yarn] External shuffle service r...

2015-08-21 Thread tgravescs
Github user tgravescs commented on the pull request:

https://github.com/apache/spark/pull/7943#issuecomment-133426928
  
sounds like we missed 1.5 on this so we can put in master for 1.6


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10148][Streaming]Display active and ina...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8351#issuecomment-133481655
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41371/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10148][Streaming]Display active and ina...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8351#issuecomment-133481653
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10155][SQL]Change SqlParser to object t...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8357#issuecomment-133491665
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10155][SQL]Change SqlParser to object t...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8357#issuecomment-133491668
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41369/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9864] [DOC] [MLlib] [SQL] Replace since...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8352#issuecomment-133494112
  
  [Test build #41373 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41373/consoleFull)
 for   PR 8352 at commit 
[`d6dbd4f`](https://github.com/apache/spark/commit/d6dbd4f692d308b66f3903f1828d613957acf353).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8436][SQL] Inconsistent behavior when c...

2015-08-21 Thread x1-
Github user x1- commented on the pull request:

https://github.com/apache/spark/pull/8339#issuecomment-133457237
  
@sarutak 
I tried *casting Timestamp to Long and Long to Timestamp* using Hive below.

```sql
hive SELECT CAST(CAST(1.2 AS TIMESTAMP) AS DOUBLE);
OK
1.2
Time taken: 3.637 seconds, Fetched: 1 row(s)

hive SELECT CAST(CAST(1200 AS TIMESTAMP) AS INT);
OK
1
```

This mean that Hive has contradiction in converting Timestamp to Int, and 
then Int to Timestamp.
Because of spark has different dialect, it is better way closing this PR, I 
think.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8436][SQL] Inconsistent behavior when c...

2015-08-21 Thread x1-
Github user x1- closed the pull request at:

https://github.com/apache/spark/pull/8339


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8436][SQL] Inconsistent behavior when c...

2015-08-21 Thread x1-
Github user x1- commented on the pull request:

https://github.com/apache/spark/pull/8339#issuecomment-133457361
  
So, now I close this.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10148][Streaming]Display active and ina...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8351#issuecomment-133463951
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10148][Streaming]Display active and ina...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8351#issuecomment-133463981
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10148][Streaming]Display active and ina...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8351#issuecomment-133466150
  
  [Test build #41371 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41371/consoleFull)
 for   PR 8351 at commit 
[`59c56d4`](https://github.com/apache/spark/commit/59c56d48c2c8166bd7fe424d7adf7fbf4f60e2a2).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: ENH: Add ability to specify s3 bootstrap scrip...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8359#issuecomment-133469979
  
Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10137][Streaming]Avoid to restart recei...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8340#issuecomment-133478816
  
  [Test build #41370 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41370/console)
 for   PR 8340 at commit 
[`f0d1f6e`](https://github.com/apache/spark/commit/f0d1f6e9992afa2be6f9efc74abf155b9dccbcf2).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9864] [DOC] [MLlib] [SQL] Replace since...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8352#issuecomment-133492487
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9104][SPARK-9105][SPARK-9106][SPARK-910...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/7753#issuecomment-133492452
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9864] [DOC] [MLlib] [SQL] Replace since...

2015-08-21 Thread MechCoder
Github user MechCoder commented on the pull request:

https://github.com/apache/spark/pull/8352#issuecomment-133492168
  
rebased


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9104][SPARK-9105][SPARK-9106][SPARK-910...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/7753#issuecomment-133492490
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: ENH: Add ability to specify s3 bootstrap scrip...

2015-08-21 Thread mdagost
Github user mdagost commented on the pull request:

https://github.com/apache/spark/pull/8359#issuecomment-133480514
  
Sure.  Sorry that I forgot to open a Jira ticket first: 
https://issues.apache.org/jira/browse/SPARK-10157


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10155][SQL]Change SqlParser to object t...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8357#issuecomment-133491439
  
  [Test build #41369 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41369/console)
 for   PR 8357 at commit 
[`4bc33e0`](https://github.com/apache/spark/commit/4bc33e0440e844980f8119e19bba70e3656c856c).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9864] [DOC] [MLlib] [SQL] Replace since...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8352#issuecomment-133492446
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10137][Streaming]Avoid to restart recei...

2015-08-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request:

https://github.com/apache/spark/pull/8340#discussion_r37644626
  
--- Diff: 
streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceiverSchedulingPolicy.scala
 ---
@@ -144,27 +141,31 @@ private[streaming] class ReceiverSchedulingPolicy {
 val scheduledExecutors = mutable.Set[String]()
 scheduledExecutors ++= preferredLocation
 
-val executorWeights = receiverTrackingInfoMap.values.flatMap { 
receiverTrackingInfo =
-  receiverTrackingInfo.state match {
-case ReceiverState.INACTIVE = Nil
-case ReceiverState.SCHEDULED =
-  val scheduledExecutors = 
receiverTrackingInfo.scheduledExecutors.get
-  // The probability that a scheduled receiver will run in an 
executor is
-  // 1.0 / scheduledLocations.size
-  scheduledExecutors.map(location = location - (1.0 / 
scheduledExecutors.size))
-case ReceiverState.ACTIVE = 
Seq(receiverTrackingInfo.runningExecutor.get - 1.0)
-  }
+val executorWeights = receiverTrackingInfoMap.filter(_._1 != 
receiverId).values.flatMap {
--- End diff --

Updated to use the new approach. And since we don't call rescheduleReceiver 
when the receiver registers at the first time, it's not necessary to do 
filtering.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10154][SQL] remove the no-longer-necess...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8356#issuecomment-133460607
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10154][SQL] remove the no-longer-necess...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8356#issuecomment-133460611
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41368/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10148][Streaming]Display active and ina...

2015-08-21 Thread zsxwing
Github user zsxwing commented on the pull request:

https://github.com/apache/spark/pull/8351#issuecomment-133463286
  

img width=749 alt=screen shot 2015-08-21 at 11 25 44 pm 
src=https://cloud.githubusercontent.com/assets/1000778/9412055/48cf6966-485c-11e5-9ff2-585e7392969e.png;




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10137][Streaming]Avoid to restart recei...

2015-08-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request:

https://github.com/apache/spark/pull/8340#discussion_r37644655
  
--- Diff: 
streaming/src/test/scala/org/apache/spark/streaming/scheduler/ReceiverSchedulingPolicySuite.scala
 ---
@@ -127,4 +127,26 @@ class ReceiverSchedulingPolicySuite extends 
SparkFunSuite {
   assert(executors.isEmpty)
 }
   }
+
+  test(when scheduleReceivers return a balanced result, we should not 
restart receivers) {
--- End diff --

Removed this test since we don't call rescheduleReceiver when the receiver 
registers at the first time.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9613] [CORE] Ban use of JavaConversions...

2015-08-21 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/8033#issuecomment-133462879
  
@JoshRosen wanted to call your attention to this PR since it's ready for a 
look. It is complete and passes tests -- usually. It needs constant rebasing 
though, so wanted to get it reviewed in a passing state that just needs a minor 
rebase.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10154][SQL] remove the no-longer-necess...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8356#issuecomment-133460408
  
**[Test build #41368 timed 
out](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41368/console)**
 for PR 8356 at commit 
[`6ab9de0`](https://github.com/apache/spark/commit/6ab9de09e37600d7dd2073ca6a078b9eab7cc3cf)
 after a configured wait of `175m`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10137][Streaming]Avoid to restart recei...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8340#issuecomment-133460250
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: ENH: Add ability to specify s3 bootstrap scrip...

2015-08-21 Thread mdagost
GitHub user mdagost opened a pull request:

https://github.com/apache/spark/pull/8359

ENH: Add ability to specify s3 bootstrap script to spark-ec2

One of the nice features of using EMR to start spark clusters is that you 
can specify bootstrap actions to run on each node.  This is useful for 
installing custom libraries, etc.  However, you pay a per-node/per-hour premium 
for using EMR (https://aws.amazon.com/elasticmapreduce/pricing/).

This pull request adds the `--bootstrap-script` flag to the `spark-ec2` 
script.  This flag specifies an s3 path to a shell script.  The script is 
downloaded and run on each node during the setup process.  The flag can be 
specified multiple times for multiple bootstrap actions.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/mdagost/spark ec2_bootstrap

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/8359.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #8359


commit d51aec04dc3f700de09cc398fc69b96cb86061b9
Author: Michelangelo D'Agostino mdagost...@civisanalytics.com
Date:   2015-08-21T15:43:10Z

ENH: Add ability to specify s3 bootstrap script to spark-ec2.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9104][SPARK-9105][SPARK-9106][SPARK-910...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/7753#issuecomment-133494106
  
  [Test build #41372 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41372/consoleFull)
 for   PR 7753 at commit 
[`f21a804`](https://github.com/apache/spark/commit/f21a8047e615c7dc17e5a823ebaabbbc84fffdb3).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10137][Streaming]Avoid to restart recei...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8340#issuecomment-133462018
  
  [Test build #41370 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41370/consoleFull)
 for   PR 8340 at commit 
[`f0d1f6e`](https://github.com/apache/spark/commit/f0d1f6e9992afa2be6f9efc74abf155b9dccbcf2).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: ENH: Add ability to specify s3 bootstrap scrip...

2015-08-21 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/8359#issuecomment-133478794
  
Please read 
https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark to 
understand how to propose a change


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10137][Streaming]Avoid to restart recei...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8340#issuecomment-133478999
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41370/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10137][Streaming]Avoid to restart recei...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8340#issuecomment-133478995
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10148][Streaming]Display active and ina...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8351#issuecomment-133481526
  
  [Test build #41371 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41371/console)
 for   PR 8351 at commit 
[`59c56d4`](https://github.com/apache/spark/commit/59c56d48c2c8166bd7fe424d7adf7fbf4f60e2a2).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9669][MESOS] Support PySpark on Mesos c...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8349#issuecomment-133534880
  
  [Test build #41379 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41379/consoleFull)
 for   PR 8349 at commit 
[`a8f2fe9`](https://github.com/apache/spark/commit/a8f2fe9a5b4bc4aaee1a66fac59a2694c2ee583c).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5259][CORE] don't submit stage until it...

2015-08-21 Thread squito
Github user squito commented on the pull request:

https://github.com/apache/spark/pull/7699#issuecomment-133540222
  
@kayousterhout @markhamstra @pwendell @mateiz another one still waiting.  
Mostly got buy-in from Mark on https://github.com/apache/spark/pull/4055, but 
I'd still appreciate somebody else looking, especially since I was more 
involved in the code for this


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9669][MESOS] Support PySpark on Mesos c...

2015-08-21 Thread tarnfeld
Github user tarnfeld commented on the pull request:

https://github.com/apache/spark/pull/8349#issuecomment-133542132
  
:+1: This is awesome! Given it a little test locally and works a treat!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5259][CORE] don't submit stage until it...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/7699#issuecomment-133541667
  
  [Test build #41381 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41381/consoleFull)
 for   PR 7699 at commit 
[`51f3c47`](https://github.com/apache/spark/commit/51f3c47ab98a001a6649e51f58b4ee7bc8b7e721).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5882][GraphX] Add a test for GraphLoade...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4674#issuecomment-133543920
  
 Build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9741][SQL] Approximate Count Distinct u...

2015-08-21 Thread rxin
Github user rxin commented on the pull request:

https://github.com/apache/spark/pull/8362#issuecomment-133544919
  
This made my day. The approach is super cool.

Couple suggestions:

1. Can we use HyperLogLogPlus? It's also in streamlib: 
https://github.com/addthis/stream-lib/blob/master/src/main/java/com/clearspring/analytics/stream/cardinality/HyperLogLogPlus.java

2. Can we write this in a way to make it more unit testable?


Beyond this, would be cool to have count-min sketch too! (future work)




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9851] Support submitting map stages ind...

2015-08-21 Thread squito
Github user squito commented on a diff in the pull request:

https://github.com/apache/spark/pull/8180#discussion_r37672673
  
--- Diff: core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala 
---
@@ -47,15 +47,48 @@ import 
org.apache.spark.storage.BlockManagerMessages.BlockManagerHeartbeat
  * minimal schedule to run the job. It then submits stages as TaskSets to 
an underlying
  * TaskScheduler implementation that runs them on the cluster.
  *
- * In addition to coming up with a DAG of stages, this class also 
determines the preferred
+ * Spark stages are created by breaking the RDD graph at shuffle 
boundaries. RDD operations with
+ * narrow dependencies, like map() and filter(), are pipelined together 
into one set of tasks
+ * in each stage, but operations with shuffle dependencies require 
multiple stages (one to write a
+ * set of map output files, and another to read those files after a 
barrier). In the end, every
+ * stage will have only shuffle dependencies on other stages, and may 
compute multiple operations
+ * inside it. The actual pipelining of these operations happens in the 
RDD.compute() functions of
+ * various RDDs (MappedRDD, FilteredRDD, etc).
+ *
+ * In addition to coming up with a DAG of stages, the DAGScheduler also 
determines the preferred
  * locations to run each task on, based on the current cache status, and 
passes these to the
  * low-level TaskScheduler. Furthermore, it handles failures due to 
shuffle output files being
  * lost, in which case old stages may need to be resubmitted. Failures 
*within* a stage that are
  * not caused by shuffle file loss are handled by the TaskScheduler, which 
will retry each task
  * a small number of times before cancelling the whole stage.
  *
+ * When looking through this code, there are several key concepts:
+ *
+ *  - Jobs (represented by [[ActiveJob]]) are the top-level work items 
submitted to the scheduler.
+ *For example, when the user calls an action, like count(), a job will 
be submitted through
+ *submitJob. Each Job may require the execution of multiple stages to 
build intermediate data.
+ *
+ *  - Stages ([[Stage]]) are sets of tasks that compute intermediate 
results in jobs, where each
+ *task computes the same function on partitions of the same RDD. 
Stages are separated at shuffle
+ *boundaries, which introduce a barrier (where we must wait for the 
previous stage to finish to
+ *fetch outputs). There are two types of stages: [[ResultStage]], for 
the final stage that
+ *executes an action, and [[ShuffleMapStage]], which writes map output 
files for a shuffle.
+ *Stages are often shared across multiple jobs, if these jobs reuse 
the same RDDs.
--- End diff --

Its nice to see these expanded comments, but I think we really need to add 
a section on stage attempts.  That is most probably the most confusing part of 
the dag schduler and where most bugs occur.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9851] Support submitting map stages ind...

2015-08-21 Thread squito
Github user squito commented on a diff in the pull request:

https://github.com/apache/spark/pull/8180#discussion_r37673715
  
--- Diff: core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala 
---
@@ -812,7 +971,7 @@ class DAGScheduler(
 case s: ResultStage =
   val job = s.resultOfJob.get
   partitionsToCompute.map { id =
-val p = job.partitions(id)
+val p = s.partitions(id)
--- End diff --

I've wondered about this line many times in the past, but been too scared 
to change it -- is there any reason why `job.partitions` will be different from 
`resultStage.partitions`?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK 5280] RDF Loader added + documentation

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4650#issuecomment-133550145
  
Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10136] [SQL] A more robust fix for SPAR...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8361#issuecomment-133550172
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10136] [SQL] A more robust fix for SPAR...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8361#issuecomment-133550175
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41376/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8400] [ML] Added check for positive blo...

2015-08-21 Thread BryanCutler
GitHub user BryanCutler opened a pull request:

https://github.com/apache/spark/pull/8363

[SPARK-8400] [ML] Added check for positive block size

Added check for positive block size with a note that -1 for 
auto-configuring is not supported

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/BryanCutler/spark 
ml.ALS-neg-blocksize-8400-1.3

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/8363.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #8363


commit a231c96cc9a64010db9440d634f1c711ee510c70
Author: Bryan Cutler bjcut...@us.ibm.com
Date:   2015-08-21T20:33:06Z

[SPARK-8400] Added check for positive block size with a note that -1 
auto-config is not supported




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8400] [ML] Added check for positive blo...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8363#issuecomment-133556173
  
Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10121] [SQL] When add jar, also explici...

2015-08-21 Thread yhuai
GitHub user yhuai opened a pull request:

https://github.com/apache/spark/pull/8364

[SPARK-10121] [SQL] When add jar, also explicitly set the classloader of 
metadataHive's HiveConf inside metadataHive's state.

https://issues.apache.org/jira/browse/SPARK-10121

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/yhuai/spark SPARK-10121

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/8364.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #8364


commit 4aecbbaeb9da221a205a002739e1eca008b9756c
Author: Yin Huai yh...@databricks.com
Date:   2015-08-21T20:57:24Z

Expose initialConf.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9741][SQL] Approximate Count Distinct u...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8362#issuecomment-133562454
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41377/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9741][SQL] Approximate Count Distinct u...

2015-08-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8362#issuecomment-133562302
  
  [Test build #41377 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41377/console)
 for   PR 8362 at commit 
[`e178d9e`](https://github.com/apache/spark/commit/e178d9e942720c879dc947a5fd3e6593a351f04f).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `case class HyperLogLog(child: Expression, relativeSD: Double = 0.05)`



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-9851] Support submitting map stages ind...

2015-08-21 Thread squito
Github user squito commented on a diff in the pull request:

https://github.com/apache/spark/pull/8180#discussion_r37678288
  
--- Diff: core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala 
---
@@ -746,6 +848,63 @@ class DAGScheduler(
 submitWaitingStages()
   }
 
+  private[scheduler] def handleMapStageSubmitted(jobId: Int,
+  dependency: ShuffleDependency[_, _, _],
+  callSite: CallSite,
+  listener: JobListener,
+  properties: Properties) {
+// Submitting this map stage might still require the creation of some 
parent stages, so make
+// sure that happens.
+var finalStage: ShuffleMapStage = null
+try {
+  // New stage creation may throw an exception if, for example, jobs 
are run on a
+  // HadoopRDD whose underlying HDFS files have been deleted.
+  finalStage = getShuffleMapStage(dependency, jobId)
+} catch {
+  case e: Exception =
+logWarning(Creating new stage failed due to exception - job:  + 
jobId, e)
+listener.jobFailed(e)
+return
+}
+if (finalStage != null) {
--- End diff --

can you add a comment on how `finalStage` can be `null`?  would be nice to 
also have this in `handleJobSubmitted` as well, actually


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10004] [shuffle] Perform auth checks wh...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8218#issuecomment-133564829
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10004] [shuffle] Perform auth checks wh...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8218#issuecomment-133564844
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4413][SQL] Parquet support through data...

2015-08-21 Thread marmbrus
Github user marmbrus commented on a diff in the pull request:

https://github.com/apache/spark/pull/3269#discussion_r37668626
  
--- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/sources/interfaces.scala ---
@@ -93,3 +98,18 @@ abstract class PrunedScan extends BaseRelation {
 abstract class PrunedFilteredScan extends BaseRelation {
   def buildScan(requiredColumns: Array[String], filters: Array[Filter]): 
RDD[Row]
 }
+
+/**
+ * ::Experimental::
+ * An interface for experimenting with a more direct connection to the 
query planner.  Compared to
+ * [[PrunedFilteredScan]], this operator receives the raw expressions from 
the
+ * [[org.apache.spark.sql.catalyst.plans.logical.LogicalPlan]].  Unlike 
the other APIs this
+ * interface is not designed to be binary compatible across releases and 
thus should only be used
+ * for experimentation.
+ */
+@Experimental
+abstract class CatalystScan extends BaseRelation {
+  def buildScan(requiredColumns: Seq[Attribute], filters: 
Seq[Expression]): RDD[Row]
+}
--- End diff --

Why remove it?  Its labeled experimental, its not broken...


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10130] [SQL] type coercion for IF shoul...

2015-08-21 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/8331


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5882][GraphX] Add a test for GraphLoade...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4674#issuecomment-133544483
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41382/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5882][GraphX] Add a test for GraphLoade...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4674#issuecomment-133544479
  
Build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10143] [SQL] Use parquet's block size (...

2015-08-21 Thread yhuai
Github user yhuai commented on the pull request:

https://github.com/apache/spark/pull/8346#issuecomment-133547106
  
Tested it on a cluster using 
```
val count = 
sqlContext.table(store_sales).groupBy().count().queryExecution.executedPlan(3).execute().count
```
Basically, it reads 0 column of table `store_sales`. My table has 1824 
parquet files with size from 80MB to 280MB (1 to 3 row group sizes). Without 
this patch, in a 16 worker cluster, the job had 5023 tasks and spent 102s. With 
this patch, the job had 2893 tasks and spent 64s. It is still not as good as 
using one mapper per file (1824 tasks and 42s), but it is much better than our 
master.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10121] [SQL] [WIP] When add jar, also e...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8364#issuecomment-133562976
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-10121] [SQL] [WIP] When add jar, also e...

2015-08-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8364#issuecomment-133562994
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



  1   2   3   4   5   >