[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-15 Thread ahirreddy
Github user ahirreddy commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40452665 Awesome, thanks!— Sent from Mailbox for iPhone On Tue, Apr 15, 2014 at 12:16 AM, asfgit wrote: > Closed #363 via c99bcb7feaa761c5826f2e1d844d0502a

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-15 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/363 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabl

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-15 Thread pwendell
Github user pwendell commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40451813 I've merged this. Thanks @ahirreddy - cool stuff! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40447100 All automated tests passed. Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14133/ --- If your project

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40447099 Merged build finished. All automated tests passed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread mateiz
Github user mateiz commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40446008 Hey guys, I looked through the code and tried this out, and it looks good to me. So if we can fix the test issues I'd say it's ready to merge. --- If your project is set u

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40444369 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40444360 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread pwendell
Github user pwendell commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40440638 I manually cancelled this build since we'll need to reterst. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as wel

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40440626 Merged build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40440628 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14130/ --- If your project is set up for it, you can r

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread pwendell
Github user pwendell commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40440452 @marmbrus I see- the duration issue was just that we had stopped running hive tests for a bit after Aaron's build change. --- If your project is set up for it, you can

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40439427 Regarding the longer test time, we should make sure that we aren't just comparing to times when the Hive tests weren't running at all. Should definitely look into

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40439428 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40439431 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread pwendell
Github user pwendell commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40439254 So a few concerns on the test output. The first is that the tests took way longer than normal (could just be a slow jenkins worker) and the second is that there was a bun

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40437824 All automated tests passed. Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14125/ --- If your project

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40437823 Merged build finished. All automated tests passed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40433881 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40433872 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40432706 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14124/ --- If your project is set up for it, you can r

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40432704 Merged build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40432661 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40432650 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread ahirreddy
Github user ahirreddy commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40432281 MIMA Checker issue because we now include Hive in the assembly jar when building on Jenkins. See Jira SPARK-1494 for more information. https://issues.apache.org/jira/

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40432159 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14123/ --- If your project is set up for it, you can r

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40432158 Merged build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40432076 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40432063 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40411343 Merged build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40411346 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14114/ --- If your project is set up for it, you can r

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40401538 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40401529 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-14 Thread pwendell
Github user pwendell commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40401432 Jenkins, retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread mateiz
Github user mateiz commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40300482 Is this failing due to not cleaning up some files? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your p

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40292065 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14081/ --- If your project is set up for it, you can r

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40292064 Merged build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40291396 Merged build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40291397 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14080/ --- If your project is set up for it, you can r

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40289938 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40289932 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40289149 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40289144 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread ahirreddy
Github user ahirreddy commented on a diff in the pull request: https://github.com/apache/spark/pull/363#discussion_r11561164 --- Diff: python/run-tests --- @@ -56,6 +56,9 @@ run_test "pyspark/mllib/clustering.py" run_test "pyspark/mllib/recommendation.py" run_test "pyspark

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread ahirreddy
Github user ahirreddy commented on a diff in the pull request: https://github.com/apache/spark/pull/363#discussion_r11561160 --- Diff: python/pyspark/rdd.py --- @@ -1387,6 +1387,95 @@ def _jrdd(self): def _is_pipelinable(self): return not (self.is_cached or sel

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread ahirreddy
Github user ahirreddy commented on a diff in the pull request: https://github.com/apache/spark/pull/363#discussion_r11561162 --- Diff: python/pyspark/context.py --- @@ -460,6 +463,225 @@ def sparkUser(self): """ return self._jsc.sc().sparkUser() +

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread ahirreddy
Github user ahirreddy commented on a diff in the pull request: https://github.com/apache/spark/pull/363#discussion_r11560881 --- Diff: python/pyspark/rdd.py --- @@ -1387,6 +1387,95 @@ def _jrdd(self): def _is_pipelinable(self): return not (self.is_cached or sel

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread ahirreddy
Github user ahirreddy commented on a diff in the pull request: https://github.com/apache/spark/pull/363#discussion_r11560795 --- Diff: python/pyspark/rdd.py --- @@ -1387,6 +1387,95 @@ def _jrdd(self): def _is_pipelinable(self): return not (self.is_cached or sel

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread ahirreddy
Github user ahirreddy commented on a diff in the pull request: https://github.com/apache/spark/pull/363#discussion_r11560776 --- Diff: python/pyspark/rdd.py --- @@ -1387,6 +1387,95 @@ def _jrdd(self): def _is_pipelinable(self): return not (self.is_cached or sel

[GitHub] spark pull request: SPARK-1374: PySpark API for SparkSQL

2014-04-12 Thread ahirreddy
Github user ahirreddy commented on a diff in the pull request: https://github.com/apache/spark/pull/363#discussion_r11560720 --- Diff: docs/sql-programming-guide.md --- @@ -318,4 +391,24 @@ Row[] results = hiveCtx.hql("FROM src SELECT key, value").collect(); +