[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-12 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/6003 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-12 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101357457 Thanks. Merging in! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this featu

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101232749 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101232750 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-12 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101232738 [Test build #32488 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32488/consoleFull) for PR 6003 at commit [`672efba`](https://git

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-12 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101196407 [Test build #32488 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32488/consoleFull) for PR 6003 at commit [`672efba`](https://gith

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101195177 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101195273 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101167505 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-12 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101167491 [Test build #32469 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32469/consoleFull) for PR 6003 at commit [`4a148f7`](https://git

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101167507 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101150335 [Test build #32469 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32469/consoleFull) for PR 6003 at commit [`4a148f7`](https://gith

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101150198 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101150146 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101147785 Build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does n

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101147783 [Test build #32467 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32467/consoleFull) for PR 6003 at commit [`2bb3b23`](https://git

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101147787 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101147723 LGTM. Can you update it so it can merge cleanly with master? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well.

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101147617 [Test build #32467 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32467/consoleFull) for PR 6003 at commit [`2bb3b23`](https://gith

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101146599 Build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this f

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101146492 Build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have thi

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/6003#discussion_r30088189 --- Diff: python/pyspark/sql/dataframe.py --- @@ -896,6 +900,76 @@ def fillna(self, value, subset=None): return DataFrame(self._jdf.na().fi

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-101064287 Thanks. I finally had time to review this in detail. A few things we need: 1. There are lot of branches for testing various input types. We should test those as wel

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100894151 [Test build #32381 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32381/consoleFull) for PR 6003 at commit [`04209b9`](https://git

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100894167 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100894165 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100869450 [Test build #32381 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32381/consoleFull) for PR 6003 at commit [`04209b9`](https://gith

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100869237 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-11 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100869221 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-10 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/6003#discussion_r2993 --- Diff: python/pyspark/sql/dataframe.py --- @@ -1182,6 +1256,13 @@ def _to_seq(sc, cols, converter=None): return sc._jvm.PythonUtils.toSeq(cols)

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-10 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/6003#discussion_r2995 --- Diff: core/src/main/scala/org/apache/spark/api/python/PythonUtils.scala --- @@ -53,4 +53,11 @@ private[spark] object PythonUtils { def toSeq[T](cols:

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100173921 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100173918 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100173901 [Test build #32218 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32218/consoleFull) for PR 6003 at commit [`484af9e`](https://git

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100140381 [Test build #32218 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32218/consoleFull) for PR 6003 at commit [`484af9e`](https://gith

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100140283 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100140235 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread adrian-wang
Github user adrian-wang commented on a diff in the pull request: https://github.com/apache/spark/pull/6003#discussion_r29921999 --- Diff: python/pyspark/sql/dataframe.py --- @@ -896,6 +900,77 @@ def fillna(self, value, subset=None): return DataFrame(self._jdf.

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread adrian-wang
Github user adrian-wang commented on a diff in the pull request: https://github.com/apache/spark/pull/6003#discussion_r29921893 --- Diff: core/src/main/scala/org/apache/spark/api/python/PythonUtils.scala --- @@ -53,4 +53,11 @@ private[spark] object PythonUtils { def toSeq[T]

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100132906 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100132901 [Test build #32215 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32215/consoleFull) for PR 6003 at commit [`08a07ad`](https://git

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100132904 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100132393 [Test build #32215 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32215/consoleFull) for PR 6003 at commit [`08a07ad`](https://gith

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100132091 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6003#issuecomment-100132082 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/6003#discussion_r29921037 --- Diff: python/pyspark/sql/dataframe.py --- @@ -896,6 +900,77 @@ def fillna(self, value, subset=None): return DataFrame(self._jdf.na().fi

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/6003#discussion_r29921013 --- Diff: core/src/main/scala/org/apache/spark/api/python/PythonUtils.scala --- @@ -53,4 +53,11 @@ private[spark] object PythonUtils { def toSeq[T](cols:

[GitHub] spark pull request: [SPARK-6876] [PySpark] [SQL] add DataFrame na....

2015-05-08 Thread adrian-wang
GitHub user adrian-wang opened a pull request: https://github.com/apache/spark/pull/6003 [SPARK-6876] [PySpark] [SQL] add DataFrame na.replace in pyspark You can merge this pull request into a Git repository by running: $ git pull https://github.com/adrian-wang/spark pynarepla