[GitHub] spark pull request: [SPARK-8774] [ML] Add R model formula with bas...

2015-07-15 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/7381#discussion_r34742784 --- Diff: mllib/src/test/scala/org/apache/spark/ml/feature/RModelFormulaSuite.scala --- @@ -0,0 +1,78 @@ +/* + * Licensed to the Apache Software Foundat

[GitHub] spark pull request: [SPARK-8774] [ML] Add R model formula with bas...

2015-07-15 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/7381#discussion_r34742755 --- Diff: mllib/src/test/scala/org/apache/spark/ml/feature/RModelFormulaSuite.scala --- @@ -0,0 +1,78 @@ +/* + * Licensed to the Apache Software Foundat

[GitHub] spark pull request: [SPARK-8774] [ML] Add R model formula with bas...

2015-07-15 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/7381#discussion_r34742729 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/RModelFormula.scala --- @@ -0,0 +1,136 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] spark pull request: [SPARK-8774] [ML] Add R model formula with bas...

2015-07-15 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/7381#discussion_r34742685 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/RModelFormula.scala --- @@ -0,0 +1,136 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] spark pull request: [SPARK-8774] [ML] Add R model formula with bas...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7381#issuecomment-121776841 [Test build #37425 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37425/consoleFull) for PR 7381 at commit [`2db68aa`](https://gith

[GitHub] spark pull request: [SPARK-8995][SQL] cast date strings like '2015...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7353#issuecomment-121776785 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-8995][SQL] cast date strings like '2015...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7353#issuecomment-121776695 [Test build #37415 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37415/console) for PR 7353 at commit [`d20b8b4`](https://github.

[GitHub] spark pull request: [SPARK-8774] [ML] Add R model formula with bas...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7381#issuecomment-121776348 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9043] Serialize key, value and combiner...

2015-07-15 Thread massie
Github user massie commented on the pull request: https://github.com/apache/spark/pull/7403#issuecomment-121776325 I just pushed an update (5c58b4df9b) which ensures that we keep binary compatibility in `PairRDDFunctions`. The old `combineByKey` methods now call into the new `combineB

[GitHub] spark pull request: [SPARK-8774] [ML] Add R model formula with bas...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7381#issuecomment-121776334 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9043] Serialize key, value and combiner...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7403#issuecomment-121776225 [Test build #37423 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37423/consoleFull) for PR 7403 at commit [`5c58b4d`](https://gith

[GitHub] spark pull request: [SPARK-9021][PySpark] Having RDD.aggregate() m...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7378#issuecomment-121775914 [Test build #37424 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37424/consoleFull) for PR 7378 at commit [`659bb27`](https://gith

[GitHub] spark pull request: [SPARK-9043] Serialize key, value and combiner...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7403#issuecomment-121775767 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9043] Serialize key, value and combiner...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7403#issuecomment-121775752 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9021][PySpark] Having RDD.aggregate() m...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7378#issuecomment-121775780 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9021][PySpark] Having RDD.aggregate() m...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7378#issuecomment-121775758 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-8443][SQL] Split GenerateMutableProject...

2015-07-15 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/7076#issuecomment-121775149 In reality how many columns would it take to go over the limit? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as w

[GitHub] spark pull request: [SPARK-9060] [SQL] Revert SPARK-8359, SPARK-88...

2015-07-15 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/7426#issuecomment-121774937 @viirya and @davies Can you take a look to double check if I accidentally remove any thing useful? --- If your project is set up for it, you can reply to this email and h

[GitHub] spark pull request: [SPARK-746][CORE] Added Avro Serialization to ...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7004#issuecomment-121774824 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-746][CORE] Added Avro Serialization to ...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7004#issuecomment-121774720 [Test build #37408 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37408/console) for PR 7004 at commit [`fa9298b`](https://github.

[GitHub] spark pull request: [SPARK-9086][SQL] Remove BinaryNode from TreeN...

2015-07-15 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/7433#issuecomment-121774074 Lgtm --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request: [SPARK-9086][SQL] Remove BinaryNode from TreeN...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7433#issuecomment-121773693 [Test build #37422 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37422/consoleFull) for PR 7433 at commit [`1881f78`](https://gith

[GitHub] spark pull request: [SPARK-9086][SQL] Remove BinaryNode from TreeN...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7433#issuecomment-121773524 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9086][SQL] Remove BinaryNode from TreeN...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7433#issuecomment-121773510 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-5989] [MLlib] Model save/load for LDA

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6948#issuecomment-121773416 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-9086][SQL] Remove BinaryNode from TreeN...

2015-07-15 Thread rxin
GitHub user rxin opened a pull request: https://github.com/apache/spark/pull/7433 [SPARK-9086][SQL] Remove BinaryNode from TreeNode. These traits are not super useful, and yet cause problems with toString in expressions due to the orders they are mixed in. You can merge this pull r

[GitHub] spark pull request: [SPARK-5989] [MLlib] Model save/load for LDA

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6948#issuecomment-121773357 [Test build #37418 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37418/console) for PR 6948 at commit [`cc14054`](https://github.

[GitHub] spark pull request: SPARK-6548 Adding stddev to DataFrame function...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6297#issuecomment-121773151 [Test build #37411 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37411/console) for PR 6297 at commit [`43fb84f`](https://github.

[GitHub] spark pull request: SPARK-6548 Adding stddev to DataFrame function...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6297#issuecomment-121773207 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-9016][ML][WIP] make random forest class...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7432#issuecomment-121772805 [Test build #37419 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37419/console) for PR 7432 at commit [`16aea1c`](https://github.

[GitHub] spark pull request: [SPARK-9016][ML][WIP] make random forest class...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7432#issuecomment-121772817 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-8875] Remove BlockStoreShuffleFetcher c...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7268#issuecomment-121770865 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-8875] Remove BlockStoreShuffleFetcher c...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7268#issuecomment-121770825 [Test build #37403 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37403/console) for PR 7268 at commit [`14bfcbb`](https://github.

[GitHub] spark pull request: [SPARK-3665][GraphX] Java API for GraphX

2015-07-15 Thread ankurdave
Github user ankurdave closed the pull request at: https://github.com/apache/spark/pull/3234 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is e

[GitHub] spark pull request: [SPARK-3665][GraphX] Java API for GraphX

2015-07-15 Thread ankurdave
Github user ankurdave commented on the pull request: https://github.com/apache/spark/pull/3234#issuecomment-121770421 @JoshRosen @srowen Sorry I didn't see your requests to close earlier. Closing now. --- If your project is set up for it, you can reply to this email and have your rep

[GitHub] spark pull request: [SPARK-6910] [SQL] Support for pushing predica...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7421#issuecomment-121769347 [Test build #37421 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37421/consoleFull) for PR 7421 at commit [`5599cc4`](https://gith

[GitHub] spark pull request: [SPARK-6910] [SQL] Support for pushing predica...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7421#issuecomment-121769251 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-8996] [MLlib] [PySpark] Python API for ...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7430#issuecomment-121769315 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6910] [SQL] Support for pushing predica...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7421#issuecomment-121769277 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-8996] [MLlib] [PySpark] Python API for ...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7430#issuecomment-121769311 [Test build #37420 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37420/console) for PR 7430 at commit [`3b03044`](https://github.

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on the pull request: https://github.com/apache/spark/pull/7388#issuecomment-121769029 That's all for now! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have th

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7388#discussion_r34738272 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/CountVectorizerModel.scala --- @@ -19,45 +19,135 @@ package org.apache.spark.ml.feature i

[GitHub] spark pull request: [SPARK-8996] [MLlib] [PySpark] Python API for ...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7430#issuecomment-121768814 [Test build #37420 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37420/consoleFull) for PR 7430 at commit [`3b03044`](https://gith

[GitHub] spark pull request: [SPARK-9016][ML][WIP] make random forest class...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7432#issuecomment-121768798 [Test build #37419 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37419/consoleFull) for PR 7432 at commit [`16aea1c`](https://gith

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7388#discussion_r34738128 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/CountVectorizerModel.scala --- @@ -19,45 +19,135 @@ package org.apache.spark.ml.feature i

[GitHub] spark pull request: [SPARK-8996] [MLlib] [PySpark] Python API for ...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7430#issuecomment-121768545 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9016][ML][WIP] make random forest class...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7432#issuecomment-121768573 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-8996] [MLlib] [PySpark] Python API for ...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7430#issuecomment-121768572 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9016][ML][WIP] make random forest class...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7432#issuecomment-121768541 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9016][ML][WIP] make random forest class...

2015-07-15 Thread holdenk
GitHub user holdenk opened a pull request: https://github.com/apache/spark/pull/7432 [SPARK-9016][ML][WIP] make random forest classifiers implement classification trait You can merge this pull request into a Git repository by running: $ git pull https://github.com/holdenk/spa

[GitHub] spark pull request: [SPARK-5989] [MLlib] Model save/load for LDA

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6948#issuecomment-121768407 [Test build #37418 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37418/consoleFull) for PR 6948 at commit [`cc14054`](https://gith

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7388#discussion_r34737878 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/CountVectorizerModel.scala --- @@ -19,45 +19,135 @@ package org.apache.spark.ml.feature i

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7388#discussion_r34737713 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/CountVectorizerModel.scala --- @@ -19,45 +19,135 @@ package org.apache.spark.ml.feature i

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7388#discussion_r34737694 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/CountVectorizerModel.scala --- @@ -19,45 +19,135 @@ package org.apache.spark.ml.feature i

[GitHub] spark pull request: [SPARK-5989] [MLlib] Model save/load for LDA

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6948#issuecomment-121767881 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-5989] [MLlib] Model save/load for LDA

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6948#issuecomment-121767904 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-746][CORE] Added Avro Serialization to ...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7004#issuecomment-121767711 [Test build #37417 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37417/consoleFull) for PR 7004 at commit [`dd71efe`](https://gith

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7388#discussion_r34737611 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/CountVectorizerModel.scala --- @@ -19,45 +19,135 @@ package org.apache.spark.ml.feature i

[GitHub] spark pull request: [SPARK-6777] [SQL] Implements backwards compat...

2015-07-15 Thread rdblue
Github user rdblue commented on a diff in the pull request: https://github.com/apache/spark/pull/6617#discussion_r34737606 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/parquet/ParquetSchemaSuite.scala --- @@ -109,20 +245,21 @@ class ParquetSchemaSuite extends SparkFunSui

[GitHub] spark pull request: [SPARK-5989] [MLlib] Model save/load for LDA

2015-07-15 Thread MechCoder
Github user MechCoder commented on the pull request: https://github.com/apache/spark/pull/6948#issuecomment-121767752 @jkbradley done ! btw is there a preferred way to manually inspect the Parquet files generated? --- If your project is set up for it, you can reply to this email and

[GitHub] spark pull request: [SPARK-746][CORE] Added Avro Serialization to ...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7004#issuecomment-121767183 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-746][CORE] Added Avro Serialization to ...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7004#issuecomment-121767164 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [WIP][SPARK-7018][Build]: Refactor dev/run-tes...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7401#issuecomment-121766918 **[Test build #37416 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37416/console)** for PR 7401 at commit [`cf110bc`](https://github.c

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7412#discussion_r34737026 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -43,6 +43,8 @@ class PrefixSpan private ( private var minSuppo

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7412#discussion_r34737011 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -82,20 +84,70 @@ class PrefixSpan private ( logWarning("Inpu

[GitHub] spark pull request: [WIP][SPARK-7018][Build]: Refactor dev/run-tes...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7401#issuecomment-121766367 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [WIP][SPARK-7018][Build]: Refactor dev/run-tes...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7401#issuecomment-121766342 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7412#discussion_r34736797 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -82,20 +84,70 @@ class PrefixSpan private ( logWarning("Inpu

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7412#discussion_r34736588 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -82,20 +84,70 @@ class PrefixSpan private ( logWarning("Inpu

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7412#discussion_r34736608 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -82,20 +84,70 @@ class PrefixSpan private ( logWarning("Inpu

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7412#discussion_r34736431 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -82,20 +84,70 @@ class PrefixSpan private ( logWarning("Inpu

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7412#discussion_r34735768 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -82,20 +84,70 @@ class PrefixSpan private ( logWarning("Inpu

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7412#discussion_r34735693 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -82,20 +84,70 @@ class PrefixSpan private ( logWarning("Inpu

[GitHub] spark pull request: [SPARK-746][CORE] Added Avro Serialization to ...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7004#issuecomment-121764074 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7412#discussion_r34735567 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -82,20 +84,70 @@ class PrefixSpan private ( logWarning("Inpu

[GitHub] spark pull request: [SPARK-8995][SQL] cast date strings like '2015...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7353#issuecomment-121763808 [Test build #37415 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37415/consoleFull) for PR 7353 at commit [`d20b8b4`](https://gith

[GitHub] spark pull request: [WIP][SPARK-7018][Build]: Refactor dev/run-tes...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7401#issuecomment-121763554 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [WIP][SPARK-7018][Build]: Refactor dev/run-tes...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7401#issuecomment-121763159 **[Test build #37414 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37414/console)** for PR 7401 at commit [`7b11665`](https://github.c

[GitHub] spark pull request: [WIP][SPARK-7018][Build]: Refactor dev/run-tes...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7401#issuecomment-121762983 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [WIP][SPARK-7018][Build]: Refactor dev/run-tes...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7401#issuecomment-121762919 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-8995][SQL] cast date strings like '2015...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7353#issuecomment-121762982 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on the pull request: https://github.com/apache/spark/pull/7412#issuecomment-121762995 My concern is not projected database > # patterns, rather it is that the `groupByKey` on L103 will overload an executor if some key (prefix) has many values (suffix

[GitHub] spark pull request: [SPARK-8995][SQL] cast date strings like '2015...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7353#issuecomment-121762926 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-8297] [YARN] Scheduler backend is not n...

2015-07-15 Thread mridulm
Github user mridulm commented on the pull request: https://github.com/apache/spark/pull/7431#issuecomment-121762507 Looks good to me ! If Tom does not come back with objections I am +1 on this. --- If your project is set up for it, you can reply to this email and have your reply appe

[GitHub] spark pull request: [SPARK-8103][core] DAGScheduler should not sub...

2015-07-15 Thread kayousterhout
Github user kayousterhout commented on the pull request: https://github.com/apache/spark/pull/6750#issuecomment-121761513 @squito I think there's a simpler fix here: I don't think TaskSchedulerImpl actually needs to know about all of the potential zombie task attempts. Right now ther

[GitHub] spark pull request: [SPARK-9071][SQL] MonotonicallyIncreasingID an...

2015-07-15 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/7428 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-746][CORE] Added Avro Serialization to ...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7004#issuecomment-121760780 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-746][CORE] Added Avro Serialization to ...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7004#issuecomment-121760803 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-8297] [YARN] Scheduler backend is not n...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7431#issuecomment-121760479 [Test build #37412 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37412/consoleFull) for PR 7431 at commit [`04dc112`](https://gith

[GitHub] spark pull request: [SPARK-7127] [MLLIB] [WIP] Adding broadcast of...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6300#issuecomment-121760349 [Test build #1077 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1077/console) for PR 6300 at commit [`86e73de`](https://github

[GitHub] spark pull request: [SPARK-8297] [YARN] Scheduler backend is not n...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7431#issuecomment-121759706 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-8297] [YARN] Scheduler backend is not n...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7431#issuecomment-121759734 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9071][SQL] MonotonicallyIncreasingID an...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7428#issuecomment-121759477 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-9071][SQL] MonotonicallyIncreasingID an...

2015-07-15 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7428#issuecomment-121759374 [Test build #37401 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37401/console) for PR 7428 at commit [`a760827`](https://github.

[GitHub] spark pull request: [SPARK-8297] [YARN] Scheduler backend is not n...

2015-07-15 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/7431#issuecomment-121759019 This is an updated version of #7243, which also works in yarn-client mode. /cc @mridulm @tgravescs I ran a bunch of tests on yarn client and cluster modes

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7412#discussion_r34733453 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -82,20 +84,70 @@ class PrefixSpan private ( logWarning("Inpu

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7412#discussion_r34733403 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -82,20 +84,70 @@ class PrefixSpan private ( logWarning("Inpu

[GitHub] spark pull request: [SPARK-8297] [YARN] Scheduler backend is not n...

2015-07-15 Thread vanzin
GitHub user vanzin opened a pull request: https://github.com/apache/spark/pull/7431 [SPARK-8297] [YARN] Scheduler backend is not notified in case node fails in YARN This change adds code to notify the scheduler backend when a container dies in YARN. You can merge this pull reque

[GitHub] spark pull request: [SPARK-8998][MLlib] Collect enough frequent pr...

2015-07-15 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7412#discussion_r34733098 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -86,16 +88,69 @@ class PrefixSpan private ( getFreqItemAndCo

[GitHub] spark pull request: [WIP][SPARK-7018][Build]: Refactor dev/run-tes...

2015-07-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7401#issuecomment-121757988 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

<    1   2   3   4   5   6   7   8   9   10   >