[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/7232 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/7232#issuecomment-118745945 Thanks for looking at this. Going to merge it. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your proj

[GitHub] spark pull request: [SPARK-7190] [SPARK-8804] [SPARK-7815] unsafe ...

2015-07-05 Thread tarekauel
Github user tarekauel commented on a diff in the pull request: https://github.com/apache/spark/pull/7197#discussion_r33909453 --- Diff: unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -44,60 +47,77 @@ 5, 5, 5, 5, 6, 6, 6, 6}; +

[GitHub] spark pull request: [SPARK-8270][SQL] levenshtein distance

2015-07-05 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/7214#discussion_r33909072 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/functions.scala --- @@ -1580,22 +1580,37 @@ object functions {

[GitHub] spark pull request: [SPARK-6776] [SPARK-8811] [SQL] Refactors Parq...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7231#issuecomment-118744074 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7232#issuecomment-118744073 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7232#issuecomment-118743990 [Test build #36558 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36558/console) for PR 7232 at commit [`934d3d1`](https://github.

[GitHub] spark pull request: [SPARK-6776] [SPARK-8811] [SQL] Refactors Parq...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7231#issuecomment-118743989 [Test build #36557 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36557/console) for PR 7231 at commit [`8be4723`](https://github.

[GitHub] spark pull request: [SPARK-8753][SQL][WIP] Create an IntervalType ...

2015-07-05 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/7226#issuecomment-118743957 We should add a way to cast interval into string and string to interval too. That can go in a separate pull request though. --- If your project is set up for it, you can

[GitHub] spark pull request: [SPARK-8753][SQL][WIP] Create an IntervalType ...

2015-07-05 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/7226#discussion_r33908974 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala --- @@ -1458,4 +1458,14 @@ class SQLQuerySuite extends QueryTest with BeforeAndAfterA

[GitHub] spark pull request: [SPARK-8753][SQL][WIP] Create an IntervalType ...

2015-07-05 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/7226#discussion_r33908958 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/sources/ddl.scala --- @@ -303,6 +303,10 @@ private[sql] object ResolvedDataSource { mode: SaveMo

[GitHub] spark pull request: [SPARK-8759][SQL] add default eval to binary a...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7157#issuecomment-118743737 [Test build #36562 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36562/consoleFull) for PR 7157 at commit [`2876cbd`](https://gith

[GitHub] spark pull request: [SPARK-8401] [Build] Scala version switching b...

2015-07-05 Thread mallman
Github user mallman commented on the pull request: https://github.com/apache/spark/pull/6832#issuecomment-118743436 @srowen I just returned from my vacation abroad and am catching up. Sorry for the wait. I'll take a look at this tomorrow. Cheers. --- If your project is set up for it,

[GitHub] spark pull request: [SPARK-8759][SQL] add default eval to binary a...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7157#issuecomment-118743342 Build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this f

[GitHub] spark pull request: [SPARK-8759][SQL] add default eval to binary a...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7157#issuecomment-118743331 Build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have thi

[GitHub] spark pull request: [SPARK-8270][SQL] levenshtein distance

2015-07-05 Thread tarekauel
Github user tarekauel commented on a diff in the pull request: https://github.com/apache/spark/pull/7214#discussion_r33908578 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/functions.scala --- @@ -1580,22 +1580,37 @@ object functions { ///

[GitHub] spark pull request: [SPARK-8018][MLlib]KMeans should accept initia...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6737#issuecomment-118738599 [Test build #36561 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36561/console) for PR 6737 at commit [`06d13ef`](https://github.

[GitHub] spark pull request: [SPARK-8018][MLlib]KMeans should accept initia...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6737#issuecomment-118738773 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-8811][SQL] Read array struct data from ...

2015-07-05 Thread liancheng
Github user liancheng commented on the pull request: https://github.com/apache/spark/pull/7209#issuecomment-118737760 @scwf Yeah, I didn't make the `element` to `array` change in #7231. It would be good to have one, either based on this PR or open a new one. The tricky part is it need

[GitHub] spark pull request: [SPARK-8753][SQL][WIP] Create an IntervalType ...

2015-07-05 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/7226#discussion_r33908086 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/types/DayTimeIntervalType.scala --- @@ -0,0 +1,45 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread yijieshen
Github user yijieshen commented on the pull request: https://github.com/apache/spark/pull/7232#issuecomment-118736827 Tried this out locally and works fine. LGTM as well. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request: [Spark-8703] [ML] Add CountVectorizer as a ml ...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7084#issuecomment-118735867 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [Spark-8703] [ML] Add CountVectorizer as a ml ...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7084#issuecomment-118735791 [Test build #36560 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36560/console) for PR 7084 at commit [`24728e4`](https://github.

[GitHub] spark pull request: [SPARK-8271][SQL]string function: soundex

2015-07-05 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/7115#discussion_r33907699 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringOperations.scala --- @@ -301,6 +303,27 @@ case class StringLength(child: E

[GitHub] spark pull request: [SPARK-8271][SQL]string function: soundex

2015-07-05 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/7115#discussion_r33907684 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringOperations.scala --- @@ -301,6 +303,27 @@ case class StringLength(child: E

[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread cloud-fan
Github user cloud-fan commented on the pull request: https://github.com/apache/spark/pull/7232#issuecomment-118735055 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request: [SPARK-8271][SQL]string function: soundex

2015-07-05 Thread HuJiayin
Github user HuJiayin commented on the pull request: https://github.com/apache/spark/pull/7115#issuecomment-118735097 @rxin @davies @liancheng can you trigger the unit test? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well.

[GitHub] spark pull request: [SPARK-8018][MLlib]KMeans should accept initia...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6737#issuecomment-118733142 [Test build #36561 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36561/consoleFull) for PR 6737 at commit [`06d13ef`](https://gith

[GitHub] spark pull request: [SPARK-8018][MLlib]KMeans should accept initia...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6737#issuecomment-118732111 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-8018][MLlib]KMeans should accept initia...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6737#issuecomment-118732126 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-8819] Fix build for maven 3.3.x

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7219#issuecomment-118730797 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-8819] Fix build for maven 3.3.x

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7219#issuecomment-118730681 [Test build #36556 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36556/console) for PR 7219 at commit [`537579a`](https://github.

[GitHub] spark pull request: [SPARK-4176][WIP] Support decimal types with p...

2015-07-05 Thread liancheng
Github user liancheng commented on the pull request: https://github.com/apache/spark/pull/6796#issuecomment-118730215 Hey @rtreffer, just want to make sure whether you are still working on this? I'm asking because I just opened #7231 to refactor Parquet read path for interoperability

[GitHub] spark pull request: [Spark-8703] [ML] Add CountVectorizer as a ml ...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7084#issuecomment-118729997 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: Fixed First and Last aggregates to compute fir...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7233#issuecomment-118727714 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your pr

[GitHub] spark pull request: [Spark-8703] [ML] Add CountVectorizer as a ml ...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7084#issuecomment-118726940 [Test build #36560 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36560/consoleFull) for PR 7084 at commit [`24728e4`](https://gith

[GitHub] spark pull request: Fixed First and Last aggregates to compute fir...

2015-07-05 Thread ggupta81
GitHub user ggupta81 opened a pull request: https://github.com/apache/spark/pull/7233 Fixed First and Last aggregates to compute first and last value per G… In current implementation, First and Last aggregates were calculating the values for entire DataFrame partition and then the

[GitHub] spark pull request: [Spark-8703] [ML] Add CountVectorizer as a ml ...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7084#issuecomment-118725388 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [Spark-8703] [ML] Add CountVectorizer as a ml ...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7084#issuecomment-118725374 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [Spark-8703] [ML] Add CountVectorizer as a ml ...

2015-07-05 Thread hhbyyh
Github user hhbyyh commented on the pull request: https://github.com/apache/spark/pull/7084#issuecomment-118725060 Thank a lot @jkbradley. I sent an update with: 1. change the class name to CountVectorizerModel. 2. make vocab a val. 3. change minTermCount to minTermFreq and

[GitHub] spark pull request: [Spark-8703] [ML] Add CountVectorizer as a ml ...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7084#issuecomment-118724246 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [Spark-8703] [ML] Add CountVectorizer as a ml ...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7084#issuecomment-118724201 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread yijieshen
Github user yijieshen commented on the pull request: https://github.com/apache/spark/pull/7232#issuecomment-118721775 Yep, just what I need :) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-8753][SQL][WIP] Create an IntervalType ...

2015-07-05 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/7226#discussion_r33906404 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/SqlParser.scala --- @@ -58,6 +58,7 @@ class SqlParser extends AbstractSparkSQLParser with Da

[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7232#issuecomment-118721345 [Test build #36558 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36558/consoleFull) for PR 7232 at commit [`934d3d1`](https://gith

[GitHub] spark pull request: fix percentPruned log message

2015-07-05 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/7227#issuecomment-118720578 cc @liancheng --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature en

[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7232#issuecomment-118720476 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: fix percentPruned log message

2015-07-05 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/7227#issuecomment-118720548 Jenkins, test this please. (Did something go wrong with Jenkins?) --- If your project is set up for it, you can reply to this email and have your reply appear on G

[GitHub] spark pull request: [SPARK-7050][build] Keep kafka-assembly maven ...

2015-07-05 Thread jerryshao
Github user jerryshao commented on the pull request: https://github.com/apache/spark/pull/5632#issuecomment-118720543 Hi @srowen , if the assembly jar is not found using this pattern `target/scala-*/spark-streaming-kafka-assembly-*.jar`, the python Kafka test will not be run at all.

[GitHub] spark pull request: [SPARK-8753][SQL][WIP] Create an IntervalType ...

2015-07-05 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/7226#discussion_r33906339 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/types/DayTimeIntervalType.scala --- @@ -0,0 +1,45 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7232#issuecomment-118720493 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/7232#issuecomment-118719989 cc @yijieshen I think you were looking for this in round? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread rxin
GitHub user rxin opened a pull request: https://github.com/apache/spark/pull/7232 [SPARK-8831][SQL] Support AbstractDataType in TypeCollection. Otherwise it is impossible to declare an expression supporting DecimalType. You can merge this pull request into a Git repository by runni

[GitHub] spark pull request: [SPARK-8831][SQL] Support AbstractDataType in ...

2015-07-05 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/7232#discussion_r33906266 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/HiveTypeCoercion.scala --- @@ -708,8 +708,6 @@ object HiveTypeCoercion {

[GitHub] spark pull request: [SPARK-8233][SQL] misc function: hash

2015-07-05 Thread chenghao-intel
Github user chenghao-intel commented on a diff in the pull request: https://github.com/apache/spark/pull/6971#discussion_r33906234 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/misc.scala --- @@ -47,4 +48,44 @@ case class Md5(child: Expression)

[GitHub] spark pull request: [SPARK-8819] Fix build for maven 3.3.x

2015-07-05 Thread pwendell
Github user pwendell commented on the pull request: https://github.com/apache/spark/pull/7219#issuecomment-118719759 I made a minor comment regarding the name. Pending that small update this LGTM --- If your project is set up for it, you can reply to this email and have your reply ap

[GitHub] spark pull request: [SPARK-8819] Fix build for maven 3.3.x

2015-07-05 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/7219#discussion_r33906211 --- Diff: pom.xml --- @@ -1826,6 +1830,26 @@ + + + release-profile --- End diff -- Actuall

[GitHub] spark pull request: [SPARK-8819] Fix build for maven 3.3.x

2015-07-05 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/7219#discussion_r33906199 --- Diff: pom.xml --- @@ -1826,6 +1830,26 @@ + + + release-profile --- End diff -- Can you

[GitHub] spark pull request: [SPARK-8233][SQL] misc function: hash

2015-07-05 Thread chenghao-intel
Github user chenghao-intel commented on a diff in the pull request: https://github.com/apache/spark/pull/6971#discussion_r33906097 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/misc.scala --- @@ -47,4 +48,44 @@ case class Md5(child: Expression)

[GitHub] spark pull request: [SPARK-8270][SQL] levenshtein distance

2015-07-05 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/7214#discussion_r33906065 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/functions.scala --- @@ -1580,22 +1580,37 @@ object functions {

[GitHub] spark pull request: [SQL][Minor] Update the DataFrame API for enco...

2015-07-05 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/7230 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-8221][SQL]Add pmod function

2015-07-05 Thread zhichao-li
Github user zhichao-li commented on the pull request: https://github.com/apache/spark/pull/6783#issuecomment-118715210 cc @rxin @davies could you take a look at this? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If y

[GitHub] spark pull request: [SQL][Minor] Update the DataFrame API for enco...

2015-07-05 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/7230#issuecomment-118714652 Thanks - merging this. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this f

[GitHub] spark pull request: [SPARK-8549][SparkR] Fix the line length of Sp...

2015-07-05 Thread yu-iskw
Github user yu-iskw commented on the pull request: https://github.com/apache/spark/pull/7204#issuecomment-118711831 @rxin Thank you for letting me know. I have updated my github email in my account profile. --- If your project is set up for it, you can reply to this email and have yo

[GitHub] spark pull request: [SPARK-6776] [SPARK-8811] [SQL] Refactors Parq...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7231#issuecomment-118710803 [Test build #36557 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36557/consoleFull) for PR 7231 at commit [`8be4723`](https://gith

[GitHub] spark pull request: [SPARK-8233][SQL] misc function: hash

2015-07-05 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/6971#discussion_r33905466 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/misc.scala --- @@ -47,4 +48,44 @@ case class Md5(child: Expression) "

[GitHub] spark pull request: [SPARK-6776] [SPARK-8811] [SQL] Refactors Parq...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7231#issuecomment-118710134 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6776] [SPARK-8811] [SQL] Refactors Parq...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7231#issuecomment-118710172 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-8270][SQL] levenshtein distance

2015-07-05 Thread tarekauel
Github user tarekauel commented on a diff in the pull request: https://github.com/apache/spark/pull/7214#discussion_r33905262 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/functions.scala --- @@ -1580,22 +1580,37 @@ object functions { ///

[GitHub] spark pull request: [SPARK-8549][SparkR] Fix the line length of Sp...

2015-07-05 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/7204#issuecomment-118709270 @yu-iskw you might want to update your github email to include the email address you used for your commits. Otherwise it doesn't show up in the github metadata. --- If y

[GitHub] spark pull request: [SPARK-8270][SQL] levenshtein distance

2015-07-05 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/7214#discussion_r33905094 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/functions.scala --- @@ -1580,22 +1580,37 @@ object functions {

[GitHub] spark pull request: [SPARK-8270][SQL] levenshtein distance

2015-07-05 Thread tarekauel
Github user tarekauel commented on a diff in the pull request: https://github.com/apache/spark/pull/7214#discussion_r33904993 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/functions.scala --- @@ -1580,22 +1580,37 @@ object functions { ///

[GitHub] spark pull request: [SPARK-8269][SQL]string function: initcap

2015-07-05 Thread tarekauel
Github user tarekauel commented on a diff in the pull request: https://github.com/apache/spark/pull/7208#discussion_r33904801 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringOperations.scala --- @@ -298,3 +299,56 @@ case class StringLength(chi

[GitHub] spark pull request: [SPARK-5259][CORE]Make sure shuffle metadata a...

2015-07-05 Thread suyanNone
Github user suyanNone commented on the pull request: https://github.com/apache/spark/pull/4055#issuecomment-118708012 @squito oh...I had skipped it... 1) Task attempt now is described in `TaskInfo` in Spark `TaskSetManager`. `TaskSetManager` is responsible for completing task attem

[GitHub] spark pull request: [SPARK-8549][SparkR] Fix the line length of Sp...

2015-07-05 Thread yu-iskw
Github user yu-iskw commented on the pull request: https://github.com/apache/spark/pull/7204#issuecomment-118707328 @shivaram Thank you for merging it! I agree with that. I will check the lintr code and ask Jim it. --- If your project is set up for it, you can reply to this email

[GitHub] spark pull request: [SPARK-8549][SparkR] Fix the line length of Sp...

2015-07-05 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/7204 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-8549][SparkR] Fix the line length of Sp...

2015-07-05 Thread shivaram
Github user shivaram commented on the pull request: https://github.com/apache/spark/pull/7204#issuecomment-118706866 Thanks @yu-iskw. BTW We should investigate how to get rid of the false warnings `no visible global function definition` as those functions are package private.

[GitHub] spark pull request: [SPARK-8819] Fix build for maven 3.3.x

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7219#issuecomment-118705402 [Test build #36556 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36556/consoleFull) for PR 7219 at commit [`537579a`](https://gith

[GitHub] spark pull request: [SPARK-8819] Fix build for maven 3.3.x

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7219#issuecomment-118703671 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-8819] Fix build for maven 3.3.x

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7219#issuecomment-118703711 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-8389][Streaming][PySpark] Expose KafkaR...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7185#issuecomment-118703589 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-8389][Streaming][PySpark] Expose KafkaR...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7185#issuecomment-118703559 [Test build #36555 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36555/console) for PR 7185 at commit [`fd13937`](https://github.

[GitHub] spark pull request: [SPARK-8819] Fix build for maven 3.3.x

2015-07-05 Thread andrewor14
Github user andrewor14 commented on the pull request: https://github.com/apache/spark/pull/7219#issuecomment-118703366 @srowen @pwendell please let me know whether I understood your comments correctly in the latest commit. --- If your project is set up for it, you can reply to this e

[GitHub] spark pull request: [SPARK-8819] Fix build for maven 3.3.x

2015-07-05 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request: https://github.com/apache/spark/pull/7219#discussion_r33904037 --- Diff: pom.xml --- @@ -1826,6 +1830,26 @@ + + + release --- End diff -- I still need

[GitHub] spark pull request: [SPARK-8221][SQL]Add pmod function

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6783#issuecomment-118700999 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-8221][SQL]Add pmod function

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6783#issuecomment-118700970 [Test build #36552 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36552/console) for PR 6783 at commit [`cbaf9f5`](https://github.

[GitHub] spark pull request: [SPARK-8271][SQL]string function: soundex

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7115#issuecomment-118700840 [Test build #999 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/999/console) for PR 7115 at commit [`f88d320`](https://github.c

[GitHub] spark pull request: [SPARK-8389][Streaming][PySpark] Expose KafkaR...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7185#issuecomment-118700156 [Test build #36555 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36555/consoleFull) for PR 7185 at commit [`fd13937`](https://gith

[GitHub] spark pull request: [SPARK-8811][SQL] Read array struct data from ...

2015-07-05 Thread Sephiroth-Lin
Github user Sephiroth-Lin commented on the pull request: https://github.com/apache/spark/pull/7209#issuecomment-118699916 @liancheng OK, good, thank you. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project doe

[GitHub] spark pull request: [SPARK-8389][Streaming][PySpark] Expose KafkaR...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7185#issuecomment-118699892 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-8389][Streaming][PySpark] Expose KafkaR...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7185#issuecomment-118699883 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-8233][SQL] misc function: hash

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6971#issuecomment-118699528 [Test build #998 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/998/console) for PR 6971 at commit [`d79088b`](https://github.c

[GitHub] spark pull request: [SPARK-8271][SQL]string function: soundex

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7115#issuecomment-118699444 [Test build #999 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/999/consoleFull) for PR 7115 at commit [`f88d320`](https://githu

[GitHub] spark pull request: [SPARK-8233][SQL] misc function: hash

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6971#issuecomment-118699427 [Test build #998 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/998/consoleFull) for PR 6971 at commit [`d79088b`](https://githu

[GitHub] spark pull request: [SPARK-6776] [SPARK-8811] [SQL] Refactors Parq...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7231#issuecomment-118698676 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6776] [SPARK-8811] [SQL] Refactors Parq...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7231#issuecomment-118698670 [Test build #36554 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36554/console) for PR 7231 at commit [`9b87903`](https://github.

[GitHub] spark pull request: [SPARK-6776] [SPARK-8811] [SQL] Refactors Parq...

2015-07-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7231#issuecomment-118695116 [Test build #36554 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36554/consoleFull) for PR 7231 at commit [`9b87903`](https://gith

[GitHub] spark pull request: [SPARK-6776] [SPARK-8811] [SQL] Refactors Parq...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7231#issuecomment-118694192 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6776] [SPARK-8811] [SQL] Refactors Parq...

2015-07-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7231#issuecomment-118694202 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [Spark-8703] [ML] Add CountVectorizer as a ml ...

2015-07-05 Thread hhbyyh
Github user hhbyyh commented on a diff in the pull request: https://github.com/apache/spark/pull/7084#discussion_r33902864 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/CountVectorizer.scala --- @@ -0,0 +1,79 @@ +/* + * Licensed to the Apache Software Foundatio

  1   2   >