[GitHub] spark pull request: Fix SPARK-1413: Parquet messes up stdout and s...

2014-04-09 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/325#discussion_r11472274 --- Diff: core/src/main/scala/org/apache/spark/Logging.scala --- @@ -135,4 +136,6 @@ trait Logging { private object Logging { @volatile private var in

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40048195 Merged build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40048196 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13983/ --- If your project is set up for it, you can r

[GitHub] spark pull request: Fix SPARK-1413: Parquet messes up stdout and s...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/325#discussion_r11472039 --- Diff: core/src/main/scala/org/apache/spark/Logging.scala --- @@ -135,4 +136,6 @@ trait Logging { private object Logging { @volatile private var

[GitHub] spark pull request: Fix SPARK-1413: Parquet messes up stdout and s...

2014-04-09 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/325#discussion_r11471966 --- Diff: core/src/main/scala/org/apache/spark/Logging.scala --- @@ -135,4 +136,6 @@ trait Logging { private object Logging { @volatile private var in

[GitHub] spark pull request: Fix SPARK-1413: Parquet messes up stdout and s...

2014-04-09 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/325#discussion_r11471958 --- Diff: core/src/main/scala/org/apache/spark/Logging.scala --- @@ -135,4 +136,6 @@ trait Logging { private object Logging { @volatile private var i

[GitHub] spark pull request: Fix SPARK-1413: Parquet messes up stdout and s...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/325#discussion_r11471909 --- Diff: core/src/main/scala/org/apache/spark/Logging.scala --- @@ -135,4 +136,6 @@ trait Logging { private object Logging { @volatile private var

[GitHub] spark pull request: Fix SPARK-1413: Parquet messes up stdout and s...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/325#discussion_r11471870 --- Diff: core/src/main/scala/org/apache/spark/Logging.scala --- @@ -135,4 +136,6 @@ trait Logging { private object Logging { @volatile private var

[GitHub] spark pull request: Fix SPARK-1413: Parquet messes up stdout and s...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/325#discussion_r11471838 --- Diff: core/src/main/scala/org/apache/spark/Logging.scala --- @@ -135,4 +136,6 @@ trait Logging { private object Logging { @volatile private var

[GitHub] spark pull request: [WIP] Spark-1392: Add parameter to reserve min...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/377#issuecomment-40047132 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your proj

[GitHub] spark pull request: [WIP] Spark-1392: Add parameter to reserve min...

2014-04-09 Thread patmcdonough
GitHub user patmcdonough opened a pull request: https://github.com/apache/spark/pull/377 [WIP] Spark-1392: Add parameter to reserve minimum memory for the system and increase default executor memory * create a new property that sets the minimum amount of heap reserved for the syste

[GitHub] spark pull request: PySpark API for SparkSQL

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40046633 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: PySpark API for SparkSQL

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/363#issuecomment-40046637 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1446: Spark examples should not do a Sys...

2014-04-09 Thread techaddict
Github user techaddict commented on the pull request: https://github.com/apache/spark/pull/370#issuecomment-40046601 @tgravescs any changes ? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have t

[GitHub] spark pull request: SPARK-1428: MLlib should convert non-float64 N...

2014-04-09 Thread techaddict
Github user techaddict commented on the pull request: https://github.com/apache/spark/pull/356#issuecomment-40046556 @mateiz is there any other problem ? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project doe

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread andrewor14
Github user andrewor14 commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40046213 Not from my side (unless the tests fail...) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your proj

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40046214 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40046220 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40046079 This LGTM pending the small fix of changing Finished --> Completed in the UI. Is there anything else you want to do here or is this okay to merge? --- If your project i

[GitHub] spark pull request: SPARK-1202 - Add a "cancel" button in the UI f...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/246#issuecomment-40045886 Build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this f

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11471050 --- Diff: core/src/main/scala/org/apache/spark/ui/WebUI.scala --- @@ -20,6 +20,25 @@ package org.apache.spark.ui import java.text.SimpleDateFormat im

[GitHub] spark pull request: Fix SPARK-1413: Parquet messes up stdout and s...

2014-04-09 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/325#discussion_r11471046 --- Diff: core/src/main/scala/org/apache/spark/Logging.scala --- @@ -135,4 +136,6 @@ trait Logging { private object Logging { @volatile private var i

[GitHub] spark pull request: SPARK-1202 - Add a "cancel" button in the UI f...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/246#issuecomment-40045887 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13987/ --- If your project is set up for it, you can r

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11471042 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/IndexPage.scala --- @@ -0,0 +1,82 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] spark pull request: SPARK-1202 - Add a "cancel" button in the UI f...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/246#issuecomment-40045814 Build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this fe

[GitHub] spark pull request: SPARK-1202 - Add a "cancel" button in the UI f...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/246#issuecomment-40045810 Build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40045811 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40045815 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11471030 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/IndexPage.scala --- @@ -0,0 +1,82 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11471015 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServer.scala --- @@ -0,0 +1,287 @@ +/* + * Licensed to the Apache Software Founda

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/376#issuecomment-40045623 Build finished. All automated tests passed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your p

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/268#issuecomment-40045593 Build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/376#issuecomment-40045624 All automated tests passed. Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13982/ --- If your project

[GitHub] spark pull request: [SPARK-1386] Web UI for Spark Streaming

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/290#issuecomment-40045592 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: [SPARK-1386] Web UI for Spark Streaming

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/290#issuecomment-40045602 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/268#issuecomment-40045601 Build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this fe

[GitHub] spark pull request: [SPARK-1386] Web UI for Spark Streaming

2014-04-09 Thread tdas
Github user tdas commented on the pull request: https://github.com/apache/spark/pull/290#issuecomment-40045530 @pwendell This ready for review to be started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-1386] Web UI for Spark Streaming

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/290#issuecomment-40045215 Merged build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-1386] Web UI for Spark Streaming

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/290#issuecomment-40045216 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13984/ --- If your project is set up for it, you can r

[GitHub] spark pull request: [SPARK-1386] Web UI for Spark Streaming [WIP]

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/290#issuecomment-40045150 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: [SPARK-1386] Web UI for Spark Streaming [WIP]

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/290#issuecomment-40045153 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/268#issuecomment-40044966 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13981/ --- If your project is set up for it, you can r

[GitHub] spark pull request: [WIP][BUGFIX] In-memory columnar storage bug f...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/374#issuecomment-40044965 Merged build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/268#issuecomment-40044964 Build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this f

[GitHub] spark pull request: [SPARK-1225, 1241] [MLLIB] Add AreaUnderCurve ...

2014-04-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/364#discussion_r11470702 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/evaluation/binary/BinaryConfusionMatrix.scala --- @@ -0,0 +1,41 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [WIP][BUGFIX] In-memory columnar storage bug f...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/374#issuecomment-40044967 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13980/ --- If your project is set up for it, you can r

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11470537 --- Diff: core/src/main/scala/org/apache/spark/scheduler/EventLoggingListener.scala --- @@ -90,9 +97,115 @@ private[spark] class EventLoggingListener(appName:

[GitHub] spark pull request: SPARK-1310: Start adding k-fold cross validati...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/18#issuecomment-40044574 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13979/ --- If your project is set up for it, you can re

[GitHub] spark pull request: SPARK-1310: Start adding k-fold cross validati...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/18#issuecomment-40044573 Merged build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-1225, 1241] [MLLIB] Add AreaUnderCurve ...

2014-04-09 Thread mateiz
Github user mateiz commented on a diff in the pull request: https://github.com/apache/spark/pull/364#discussion_r11470460 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/evaluation/binary/BinaryConfusionMatrix.scala --- @@ -0,0 +1,41 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread yinxusen
Github user yinxusen commented on the pull request: https://github.com/apache/spark/pull/376#issuecomment-40044469 How about to add a subclass called `WholeTextFileRDD` extends from `NewHadoopRDD`, and use the `setMaxSplitSize` only for this subclass? --- If your project is set up fo

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread yinxusen
Github user yinxusen commented on a diff in the pull request: https://github.com/apache/spark/pull/376#discussion_r11470220 --- Diff: core/src/main/scala/org/apache/spark/input/WholeTextFileInputFormat.scala --- @@ -44,4 +47,15 @@ private[spark] class WholeTextFileInputFormat exten

[GitHub] spark pull request: [SPARK-1225, 1241] [MLLIB] Add AreaUnderCurve ...

2014-04-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/364#discussion_r11470221 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/evaluation/binary/BinaryClassificationMetrics.scala --- @@ -0,0 +1,204 @@ +/* + * Licensed to the

[GitHub] spark pull request: [SPARK-1225, 1241] [MLLIB] Add AreaUnderCurve ...

2014-04-09 Thread mateiz
Github user mateiz commented on a diff in the pull request: https://github.com/apache/spark/pull/364#discussion_r11470216 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/evaluation/binary/BinaryClassificationMetrics.scala --- @@ -0,0 +1,204 @@ +/* + * Licensed to the

[GitHub] spark pull request: [SPARK-1225, 1241] [MLLIB] Add AreaUnderCurve ...

2014-04-09 Thread mateiz
Github user mateiz commented on a diff in the pull request: https://github.com/apache/spark/pull/364#discussion_r11470192 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/evaluation/binary/BinaryClassificationMetrics.scala --- @@ -0,0 +1,204 @@ +/* + * Licensed to the

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/268#discussion_r11470173 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/RowMatrix.scala --- @@ -273,6 +411,27 @@ class RowMatrix( } mat

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40044103 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40044096 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread yinxusen
Github user yinxusen commented on a diff in the pull request: https://github.com/apache/spark/pull/376#discussion_r11470152 --- Diff: core/src/main/scala/org/apache/spark/input/WholeTextFileInputFormat.scala --- @@ -44,4 +47,15 @@ private[spark] class WholeTextFileInputFormat exten

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread andrewor14
Github user andrewor14 commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40044025 Jenkins, retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not hav

[GitHub] spark pull request: SPARK-1310: Start adding k-fold cross validati...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/18#issuecomment-40043941 Merged build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-1310: Start adding k-fold cross validati...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/18#issuecomment-40043942 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13977/ --- If your project is set up for it, you can re

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread mateiz
Github user mateiz commented on a diff in the pull request: https://github.com/apache/spark/pull/268#discussion_r11470086 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/RowMatrix.scala --- @@ -273,6 +411,27 @@ class RowMatrix( } mat

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread yinxusen
Github user yinxusen commented on a diff in the pull request: https://github.com/apache/spark/pull/268#discussion_r11470053 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/RowMatrix.scala --- @@ -19,13 +19,144 @@ package org.apache.spark.mllib.linalg.distr

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread mateiz
Github user mateiz commented on a diff in the pull request: https://github.com/apache/spark/pull/268#discussion_r11470050 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/RowMatrix.scala --- @@ -19,13 +19,144 @@ package org.apache.spark.mllib.linalg.distrib

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread mateiz
Github user mateiz commented on a diff in the pull request: https://github.com/apache/spark/pull/268#discussion_r11470041 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/RowMatrix.scala --- @@ -19,13 +19,144 @@ package org.apache.spark.mllib.linalg.distrib

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread mateiz
Github user mateiz commented on a diff in the pull request: https://github.com/apache/spark/pull/268#discussion_r11470006 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/RowMatrix.scala --- @@ -19,13 +19,144 @@ package org.apache.spark.mllib.linalg.distrib

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11470003 --- Diff: core/src/main/scala/org/apache/spark/scheduler/EventLoggingListener.scala --- @@ -90,9 +97,115 @@ private[spark] class EventLoggingListener(appNam

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread yinxusen
Github user yinxusen commented on a diff in the pull request: https://github.com/apache/spark/pull/376#discussion_r11469988 --- Diff: core/src/main/scala/org/apache/spark/input/WholeTextFileInputFormat.scala --- @@ -44,4 +47,15 @@ private[spark] class WholeTextFileInputFormat exten

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469982 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServer.scala --- @@ -0,0 +1,287 @@ +/* + * Licensed to the Apache Software Foun

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread mateiz
Github user mateiz commented on the pull request: https://github.com/apache/spark/pull/376#issuecomment-40043478 Hey FYI I've decided to wait a bit on this because Lucian wants to cut a stable branch soon, but we'll probably merge it tomorrow. --- If your project is set up for it, yo

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469955 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServer.scala --- @@ -0,0 +1,287 @@ +/* + * Licensed to the Apache Software Foun

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469937 --- Diff: core/src/main/scala/org/apache/spark/scheduler/ApplicationEventListener.scala --- @@ -0,0 +1,51 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469925 --- Diff: core/src/main/scala/org/apache/spark/scheduler/EventLoggingListener.scala --- @@ -90,9 +97,115 @@ private[spark] class EventLoggingListener(appNam

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread mateiz
Github user mateiz commented on the pull request: https://github.com/apache/spark/pull/376#issuecomment-40043334 I see, this is indeed a bit more complicated -- I asked some questions on it there. The weird part is that minSplits is passed to NewHadoopRDD but only used for this InputF

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread mateiz
Github user mateiz commented on a diff in the pull request: https://github.com/apache/spark/pull/376#discussion_r11469906 --- Diff: core/src/main/scala/org/apache/spark/input/WholeTextFileInputFormat.scala --- @@ -44,4 +47,15 @@ private[spark] class WholeTextFileInputFormat extends

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread mateiz
Github user mateiz commented on a diff in the pull request: https://github.com/apache/spark/pull/376#discussion_r11469892 --- Diff: core/src/main/scala/org/apache/spark/input/WholeTextFileInputFormat.scala --- @@ -44,4 +47,15 @@ private[spark] class WholeTextFileInputFormat extends

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469880 --- Diff: core/src/main/scala/org/apache/spark/scheduler/EventLoggingListener.scala --- @@ -90,9 +97,115 @@ private[spark] class EventLoggingListener(appNam

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469876 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServer.scala --- @@ -0,0 +1,287 @@ +/* + * Licensed to the Apache Software Founda

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread mateiz
Github user mateiz commented on a diff in the pull request: https://github.com/apache/spark/pull/376#discussion_r11469867 --- Diff: core/src/main/scala/org/apache/spark/SparkContext.scala --- @@ -385,7 +385,7 @@ class SparkContext( * hdfs://a-hdfs-path/part-n *

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread mateiz
Github user mateiz commented on a diff in the pull request: https://github.com/apache/spark/pull/376#discussion_r11469873 --- Diff: core/src/main/scala/org/apache/spark/api/java/JavaSparkContext.scala --- @@ -167,7 +167,8 @@ class JavaSparkContext(val sc: SparkContext) extends Jav

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/376#issuecomment-40043096 Build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this fe

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/376#issuecomment-40043092 Build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469853 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServer.scala --- @@ -0,0 +1,287 @@ +/* + * Licensed to the Apache Software Foun

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread yinxusen
Github user yinxusen commented on the pull request: https://github.com/apache/spark/pull/376#issuecomment-40043078 @mateiz , I have to modify some APIs so as to add the `minSplits`. I am not sure whether the modification is good or not. Could you have a look at it? --- If your projec

[GitHub] spark pull request: [SPARK-1415] Hadoop min split for wholeTextFil...

2014-04-09 Thread yinxusen
GitHub user yinxusen opened a pull request: https://github.com/apache/spark/pull/376 [SPARK-1415] Hadoop min split for wholeTextFiles() JIRA issue [here](https://issues.apache.org/jira/browse/SPARK-1415). New Hadoop API of `InputFormat` does not provide the `minSplits` param

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40042754 Merged build finished. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/204#issuecomment-40042755 Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13976/ --- If your project is set up for it, you can r

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/268#issuecomment-40042693 @yinxusen Thanks for your work! LGTM and let's wait for Jenkins. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as w

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469740 --- Diff: core/src/main/scala/org/apache/spark/scheduler/EventLoggingListener.scala --- @@ -90,9 +97,115 @@ private[spark] class EventLoggingListener(appName:

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469656 --- Diff: core/src/main/scala/org/apache/spark/scheduler/EventLoggingListener.scala --- @@ -90,9 +97,115 @@ private[spark] class EventLoggingListener(appName:

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469601 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServer.scala --- @@ -0,0 +1,287 @@ +/* + * Licensed to the Apache Software Founda

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469526 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServerArguments.scala --- @@ -0,0 +1,72 @@ +/* + * Licensed to the Apache Softwa

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469229 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServer.scala --- @@ -0,0 +1,287 @@ +/* + * Licensed to the Apache Software Founda

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469215 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServer.scala --- @@ -0,0 +1,287 @@ +/* + * Licensed to the Apache Software Founda

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/268#issuecomment-40041612 Build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this fe

[GitHub] spark pull request: [WIP] [SPARK-1328] Add vector statistics

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/268#issuecomment-40041606 Build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469154 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServer.scala --- @@ -0,0 +1,287 @@ +/* + * Licensed to the Apache Software Founda

[GitHub] spark pull request: [SPARK-1276] Add a HistoryServer to render per...

2014-04-09 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/204#discussion_r11469140 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServer.scala --- @@ -0,0 +1,287 @@ +/* + * Licensed to the Apache Software Founda

[GitHub] spark pull request: [WIP][BUGFIX] In-memory columnar storage bug f...

2014-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/374#issuecomment-40041420 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not ha

  1   2   3   4   5   >