[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2015-03-10 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356399#comment-14356399 ] Yu Ishikawa commented on SPARK-2429: [~rnowling] I apologize for the delay in replyin

[jira] [Commented] (SPARK-6275) Miss toDF() function in docs/sql-programming-guide.md

2015-03-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356380#comment-14356380 ] Apache Spark commented on SPARK-6275: - User 'zzcclp' has created a pull request for th

[jira] [Created] (SPARK-6275) Miss toDF() function in docs/sql-programming-guide.md

2015-03-10 Thread zzc (JIRA)
zzc created SPARK-6275: -- Summary: Miss toDF() function in docs/sql-programming-guide.md Key: SPARK-6275 URL: https://issues.apache.org/jira/browse/SPARK-6275 Project: Spark Issue Type: Documentation

[jira] [Commented] (SPARK-6274) Add streaming examples showing integration with DataFrames and SQL

2015-03-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356328#comment-14356328 ] Apache Spark commented on SPARK-6274: - User 'tdas' has created a pull request for this

[jira] [Created] (SPARK-6274) Add streaming examples showing integration with DataFrames and SQL

2015-03-10 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-6274: Summary: Add streaming examples showing integration with DataFrames and SQL Key: SPARK-6274 URL: https://issues.apache.org/jira/browse/SPARK-6274 Project: Spark

[jira] [Created] (SPARK-6273) Got error when do join

2015-03-10 Thread Jeff (JIRA)
Jeff created SPARK-6273: --- Summary: Got error when do join Key: SPARK-6273 URL: https://issues.apache.org/jira/browse/SPARK-6273 Project: Spark Issue Type: Bug Affects Versions: 1.2.1 Re

[jira] [Commented] (SPARK-6268) KMeans parameter getter methods

2015-03-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356270#comment-14356270 ] Apache Spark commented on SPARK-6268: - User 'hhbyyh' has created a pull request for th

[jira] [Commented] (SPARK-4852) Hive query plan deserialization failure caused by shaded hive-exec jar file when generating golden answers

2015-03-10 Thread Kannan Rajah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356173#comment-14356173 ] Kannan Rajah commented on SPARK-4852: - We are hitting this issue in a production case,

[jira] [Issue Comment Deleted] (SPARK-6244) Implement VectorSpace to easy create a complicated feature vector

2015-03-10 Thread Kirill A. Korinskiy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kirill A. Korinskiy updated SPARK-6244: --- Comment: was deleted (was: Yes, this way sounds good. I can use same issue and pull r

[jira] [Commented] (SPARK-6244) Implement VectorSpace to easy create a complicated feature vector

2015-03-10 Thread Kirill A. Korinskiy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356164#comment-14356164 ] Kirill A. Korinskiy commented on SPARK-6244: Yes, this way sounds good. I can

[jira] [Commented] (SPARK-6244) Implement VectorSpace to easy create a complicated feature vector

2015-03-10 Thread Kirill A. Korinskiy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356165#comment-14356165 ] Kirill A. Korinskiy commented on SPARK-6244: Yes, this way sounds good. I can

[jira] [Commented] (SPARK-6271) Sort these tokens in alphabetic order to avoid further duplicate in HiveQl

2015-03-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6271?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356152#comment-14356152 ] Apache Spark commented on SPARK-6271: - User 'DoingDone9' has created a pull request fo

[jira] [Closed] (SPARK-6272) Sort these tokens in alphabetic order to avoid further duplicate in HiveQl

2015-03-10 Thread DoingDone9 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DoingDone9 closed SPARK-6272. - Resolution: Duplicate > Sort these tokens in alphabetic order to avoid further duplicate in HiveQl > -

[jira] [Commented] (SPARK-6270) Standalone Master hangs when streaming job completes

2015-03-10 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356142#comment-14356142 ] Tathagata Das commented on SPARK-6270: -- [~joshrosen] Another user other than Netflix

[jira] [Updated] (SPARK-6270) Standalone Master hangs when streaming job completes

2015-03-10 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-6270: - Description: If the event logging is enabled, the Spark Standalone Master tries to recreate the w

[jira] [Created] (SPARK-6272) Sort these tokens in alphabetic order to avoid further duplicate in HiveQl

2015-03-10 Thread DoingDone9 (JIRA)
DoingDone9 created SPARK-6272: - Summary: Sort these tokens in alphabetic order to avoid further duplicate in HiveQl Key: SPARK-6272 URL: https://issues.apache.org/jira/browse/SPARK-6272 Project: Spark

[jira] [Created] (SPARK-6271) Sort these tokens in alphabetic order to avoid further duplicate in HiveQl

2015-03-10 Thread DoingDone9 (JIRA)
DoingDone9 created SPARK-6271: - Summary: Sort these tokens in alphabetic order to avoid further duplicate in HiveQl Key: SPARK-6271 URL: https://issues.apache.org/jira/browse/SPARK-6271 Project: Spark

[jira] [Updated] (SPARK-6268) KMeans parameter getter methods

2015-03-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6268: - Assignee: yuhao yang > KMeans parameter getter methods > --- >

[jira] [Commented] (SPARK-6268) KMeans parameter getter methods

2015-03-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356125#comment-14356125 ] yuhao yang commented on SPARK-6268: --- Sure, I'll propose a PR very soon. Thanks! > KMean

[jira] [Updated] (SPARK-6270) Standalone Master hangs when streaming job completes

2015-03-10 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-6270: - Description: If the event logging is enabled, the Spark Standalone Master tries to recreate the w

[jira] [Created] (SPARK-6270) Standalone Master hangs when streaming job completes

2015-03-10 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-6270: Summary: Standalone Master hangs when streaming job completes Key: SPARK-6270 URL: https://issues.apache.org/jira/browse/SPARK-6270 Project: Spark Issue Type

[jira] [Commented] (SPARK-6268) KMeans parameter getter methods

2015-03-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356123#comment-14356123 ] Joseph K. Bradley commented on SPARK-6268: -- It's not rude at all! I made a bunch

[jira] [Commented] (SPARK-3438) Support for accessing secured HDFS in Standalone Mode

2015-03-10 Thread Tao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356114#comment-14356114 ] Tao Wang commented on SPARK-3438: - Looks like this issue is same with that in SPARK-5158.

[jira] [Commented] (SPARK-6268) KMeans parameter getter methods

2015-03-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356106#comment-14356106 ] yuhao yang commented on SPARK-6268: --- Hi Bradley, I hope this is not rude. Not sure if yo

[jira] [Comment Edited] (SPARK-6268) KMeans parameter getter methods

2015-03-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356106#comment-14356106 ] yuhao yang edited comment on SPARK-6268 at 3/11/15 2:14 AM: Hi

[jira] [Closed] (SPARK-6177) Add note in LDA example to remind possible coalesce

2015-03-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-6177. - Fix and merged, thanks. > Add note in LDA example to remind possible coalesce >

[jira] [Commented] (SPARK-6269) Using a different implementation of java array reflection for size estimation

2015-03-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356073#comment-14356073 ] Apache Spark commented on SPARK-6269: - User 'mccheah' has created a pull request for t

[jira] [Commented] (SPARK-6245) jsonRDD() of empty RDD results in exception

2015-03-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356030#comment-14356030 ] Apache Spark commented on SPARK-6245: - User 'srowen' has created a pull request for th

[jira] [Commented] (SPARK-5987) Model import/export for GaussianMixtureModel

2015-03-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356028#comment-14356028 ] Joseph K. Bradley commented on SPARK-5987: -- This is because Spark SQL's DataFrame

[jira] [Commented] (SPARK-6234) 10% Performance regression with Breeze upgrade

2015-03-10 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356016#comment-14356016 ] Nishkam Ravi commented on SPARK-6234: - If Spark doesn't directly/indirectly invoke squ

[jira] [Created] (SPARK-6269) Using a different implementation of java array reflection for size estimation

2015-03-10 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-6269: - Summary: Using a different implementation of java array reflection for size estimation Key: SPARK-6269 URL: https://issues.apache.org/jira/browse/SPARK-6269 Project: Spark

[jira] [Commented] (SPARK-6222) [STREAMING] All data may not be recovered from WAL when driver is killed

2015-03-10 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355996#comment-14355996 ] Hari Shreedharan commented on SPARK-6222: - Thinking about it again - markBatchFull

[jira] [Commented] (SPARK-6222) [STREAMING] All data may not be recovered from WAL when driver is killed

2015-03-10 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355965#comment-14355965 ] Hari Shreedharan commented on SPARK-6222: - Another option is to change the way we

[jira] [Created] (SPARK-6268) KMeans parameter getter methods

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6268: Summary: KMeans parameter getter methods Key: SPARK-6268 URL: https://issues.apache.org/jira/browse/SPARK-6268 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-6267) Python API for IsotonicRegression

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6267: Summary: Python API for IsotonicRegression Key: SPARK-6267 URL: https://issues.apache.org/jira/browse/SPARK-6267 Project: Spark Issue Type: New Featu

[jira] [Created] (SPARK-6266) PySpark SparseVector missing doc for size, indices, values

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6266: Summary: PySpark SparseVector missing doc for size, indices, values Key: SPARK-6266 URL: https://issues.apache.org/jira/browse/SPARK-6266 Project: Spark

[jira] [Created] (SPARK-6265) PySpark GLMs missing doc for intercept, weights

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6265: Summary: PySpark GLMs missing doc for intercept, weights Key: SPARK-6265 URL: https://issues.apache.org/jira/browse/SPARK-6265 Project: Spark Issue T

[jira] [Updated] (SPARK-6173) Python doc parity with Scala/Java in MLlib

2015-03-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6173: - Target Version/s: 1.4.0, 1.3.1 (was: 1.4.0) > Python doc parity with Scala/Java in MLlib

[jira] [Updated] (SPARK-6174) Improve doc: Python ALS, MatrixFactorizationModel

2015-03-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6174: - Target Version/s: 1.4.0, 1.3.1 (was: 1.4.0) > Improve doc: Python ALS, MatrixFactorizatio

[jira] [Created] (SPARK-6264) Python API for FPGrowth

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6264: Summary: Python API for FPGrowth Key: SPARK-6264 URL: https://issues.apache.org/jira/browse/SPARK-6264 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-6254) MLlib Python API parity check at 1.3 release

2015-03-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355895#comment-14355895 ] Joseph K. Bradley commented on SPARK-6254: -- *Linear Algebra* Above, I am not lis

[jira] [Created] (SPARK-6263) Python MLlib API missing items: Utils

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6263: Summary: Python MLlib API missing items: Utils Key: SPARK-6263 URL: https://issues.apache.org/jira/browse/SPARK-6263 Project: Spark Issue Type: Sub-t

[jira] [Created] (SPARK-6262) Python MLlib API missing items: Statistics

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6262: Summary: Python MLlib API missing items: Statistics Key: SPARK-6262 URL: https://issues.apache.org/jira/browse/SPARK-6262 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-1503) Implement Nesterov's accelerated first-order method

2015-03-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355883#comment-14355883 ] Sean Owen edited comment on SPARK-1503 at 3/10/15 10:50 PM: I

[jira] [Created] (SPARK-6261) Python MLlib API missing items: Feature

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6261: Summary: Python MLlib API missing items: Feature Key: SPARK-6261 URL: https://issues.apache.org/jira/browse/SPARK-6261 Project: Spark Issue Type: Sub

[jira] [Commented] (SPARK-1503) Implement Nesterov's accelerated first-order method

2015-03-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355883#comment-14355883 ] Sean Owen commented on SPARK-1503: -- I was just today reminded of this other issue, which

[jira] [Created] (SPARK-6260) Python API for PowerIterationClustering

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6260: Summary: Python API for PowerIterationClustering Key: SPARK-6260 URL: https://issues.apache.org/jira/browse/SPARK-6260 Project: Spark Issue Type: Imp

[jira] [Updated] (SPARK-6254) MLlib Python API parity check at 1.3 release

2015-03-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6254: - Description: This is an umbrella JIRA to list MLlib features which are present in the Scal

[jira] [Created] (SPARK-6259) Python API for LDA

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6259: Summary: Python API for LDA Key: SPARK-6259 URL: https://issues.apache.org/jira/browse/SPARK-6259 Project: Spark Issue Type: Improvement Co

[jira] [Created] (SPARK-6258) Python MLlib API missing items: Clustering

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6258: Summary: Python MLlib API missing items: Clustering Key: SPARK-6258 URL: https://issues.apache.org/jira/browse/SPARK-6258 Project: Spark Issue Type:

[jira] [Created] (SPARK-6257) Python MLlib API missing items: Recommendation

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6257: Summary: Python MLlib API missing items: Recommendation Key: SPARK-6257 URL: https://issues.apache.org/jira/browse/SPARK-6257 Project: Spark Issue Ty

[jira] [Created] (SPARK-6256) Python MLlib API missing items: Regression

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6256: Summary: Python MLlib API missing items: Regression Key: SPARK-6256 URL: https://issues.apache.org/jira/browse/SPARK-6256 Project: Spark Issue Type:

[jira] [Created] (SPARK-6255) Python MLlib API missing items: Classification

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6255: Summary: Python MLlib API missing items: Classification Key: SPARK-6255 URL: https://issues.apache.org/jira/browse/SPARK-6255 Project: Spark Issue Ty

[jira] [Created] (SPARK-6254) MLlib Python API parity check at 1.3 release

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6254: Summary: MLlib Python API parity check at 1.3 release Key: SPARK-6254 URL: https://issues.apache.org/jira/browse/SPARK-6254 Project: Spark Issue Type

[jira] [Commented] (SPARK-6245) jsonRDD() of empty RDD results in exception

2015-03-10 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355843#comment-14355843 ] Matthew Farrellee commented on SPARK-6245: -- this is an issue for the scala interf

[jira] [Commented] (SPARK-6252) Scala NaiveBayes should expose getLambda

2015-03-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355834#comment-14355834 ] Apache Spark commented on SPARK-6252: - User 'jkbradley' has created a pull request for

[jira] [Created] (SPARK-6253) Add LassoModel to __all__ in regression.py

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6253: Summary: Add LassoModel to __all__ in regression.py Key: SPARK-6253 URL: https://issues.apache.org/jira/browse/SPARK-6253 Project: Spark Issue Type:

[jira] [Created] (SPARK-6252) Scala NaiveBayes should expose getLambda

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6252: Summary: Scala NaiveBayes should expose getLambda Key: SPARK-6252 URL: https://issues.apache.org/jira/browse/SPARK-6252 Project: Spark Issue Type: Im

[jira] [Commented] (SPARK-5313) Create simple framework for highlighting changes introduced in a PR

2015-03-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355819#comment-14355819 ] Nicholas Chammas commented on SPARK-5313: - I had an idea to generalize the process

[jira] [Commented] (SPARK-6251) Mark parts of LBFGS, GradientDescent as DeveloperApi

2015-03-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355820#comment-14355820 ] Apache Spark commented on SPARK-6251: - User 'jkbradley' has created a pull request for

[jira] [Updated] (SPARK-4325) Improve spark-ec2 cluster launch times

2015-03-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-4325: Description: This is an umbrella task to capture several pieces of work related to signific

[jira] [Updated] (SPARK-4325) Improve spark-ec2 cluster launch times

2015-03-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-4325: Issue Type: Umbrella (was: Improvement) > Improve spark-ec2 cluster launch times >

[jira] [Created] (SPARK-6251) Mark parts of LBFGS, GradientDescent as DeveloperApi

2015-03-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6251: Summary: Mark parts of LBFGS, GradientDescent as DeveloperApi Key: SPARK-6251 URL: https://issues.apache.org/jira/browse/SPARK-6251 Project: Spark Is

[jira] [Created] (SPARK-6250) Types are now reserved words in DDL parser.

2015-03-10 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-6250: --- Summary: Types are now reserved words in DDL parser. Key: SPARK-6250 URL: https://issues.apache.org/jira/browse/SPARK-6250 Project: Spark Issue Type: B

[jira] [Commented] (SPARK-6222) [STREAMING] All data may not be recovered from WAL when driver is killed

2015-03-10 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355731#comment-14355731 ] Hari Shreedharan commented on SPARK-6222: - In the direct connector for Kafka, we c

[jira] [Resolved] (SPARK-6232) Spark Streaming: simple application stalls processing

2015-03-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6232. -- Resolution: Duplicate Fix Version/s: (was: 1.3.0) OK, resolving as a Duplicate then. I mean

[jira] [Commented] (SPARK-6232) Spark Streaming: simple application stalls processing

2015-03-10 Thread Platon Potapov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355692#comment-14355692 ] Platon Potapov commented on SPARK-6232: --- * i've tried 1.3.0 and it seemed to work.

[jira] [Commented] (SPARK-6222) [STREAMING] All data may not be recovered from WAL when driver is killed

2015-03-10 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355652#comment-14355652 ] Hari Shreedharan commented on SPARK-6222: - I could not generate TRACE level logs b

[jira] [Commented] (SPARK-4286) Support External Shuffle Service with Mesos integration

2015-03-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355646#comment-14355646 ] Apache Spark commented on SPARK-4286: - User 'dragos' has created a pull request for th

[jira] [Commented] (SPARK-6246) spark-ec2 can't handle clusters with > 100 nodes

2015-03-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355642#comment-14355642 ] Nicholas Chammas commented on SPARK-6246: - I dunno, I haven't looked into the prob

[jira] [Commented] (SPARK-5845) Time to cleanup spilled shuffle files not included in shuffle write time

2015-03-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355638#comment-14355638 ] Apache Spark commented on SPARK-5845: - User 'ilganeli' has created a pull request for

[jira] [Commented] (SPARK-6222) [STREAMING] All data may not be recovered from WAL when driver is killed

2015-03-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355632#comment-14355632 ] Apache Spark commented on SPARK-6222: - User 'harishreedharan' has created a pull reque

[jira] [Updated] (SPARK-5312) Use sbt to detect new or changed public classes in PRs

2015-03-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5312: Description: We currently use an [unwieldy grep/sed contraption|https://github.com/apache/s

[jira] [Commented] (SPARK-5312) Use sbt to detect new or changed public classes in PRs

2015-03-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355622#comment-14355622 ] Nicholas Chammas commented on SPARK-5312: - Thanks for looking into this [~boyork].

[jira] [Commented] (SPARK-6249) Get Kafka offsets from consumer group in ZK when using direct stream

2015-03-10 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355561#comment-14355561 ] Cody Koeninger commented on SPARK-6249: --- First, I don't think setting group.id shoul

[jira] [Commented] (SPARK-6249) Get Kafka offsets from consumer group in ZK when using direct stream

2015-03-10 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355459#comment-14355459 ] Tathagata Das commented on SPARK-6249: -- [~c...@koeninger.org] [~jerryshao] Let's disc

[jira] [Created] (SPARK-6249) Get Kafka offsets from consumer group in ZK when using direct stream

2015-03-10 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-6249: Summary: Get Kafka offsets from consumer group in ZK when using direct stream Key: SPARK-6249 URL: https://issues.apache.org/jira/browse/SPARK-6249 Project: Spark

[jira] [Resolved] (SPARK-4456) Document why spilling depends on both elements read and memory used

2015-03-10 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza resolved SPARK-4456. --- Resolution: Invalid Closing this in light of changes to when spilling occurs. > Document why spilling

[jira] [Resolved] (SPARK-2819) Difficult to turn on intercept with linear models

2015-03-10 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza resolved SPARK-2819. --- Resolution: Invalid > Difficult to turn on intercept with linear models >

[jira] [Commented] (SPARK-2819) Difficult to turn on intercept with linear models

2015-03-10 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355398#comment-14355398 ] Sandy Ryza commented on SPARK-2819: --- With the pipelines API superceding this, I think we

[jira] [Resolved] (SPARK-1956) Enable shuffle consolidation by default

2015-03-10 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza resolved SPARK-1956. --- Resolution: Won't Fix > Enable shuffle consolidation by default >

[jira] [Commented] (SPARK-1956) Enable shuffle consolidation by default

2015-03-10 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355390#comment-14355390 ] Sandy Ryza commented on SPARK-1956: --- Closing this as "Won't Fix" now that we've moved al

[jira] [Comment Edited] (SPARK-2819) Difficult to turn on intercept with linear models

2015-03-10 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355398#comment-14355398 ] Sandy Ryza edited comment on SPARK-2819 at 3/10/15 6:23 PM: Wi

[jira] [Commented] (SPARK-2114) groupByKey and joins on raw data

2015-03-10 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355385#comment-14355385 ] Sandy Ryza commented on SPARK-2114: --- Closing this in favor of SPARK-4550 and SPARK-2926.

[jira] [Resolved] (SPARK-2114) groupByKey and joins on raw data

2015-03-10 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza resolved SPARK-2114. --- Resolution: Duplicate > groupByKey and joins on raw data > > >

[jira] [Resolved] (SPARK-4921) TaskSetManager mistakenly returns PROCESS_LOCAL for NO_PREF tasks

2015-03-10 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza resolved SPARK-4921. --- Resolution: Won't Fix > TaskSetManager mistakenly returns PROCESS_LOCAL for NO_PREF tasks > --

[jira] [Commented] (SPARK-6232) Spark Streaming: simple application stalls processing

2015-03-10 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355345#comment-14355345 ] Tathagata Das commented on SPARK-6232: -- Yeah, I saw the comment :) I just wanted the

[jira] [Commented] (SPARK-6244) Implement VectorSpace to easy create a complicated feature vector

2015-03-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355337#comment-14355337 ] Xiangrui Meng commented on SPARK-6244: -- Agree with Sean that this is not a vector spa

[jira] [Commented] (SPARK-6232) Spark Streaming: simple application stalls processing

2015-03-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355321#comment-14355321 ] Sean Owen commented on SPARK-6232: -- Yeah I believe he updated the comment above (https:/

[jira] [Comment Edited] (SPARK-5987) Model import/export for GaussianMixtureModel

2015-03-10 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355274#comment-14355274 ] Manoj Kumar edited comment on SPARK-5987 at 3/10/15 5:32 PM: -

[jira] [Created] (SPARK-6248) LocalRelation needs to implement statistics

2015-03-10 Thread Yin Huai (JIRA)
Yin Huai created SPARK-6248: --- Summary: LocalRelation needs to implement statistics Key: SPARK-6248 URL: https://issues.apache.org/jira/browse/SPARK-6248 Project: Spark Issue Type: Bug Com

[jira] [Created] (SPARK-6247) Certain self joins cannot be analyzed

2015-03-10 Thread Yin Huai (JIRA)
Yin Huai created SPARK-6247: --- Summary: Certain self joins cannot be analyzed Key: SPARK-6247 URL: https://issues.apache.org/jira/browse/SPARK-6247 Project: Spark Issue Type: Bug Component

[jira] [Commented] (SPARK-5312) Use sbt to detect new or changed public classes in PRs

2015-03-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355196#comment-14355196 ] Sean Owen commented on SPARK-5312: -- Oh OK, I resolved given the discussion above, but if

[jira] [Commented] (SPARK-4012) Uncaught OOM in ContextCleaner

2015-03-10 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355151#comment-14355151 ] Nan Zhu commented on SPARK-4012: [~srowen], actually I got more understanding on the scena

[jira] [Commented] (SPARK-5986) Model import/export for KMeansModel

2015-03-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355134#comment-14355134 ] Joseph K. Bradley commented on SPARK-5986: -- Yes, that's correct. It doesn't soun

[jira] [Commented] (SPARK-3278) Isotonic regression

2015-03-10 Thread Vladimir Vladimirov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355119#comment-14355119 ] Vladimir Vladimirov commented on SPARK-3278: Martin. This would be really nic

[jira] [Updated] (SPARK-6241) hiveql ANALYZE TABLE doesn't work for external tables

2015-03-10 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6241: Target Version/s: 1.4.0 > hiveql ANALYZE TABLE doesn't work for external tables > --

[jira] [Updated] (SPARK-4122) Add library to write data back to Kafka

2015-03-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4122: - Issue Type: Improvement (was: Bug) > Add library to write data back to Kafka > --

[jira] [Updated] (SPARK-4496) smallint (16 bit value) is being send as a 32 bit value in the thrift interface.

2015-03-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4496: - Component/s: (was: Input/Output) SQL > smallint (16 bit value) is being send as a 3

[jira] [Commented] (SPARK-4496) smallint (16 bit value) is being send as a 32 bit value in the thrift interface.

2015-03-10 Thread Chip Sands (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355084#comment-14355084 ] Chip Sands commented on SPARK-4496: --- I have not look at the spark code. But it would be

  1   2   >