[jira] [Commented] (SPARK-8472) Python API for DCT

2015-06-21 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14594977#comment-14594977 ] Yu Ishikawa commented on SPARK-8472: Please assign this issue to me. > Python API for

[jira] [Resolved] (SPARK-8379) LeaseExpiredException when using dynamic partition with speculative execution

2015-06-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-8379. --- Resolution: Fixed Fix Version/s: 1.4.1 1.5.0 Issue resolved by pull request

[jira] [Created] (SPARK-8508) Test case "SQLQuerySuite.test script transform for stderr" generates super long output

2015-06-21 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-8508: - Summary: Test case "SQLQuerySuite.test script transform for stderr" generates super long output Key: SPARK-8508 URL: https://issues.apache.org/jira/browse/SPARK-8508 Projec

[jira] [Commented] (SPARK-8508) Test case "SQLQuerySuite.test script transform for stderr" generates super long output

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14594985#comment-14594985 ] Apache Spark commented on SPARK-8508: - User 'liancheng' has created a pull request for

[jira] [Assigned] (SPARK-8508) Test case "SQLQuerySuite.test script transform for stderr" generates super long output

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8508: --- Assignee: Cheng Lian (was: Apache Spark) > Test case "SQLQuerySuite.test script transform fo

[jira] [Assigned] (SPARK-8508) Test case "SQLQuerySuite.test script transform for stderr" generates super long output

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8508: --- Assignee: Apache Spark (was: Cheng Lian) > Test case "SQLQuerySuite.test script transform fo

[jira] [Created] (SPARK-8509) Failed to JOIN in pyspark

2015-06-21 Thread afancy (JIRA)
afancy created SPARK-8509: - Summary: Failed to JOIN in pyspark Key: SPARK-8509 URL: https://issues.apache.org/jira/browse/SPARK-8509 Project: Spark Issue Type: Bug Reporter: afancy Hi,

[jira] [Commented] (SPARK-8503) SizeEstimator returns negative value for recursive data structures

2015-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595014#comment-14595014 ] Sean Owen commented on SPARK-8503: -- That looks near to Long.MIN_VALUE. It could be overfl

[jira] [Commented] (SPARK-6112) Provide external block store support through HDFS RAM_DISK

2015-06-21 Thread Bogdan Ghit (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595034#comment-14595034 ] Bogdan Ghit commented on SPARK-6112: This is my configuration: 1. My tmpfs is mounted

[jira] [Created] (SPARK-8510) Store and read NumPy arrays and matrices as values in sequence files

2015-06-21 Thread Peter Aberline (JIRA)
Peter Aberline created SPARK-8510: - Summary: Store and read NumPy arrays and matrices as values in sequence files Key: SPARK-8510 URL: https://issues.apache.org/jira/browse/SPARK-8510 Project: Spark

[jira] [Commented] (SPARK-8419) Statistics.colStats could avoid an extra count()

2015-06-21 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595048#comment-14595048 ] Kai Sasaki commented on SPARK-8419: --- In the {{Statistics#colStats}}, the number of rows

[jira] [Commented] (SPARK-8115) Remove TestData

2015-06-21 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595052#comment-14595052 ] Benjamin Fradet commented on SPARK-8115: I've started working on this. > Remove T

[jira] [Commented] (SPARK-8503) SizeEstimator returns negative value for recursive data structures

2015-06-21 Thread Ilya Rakitsin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595060#comment-14595060 ] Ilya Rakitsin commented on SPARK-8503: -- The structure is a simple cycled graph, like

[jira] [Comment Edited] (SPARK-8503) SizeEstimator returns negative value for recursive data structures

2015-06-21 Thread Ilya Rakitsin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595060#comment-14595060 ] Ilya Rakitsin edited comment on SPARK-8503 at 6/21/15 2:28 PM: -

[jira] [Comment Edited] (SPARK-8503) SizeEstimator returns negative value for recursive data structures

2015-06-21 Thread Ilya Rakitsin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595060#comment-14595060 ] Ilya Rakitsin edited comment on SPARK-8503 at 6/21/15 2:28 PM: -

[jira] [Updated] (SPARK-8510) Support NumPy arrays and matrices as values in sequence files

2015-06-21 Thread Peter Aberline (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Aberline updated SPARK-8510: -- Summary: Support NumPy arrays and matrices as values in sequence files (was: Store and read Num

[jira] [Created] (SPARK-8511) Modify a test to remove a saved model in `regression.py`

2015-06-21 Thread Yu Ishikawa (JIRA)
Yu Ishikawa created SPARK-8511: -- Summary: Modify a test to remove a saved model in `regression.py` Key: SPARK-8511 URL: https://issues.apache.org/jira/browse/SPARK-8511 Project: Spark Issue Type

[jira] [Updated] (SPARK-8511) Modify a test to remove a saved model in `regression.py`

2015-06-21 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Ishikawa updated SPARK-8511: --- Description: According to the reference of python, {{os.removedirs}} doesn't work if there are any fi

[jira] [Commented] (SPARK-8472) Python API for DCT

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595119#comment-14595119 ] Joseph K. Bradley commented on SPARK-8472: -- Let's wait until its dependency is co

[jira] [Commented] (SPARK-8419) Statistics.colStats could avoid an extra count()

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595120#comment-14595120 ] Joseph K. Bradley commented on SPARK-8419: -- Oops, I misread what the count() was

[jira] [Closed] (SPARK-8419) Statistics.colStats could avoid an extra count()

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-8419. Resolution: Not A Problem > Statistics.colStats could avoid an extra count() > -

[jira] [Assigned] (SPARK-8511) Modify a test to remove a saved model in `regression.py`

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8511: --- Assignee: Apache Spark > Modify a test to remove a saved model in `regression.py` > -

[jira] [Assigned] (SPARK-8511) Modify a test to remove a saved model in `regression.py`

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8511: --- Assignee: (was: Apache Spark) > Modify a test to remove a saved model in `regression.py`

[jira] [Commented] (SPARK-8472) Python API for DCT

2015-06-21 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595125#comment-14595125 ] Yu Ishikawa commented on SPARK-8472: Alright. Thanks! > Python API for DCT >

[jira] [Commented] (SPARK-8511) Modify a test to remove a saved model in `regression.py`

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595124#comment-14595124 ] Apache Spark commented on SPARK-8511: - User 'yu-iskw' has created a pull request for t

[jira] [Commented] (SPARK-6813) SparkR style guide

2015-06-21 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595150#comment-14595150 ] Shivaram Venkataraman commented on SPARK-6813: -- Yeah we could disable those t

[jira] [Commented] (SPARK-7499) Investigate how to specify columns in SparkR without $ or strings

2015-06-21 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595153#comment-14595153 ] Shivaram Venkataraman commented on SPARK-7499: -- [~sd2k] Thanks for taking a s

[jira] [Updated] (SPARK-7426) spark.ml AttributeFactory.fromStructField should allow other NumericTypes

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7426: - Assignee: Mike Dusenberry > spark.ml AttributeFactory.fromStructField should allow other N

[jira] [Updated] (SPARK-8509) Failed to JOIN in pyspark

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8509: - Description: Hi, I am writing pyspark stream program. I have the training data set to com

[jira] [Updated] (SPARK-8509) Failed to JOIN in pyspark

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8509: - Description: Hi, I am writing pyspark stream program. I have the training data set to com

[jira] [Commented] (SPARK-6813) SparkR style guide

2015-06-21 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595159#comment-14595159 ] Yu Ishikawa commented on SPARK-6813: I got it. As you mentioned, disabling those two c

[jira] [Updated] (SPARK-8512) Web UI Inconsistent with History Server in Standalone Mode

2015-06-21 Thread Jonathon Cai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathon Cai updated SPARK-8512: Affects Version/s: 1.4.0 > Web UI Inconsistent with History Server in Standalone Mode >

[jira] [Created] (SPARK-8512) Web UI Inconsistent with History Server in Standalone Mode

2015-06-21 Thread Jonathon Cai (JIRA)
Jonathon Cai created SPARK-8512: --- Summary: Web UI Inconsistent with History Server in Standalone Mode Key: SPARK-8512 URL: https://issues.apache.org/jira/browse/SPARK-8512 Project: Spark Issue

[jira] [Closed] (SPARK-8509) Failed to JOIN in pyspark

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-8509. Resolution: Not A Problem > Failed to JOIN in pyspark > - > >

[jira] [Commented] (SPARK-8509) Failed to JOIN in pyspark

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595163#comment-14595163 ] Joseph K. Bradley commented on SPARK-8509: -- I believe the bug is here, where you'

[jira] [Resolved] (SPARK-7604) Python API for PCA and PCAModel

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7604. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6315 [https

[jira] [Commented] (SPARK-7499) Investigate how to specify columns in SparkR without $ or strings

2015-06-21 Thread Ben Sully (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595171#comment-14595171 ] Ben Sully commented on SPARK-7499: -- I just had a quick go and it seems quite possibly: `

[jira] [Updated] (SPARK-8510) NumPy arrays and matrices as values in sequence files

2015-06-21 Thread Peter Aberline (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Aberline updated SPARK-8510: -- Summary: NumPy arrays and matrices as values in sequence files (was: Support NumPy arrays and m

[jira] [Updated] (SPARK-8510) NumPy arrays and matrices as values in sequence files

2015-06-21 Thread Peter Aberline (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Aberline updated SPARK-8510: -- Description: Using the DoubleArrayWritable example, I have added support for storing NumPy doub

[jira] [Resolved] (SPARK-8508) Test case "SQLQuerySuite.test script transform for stderr" generates super long output

2015-06-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-8508. --- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6925 [https://github.com/

[jira] [Commented] (SPARK-5111) HiveContext and Thriftserver cannot work in secure cluster beyond hadoop2.5

2015-06-21 Thread Bolke de Bruin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595205#comment-14595205 ] Bolke de Bruin commented on SPARK-5111: --- This patch does not work on a hadoop 2.6 +

[jira] [Updated] (SPARK-7075) Project Tungsten: Improving Physical Execution

2015-06-21 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7075: --- Summary: Project Tungsten: Improving Physical Execution (was: Project Tungsten: Improving Physical Ex

[jira] [Commented] (SPARK-1503) Implement Nesterov's accelerated first-order method

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595226#comment-14595226 ] Joseph K. Bradley commented on SPARK-1503: -- [~staple] [~lewuathe] Can you please

[jira] [Commented] (SPARK-8072) Better AnalysisException for writing DataFrame with identically named columns

2015-06-21 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595239#comment-14595239 ] Reynold Xin commented on SPARK-8072: I think it's best to say the name of the duplicat

[jira] [Resolved] (SPARK-7715) Update MLlib Programming Guide for 1.4

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7715. -- Resolution: Fixed Fix Version/s: (was: 1.4.0) 1.4.1

[jira] [Assigned] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7888: --- Assignee: Apache Spark (was: holdenk) > Be able to disable intercept in Linear Regression in

[jira] [Assigned] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7888: --- Assignee: holdenk (was: Apache Spark) > Be able to disable intercept in Linear Regression in

[jira] [Commented] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595251#comment-14595251 ] Apache Spark commented on SPARK-7888: - User 'holdenk' has created a pull request for t

[jira] [Resolved] (SPARK-7443) MLlib 1.4 QA plan

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7443. -- Resolution: Fixed Fix Version/s: 1.4.0 I'm closing this and marking it Fixed. I'

[jira] [Assigned] (SPARK-8506) SparkR does not provide an easy way to depend on Spark Packages when performing init from inside of R

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8506: --- Assignee: (was: Apache Spark) > SparkR does not provide an easy way to depend on Spark Pa

[jira] [Commented] (SPARK-8506) SparkR does not provide an easy way to depend on Spark Packages when performing init from inside of R

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595275#comment-14595275 ] Apache Spark commented on SPARK-8506: - User 'holdenk' has created a pull request for t

[jira] [Assigned] (SPARK-8506) SparkR does not provide an easy way to depend on Spark Packages when performing init from inside of R

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8506: --- Assignee: Apache Spark > SparkR does not provide an easy way to depend on Spark Packages when

[jira] [Commented] (SPARK-6412) Add Char support in dataTypes.

2015-06-21 Thread Naden Franciscus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595280#comment-14595280 ] Naden Franciscus commented on SPARK-6412: - This needs to be reopened. The use cas

[jira] [Issue Comment Deleted] (SPARK-6412) Add Char support in dataTypes.

2015-06-21 Thread Naden Franciscus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Naden Franciscus updated SPARK-6412: Comment: was deleted (was: This needs to be reopened. The use case couldn't be more clear.

[jira] [Resolved] (SPARK-7426) spark.ml AttributeFactory.fromStructField should allow other NumericTypes

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7426. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6540 [https

[jira] [Created] (SPARK-8513) _temporary may be left undeleted when a write job committed with FileOutputCommitter fails due to a race condition

2015-06-21 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-8513: - Summary: _temporary may be left undeleted when a write job committed with FileOutputCommitter fails due to a race condition Key: SPARK-8513 URL: https://issues.apache.org/jira/browse/SP

[jira] [Updated] (SPARK-8513) _temporary may be left undeleted when a write job committed with FileOutputCommitter fails due to a race condition

2015-06-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-8513: -- Component/s: SQL > _temporary may be left undeleted when a write job committed with > FileOutputCommitt

[jira] [Updated] (SPARK-8513) _temporary may be left undeleted when a write job committed with FileOutputCommitter fails due to a race condition

2015-06-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-8513: -- Description: To reproduce this issue, we need a node with relatively more cores, say 32 (e.g., Spark Je

[jira] [Updated] (SPARK-8513) _temporary may be left undeleted when a write job committed with FileOutputCommitter fails due to a race condition

2015-06-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-8513: -- Description: To reproduce this issue, we need a node with relatively more cores, say 32 (e.g., Spark Je

[jira] [Updated] (SPARK-8513) _temporary may be left undeleted when a write job committed with FileOutputCommitter fails due to a race condition

2015-06-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-8513: -- Description: To reproduce this issue, we need a node with relatively more cores, say 32 (e.g., Spark Je

[jira] [Commented] (SPARK-8475) SparkSubmit with Ivy jars is very slow to load with no internet access

2015-06-21 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595363#comment-14595363 ] Burak Yavuz commented on SPARK-8475: Me too. I prefer option 1 as well. > SparkSubmit

[jira] [Updated] (SPARK-7398) Add back-pressure to Spark Streaming

2015-06-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7398: - Priority: Critical (was: Major) Target Version/s: 1.5.0 > Add back-pressure to Spark

[jira] [Commented] (SPARK-7398) Add back-pressure to Spark Streaming

2015-06-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595374#comment-14595374 ] Tathagata Das commented on SPARK-7398: -- I took a look at the whole design doc. Its ve

[jira] [Updated] (SPARK-8513) _temporary may be left undeleted when a write job committed with FileOutputCommitter fails due to a race condition

2015-06-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-8513: -- Description: To reproduce this issue, we need a node with relatively more cores, say 32 (e.g., Spark Je

[jira] [Created] (SPARK-8514) LU factorization on BlockMatrix

2015-06-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-8514: Summary: LU factorization on BlockMatrix Key: SPARK-8514 URL: https://issues.apache.org/jira/browse/SPARK-8514 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-8514) LU factorization on BlockMatrix

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8514: - Labels: advanced (was: ) > LU factorization on BlockMatrix > --- > >

[jira] [Created] (SPARK-8515) Improve ML attribute API

2015-06-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-8515: Summary: Improve ML attribute API Key: SPARK-8515 URL: https://issues.apache.org/jira/browse/SPARK-8515 Project: Spark Issue Type: Improvement Comp

[jira] [Updated] (SPARK-8515) Improve ML attribute API

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8515: - Labels: advanced (was: ) > Improve ML attribute API > > >

[jira] [Created] (SPARK-8516) ML attribute API in PySpark

2015-06-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-8516: Summary: ML attribute API in PySpark Key: SPARK-8516 URL: https://issues.apache.org/jira/browse/SPARK-8516 Project: Spark Issue Type: New Feature C

[jira] [Created] (SPARK-8517) Improve the organization and style of MLlib's user guide

2015-06-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-8517: Summary: Improve the organization and style of MLlib's user guide Key: SPARK-8517 URL: https://issues.apache.org/jira/browse/SPARK-8517 Project: Spark Issue

[jira] [Created] (SPARK-8518) Log-linear models for survival analysis

2015-06-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-8518: Summary: Log-linear models for survival analysis Key: SPARK-8518 URL: https://issues.apache.org/jira/browse/SPARK-8518 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-8420) Inconsistent behavior with Dataframe Timestamp between 1.3.1 and 1.4.0

2015-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595412#comment-14595412 ] Apache Spark commented on SPARK-8420: - User 'navis' has created a pull request for thi

[jira] [Created] (SPARK-8519) Blockify distance computation in k-means

2015-06-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-8519: Summary: Blockify distance computation in k-means Key: SPARK-8519 URL: https://issues.apache.org/jira/browse/SPARK-8519 Project: Spark Issue Type: Improvemen

[jira] [Updated] (SPARK-8511) Modify ML Python tests to remove saved models

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8511: - Summary: Modify ML Python tests to remove saved models (was: Modify a test to remove a sa

[jira] [Updated] (SPARK-8511) Modify ML Python tests to remove saved models

2015-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8511: - Target Version/s: 1.4.1, 1.5.0 (was: 1.4.1) > Modify ML Python tests to remove saved mode

[jira] [Created] (SPARK-8520) Improve GLM's scalability on number of features

2015-06-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-8520: Summary: Improve GLM's scalability on number of features Key: SPARK-8520 URL: https://issues.apache.org/jira/browse/SPARK-8520 Project: Spark Issue Type: Imp

[jira] [Assigned] (SPARK-8520) Improve GLM's scalability on number of features

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-8520: Assignee: Xiangrui Meng > Improve GLM's scalability on number of features > ---

[jira] [Updated] (SPARK-5884) Implement feature transformers to ML pipelines for Spark 1.4

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5884: - Summary: Implement feature transformers to ML pipelines for Spark 1.4 (was: Implement feature tra

[jira] [Created] (SPARK-8521) Feature Transformers in 1.5

2015-06-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-8521: Summary: Feature Transformers in 1.5 Key: SPARK-8521 URL: https://issues.apache.org/jira/browse/SPARK-8521 Project: Spark Issue Type: Umbrella Comp

[jira] [Updated] (SPARK-5895) Add VectorSlicer

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5895: - Parent Issue: SPARK-8521 (was: SPARK-5884) > Add VectorSlicer > > >

[jira] [Updated] (SPARK-6530) ChiSqSelector transformer

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6530: - Parent Issue: SPARK-8521 (was: SPARK-5884) > ChiSqSelector transformer >

[jira] [Updated] (SPARK-6530) ChiSqSelector transformer

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6530: - Target Version/s: 1.5.0 > ChiSqSelector transformer > - > >

[jira] [Updated] (SPARK-7290) Add StringVectorizer

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7290: - Parent Issue: SPARK-8521 (was: SPARK-5884) > Add StringVectorizer > > >

[jira] [Updated] (SPARK-5890) Add FeatureDiscretizer

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5890: - Parent Issue: SPARK-8521 (was: SPARK-5884) > Add FeatureDiscretizer > -- > >

[jira] [Closed] (SPARK-5884) Implement feature transformers to ML pipelines for Spark 1.4

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-5884. Resolution: Fixed Fix Version/s: 1.4.0 Closed this JIRA since Spark 1.4 was released. Thanks

[jira] [Updated] (SPARK-8345) Add an SQL node as a feature transformer

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8345: - Issue Type: Sub-task (was: New Feature) Parent: SPARK-8521 > Add an SQL node as a feature

[jira] [Updated] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7008: - Labels: features (was: features patch) > An implementation of Factorization Machine (LibFM) > ---

[jira] [Updated] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7008: - Target Version/s: 1.5.0 Affects Version/s: (was: 1.3.2) (was: 1

[jira] [Updated] (SPARK-2429) Hierarchical Implementation of KMeans

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2429: - Target Version/s: 1.5.0 Shepherd: Xiangrui Meng > Hierarchical Implementation of KMean

[jira] [Updated] (SPARK-6717) Clear shuffle files after checkpointing in ALS

2015-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6717: - Affects Version/s: (was: 1.3.1) > Clear shuffle files after checkpointing in ALS > ---

[jira] [Commented] (SPARK-8521) Feature Transformers in 1.5

2015-06-21 Thread Jao Rabary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595432#comment-14595432 ] Jao Rabary commented on SPARK-8521: --- I suggest to add PCA transformer. It's already in m