[jira] [Resolved] (SPARK-958) When iteration in ALS increases to 10 running in local mode, spark throws out error of StackOverflowError

2014-04-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-958. - Resolution: Duplicate When iteration in ALS increases to 10 running in local mode, spark throws

[jira] [Created] (SPARK-1410) Class not found exception with application launched from sbt 0.13.x

2014-04-03 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1410: Summary: Class not found exception with application launched from sbt 0.13.x Key: SPARK-1410 URL: https://issues.apache.org/jira/browse/SPARK-1410 Project: Spark

[jira] [Commented] (SPARK-1410) Class not found exception with application launched from sbt 0.13.x

2014-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13959363#comment-13959363 ] Xiangrui Meng commented on SPARK-1410: -- The code is available at

[jira] [Updated] (SPARK-1434) Make labelParser Java friendly.

2014-04-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1434: - Component/s: MLlib Make labelParser Java friendly. ---

[jira] [Commented] (SPARK-1406) PMML model evaluation support via MLib

2014-04-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13962048#comment-13962048 ] Xiangrui Meng commented on SPARK-1406: -- I think we should support PMML import/export

[jira] [Resolved] (SPARK-1218) Minibatch SGD with random sampling

2014-04-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1218. -- Resolution: Fixed Fix Version/s: 0.9.0 Fixed in 0.9.0 or an earlier version.

[jira] [Resolved] (SPARK-1217) Add proximal gradient updater.

2014-04-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1217. -- Resolution: Fixed Fix Version/s: 0.9.0 Add proximal gradient updater.

[jira] [Resolved] (SPARK-1219) Minibatch SGD with disjoint partitions

2014-04-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1219. -- Resolution: Fixed Implemented in 0.9.0 or an earlier version. Minibatch SGD with disjoint

[jira] [Commented] (SPARK-1357) [MLLIB] Annotate developer and experimental API's

2014-04-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13964405#comment-13964405 ] Xiangrui Meng commented on SPARK-1357: -- Hi Sean, Actually, you came in just in

[jira] [Commented] (SPARK-1215) Clustering: Index out of bounds error

2014-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13969718#comment-13969718 ] Xiangrui Meng commented on SPARK-1215: -- The error was due to small number of points

[jira] [Created] (SPARK-1503) Implement Nesterov's accelerated first-order method

2014-04-15 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1503: Summary: Implement Nesterov's accelerated first-order method Key: SPARK-1503 URL: https://issues.apache.org/jira/browse/SPARK-1503 Project: Spark Issue

[jira] [Created] (SPARK-1506) Documentation improvements for MLlib 1.0

2014-04-15 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1506: Summary: Documentation improvements for MLlib 1.0 Key: SPARK-1506 URL: https://issues.apache.org/jira/browse/SPARK-1506 Project: Spark Issue Type:

[jira] [Commented] (SPARK-1520) Inclusion of breeze corrupts assembly when compiled with JDK7 and run on JDK6

2014-04-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13973291#comment-13973291 ] Xiangrui Meng commented on SPARK-1520: -- I'm using Java 6 JDK located at

[jira] [Commented] (SPARK-1520) Inclusion of breeze corrupts assembly when compiled with JDK7 and run on JDK6

2014-04-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13973306#comment-13973306 ] Xiangrui Meng commented on SPARK-1520: -- When I try to use jar-1.6 to untar the

[jira] [Comment Edited] (SPARK-1520) Inclusion of breeze corrupts assembly when compiled with JDK7 and run on JDK6

2014-04-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13973326#comment-13973326 ] Xiangrui Meng edited comment on SPARK-1520 at 4/17/14 7:59 PM:

[jira] [Commented] (SPARK-1520) Inclusion of breeze corrupts assembly when compiled with JDK7 and run on JDK6

2014-04-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13973326#comment-13973326 ] Xiangrui Meng commented on SPARK-1520: -- The quick fix may be removing fastutil. In

[jira] [Commented] (SPARK-1520) Assembly Jar with more than 65536 files won't work when compiled on JDK7 and run on JDK6

2014-04-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13973463#comment-13973463 ] Xiangrui Meng commented on SPARK-1520: -- It seems HyperLogLog doesn't need fastutil,

[jira] [Resolved] (SPARK-1464) Update MLLib Examples to Use Breeze

2014-04-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1464. -- Resolution: Duplicate Update MLLib Examples to Use Breeze

[jira] [Assigned] (SPARK-1520) Assembly Jar with more than 65536 files won't work when compiled on JDK7 and run on JDK6

2014-04-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-1520: Assignee: Xiangrui Meng Assembly Jar with more than 65536 files won't work when compiled

[jira] [Created] (SPARK-1533) The (kill) button in the web UI is visible to everyone.

2014-04-18 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1533: Summary: The (kill) button in the web UI is visible to everyone. Key: SPARK-1533 URL: https://issues.apache.org/jira/browse/SPARK-1533 Project: Spark Issue

[jira] [Updated] (SPARK-1485) Implement AllReduce

2014-04-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1485: - Affects Version/s: (was: 1.0.0) Implement AllReduce ---

[jira] [Created] (SPARK-1561) sbt/sbt assembly generates too many local files

2014-04-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1561: Summary: sbt/sbt assembly generates too many local files Key: SPARK-1561 URL: https://issues.apache.org/jira/browse/SPARK-1561 Project: Spark Issue Type:

[jira] [Updated] (SPARK-1561) sbt/sbt assembly generates too many local files

2014-04-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1561: - Description: Running `find ./ | wc -l` after `sbt/sbt assembly` returned This hits the

[jira] [Commented] (SPARK-1561) sbt/sbt assembly generates too many local files

2014-04-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13976487#comment-13976487 ] Xiangrui Meng commented on SPARK-1561: -- Tried adding {code} assemblyOption in

[jira] [Updated] (SPARK-1561) sbt/sbt assembly generates too many local files

2014-04-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1561: - Description: Running `find ./ | wc -l` after `sbt/sbt assembly` returned 564365 This hits the

[jira] [Updated] (SPARK-1595) Remove VectorRDDs

2014-04-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1595: - Affects Version/s: 1.0.0 Remove VectorRDDs - Key: SPARK-1595

[jira] [Created] (SPARK-1599) Allow to use intercept in Ridge and Lasso

2014-04-23 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1599: Summary: Allow to use intercept in Ridge and Lasso Key: SPARK-1599 URL: https://issues.apache.org/jira/browse/SPARK-1599 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-1634) Java API docs contain test cases

2014-04-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1634: - Summary: Java API docs contain test cases (was: JavaDoc contains test cases) Java API docs

[jira] [Updated] (SPARK-1634) Java API docs contain test cases

2014-04-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1634: - Description: The generated Java API docs contain all test cases. (was: The generated Java API

[jira] [Created] (SPARK-1635) Java API docs do not show annotation.

2014-04-25 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1635: Summary: Java API docs do not show annotation. Key: SPARK-1635 URL: https://issues.apache.org/jira/browse/SPARK-1635 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-1636) Move main methods to examples

2014-04-25 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1636: Summary: Move main methods to examples Key: SPARK-1636 URL: https://issues.apache.org/jira/browse/SPARK-1636 Project: Spark Issue Type: Sub-task

[jira] [Resolved] (SPARK-1598) Mark main methods experimental

2014-04-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1598. -- Resolution: Duplicate We will move main methods to examples instead. Mark main methods

[jira] [Closed] (SPARK-1634) Java API docs contain test cases

2014-04-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-1634. Resolution: Not a Problem Fix Version/s: 1.0.0 Assignee: Xiangrui Meng Re-tried

[jira] [Created] (SPARK-1668) Add implicit preference as an option to examples/MovieLensALS

2014-04-29 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1668: Summary: Add implicit preference as an option to examples/MovieLensALS Key: SPARK-1668 URL: https://issues.apache.org/jira/browse/SPARK-1668 Project: Spark

[jira] [Created] (SPARK-1674) Interrupted system call error in pyspark's RDD.pipe

2014-04-29 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1674: Summary: Interrupted system call error in pyspark's RDD.pipe Key: SPARK-1674 URL: https://issues.apache.org/jira/browse/SPARK-1674 Project: Spark Issue

[jira] [Resolved] (SPARK-1674) Interrupted system call error in pyspark's RDD.pipe

2014-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1674. -- Resolution: Fixed Fix Version/s: 1.0.0 Interrupted system call error in pyspark's

[jira] [Commented] (SPARK-1520) Assembly Jar with more than 65536 files won't work when compiled on JDK7 and run on JDK6

2014-05-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13989206#comment-13989206 ] Xiangrui Meng commented on SPARK-1520: -- Koert, which JDK6 did you use? This problem

[jira] [Created] (SPARK-1723) Add saveAsLibSVMFile

2014-05-05 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1723: Summary: Add saveAsLibSVMFile Key: SPARK-1723 URL: https://issues.apache.org/jira/browse/SPARK-1723 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-1724) Add appendBias

2014-05-05 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1724: Summary: Add appendBias Key: SPARK-1724 URL: https://issues.apache.org/jira/browse/SPARK-1724 Project: Spark Issue Type: Sub-task Components:

[jira] [Resolved] (SPARK-1595) Remove VectorRDDs

2014-05-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1595. -- Resolution: Fixed Fix Version/s: 1.0.0 https://github.com/apache/spark/pull/524

[jira] [Resolved] (SPARK-1723) Add saveAsLibSVMFile

2014-05-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1723. -- Resolution: Fixed Fix Version/s: 1.0.0 https://github.com/apache/spark/pull/524 Add

[jira] [Resolved] (SPARK-1596) Re-arrange public methods in evaluation.

2014-05-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1596. -- Resolution: Fixed Fix Version/s: 1.0.0 https://github.com/apache/spark/pull/524

[jira] [Resolved] (SPARK-1724) Add appendBias

2014-05-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1724. -- Resolution: Fixed Fix Version/s: 1.0.0 https://github.com/apache/spark/pull/524 Add

[jira] [Resolved] (SPARK-1599) Allow to use intercept in Ridge and Lasso

2014-05-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1599. -- Resolution: Fixed Fix Version/s: 1.0.0 https://github.com/apache/spark/pull/524 Allow

[jira] [Created] (SPARK-1741) Add predict(JavaRDD) to predictive models

2014-05-06 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1741: Summary: Add predict(JavaRDD) to predictive models Key: SPARK-1741 URL: https://issues.apache.org/jira/browse/SPARK-1741 Project: Spark Issue Type: New

[jira] [Created] (SPARK-1743) Add mllib.util.MLUtils.{loadLibSVMFile, saveAsLibSVMFile} to pyspark

2014-05-06 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1743: Summary: Add mllib.util.MLUtils.{loadLibSVMFile, saveAsLibSVMFile} to pyspark Key: SPARK-1743 URL: https://issues.apache.org/jira/browse/SPARK-1743 Project: Spark

[jira] [Commented] (SPARK-1743) Add mllib.util.MLUtils.{loadLibSVMFile, saveAsLibSVMFile} to pyspark

2014-05-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13991373#comment-13991373 ] Xiangrui Meng commented on SPARK-1743: -- PR: https://github.com/apache/spark/pull/672

[jira] [Commented] (SPARK-1741) Add predict(JavaRDD) to predictive models

2014-05-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13991378#comment-13991378 ] Xiangrui Meng commented on SPARK-1741: -- https://github.com/apache/spark/pull/670

[jira] [Created] (SPARK-1783) Title contains html code in MLlib guide

2014-05-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1783: Summary: Title contains html code in MLlib guide Key: SPARK-1783 URL: https://issues.apache.org/jira/browse/SPARK-1783 Project: Spark Issue Type:

[jira] [Commented] (SPARK-1675) Make clear whether computePrincipalComponents centers data

2014-05-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13998103#comment-13998103 ] Xiangrui Meng commented on SPARK-1675: -- Centering in PCA should be the standard

[jira] [Resolved] (SPARK-1668) Add implicit preference as an option to examples/MovieLensALS

2014-05-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1668. -- Resolution: Implemented Fix Version/s: 1.0.0 Add implicit preference as an option to

[jira] [Updated] (SPARK-1635) Java API docs do not show annotation.

2014-05-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1635: - Priority: Minor (was: Major) Java API docs do not show annotation.

[jira] [Commented] (SPARK-1696) RowMatrix.dspr is not using parameter alpha for DenseVector

2014-05-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13998127#comment-13998127 ] Xiangrui Meng commented on SPARK-1696: -- Thanks! I sent a PR:

[jira] [Resolved] (SPARK-1696) RowMatrix.dspr is not using parameter alpha for DenseVector

2014-05-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1696. -- Resolution: Fixed Fix Version/s: 1.0.0 RowMatrix.dspr is not using parameter alpha for

[jira] [Commented] (SPARK-1605) Improve mllib.linalg.Vector

2014-05-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13998106#comment-13998106 ] Xiangrui Meng commented on SPARK-1605: -- `toBreeze` exposes a breeze type. We might

[jira] [Resolved] (SPARK-1646) ALS micro-optimisation

2014-05-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1646. -- Resolution: Implemented Fix Version/s: 1.0.0 PR:

[jira] [Updated] (SPARK-1359) SGD implementation is not efficient

2014-05-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1359: - Affects Version/s: 1.0.0 SGD implementation is not efficient

[jira] [Updated] (SPARK-1485) Implement AllReduce

2014-05-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1485: - Priority: Critical (was: Major) Implement AllReduce ---

[jira] [Commented] (SPARK-1782) svd for sparse matrix using ARPACK

2014-05-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13999499#comment-13999499 ] Xiangrui Meng commented on SPARK-1782: -- Btw, this approach only gives us \Sigma and

[jira] [Updated] (SPARK-1752) Standardize input/output format for vectors and labeled points

2014-05-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1752: - Fix Version/s: 1.1.0 Standardize input/output format for vectors and labeled points

[jira] [Updated] (SPARK-1553) Support alternating nonnegative least-squares

2014-05-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1553: - Fix Version/s: 1.1.0 Support alternating nonnegative least-squares

[jira] [Commented] (SPARK-1585) Not robust Lasso causes Infinity on weights and losses

2014-05-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13999123#comment-13999123 ] Xiangrui Meng commented on SPARK-1585: -- I think the gradient should pull the weights

[jira] [Created] (SPARK-1855) Provide memory-and-local-disk RDD checkpointing

2014-05-16 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1855: Summary: Provide memory-and-local-disk RDD checkpointing Key: SPARK-1855 URL: https://issues.apache.org/jira/browse/SPARK-1855 Project: Spark Issue Type:

[jira] [Updated] (SPARK-1485) Implement AllReduce

2014-05-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1485: - Fix Version/s: 1.1.0 Implement AllReduce --- Key: SPARK-1485

[jira] [Commented] (SPARK-1782) svd for sparse matrix using ARPACK

2014-05-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13999493#comment-13999493 ] Xiangrui Meng commented on SPARK-1782: -- This sounds good to me. Let's assume that A

[jira] [Updated] (SPARK-1580) ALS: Estimate communication and computation costs given a partitioner

2014-05-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1580: - Fix Version/s: 1.1.0 ALS: Estimate communication and computation costs given a partitioner

[jira] [Created] (SPARK-1856) Standardize MLlib interfaces

2014-05-16 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1856: Summary: Standardize MLlib interfaces Key: SPARK-1856 URL: https://issues.apache.org/jira/browse/SPARK-1856 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-1861) ArrayIndexOutOfBoundsException when reading bzip2 files

2014-05-16 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1861: Summary: ArrayIndexOutOfBoundsException when reading bzip2 files Key: SPARK-1861 URL: https://issues.apache.org/jira/browse/SPARK-1861 Project: Spark Issue

[jira] [Updated] (SPARK-1486) Support multi-model training in MLlib

2014-05-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1486: - Priority: Critical (was: Major) Support multi-model training in MLlib

[jira] [Updated] (SPARK-1553) Support alternating nonnegative least-squares

2014-05-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1553: - Priority: Major (was: Minor) Support alternating nonnegative least-squares

[jira] [Commented] (SPARK-1782) svd for sparse matrix using ARPACK

2014-05-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14000620#comment-14000620 ] Xiangrui Meng commented on SPARK-1782: -- If you need the the latest Breeze to use

[jira] [Commented] (SPARK-1861) ArrayIndexOutOfBoundsException when reading bzip2 files

2014-05-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14000677#comment-14000677 ] Xiangrui Meng commented on SPARK-1861: -- Patch available at

[jira] [Comment Edited] (SPARK-1859) Linear, Ridge and Lasso Regressions with SGD yield unexpected results

2014-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14001136#comment-14001136 ] Xiangrui Meng edited comment on SPARK-1859 at 5/18/14 5:27 PM:

[jira] [Commented] (SPARK-1859) Linear, Ridge and Lasso Regressions with SGD yield unexpected results

2014-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14001136#comment-14001136 ] Xiangrui Meng commented on SPARK-1859: -- The step size should be smaller than the

[jira] [Commented] (SPARK-1783) Title contains html code in MLlib guide

2014-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14001183#comment-14001183 ] Xiangrui Meng commented on SPARK-1783: -- Added `displayTitle` variable to the global

[jira] [Updated] (SPARK-1871) Improve MLlib guide for v1.0

2014-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1871: - Summary: Improve MLlib guide for v1.0 (was: Improve MLlib guide) Improve MLlib guide for v1.0

[jira] [Created] (SPARK-1872) Update api links for unidoc

2014-05-18 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1872: Summary: Update api links for unidoc Key: SPARK-1872 URL: https://issues.apache.org/jira/browse/SPARK-1872 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-1783) Title contains html code in MLlib guide

2014-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1783: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-1871 Title contains html code in

[jira] [Updated] (SPARK-1871) Improve MLlib guide for v1.0

2014-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1871: - Description: More improvements to MLlib guide. Improve MLlib guide for v1.0

[jira] [Commented] (SPARK-1870) Jars specified via --jars in spark-submit are not added to executor classpath for YARN

2014-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14001195#comment-14001195 ] Xiangrui Meng commented on SPARK-1870: -- I specified the jar via `--jars` and add it

[jira] [Comment Edited] (SPARK-1870) Jars specified via --jars in spark-submit are not added to executor classpath for YARN

2014-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14001195#comment-14001195 ] Xiangrui Meng edited comment on SPARK-1870 at 5/18/14 8:36 PM:

[jira] [Updated] (SPARK-1871) Improve MLlib guide

2014-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1871: - Component/s: Documentation Improve MLlib guide --- Key:

[jira] [Created] (SPARK-1871) Improve MLlib guide

2014-05-18 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1871: Summary: Improve MLlib guide Key: SPARK-1871 URL: https://issues.apache.org/jira/browse/SPARK-1871 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-1874) Clean up MLlib sample data

2014-05-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14001254#comment-14001254 ] Xiangrui Meng commented on SPARK-1874: -- Is `data/mllib` a better place than

[jira] [Commented] (SPARK-1874) Clean up MLlib sample data

2014-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14002535#comment-14002535 ] Xiangrui Meng commented on SPARK-1874: -- There are three files under `data/`:

[jira] [Commented] (SPARK-1870) Jars specified via --jars in spark-submit are not added to executor classpath for YARN

2014-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14002613#comment-14002613 ] Xiangrui Meng commented on SPARK-1870: -- I tested it on a Spark 1.0RC standalone

[jira] [Resolved] (SPARK-1861) ArrayIndexOutOfBoundsException when reading bzip2 files

2014-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1861. -- Resolution: Implemented Patch will be included for the next Hadoop release (1.3.0, 2.5.0).

[jira] [Updated] (SPARK-1870) Jars specified via --jars in spark-submit are not added to executor classpath for YARN

2014-05-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1870: - Issue Type: Sub-task (was: Bug) Parent: SPARK-1905 Jars specified via --jars in

[jira] [Created] (SPARK-1906) spark-submit doesn't send master URL to Driver in standalone cluster mode

2014-05-22 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1906: Summary: spark-submit doesn't send master URL to Driver in standalone cluster mode Key: SPARK-1906 URL: https://issues.apache.org/jira/browse/SPARK-1906 Project:

[jira] [Created] (SPARK-1908) Support local app jar in standalone cluster mode

2014-05-22 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1908: Summary: Support local app jar in standalone cluster mode Key: SPARK-1908 URL: https://issues.apache.org/jira/browse/SPARK-1908 Project: Spark Issue Type:

[jira] [Created] (SPARK-1909) --jars is not supported in standalone cluster mode

2014-05-22 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-1909: Summary: --jars is not supported in standalone cluster mode Key: SPARK-1909 URL: https://issues.apache.org/jira/browse/SPARK-1909 Project: Spark Issue Type:

[jira] [Updated] (SPARK-1900) Fix running PySpark files on YARN

2014-05-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1900: - Issue Type: Sub-task (was: Bug) Parent: SPARK-1652 Fix running PySpark files on YARN

[jira] [Updated] (SPARK-1900) Fix running PySpark files on YARN

2014-05-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1900: - Issue Type: Bug (was: Sub-task) Parent: (was: SPARK-1905) Fix running PySpark

[jira] [Updated] (SPARK-1870) Jars specified via --jars in spark-submit are not added to executor classpath for YARN

2014-05-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1870: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-1652 Jars specified via --jars

[jira] [Updated] (SPARK-1906) spark-submit doesn't send master URL to Driver in standalone cluster mode

2014-05-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1906: - Issue Type: Improvement (was: Sub-task) Parent: (was: SPARK-1905) spark-submit

[jira] [Updated] (SPARK-1909) --jars is not supported in standalone cluster mode

2014-05-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1909: - Issue Type: Improvement (was: Sub-task) Parent: (was: SPARK-1905) --jars is not

[jira] [Updated] (SPARK-1908) Support local app jar in standalone cluster mode

2014-05-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1908: - Issue Type: Bug (was: Sub-task) Parent: (was: SPARK-1905) Support local app jar in

[jira] [Updated] (SPARK-1908) Support local app jar in standalone cluster mode

2014-05-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1908: - Issue Type: Sub-task (was: Bug) Parent: SPARK-1652 Support local app jar in standalone

[jira] [Updated] (SPARK-1906) spark-submit doesn't send master URL to Driver in standalone cluster mode

2014-05-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1906: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-1652 spark-submit doesn't send

[jira] [Resolved] (SPARK-1905) Issues with `spark-submit`

2014-05-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1905. -- Resolution: Duplicate Issues with `spark-submit` --

  1   2   3   4   5   6   7   8   9   10   >