[jira] [Commented] (SPARK-14315) GLMs model persistence in SparkR

2016-03-31 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15220730#comment-15220730 ] Xiangrui Meng commented on SPARK-14315: --- Hold until SPARK-14303 is done. > GLMs mo

[jira] [Updated] (SPARK-14311) Model persistence in SparkR

2016-03-31 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14311: -- Description: In Spark 2.0, we are going to have 4 ML models in SparkR: GLMs, k-means, naive Ba

[jira] [Created] (SPARK-14313) AFTSurvivalRegression model persistence in SparkR

2016-03-31 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14313: - Summary: AFTSurvivalRegression model persistence in SparkR Key: SPARK-14313 URL: https://issues.apache.org/jira/browse/SPARK-14313 Project: Spark Issue Typ

[jira] [Created] (SPARK-14315) GLMs model persistence in SparkR

2016-03-31 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14315: - Summary: GLMs model persistence in SparkR Key: SPARK-14315 URL: https://issues.apache.org/jira/browse/SPARK-14315 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-14314) K-means model persistence in SparkR

2016-03-31 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14314: - Summary: K-means model persistence in SparkR Key: SPARK-14314 URL: https://issues.apache.org/jira/browse/SPARK-14314 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-14311) Model persistence in SparkR

2016-03-31 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14311: - Summary: Model persistence in SparkR Key: SPARK-14311 URL: https://issues.apache.org/jira/browse/SPARK-14311 Project: Spark Issue Type: Umbrella

[jira] [Created] (SPARK-14312) NaiveBayes model persistence in SparkR

2016-03-31 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14312: - Summary: NaiveBayes model persistence in SparkR Key: SPARK-14312 URL: https://issues.apache.org/jira/browse/SPARK-14312 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-14303) Refactor SparkRWrappers

2016-03-31 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14303: -- Description: We use a single object `SparkRWrappers` (https://github.com/apache/spark/blob/mas

[jira] [Created] (SPARK-14303) Refactor SparkRWrappers

2016-03-31 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14303: - Summary: Refactor SparkRWrappers Key: SPARK-14303 URL: https://issues.apache.org/jira/browse/SPARK-14303 Project: Spark Issue Type: Improvement C

[jira] [Updated] (SPARK-14164) Improve input layer validation of MultilayerPerceptronClassifier

2016-03-31 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14164: -- Assignee: Dongjoon Hyun > Improve input layer validation of MultilayerPerceptronClassifier > --

[jira] [Resolved] (SPARK-14164) Improve input layer validation of MultilayerPerceptronClassifier

2016-03-31 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14164. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11964 [https://g

[jira] [Updated] (SPARK-14187) Incorrect use of binarysearch in SparseMatrix

2016-03-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14187: -- Assignee: Chenliang Xu > Incorrect use of binarysearch in SparseMatrix > --

[jira] [Updated] (SPARK-14187) Incorrect use of binarysearch in SparseMatrix

2016-03-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14187: -- Affects Version/s: 2.0.0 1.2.2 1.3.1

[jira] [Updated] (SPARK-14187) Incorrect use of binarysearch in SparseMatrix

2016-03-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14187: -- Target Version/s: 1.5.3, 1.6.2, 2.0.0 > Incorrect use of binarysearch in SparseMatrix > ---

[jira] [Resolved] (SPARK-14187) Incorrect use of binarysearch in SparseMatrix

2016-03-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14187. --- Resolution: Fixed Fix Version/s: 1.6.2 1.5.3 2.0.

[jira] [Updated] (SPARK-14159) StringIndexerModel sets output column metadata incorrectly

2016-03-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14159: -- Target Version/s: 1.6.2, 2.0.0 (was: 2.0.0) > StringIndexerModel sets output column metadata i

[jira] [Reopened] (SPARK-14159) StringIndexerModel sets output column metadata incorrectly

2016-03-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reopened SPARK-14159: --- > StringIndexerModel sets output column metadata incorrectly > --

[jira] [Resolved] (SPARK-14159) StringIndexerModel sets output column metadata incorrectly

2016-03-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14159. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11965 [https://g

[jira] [Resolved] (SPARK-13010) Survival analysis in SparkR

2016-03-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13010. --- Resolution: Fixed Fix Version/s: 2.0.0 > Survival analysis in SparkR > ---

[jira] [Updated] (SPARK-13949) PySpark ml DecisionTreeClassifier, Regressor support export/import

2016-03-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13949: -- Assignee: Gayathri Murali > PySpark ml DecisionTreeClassifier, Regressor support export/import

[jira] [Resolved] (SPARK-13949) PySpark ml DecisionTreeClassifier, Regressor support export/import

2016-03-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13949. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11892 [https://g

[jira] [Updated] (SPARK-13782) Model export/import for spark.ml: BisectingKMeans

2016-03-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13782: -- Assignee: yuhao yang > Model export/import for spark.ml: BisectingKMeans >

[jira] [Updated] (SPARK-13782) Model export/import for spark.ml: BisectingKMeans

2016-03-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13782: -- Shepherd: Xiangrui Meng Target Version/s: 2.0.0 > Model export/import for spark.ml:

[jira] [Resolved] (SPARK-14107) PySpark spark.ml GBT algs need seed Param

2016-03-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14107. --- Resolution: Fixed Fix Version/s: 1.6.2 2.0.0 Issue resolved by pull

[jira] [Updated] (SPARK-14107) PySpark spark.ml GBT algs need seed Param

2016-03-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14107: -- Assignee: Seth Hendrickson > PySpark spark.ml GBT algs need seed Param > --

[jira] [Resolved] (SPARK-11871) Model export/import for spark.ml: Multilayer Perceptron

2016-03-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11871. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 9854 [https://gi

[jira] [Resolved] (SPARK-13017) Replace example code in mllib-feature-extraction.md using include_example

2016-03-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13017. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11142 [https://g

[jira] [Commented] (SPARK-7992) Hide private classes/objects in in generated Java API doc

2016-03-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15208770#comment-15208770 ] Xiangrui Meng commented on SPARK-7992: -- [~jodersky] Any updates? > Hide private clas

[jira] [Commented] (SPARK-14074) Use fixed version of install_github in SparkR build

2016-03-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15208698#comment-15208698 ] Xiangrui Meng commented on SPARK-14074: --- Yes, the merge script chose 2.1.0 automati

[jira] [Updated] (SPARK-14074) Use fixed version of install_github in SparkR build

2016-03-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14074: -- Affects Version/s: 1.6.2 > Use fixed version of install_github in SparkR build > --

[jira] [Updated] (SPARK-14074) Use fixed version of install_github in SparkR build

2016-03-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14074: -- Fix Version/s: (was: 2.1.0) 2.0.0 > Use fixed version of install_github

[jira] [Resolved] (SPARK-14074) Use fixed version of install_github in SparkR build

2016-03-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14074. --- Resolution: Fixed Fix Version/s: 1.6.2 2.1.0 Issue resolved by pull

[jira] [Updated] (SPARK-14074) Do not use install_github in SparkR build

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14074: -- Target Version/s: 1.6.2, 2.0.0 > Do not use install_github in SparkR build > --

[jira] [Commented] (SPARK-14074) Do not use install_github in SparkR build

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15207949#comment-15207949 ] Xiangrui Meng commented on SPARK-14074: --- +1 on using commit. We should consider swi

[jira] [Updated] (SPARK-14074) Do not use install_github in SparkR build

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14074: -- Assignee: Sun Rui > Do not use install_github in SparkR build > ---

[jira] [Created] (SPARK-14084) Parallel training jobs in model selection

2016-03-22 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14084: - Summary: Parallel training jobs in model selection Key: SPARK-14084 URL: https://issues.apache.org/jira/browse/SPARK-14084 Project: Spark Issue Type: New F

[jira] [Updated] (SPARK-14084) Parallel training jobs in model selection

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14084: -- Description: In CrossValidator and TrainValidationSplit, we run training jobs one by one. If us

[jira] [Resolved] (SPARK-13449) Naive Bayes wrapper in SparkR

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13449. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11890 [https://g

[jira] [Updated] (SPARK-14074) Do not use install_github in SparkR build

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14074: -- Priority: Critical (was: Major) > Do not use install_github in SparkR build >

[jira] [Updated] (SPARK-14074) Do not use install_github in SparkR build

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14074: -- Priority: Major (was: Critical) > Do not use install_github in SparkR build >

[jira] [Updated] (SPARK-13951) PySpark ml.pipeline support export/import - nested Piplines

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13951: -- Assignee: Xusen Yin > PySpark ml.pipeline support export/import - nested Piplines > ---

[jira] [Resolved] (SPARK-13951) PySpark ml.pipeline support export/import - nested Piplines

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13951. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11866 [https://g

[jira] [Updated] (SPARK-14077) Support weighted instances in naive Bayes

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14077: -- Labels: naive-bayes (was: ) > Support weighted instances in naive Bayes >

[jira] [Created] (SPARK-14077) Support weighted instances in naive Bayes

2016-03-22 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14077: - Summary: Support weighted instances in naive Bayes Key: SPARK-14077 URL: https://issues.apache.org/jira/browse/SPARK-14077 Project: Spark Issue Type: New F

[jira] [Updated] (SPARK-14076) Naive Bayes should output attributes in predictions

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14076: -- Labels: naive-bayes (was: ) > Naive Bayes should output attributes in predictions > --

[jira] [Created] (SPARK-14076) Naive Bayes should output attributes in predictions

2016-03-22 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14076: - Summary: Naive Bayes should output attributes in predictions Key: SPARK-14076 URL: https://issues.apache.org/jira/browse/SPARK-14076 Project: Spark Issue T

[jira] [Updated] (SPARK-14006) Builds of 1.6 branch fail R style check

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14006: -- Assignee: Sun Rui > Builds of 1.6 branch fail R style check > -

[jira] [Resolved] (SPARK-14006) Builds of 1.6 branch fail R style check

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14006. --- Resolution: Fixed Fix Version/s: 1.6.2 Issue resolved by pull request 11884 [https://g

[jira] [Updated] (SPARK-14006) Builds of 1.6 branch fail R style check

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14006: -- Affects Version/s: 1.6.1 > Builds of 1.6 branch fail R style check > --

[jira] [Updated] (SPARK-14074) Do not use install_github in SparkR build

2016-03-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14074: -- Description: In dev/lint-r.R, `install_github` makes our builds depend on a unstable source. W

[jira] [Created] (SPARK-14074) Do not use install_github in SparkR build

2016-03-22 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14074: - Summary: Do not use install_github in SparkR build Key: SPARK-14074 URL: https://issues.apache.org/jira/browse/SPARK-14074 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-14059) Define R wrappers under org.apache.spark.ml.r

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14059: -- Affects Version/s: (was: 1.6.0) 1.6.1 > Define R wrappers under org.

[jira] [Updated] (SPARK-14059) Define R wrappers under org.apache.spark.ml.r

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14059: -- Description: Currently, the wrapper files are under .../ml/r but the wrapper classes are defin

[jira] [Updated] (SPARK-14059) Define R wrappers under org.apache.spark.ml.r

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14059: -- Affects Version/s: 1.6.0 > Define R wrappers under org.apache.spark.ml.r >

[jira] [Created] (SPARK-14059) Define R wrappers under org.apache.spark.ml.r

2016-03-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14059: - Summary: Define R wrappers under org.apache.spark.ml.r Key: SPARK-14059 URL: https://issues.apache.org/jira/browse/SPARK-14059 Project: Spark Issue Type: B

[jira] [Updated] (SPARK-14030) Add parameter check to LBFGS

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14030: -- Assignee: zhengruifeng > Add parameter check to LBFGS > > >

[jira] [Updated] (SPARK-14030) Add parameter check to LBFGS

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14030: -- Target Version/s: 2.0.0 > Add parameter check to LBFGS > > >

[jira] [Created] (SPARK-14053) Merge absTol and relTol into one in MLlib tests

2016-03-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14053: - Summary: Merge absTol and relTol into one in MLlib tests Key: SPARK-14053 URL: https://issues.apache.org/jira/browse/SPARK-14053 Project: Spark Issue Type:

[jira] [Updated] (SPARK-14053) Merge absTol and relTol into one in MLlib tests

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14053: -- Description: We have absTol and relTol in MLlib tests to compare values with possible numerica

[jira] [Resolved] (SPARK-13019) Replace example code in mllib-statistics.md using include_example

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13019. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11108 [https://g

[jira] [Updated] (SPARK-12869) Optimize conversion from BlockMatrix to IndexedRowMatrix

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12869: -- Assignee: Fokko Driesprong > Optimize conversion from BlockMatrix to IndexedRowMatrix > ---

[jira] [Updated] (SPARK-12869) Optimize conversion from BlockMatrix to IndexedRowMatrix

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12869: -- Shepherd: Xiangrui Meng Target Version/s: 2.0.0 > Optimize conversion from BlockMat

[jira] [Updated] (SPARK-14041) Locate possible duplicates and group them into subtasks

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14041: -- Description: Please go through the current example code and list possible duplicates. > Locate

[jira] [Updated] (SPARK-13461) Merge and clean up duplicated MLlib example code

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13461: -- Summary: Merge and clean up duplicated MLlib example code (was: Duplicated example code merge

[jira] [Updated] (SPARK-13461) Duplicated example code merge and cleanup

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13461: -- Target Version/s: 2.0.0 > Duplicated example code merge and cleanup > -

[jira] [Updated] (SPARK-13461) Duplicated example code merge and cleanup

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13461: -- Component/s: MLlib ML > Duplicated example code merge and cleanup > --

[jira] [Commented] (SPARK-13461) Duplicated example code merge and cleanup

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15204841#comment-15204841 ] Xiangrui Meng commented on SPARK-13461: --- I changed the JIRA type to Umbrella. As I

[jira] [Created] (SPARK-14041) Locate possible duplicates and group them into subtasks

2016-03-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14041: - Summary: Locate possible duplicates and group them into subtasks Key: SPARK-14041 URL: https://issues.apache.org/jira/browse/SPARK-14041 Project: Spark Iss

[jira] [Updated] (SPARK-13461) Duplicated example code merge and cleanup

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13461: -- Issue Type: Umbrella (was: Sub-task) Parent: (was: SPARK-11337) > Duplicated examp

[jira] [Updated] (SPARK-8884) 1-sample Anderson-Darling Goodness-of-Fit test

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8884: - Target Version/s: 2.0.0 Priority: Major (was: Minor) > 1-sample Anderson-Darling Good

[jira] [Updated] (SPARK-8884) 1-sample Anderson-Darling Goodness-of-Fit test

2016-03-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8884: - Shepherd: Xiangrui Meng > 1-sample Anderson-Darling Goodness-of-Fit test > ---

[jira] [Updated] (SPARK-12626) MLlib 2.0 Roadmap

2016-03-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12626: -- Description: This is a master list for MLlib improvements we plan to have in Spark 2.0. Please

[jira] [Updated] (SPARK-11011) UserDefinedType serialization should be strongly typed

2016-03-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11011: -- Assignee: Jakob Odersky > UserDefinedType serialization should be strongly typed >

[jira] [Updated] (SPARK-7992) Hide private classes/objects in in generated Java API doc

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7992: - Assignee: (was: Xiangrui Meng) > Hide private classes/objects in in generated Java API doc > -

[jira] [Updated] (SPARK-10574) HashingTF should use MurmurHash3

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10574: -- Assignee: Yanbo Liang > HashingTF should use MurmurHash3 > > >

[jira] [Commented] (SPARK-7992) Hide private classes/objects in in generated Java API doc

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15200106#comment-15200106 ] Xiangrui Meng commented on SPARK-7992: -- Great! I pinged you on the old genjavadoc PR

[jira] [Created] (SPARK-13944) Separate out local linear algebra as a standalone module without Spark dependency

2016-03-19 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-13944: - Summary: Separate out local linear algebra as a standalone module without Spark dependency Key: SPARK-13944 URL: https://issues.apache.org/jira/browse/SPARK-13944 P

[jira] [Commented] (SPARK-7992) Hide private classes/objects in in generated Java API doc

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15197676#comment-15197676 ] Xiangrui Meng commented on SPARK-7992: -- [~jodersky] Do you mind taking a look at this

[jira] [Resolved] (SPARK-13613) Provide ignored tests to export test dataset into CSV format

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13613. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11463 [https://g

[jira] [Updated] (SPARK-7992) Hide private classes/objects in in generated Java API doc

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7992: - Target Version/s: 2.0.0 Component/s: Build > Hide private classes/objects in in generated

[jira] [Updated] (SPARK-13613) Provide ignored tests to export test dataset into CSV format

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13613: -- Target Version/s: 2.0.0 > Provide ignored tests to export test dataset into CSV format > --

[jira] [Updated] (SPARK-10574) HashingTF should use MurmurHash3

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10574: -- Target Version/s: 2.0.0 Priority: Major (was: Critical) > HashingTF should use Mur

[jira] [Resolved] (SPARK-11011) UserDefinedType serialization should be strongly typed

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11011. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11379 [https://g

[jira] [Updated] (SPARK-13613) Provide ignored tests to export test dataset into CSV format

2016-03-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13613: -- Assignee: Yanbo Liang > Provide ignored tests to export test dataset into CSV format >

[jira] [Commented] (SPARK-13857) Feature parity for ALS ML with MLLIB

2016-03-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15202240#comment-15202240 ] Xiangrui Meng commented on SPARK-13857: --- +1. We need to figure out the semantics in

[jira] [Updated] (SPARK-10788) Decision Tree duplicates bins for unordered categorical features

2016-03-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10788: -- Target Version/s: 2.0.0 > Decision Tree duplicates bins for unordered categorical features > --

[jira] [Updated] (SPARK-10788) Decision Tree duplicates bins for unordered categorical features

2016-03-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10788: -- Shepherd: Joseph K. Bradley > Decision Tree duplicates bins for unordered categorical features

[jira] [Updated] (SPARK-10788) Decision Tree duplicates bins for unordered categorical features

2016-03-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10788: -- Assignee: Seth Hendrickson > Decision Tree duplicates bins for unordered categorical features >

[jira] [Updated] (SPARK-13927) Add row/column iterator to local matrix

2016-03-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13927: -- Summary: Add row/column iterator to local matrix (was: Add row/column iterator to matrix) > A

[jira] [Updated] (SPARK-13927) Add row/column iterator to local matrices

2016-03-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13927: -- Summary: Add row/column iterator to local matrices (was: Add row/column iterator to local matr

[jira] [Created] (SPARK-13927) Add row/column iterator to matrix

2016-03-15 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-13927: - Summary: Add row/column iterator to matrix Key: SPARK-13927 URL: https://issues.apache.org/jira/browse/SPARK-13927 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-13925) Expose R-like summary statistics in SparkR::glm for more family and link functions

2016-03-15 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-13925: - Summary: Expose R-like summary statistics in SparkR::glm for more family and link functions Key: SPARK-13925 URL: https://issues.apache.org/jira/browse/SPARK-13925

[jira] [Updated] (SPARK-13925) Expose R-like summary statistics in SparkR::glm for more family and link functions

2016-03-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13925: -- Priority: Critical (was: Major) > Expose R-like summary statistics in SparkR::glm for more fam

[jira] [Resolved] (SPARK-9837) Provide R-like summary statistics for GLMs via iteratively reweighted least squares

2016-03-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-9837. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11694 [https://gith

[jira] [Resolved] (SPARK-13686) Add a constructor parameter `regParam` to (Streaming)LinearRegressionWithSGD

2016-03-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13686. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11527 [https://g

[jira] [Updated] (SPARK-13686) Add a constructor parameter `regParam` to (Streaming)LinearRegressionWithSGD

2016-03-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13686: -- Assignee: Dongjoon Hyun > Add a constructor parameter `regParam` to (Streaming)LinearRegression

[jira] [Updated] (SPARK-13715) Remove last usages of jblas in tests

2016-03-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13715: -- Shepherd: Xiangrui Meng > Remove last usages of jblas in tests > --

[jira] [Updated] (SPARK-13715) Remove last usages of jblas in tests

2016-03-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13715: -- Assignee: Sean Owen > Remove last usages of jblas in tests > --

[jira] [Created] (SPARK-13733) Support initial weight distribution in personalized PageRank

2016-03-07 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-13733: - Summary: Support initial weight distribution in personalized PageRank Key: SPARK-13733 URL: https://issues.apache.org/jira/browse/SPARK-13733 Project: Spark

[jira] [Updated] (SPARK-13319) Pyspark VectorSlicer, StopWordsRemvoer should have setDefault

2016-03-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13319: -- Assignee: Xusen Yin > Pyspark VectorSlicer, StopWordsRemvoer should have setDefault > -

<    6   7   8   9   10   11   12   13   14   15   >