[jira] [Commented] (SPARK-10518) Update code examples in spark.ml user guide to use LIBSVM data source instead of MLUtils

2015-09-12 Thread shimizu yoshihiro (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-10518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14741922#comment-14741922
 ] 

shimizu yoshihiro commented on SPARK-10518:
---

[~mengxr] Thank you for review on github. Here is my account name. Thanks!

> Update code examples in spark.ml user guide to use LIBSVM data source instead 
> of MLUtils
> 
>
> Key: SPARK-10518
> URL: https://issues.apache.org/jira/browse/SPARK-10518
> Project: Spark
>  Issue Type: Improvement
>  Components: Documentation, MLlib
>Affects Versions: 1.5.0
>Reporter: Xiangrui Meng
>Priority: Minor
>  Labels: starter
> Fix For: 1.6.0
>
>
> SPARK-10117 was merged, we should use LIBSVM data source in the example code 
> in spark.ml user guide, e.g.,
> {code}
> val df = sqlContext.read.format("libsvm").load("path")
> {code}
> instead of
> {code}
> val df = MLUtils.loadLibSVMFile(sc, "path").toDF()
> {code}
> We should update the following:
> {code}
> ml-ensembles.md:40:val data = MLUtils.loadLibSVMFile(sc,
> ml-ensembles.md:87:RDD data = MLUtils.loadLibSVMFile(jsc.sc(),
> ml-features.md:866:val data = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt").toDF()
> ml-features.md:892:JavaRDD rdd = MLUtils.loadLibSVMFile(sc.sc(),
> ml-features.md:917:data = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt").toDF()
> ml-features.md:940:val data = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt")
> ml-features.md:964:  MLUtils.loadLibSVMFile(jsc.sc(), 
> "data/mllib/sample_libsvm_data.txt").toJavaRDD();
> ml-features.md:985:data = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt")
> ml-features.md:1022:val data = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt")
> ml-features.md:1047:  MLUtils.loadLibSVMFile(jsc.sc(), 
> "data/mllib/sample_libsvm_data.txt").toJavaRDD();
> ml-features.md:1068:data = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt")
> ml-linear-methods.md:44:val training = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt").toDF()
> ml-linear-methods.md:84:DataFrame training = 
> sql.createDataFrame(MLUtils.loadLibSVMFile(sc, path).toJavaRDD(), 
> LabeledPoint.class);
> ml-linear-methods.md:110:training = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt").toDF()
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-10518) Update code examples in spark.ml user guide to use LIBSVM data source instead of MLUtils

2015-09-10 Thread Apache Spark (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-10518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738625#comment-14738625
 ] 

Apache Spark commented on SPARK-10518:
--

User 'y-shimizu' has created a pull request for this issue:
https://github.com/apache/spark/pull/8697

> Update code examples in spark.ml user guide to use LIBSVM data source instead 
> of MLUtils
> 
>
> Key: SPARK-10518
> URL: https://issues.apache.org/jira/browse/SPARK-10518
> Project: Spark
>  Issue Type: Improvement
>  Components: Documentation, MLlib
>Affects Versions: 1.5.0
>Reporter: Xiangrui Meng
>Priority: Minor
>  Labels: starter
>
> SPARK-10117 was merged, we should use LIBSVM data source in the example code 
> in spark.ml user guide, e.g.,
> {code}
> val df = sqlContext.read.format("libsvm").load("path")
> {code}
> instead of
> {code}
> val df = MLUtils.loadLibSVMFile(sc, "path").toDF()
> {code}
> We should update the following:
> {code}
> ml-ensembles.md:40:val data = MLUtils.loadLibSVMFile(sc,
> ml-ensembles.md:87:RDD data = MLUtils.loadLibSVMFile(jsc.sc(),
> ml-features.md:866:val data = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt").toDF()
> ml-features.md:892:JavaRDD rdd = MLUtils.loadLibSVMFile(sc.sc(),
> ml-features.md:917:data = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt").toDF()
> ml-features.md:940:val data = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt")
> ml-features.md:964:  MLUtils.loadLibSVMFile(jsc.sc(), 
> "data/mllib/sample_libsvm_data.txt").toJavaRDD();
> ml-features.md:985:data = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt")
> ml-features.md:1022:val data = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt")
> ml-features.md:1047:  MLUtils.loadLibSVMFile(jsc.sc(), 
> "data/mllib/sample_libsvm_data.txt").toJavaRDD();
> ml-features.md:1068:data = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt")
> ml-linear-methods.md:44:val training = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt").toDF()
> ml-linear-methods.md:84:DataFrame training = 
> sql.createDataFrame(MLUtils.loadLibSVMFile(sc, path).toJavaRDD(), 
> LabeledPoint.class);
> ml-linear-methods.md:110:training = MLUtils.loadLibSVMFile(sc, 
> "data/mllib/sample_libsvm_data.txt").toDF()
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org