Repository: spark
Updated Branches:
  refs/heads/master 3e4f1666a -> 6c66ab8b3


[SPARK-24688][EXAMPLES] Modify the comments about LabeledPoint

## What changes were proposed in this pull request?

An RDD is created using LabeledPoint, but the comment is like 
#LabeledPoint(feature, label).
Although in the method ChiSquareTest.test, the second parameter is feature and 
the third parameter is label, it it better to write label in front of feature 
here because if an RDD is created using LabeldPoint, what we get are actually 
(label, feature) pairs.
Now it is changed as LabeledPoint(label, feature).

The comments in Scala and Java example have the same typos.

## How was this patch tested?

tested

https://issues.apache.org/jira/browse/SPARK-24688

Author: Weizhe Huang 492816239qq.com

Please review http://spark.apache.org/contributing.html before opening a pull 
request.

Closes #21665 from uzmijnlm/my_change.

Authored-by: Huangweizhe <huangwei...@bbdservice.com>
Signed-off-by: Sean Owen <sean.o...@databricks.com>


Project: http://git-wip-us.apache.org/repos/asf/spark/repo
Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/6c66ab8b
Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/6c66ab8b
Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/6c66ab8b

Branch: refs/heads/master
Commit: 6c66ab8b334c5358bc77995650f1886e4c43231d
Parents: 3e4f166
Author: Huangweizhe <huangwei...@bbdservice.com>
Authored: Sat Aug 25 09:24:20 2018 -0500
Committer: Sean Owen <sean.o...@databricks.com>
Committed: Sat Aug 25 09:24:20 2018 -0500

----------------------------------------------------------------------
 .../spark/examples/mllib/JavaHypothesisTestingExample.java       | 2 +-
 examples/src/main/python/mllib/hypothesis_testing_example.py     | 2 +-
 .../apache/spark/examples/mllib/HypothesisTestingExample.scala   | 4 ++--
 3 files changed, 4 insertions(+), 4 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/spark/blob/6c66ab8b/examples/src/main/java/org/apache/spark/examples/mllib/JavaHypothesisTestingExample.java
----------------------------------------------------------------------
diff --git 
a/examples/src/main/java/org/apache/spark/examples/mllib/JavaHypothesisTestingExample.java
 
b/examples/src/main/java/org/apache/spark/examples/mllib/JavaHypothesisTestingExample.java
index b48b95f..2732736 100644
--- 
a/examples/src/main/java/org/apache/spark/examples/mllib/JavaHypothesisTestingExample.java
+++ 
b/examples/src/main/java/org/apache/spark/examples/mllib/JavaHypothesisTestingExample.java
@@ -67,7 +67,7 @@ public class JavaHypothesisTestingExample {
       )
     );
 
-    // The contingency table is constructed from the raw (feature, label) 
pairs and used to conduct
+    // The contingency table is constructed from the raw (label, feature) 
pairs and used to conduct
     // the independence test. Returns an array containing the 
ChiSquaredTestResult for every feature
     // against the label.
     ChiSqTestResult[] featureTestResults = Statistics.chiSqTest(obs.rdd());

http://git-wip-us.apache.org/repos/asf/spark/blob/6c66ab8b/examples/src/main/python/mllib/hypothesis_testing_example.py
----------------------------------------------------------------------
diff --git a/examples/src/main/python/mllib/hypothesis_testing_example.py 
b/examples/src/main/python/mllib/hypothesis_testing_example.py
index e566ead..21a5584 100644
--- a/examples/src/main/python/mllib/hypothesis_testing_example.py
+++ b/examples/src/main/python/mllib/hypothesis_testing_example.py
@@ -51,7 +51,7 @@ if __name__ == "__main__":
         [LabeledPoint(1.0, [1.0, 0.0, 3.0]),
          LabeledPoint(1.0, [1.0, 2.0, 0.0]),
          LabeledPoint(1.0, [-1.0, 0.0, -0.5])]
-    )  # LabeledPoint(feature, label)
+    )  # LabeledPoint(label, feature)
 
     # The contingency table is constructed from an RDD of LabeledPoint and 
used to conduct
     # the independence test. Returns an array containing the 
ChiSquaredTestResult for every feature

http://git-wip-us.apache.org/repos/asf/spark/blob/6c66ab8b/examples/src/main/scala/org/apache/spark/examples/mllib/HypothesisTestingExample.scala
----------------------------------------------------------------------
diff --git 
a/examples/src/main/scala/org/apache/spark/examples/mllib/HypothesisTestingExample.scala
 
b/examples/src/main/scala/org/apache/spark/examples/mllib/HypothesisTestingExample.scala
index add1719..9b3c326 100644
--- 
a/examples/src/main/scala/org/apache/spark/examples/mllib/HypothesisTestingExample.scala
+++ 
b/examples/src/main/scala/org/apache/spark/examples/mllib/HypothesisTestingExample.scala
@@ -61,9 +61,9 @@ object HypothesisTestingExample {
           LabeledPoint(-1.0, Vectors.dense(-1.0, 0.0, -0.5)
           )
         )
-      ) // (feature, label) pairs.
+      ) // (label, feature) pairs.
 
-    // The contingency table is constructed from the raw (feature, label) 
pairs and used to conduct
+    // The contingency table is constructed from the raw (label, feature) 
pairs and used to conduct
     // the independence test. Returns an array containing the 
ChiSquaredTestResult for every feature
     // against the label.
     val featureTestResults: Array[ChiSqTestResult] = Statistics.chiSqTest(obs)


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

Reply via email to