zhengruifeng created SPARK-31494:
------------------------------------

             Summary: flatten the result dataframe of ANOVATest
                 Key: SPARK-31494
                 URL: https://issues.apache.org/jira/browse/SPARK-31494
             Project: Spark
          Issue Type: Improvement
          Components: ML
    Affects Versions: 3.1.0
            Reporter: zhengruifeng


add a new method
{code:java}
@Since("3.1.0")
def test(
    dataset: DataFrame,
    featuresCol: String,
    labelCol: String,
    flatten: Boolean): DataFrame {code}
 

Similar to new {{test}} method in {{ChiSquareTest}}, it will:

1, support df operation on the returned df;

2, make driver no longer a bottleneck when dim is high



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to