[ https://issues.apache.org/jira/browse/SPARK-7492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Apache Spark reassigned SPARK-7492: ----------------------------------- Assignee: (was: Apache Spark) > Convert LocalDataFrame to LocalMatrix > ------------------------------------- > > Key: SPARK-7492 > URL: https://issues.apache.org/jira/browse/SPARK-7492 > Project: Spark > Issue Type: New Feature > Components: MLlib, SQL > Reporter: Burak Yavuz > > Having a method like, > {code:java} > Matrices.fromDataFrame(df) > {code} > would provide users the ability to perform feature selection with DataFrames. > Users will be able to chain operations like below: > {code:java} > import org.apache.spark.mllib.linalg.Matrices > import org.apache.spark.mllib.stat.Statistics > import org.apache.spark.sql.DataFrame > val df = ... // the DataFrame > val contingencyTable = df.stat.crosstab(col1, col2) > val ct = Matrices.fromDataFrame(contingencyTable) > val result: ChiSqTestResult = Statistics.chiSqTest(ct) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org