[ https://issues.apache.org/jira/browse/SPARK-1752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Xiangrui Meng updated SPARK-1752: --------------------------------- Fix Version/s: 1.1.0 > Standardize input/output format for vectors and labeled points > -------------------------------------------------------------- > > Key: SPARK-1752 > URL: https://issues.apache.org/jira/browse/SPARK-1752 > Project: Spark > Issue Type: New Feature > Components: MLlib > Reporter: Xiangrui Meng > Assignee: Xiangrui Meng > Fix For: 1.1.0 > > > We should standardize the text format used to represent vectors and labeled > points. The proposed formats are the following: > 1. dense vector: [v0,v1,..] > 2. sparse vector: (size,[i0,i1],[v0,v1]) > 3. labeled point: (label,vector) > where "(..)" indicates a tuple and "[...]" indicate an array. Those are > compatible with Python's syntax and can be easily parsed using `eval`. -- This message was sent by Atlassian JIRA (v6.2#6252)