[ https://issues.apache.org/jira/browse/SPARK-15009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15796737#comment-15796737 ]
Bryan Cutler commented on SPARK-15009: -------------------------------------- Hi [~sueann], I pretty much have this done but have been trying to get SPARK-17161 in first with no luck. It makes additions like this much cleaner in PySpark, so I'll try again to get some interest. > PySpark CountVectorizerModel should be able to construct from vocabulary list > ----------------------------------------------------------------------------- > > Key: SPARK-15009 > URL: https://issues.apache.org/jira/browse/SPARK-15009 > Project: Spark > Issue Type: Improvement > Components: ML, PySpark > Reporter: Bryan Cutler > Priority: Minor > > Like the Scala version, PySpark CountVectorizerModel should be able to > construct the model from given a vocabulary list. > For example > {noformat} > cvm = CountVectorizerModel(["a", "b", "c"]) > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org