[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15933534#comment-15933534 ] Apache Spark commented on SPARK-19962: -- User 'yupbank' has created a pull request fo

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-16 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15928123#comment-15928123 ] yu peng commented on SPARK-19962: - yeah, exactly.. i would love to use FeatureHasher when

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-16 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15927601#comment-15927601 ] Nick Pentreath commented on SPARK-19962: You may also want to take a look at htt

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926723#comment-15926723 ] yu peng commented on SPARK-19962: - yeah, with one indexer that performing like mutually e

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926680#comment-15926680 ] Sean Owen commented on SPARK-19962: --- You would have n indexers and n models for n diffe

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926559#comment-15926559 ] yu peng commented on SPARK-19962: - >Well, it's maintained by the StringIndexerModel for y

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926543#comment-15926543 ] Sean Owen commented on SPARK-19962: --- Well, it's maintained by the StringIndexerModel fo

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926537#comment-15926537 ] yu peng commented on SPARK-19962: - yeah.. StringIndexer does some job on single column an

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926518#comment-15926518 ] Sean Owen commented on SPARK-19962: --- That sounds like what https://spark.apache.org/do

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926458#comment-15926458 ] yu peng commented on SPARK-19962: - ^ i have updated the description > add DictVectorizor

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926393#comment-15926393 ] Sean Owen commented on SPARK-19962: --- Can you describe what this does, and give an examp