[ https://issues.apache.org/jira/browse/SPARK-7352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14531572#comment-14531572 ]
Xiangrui Meng commented on SPARK-7352: -------------------------------------- +1 on keeping `minDocFreq`, which is well-defined, e.g., in the IR book: http://nlp.stanford.edu/IR-book/html/htmledition/inverse-document-frequency-1.html > ml.feature.IDF should rename minDocFreq to minDocCount > ------------------------------------------------------ > > Key: SPARK-7352 > URL: https://issues.apache.org/jira/browse/SPARK-7352 > Project: Spark > Issue Type: Improvement > Components: ML > Reporter: Joseph K. Bradley > Priority: Trivial > > The parameter refers to a count of documents, not a frequency. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org