[GitHub] spark pull request: [SPARK-10105] Add most frequent k parameter to...

2015-11-25 Thread tmnd1991
Github user tmnd1991 closed the pull request at: https://github.com/apache/spark/pull/8301 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is en

[GitHub] spark pull request: [SPARK-10105] Add most frequent k parameter to...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8301#issuecomment-132439893 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your pr

[GitHub] spark pull request: [SPARK-10105] Add most frequent k parameter to...

2015-08-18 Thread tmnd1991
GitHub user tmnd1991 opened a pull request: https://github.com/apache/spark/pull/8301 [SPARK-10105] Add most frequent k parameter to Word2Vec When training Word2Vec on a really big dataset, it's really hard to evaluate the right minCount parameter, it would really help having a para