Hi Xiangrui Spark People,
I recently got round to writing an evaluation framework for Spark that I
was hoping to PR into MLLib and this would solve some of the aforementioned
issues. I have put the code on github in a separate repo for now as I
would like to get some sandboxed feedback. The
Firstly apologies for the header of my email containing some junk, I
believe it's due to a copy and paste error on a smart phone.
Thanks for your response. I will indeed make the PR you suggest, though
glancing at the code I realize it's not just a case of making these public
since the types are
LabeledPoint was used for both classification and regression, where label
type is Double for simplicity. So in BinaryClassificationMetrics, we still
use Double for labels. We compute the confusion matrix at each threshold
internally, but this is not exposed to users (
Google+
https://plus.google.com/app/basic?nopromo=1source=moggl=uk
http://mail.google.com/mail/x/mog-/gp/?source=moggl=uk
Calendar
https://www.google.com/calendar/gpcal?source=moggl=uk
Web
http://www.google.co.uk/?source=moggl=uk
more
Inbox
Apache Spark Email
GmailNot Work
S