[ https://issues.apache.org/jira/browse/SPARK-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14330458#comment-14330458 ]
Feynman Liang edited comment on SPARK-3147 at 2/21/15 10:32 PM: ---------------------------------------------------------------- Hi [~mengxr], I've made a PR for this at https://github.com/apache/spark/pull/4716. Could you assign this issue to me? Thanks. was (Author: fliang): Hi [~mengxr], Could you assign this issue to me? Thanks. > Implement A/B testing > --------------------- > > Key: SPARK-3147 > URL: https://issues.apache.org/jira/browse/SPARK-3147 > Project: Spark > Issue Type: New Feature > Components: MLlib, Streaming > Reporter: Xiangrui Meng > > A/B testing is widely used to compare online models. We can implement A/B > testing in MLlib and integrate it with Spark Streaming. For example, we have > a PairDStream[String, Double], whose keys are model ids and values are > observations (click or not, or revenue associated with the event). With A/B > testing, we can tell whether one model is significantly better than another > at a certain time. There are some caveats. For example, we should avoid > multiple testing and support A/A testing as a sanity check. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org