[jira] [Commented] (SPARK-14077) Support weighted instances in naive Bayes
[ https://issues.apache.org/jira/browse/SPARK-14077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15650350#comment-15650350 ] Apache Spark commented on SPARK-14077: -- User 'yanboliang' has created a pull request for this issue: https://github.com/apache/spark/pull/15826 > Support weighted instances in naive Bayes > - > > Key: SPARK-14077 > URL: https://issues.apache.org/jira/browse/SPARK-14077 > Project: Spark > Issue Type: New Feature > Components: ML >Reporter: Xiangrui Meng >Assignee: zhengruifeng > Labels: naive-bayes > Fix For: 2.1.0 > > > In naive Bayes, we expect inputs to be individual observations. In practice, > people may have the frequency table instead. It is useful for us to support > instance weights to handle this case. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-14077) Support weighted instances in naive Bayes
[ https://issues.apache.org/jira/browse/SPARK-14077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15536076#comment-15536076 ] Apache Spark commented on SPARK-14077: -- User 'zhengruifeng' has created a pull request for this issue: https://github.com/apache/spark/pull/15313 > Support weighted instances in naive Bayes > - > > Key: SPARK-14077 > URL: https://issues.apache.org/jira/browse/SPARK-14077 > Project: Spark > Issue Type: New Feature > Components: ML >Reporter: Xiangrui Meng >Assignee: zhengruifeng > Labels: naive-bayes > Fix For: 2.1.0 > > > In naive Bayes, we expect inputs to be individual observations. In practice, > people may have the frequency table instead. It is useful for us to support > instance weights to handle this case. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-14077) Support weighted instances in naive Bayes
[ https://issues.apache.org/jira/browse/SPARK-14077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15266253#comment-15266253 ] Apache Spark commented on SPARK-14077: -- User 'zhengruifeng' has created a pull request for this issue: https://github.com/apache/spark/pull/12819 > Support weighted instances in naive Bayes > - > > Key: SPARK-14077 > URL: https://issues.apache.org/jira/browse/SPARK-14077 > Project: Spark > Issue Type: New Feature > Components: ML >Reporter: Xiangrui Meng > Labels: naive-bayes > > In naive Bayes, we expect inputs to be individual observations. In practice, > people may have the frequency table instead. It is useful for us to support > instance weights to handle this case. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-14077) Support weighted instances in naive Bayes
[ https://issues.apache.org/jira/browse/SPARK-14077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15265652#comment-15265652 ] zhengruifeng commented on SPARK-14077: -- OK, I will have a try > Support weighted instances in naive Bayes > - > > Key: SPARK-14077 > URL: https://issues.apache.org/jira/browse/SPARK-14077 > Project: Spark > Issue Type: New Feature > Components: ML >Reporter: Xiangrui Meng > Labels: naive-bayes > > In naive Bayes, we expect inputs to be individual observations. In practice, > people may have the frequency table instead. It is useful for us to support > instance weights to handle this case. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-14077) Support weighted instances in naive Bayes
[ https://issues.apache.org/jira/browse/SPARK-14077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15256247#comment-15256247 ] Mohamed Baddar commented on SPARK-14077: I suspended working on it for the time being > Support weighted instances in naive Bayes > - > > Key: SPARK-14077 > URL: https://issues.apache.org/jira/browse/SPARK-14077 > Project: Spark > Issue Type: New Feature > Components: ML >Reporter: Xiangrui Meng > Labels: naive-bayes > > In naive Bayes, we expect inputs to be individual observations. In practice, > people may have the frequency table instead. It is useful for us to support > instance weights to handle this case. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-14077) Support weighted instances in naive Bayes
[ https://issues.apache.org/jira/browse/SPARK-14077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15256228#comment-15256228 ] zhengruifeng commented on SPARK-14077: -- Are you still working on this task? > Support weighted instances in naive Bayes > - > > Key: SPARK-14077 > URL: https://issues.apache.org/jira/browse/SPARK-14077 > Project: Spark > Issue Type: New Feature > Components: ML >Reporter: Xiangrui Meng > Labels: naive-bayes > > In naive Bayes, we expect inputs to be individual observations. In practice, > people may have the frequency table instead. It is useful for us to support > instance weights to handle this case. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-14077) Support weighted instances in naive Bayes
[ https://issues.apache.org/jira/browse/SPARK-14077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15213481#comment-15213481 ] Mohamed Baddar commented on SPARK-14077: [~mengxr] [~josephkb] In sktlearn code , they implement the same feature by scaling the target variable after binarization. Here's the source code link https://github.com/scikit-learn/scikit-learn/blob/51a765a/sklearn/naive_bayes.py#L507. I think we can follow sktlearn implementation as a guideline and it will also help in the unit test. Any thoughts ? > Support weighted instances in naive Bayes > - > > Key: SPARK-14077 > URL: https://issues.apache.org/jira/browse/SPARK-14077 > Project: Spark > Issue Type: New Feature > Components: ML >Reporter: Xiangrui Meng > Labels: naive-bayes > > In naive Bayes, we expect inputs to be individual observations. In practice, > people may have the frequency table instead. It is useful for us to support > instance weights to handle this case. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-14077) Support weighted instances in naive Bayes
[ https://issues.apache.org/jira/browse/SPARK-14077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15210069#comment-15210069 ] Mohamed Baddar commented on SPARK-14077: [~mengxr] If no body is working on this task , Can i work on it ? > Support weighted instances in naive Bayes > - > > Key: SPARK-14077 > URL: https://issues.apache.org/jira/browse/SPARK-14077 > Project: Spark > Issue Type: New Feature > Components: ML >Reporter: Xiangrui Meng > Labels: naive-bayes > > In naive Bayes, we expect inputs to be individual observations. In practice, > people may have the frequency table instead. It is useful for us to support > instance weights to handle this case. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org