[jira] [Commented] (SPARK-6164) CrossValidatorModel should keep stats from fitting

2015-06-02 Thread Leah McGuire (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-6164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14569714#comment-14569714
 ] 

Leah McGuire commented on SPARK-6164:
-

I fixed the merge conflict. Should be good to go now.

 CrossValidatorModel should keep stats from fitting
 --

 Key: SPARK-6164
 URL: https://issues.apache.org/jira/browse/SPARK-6164
 Project: Spark
  Issue Type: Improvement
  Components: ML
Affects Versions: 1.4.0
Reporter: Joseph K. Bradley
Priority: Minor

 CrossValidator computes stats for each (model, fold) pair, but they are 
 thrown out by the created model.  CrossValidatorModel should keep this info 
 and expose it to users.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-7545) Bernoulli NaiveBayes should validate data

2015-05-11 Thread Leah McGuire (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-7545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14539078#comment-14539078
 ] 

Leah McGuire commented on SPARK-7545:
-

Yes, I think I can get it in. 

 Bernoulli NaiveBayes should validate data
 -

 Key: SPARK-7545
 URL: https://issues.apache.org/jira/browse/SPARK-7545
 Project: Spark
  Issue Type: Improvement
  Components: MLlib
Affects Versions: 1.4.0
Reporter: Joseph K. Bradley
Assignee: Leah McGuire
Priority: Minor

 Bernoulli NaiveBayes expects input features to take values 0 or 1, but it 
 does not actually check that.  It should check and throw an exception if it 
 finds invalid values.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-7545) Bernoulli NaiveBayes should validate data

2015-05-11 Thread Leah McGuire (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-7545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14539077#comment-14539077
 ] 

Leah McGuire commented on SPARK-7545:
-

I think I can get it in :-)

On Mon, May 11, 2015 at 4:45 PM, Joseph K. Bradley (JIRA) j...@apache.org



 Bernoulli NaiveBayes should validate data
 -

 Key: SPARK-7545
 URL: https://issues.apache.org/jira/browse/SPARK-7545
 Project: Spark
  Issue Type: Improvement
  Components: MLlib
Affects Versions: 1.4.0
Reporter: Joseph K. Bradley
Assignee: Leah McGuire
Priority: Minor

 Bernoulli NaiveBayes expects input features to take values 0 or 1, but it 
 does not actually check that.  It should check and throw an exception if it 
 finds invalid values.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Issue Comment Deleted] (SPARK-7545) Bernoulli NaiveBayes should validate data

2015-05-11 Thread Leah McGuire (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-7545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Leah McGuire updated SPARK-7545:

Comment: was deleted

(was: Yes, I think I can get it in. )

 Bernoulli NaiveBayes should validate data
 -

 Key: SPARK-7545
 URL: https://issues.apache.org/jira/browse/SPARK-7545
 Project: Spark
  Issue Type: Improvement
  Components: MLlib
Affects Versions: 1.4.0
Reporter: Joseph K. Bradley
Assignee: Leah McGuire
Priority: Minor

 Bernoulli NaiveBayes expects input features to take values 0 or 1, but it 
 does not actually check that.  It should check and throw an exception if it 
 finds invalid values.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-6164) CrossValidatorModel should keep stats from fitting

2015-05-05 Thread Leah McGuire (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-6164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14529110#comment-14529110
 ] 

Leah McGuire commented on SPARK-6164:
-

Closed this - accidentally pulled in pieces of another request

 CrossValidatorModel should keep stats from fitting
 --

 Key: SPARK-6164
 URL: https://issues.apache.org/jira/browse/SPARK-6164
 Project: Spark
  Issue Type: Improvement
  Components: ML
Affects Versions: 1.4.0
Reporter: Joseph K. Bradley
Priority: Minor

 CrossValidator computes stats for each (model, fold) pair, but they are 
 thrown out by the created model.  CrossValidatorModel should keep this info 
 and expose it to users.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-5328) Update PySpark MLlib NaiveBayes API to take model type parameter for Bernoulli fit

2015-01-19 Thread Leah McGuire (JIRA)
Leah McGuire created SPARK-5328:
---

 Summary: Update PySpark MLlib NaiveBayes API to take model type 
parameter for Bernoulli fit
 Key: SPARK-5328
 URL: https://issues.apache.org/jira/browse/SPARK-5328
 Project: Spark
  Issue Type: Improvement
  Components: MLlib
Reporter: Leah McGuire
Priority: Minor


[SPARK-4894] Add Bernoulli-variant of Naive Bayes adds Bernoulli fitting to 
NaiveBayes.scala need to update python API to accept model type parameter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-5328) Update PySpark MLlib NaiveBayes API to take model type parameter for Bernoulli fit

2015-01-19 Thread Leah McGuire (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-5328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Leah McGuire updated SPARK-5328:

Description: [SPARK-4894] Adds Bernoulli-variant of Naive Bayes adds 
Bernoulli fitting to NaiveBayes.scala need to update python API to accept model 
type parameter.  (was: [SPARK-4894] Add Bernoulli-variant of Naive Bayes adds 
Bernoulli fitting to NaiveBayes.scala need to update python API to accept model 
type parameter.)

 Update PySpark MLlib NaiveBayes API to take model type parameter for 
 Bernoulli fit
 --

 Key: SPARK-5328
 URL: https://issues.apache.org/jira/browse/SPARK-5328
 Project: Spark
  Issue Type: Improvement
  Components: MLlib
Reporter: Leah McGuire
Priority: Minor
  Labels: mllib

 [SPARK-4894] Adds Bernoulli-variant of Naive Bayes adds Bernoulli fitting to 
 NaiveBayes.scala need to update python API to accept model type parameter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-17 Thread Leah McGuire (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14281444#comment-14281444
 ] 

Leah McGuire commented on SPARK-4894:
-

Hi [~rnowling],

I submitted a pull request to add just the Bernoulli NB to the existing code. 
It follows the outline you suggested above with the exemption of the fact that 
I used an enumeration for the model type rather than a simple string. If you 
would have time to review it I would appreciate the feedback!



 Add Bernoulli-variant of Naive Bayes
 

 Key: SPARK-4894
 URL: https://issues.apache.org/jira/browse/SPARK-4894
 Project: Spark
  Issue Type: New Feature
  Components: MLlib
Affects Versions: 1.2.0
Reporter: RJ Nowling
Assignee: RJ Nowling

 MLlib only supports the multinomial-variant of Naive Bayes.  The Bernoulli 
 version of Naive Bayes is more useful for situations where the features are 
 binary values.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread Leah McGuire (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277590#comment-14277590
 ] 

Leah McGuire commented on SPARK-4894:
-

Thanks [~rnowling]!

I can take a look at it this week and at least confirm that the docs are 
incorrect and outline the changes that need to occur. If I have time I will try 
to send you a patch this week, if not I'm happy to help with review and testing 
for your patch.

 Add Bernoulli-variant of Naive Bayes
 

 Key: SPARK-4894
 URL: https://issues.apache.org/jira/browse/SPARK-4894
 Project: Spark
  Issue Type: New Feature
  Components: MLlib
Affects Versions: 1.2.0
Reporter: RJ Nowling
Assignee: RJ Nowling

 MLlib only supports the multinomial-variant of Naive Bayes.  The Bernoulli 
 version of Naive Bayes is more useful for situations where the features are 
 binary values.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-13 Thread Leah McGuire (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276161#comment-14276161
 ] 

Leah McGuire commented on SPARK-4894:
-

Are you guys working on this? I would like to contribute as well.

 Add Bernoulli-variant of Naive Bayes
 

 Key: SPARK-4894
 URL: https://issues.apache.org/jira/browse/SPARK-4894
 Project: Spark
  Issue Type: New Feature
  Components: MLlib
Affects Versions: 1.1.1
Reporter: RJ Nowling

 MLlib only supports the multinomial-variant of Naive Bayes.  The Bernoulli 
 version of Naive Bayes is more useful for situations where the features are 
 binary values.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org