[jira] [Commented] (SPARK-8546) PMML export for Naive Bayes

2016-06-13 Thread Villu Ruusmann (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15327205#comment-15327205
 ] 

Villu Ruusmann commented on SPARK-8546:
---

Hi [~rgasiorek] - would it be an option to re-build your models in Spark ML 
instead of MLlib? I have been working on Spark ML pipelines-to-PMML converter 
called JPMML-SparkML (https://github.com/jpmml/jpmml-sparkml), which could 
fully address your use case then. JPMML-SparkML supports all tree-based models 
and the majority of non-NLP domain transformations. It would be possible to add 
support for the `classification.NaiveBayesModel` model type in a day or two if 
needed.

> PMML export for Naive Bayes
> ---
>
> Key: SPARK-8546
> URL: https://issues.apache.org/jira/browse/SPARK-8546
> Project: Spark
>  Issue Type: New Feature
>  Components: MLlib
>Reporter: Joseph K. Bradley
>Assignee: Xusen Yin
>Priority: Minor
>
> The naive Bayes section of PMML standard can be found at 
> http://www.dmg.org/v4-1/NaiveBayes.html. We should first figure out how to 
> generate PMML for both binomial and multinomial naive Bayes models using 
> JPMML (maybe [~vfed] can help).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8546) PMML export for Naive Bayes

2016-06-13 Thread Radoslaw Gasiorek (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15327167#comment-15327167
 ] 

Radoslaw Gasiorek commented on SPARK-8546:
--

hi there, [~josephkb]
We would like to use Mllib built models to classify outside spark therefore 
without Spark context available. We would like to export the models built in 
spark into PMML format, that then would be read by a stand alone java 
application without spark context (but with Mllib jar). 
The java application would load the model from the PMML file and would use the 
model to 'predict'  or rather 'classify' the new data we get. 
This feature would enable us to proceed without big architectural and 
operational changes, without this feature we might need get the the 
sparkContext available to the standalone application that would be bigger 
operational and architectural overhead.

We might need to use the plain java serialization for the proof of concept 
anyways, but surely not for produtionized product.

Can we prioritize this feature as well as 
https://issues.apache.org/jira/browse/SPARK-8542 and 
https://issues.apache.org/jira/browse/SPARK-8543 ?
What would be LOE and EAT for these?
thanks guys in advance for responses, and feedback.

> PMML export for Naive Bayes
> ---
>
> Key: SPARK-8546
> URL: https://issues.apache.org/jira/browse/SPARK-8546
> Project: Spark
>  Issue Type: New Feature
>  Components: MLlib
>Reporter: Joseph K. Bradley
>Assignee: Xusen Yin
>Priority: Minor
>
> The naive Bayes section of PMML standard can be found at 
> http://www.dmg.org/v4-1/NaiveBayes.html. We should first figure out how to 
> generate PMML for both binomial and multinomial naive Bayes models using 
> JPMML (maybe [~vfed] can help).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8546) PMML export for Naive Bayes

2016-04-26 Thread Joseph K. Bradley (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15258889#comment-15258889
 ] 

Joseph K. Bradley commented on SPARK-8546:
--

I'm removing the target version.  It will be good to add but will slip for 2.0

> PMML export for Naive Bayes
> ---
>
> Key: SPARK-8546
> URL: https://issues.apache.org/jira/browse/SPARK-8546
> Project: Spark
>  Issue Type: New Feature
>  Components: MLlib
>Reporter: Joseph K. Bradley
>Assignee: Xusen Yin
>Priority: Minor
>
> The naive Bayes section of PMML standard can be found at 
> http://www.dmg.org/v4-1/NaiveBayes.html. We should first figure out how to 
> generate PMML for both binomial and multinomial naive Bayes models using 
> JPMML (maybe [~vfed] can help).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8546) PMML export for Naive Bayes

2015-10-09 Thread Xiangrui Meng (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14951018#comment-14951018
 ] 

Xiangrui Meng commented on SPARK-8546:
--

Assigned. Thanks!

> PMML export for Naive Bayes
> ---
>
> Key: SPARK-8546
> URL: https://issues.apache.org/jira/browse/SPARK-8546
> Project: Spark
>  Issue Type: New Feature
>  Components: MLlib
>Reporter: Joseph K. Bradley
>Assignee: Xusen Yin
>Priority: Minor
>
> The naive Bayes section of PMML standard can be found at 
> http://www.dmg.org/v4-1/NaiveBayes.html. We should first figure out how to 
> generate PMML for both binomial and multinomial naive Bayes models using 
> JPMML (maybe [~vfed] can help).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8546) PMML export for Naive Bayes

2015-10-08 Thread Xusen Yin (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14949361#comment-14949361
 ] 

Xusen Yin commented on SPARK-8546:
--

Hi [~mengxr], I'd like to work on it.

> PMML export for Naive Bayes
> ---
>
> Key: SPARK-8546
> URL: https://issues.apache.org/jira/browse/SPARK-8546
> Project: Spark
>  Issue Type: New Feature
>  Components: MLlib
>Reporter: Joseph K. Bradley
>Priority: Minor
>
> The naive Bayes section of PMML standard can be found at 
> http://www.dmg.org/v4-1/NaiveBayes.html. We should first figure out how to 
> generate PMML for both binomial and multinomial naive Bayes models using 
> JPMML (maybe [~vfed] can help).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8546) PMML export for Naive Bayes

2015-06-25 Thread Villu Ruusmann (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14602023#comment-14602023
 ] 

Villu Ruusmann commented on SPARK-8546:
---

The NB exporter should be easier to implement than Logistic regression and SVM 
exporters, because the PMML representation of NB models does not distinguish 
between binary and multi-class classification cases.

I'm currently developing PMML exporters for the Python ML module called 
Scikit-Learn. Unfortunately, I haven't made to Scikit-Learn classes GaussianNB 
and MultinomialNB yet. I shall be much wiser once I get them done.

 PMML export for Naive Bayes
 ---

 Key: SPARK-8546
 URL: https://issues.apache.org/jira/browse/SPARK-8546
 Project: Spark
  Issue Type: New Feature
  Components: MLlib
Reporter: Joseph K. Bradley
Priority: Minor

 The naive Bayes section of PMML standard can be found at 
 http://www.dmg.org/v4-1/NaiveBayes.html. We should first figure out how to 
 generate PMML for both binomial and multinomial naive Bayes models using 
 JPMML (maybe [~vfed] can help).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org