Trevor Grant created MAHOUT-1937:
Summary: Model should be able to import/export to PMML
Key: MAHOUT-1937
URL: https://issues.apache.org/jira/browse/MAHOUT-1937
Project: Mahout
Issue Type
posted. FYI, as of Mahout v0.10 we also
have a Spark backed implementation of Naive Bayes in our new engine neutral
environment.
Support for PMML
Key: MAHOUT-1041
URL: https://issues.apache.org/jira/browse/MAHOUT-1041
Project
Yay Mahout!
Support for PMML
Key: MAHOUT-1041
URL: https://issues.apache.org/jira/browse/MAHOUT-1041
Project: Mahout
Issue Type: Improvement
Components: Integration
Environment: Software
on my team for DARPA memex in doing this.
We'll take a look at jpmml and report back.
Support for PMML
Key: MAHOUT-1041
URL: https://issues.apache.org/jira/browse/MAHOUT-1041
Project: Mahout
Issue Type: Improvement
into Nutch in our Naive Bayes ParseFilter here:
https://github.com/apache/nutch/blob/trunk/src/plugin/parsefilter-naivebayes/src/java/org/apache/nutch/parsefilter/naivebayes/NaiveBayesParseFilter.java
Yay Mahout!
Support for PMML
Key: MAHOUT-1041
PMML doesn’t make a lot of sense when the model is a potentially massive
matrix. One reason is that it will be pretty hard (impossible?) to parallelize
read/write with the engines we use. JSON has the same problem and the only way
SchemaRDD can read JSON is by bending the rules.
Seems like
Yes, it makes sense having one for Naive Bayes and KMeans (when we have
that !!).
On Thu, Mar 5, 2015 at 11:49 AM, Pat Ferrel p...@occamsmachete.com wrote:
PMML doesn’t make a lot of sense when the model is a potentially massive
matrix. One reason is that it will be pretty hard (impossible
PMML is a machine-to-machine mechanism, not intended really for human
consumption or production. Based on XML, it is, of course, bloated, but
that doesn't really matter for readability since reading isn't the goal.
The vision of making models easy to transfer from system to system is nice
I am willing to +1 any contribution at this point.
my previous company used pmml to serialize simple stuff, but i don't
have first hand experience. Its flexibility is ultimately pretty
limited, isn't it? And xml is ultimately a media which is too ugly and
too verbose at the same time to represent
Yes, the limitations are often an issue for people doing things that aren't
in the PMML spec yet; there could be room for suggesting new features in
the spec by building them though, I suppose.
Also agree that XML is a lousy/bloated way of representing stuff like this,
but in the end it's just
Sure, those would be options.
On Wed, Mar 4, 2015 at 3:41 PM, Saikat Kanjilal sxk1...@hotmail.com wrote:
Question, is there a way to introduce PMML with using a more lightweight
format like yaml or json?
Date: Wed, 4 Mar 2015 13:25:29 -0800
Subject: Re: PMML
From: andrew.mussel
Next question: Is the audience for PMML programmers or could it be folks that
can script? I'm wondering how this intersects with a simple spark like DSL ,
could Mahout implement an intersection between the two? If there's interest I
can go into examples.
Sent from my iPhone
On Mar 4, 2015
Question, is there a way to introduce PMML with using a more lightweight format
like yaml or json?
Date: Wed, 4 Mar 2015 13:25:29 -0800
Subject: Re: PMML
From: andrew.mussel...@gmail.com
To: dev@mahout.apache.org
Yes, the limitations are often an issue for people doing things that aren't
I think keeping it simple is best, try implementing one or two models in
XML and then get fancy if it makes sense.
On Wednesday, March 4, 2015, Saikat Kanjilal sxk1...@hotmail.com wrote:
Next question: Is the audience for PMML programmers or could it be folks
that can script? I'm wondering
How much interest is there in a mahout-pmml module, with a starting point
to be able to export a few analytic/scoring jobs to PMML representation?
I've seen a lot of interest at in being able to use PMML to translate
analytic work into production(though I think people talk about it more than
It makes sense to support PMML for classification and clustering tasks to
be able to share and distribute trained models. Sean, Pat, Dmitriy and Ted
please chime in.
PMML support in Mahout was talked about for a long time now but never
really got any traction to take off.
+1 to build
:
---
In the meantime one could use something like cascading:pattern
(http://www.cascading.org/pattern/) or jpmml-cascading
(https://github.com/jpmml/jpmml-cascading) to execute PMML models in Hadoop.
was (Author: thomasd):
In the meantime one could use something like cascading:pattern
:
---
As this wasn't mentioned yet: one could use something like cascading:pattern
(http://www.cascading.org/pattern/) or jpmml-cascading
(https://github.com/jpmml/jpmml-cascading) to execute PMML models in Hadoop.
was (Author: thomasd):
In the meantime one could use something like
Hi Pranay,
as Ted already said there were already multiple times the request for PMML
support.
I would recommend that you read all the JIRA issues about PMML they contain a
lot of information what have happened so far:
https://issues.apache.org/jira/browse/MAHOUT-1041 Support for PMML
https
The ability to export PMML for streaming k-means, Naive Bayes and the
logistic regression classifiers would be useful.
Nobody has worked on this much yet, but demand, on the other hand, is
pretty sporadic.
On Mon, Sep 2, 2013 at 6:06 AM, Pranay Tonpay pranay_ton...@yahoo.comwrote:
Hi
the benefit of
storing largish models in XML. I could see a specific issue that can do I/O of
PMML into Mahout's, but I don't see any thing running natively off of PMML.
Support for PMML
Key: MAHOUT-1041
URL: https
for PMML
Key: MAHOUT-1041
URL: https://issues.apache.org/jira/browse/MAHOUT-1041
Project: Mahout
Issue Type: Improvement
Components: Integration
Affects Versions: Backlog
Environment: Software Platform
[
https://issues.apache.org/jira/browse/MAHOUT-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Schelter updated MAHOUT-1041:
---
Fix Version/s: Backlog
Support for PMML
[
https://issues.apache.org/jira/browse/MAHOUT-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Schelter updated MAHOUT-1041:
---
Affects Version/s: (was: Backlog)
Support for PMML
this module independently so that you don't have to wait for
others to commit partial results.
On Wed, Dec 26, 2012 at 6:52 PM, Marty Kube
martyk...@beavercreekconsulting.com wrote:
I took a look at JPMML... At the bottom of it they have ran a JAXB
compiler on the PMML V4 schema to generate Java
I took a look at JPMML... At the bottom of it they have ran a JAXB
compiler on the PMML V4 schema to generate Java bindings. I didn't see
a lot of value add in JPMML beyond that.
I'd say just add the schema and bindings generation to Mahout. The
value add here is model mapping from
PM, Marty Kube
martyk...@beavercreekconsulting.com wrote:
I took a look at JPMML... At the bottom of it they have ran a JAXB
compiler on the PMML V4 schema to generate Java bindings. I didn't see a
lot of value add in JPMML beyond that.
I'd say just add the schema and bindings generation
From looking at PMML (http://www.dmg.org/v4-1/GeneralStructure.html), it seems
that JPMML is not going to really get us there if it only supports the 4 models
listed below. I would think we could go through the structures supported in
the link above and then map it to the Algorithms
Hi All,
as Grant suggested, I forward the email about mahout-pmml.
I already tried jpmml standalone and works fine for me, the next important
point is to understand or maybe create some example for each model
described before:
- NeuralNetwork
- RandomForest (implemented via Segmentation
when there wasn't much support
in terms of code contributions.
What sort of contributions can you provide? Problem specification? PMML
parser? Model code?
Support for PMML
Key: MAHOUT-1041
URL: https://issues.apache.org
[
https://issues.apache.org/jira/browse/MAHOUT-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13407972#comment-13407972
]
Duraimurugan commented on MAHOUT-1041:
--
Sure, I can contribute on PMML parser
:
* https://cwiki.apache.org/MAHOUT/how-to-contribute.html
Support for PMML
Key: MAHOUT-1041
URL: https://issues.apache.org/jira/browse/MAHOUT-1041
Project: Mahout
Issue Type: Improvement
Components
Duraimurugan created MAHOUT-1041:
Summary: Support for PMML
Key: MAHOUT-1041
URL: https://issues.apache.org/jira/browse/MAHOUT-1041
Project: Mahout
Issue Type: Improvement
33 matches
Mail list logo