Vincenzo sent a PR and included k-means as an example. Sean is helping
review it. PMML standard is quite large. So we may start with simple
model export, like linear methods, then move forward to tree-based.
-Xiangrui

On Mon, Nov 10, 2014 at 11:27 AM, Aris <arisofala...@gmail.com> wrote:
> Hello Spark and MLLib folks,
>
> So a common problem in the real world of using machine learning is that some
> data analysis use tools like R, but the more "data engineers" out there will
> use more advanced systems like Spark MLLib or even Python Scikit Learn.
>
> In the real world, I want to have "a system" where multiple different
> modeling environments can learn from data / build models, represent the
> models in a common language, and then have a layer which just takes the
> model and run model.predict() all day long -- scores the models in other
> words.
>
> It looks like the project openscoring.io and jpmml-evaluator are some
> amazing systems for this, but they fundamentally use PMML as the model
> representation here.
>
> I have read some JIRA tickets that Xiangrui Meng is interested in getting
> PMML implemented to export MLLib models, is that happening? Further, would
> something like Manish Amde's boosted ensemble tree methods be representable
> in PMML?
>
> Thank you!!
> Aris

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to