Re: [scikit-learn] [ANN] Scikit-learn 0.20.0

Andreas Mueller Fri, 28 Sep 2018 10:41:10 -0700



On 09/28/2018 12:10 PM, Sebastian Raschka wrote:

I think model serialization should be a priority.

There is also the ONNX specification that is gaining industrial adoption and 
that already includes open source exporters for several families of 
scikit-learn models:

https://github.com/onnx/onnxmltools


Didn't know about that. This is really nice! What do you think about referring 
to it under http://scikit-learn.org/stable/modules/model_persistence.html to 
make people aware that this option exists?
Would be happy to add a PR.

I don't think an open source runtime has been announced yet (or theydidn't email me like they promised lol).

I'm quite excited about this as well.

Javier:

The problem is not so much storing the "model" but storing how to makepredictions. Different versions could act differentlyon the same data structure - and the data structure could change. Bothhappen in scikit-learn.So if you want to make sure the right thing happens across versions, youeither need to provide serialization and deserialization forevery version and conversion between those or you need to provide a wayto store the prediction function,which basically means you need a turing-complete language (that's whatONNX does).

We basically said doing the first is not feasible within scikit-learngiven our current amount of resources, and no-one

has even tried doing it outside of scikit-learn (which would be possible).

Implementing a complete prediction serialization language (the secondoption) is definitely outside the scope of sklearn.



_______________________________________________
scikit-learn mailing list
[email protected]
https://mail.python.org/mailman/listinfo/scikit-learn

Re: [scikit-learn] [ANN] Scikit-learn 0.20.0

Reply via email to