Re: [scikit-learn] [ANN] Scikit-learn 0.20.0

Andreas Mueller Fri, 28 Sep 2018 10:44:20 -0700



On 09/28/2018 01:38 PM, Andreas Mueller wrote:

On 09/28/2018 12:10 PM, Sebastian Raschka wrote:
I think model serialization should be a priority.
There is also the ONNX specification that is gaining industrialadoption and that already includes open source exporters for severalfamilies of scikit-learn models:
https://github.com/onnx/onnxmltools
Didn't know about that. This is really nice! What do you think aboutreferring to it underhttp://scikit-learn.org/stable/modules/model_persistence.html to makepeople aware that this option exists?
Would be happy to add a PR.
I don't think an open source runtime has been announced yet (or theydidn't email me like they promised lol).
I'm quite excited about this as well.

Javier:
The problem is not so much storing the "model" but storing how to makepredictions. Different versions could act differentlyon the same data structure - and the data structure could change. Bothhappen in scikit-learn.So if you want to make sure the right thing happens across versions,you either need to provide serialization and deserialization forevery version and conversion between those or you need to provide away to store the prediction function,which basically means you need a turing-complete language (that's whatONNX does).
We basically said doing the first is not feasible within scikit-learngiven our current amount of resources, and no-onehas even tried doing it outside of scikit-learn (which would bepossible).Implementing a complete prediction serialization language (the secondoption) is definitely outside the scope of sklearn.

Maybe we should add to the FAQ why serialization is hard?
_______________________________________________
scikit-learn mailing list
[email protected]
https://mail.python.org/mailman/listinfo/scikit-learn

Re: [scikit-learn] [ANN] Scikit-learn 0.20.0

Reply via email to