[scikit-learn] Example of a scikit-learn compatible classifier with C++ implementation of the algorithms

drh Wed, 15 May 2019 13:13:47 -0700

I use a PYTHON BASED ECOSYSTEM (SCIKIT-LEARN, … ) FOR PROTOTYPING andI have a C++ BASED PRODUCTION SYSTEM. A scikit-learn compatibleinterface allows me to take advantage of scikit-learn’s ecosystem.Implementing the algorithm in C++ allows me to develop and test myalgorithms already during prototyping.

I started with scikit-learn’s project template to roll my own decisiontree and forest classifier and implemented the algorithms in a C++library, using Cython to create the Python bindings.

Starting out with a Python implementation, I experimented a little bitwith implementing the algorithms in Cython. But I found that if youare proficient in Python and C++ coding, that implementing thealgorithm directly in C++ was much faster than writing it in Cython.

I made this project available to everybody, because I think it couldserve as an example or template for anybody who would like to rolltheir own scikit-learn compatible classifier with a C++ basedimplementation of the algorithms to be re-used in a production system.At least version 1.0.0 should be useful, after that it might becometoo complex to be used as an example.


Check it out:

READTHEDOCs: https://koho.readthedocs.io

 GITHUB: https://github.com/AIWerkstatt/koho

I tried to be consistent with scikit-learn’s decision tree andensemble modules, and the basic concepts, including stack, samples LUTwith in-place partitioning, incremental histogram updates, for theimplementation of the classifiers are based on: G. Louppe,Understanding Random Forests, PhD Thesis, 2014. Thanks a lot Gillesfor that comprehensive work on random forests!

_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

[scikit-learn] Example of a scikit-learn compatible classifier with C++ implementation of the algorithms

Reply via email to