Feature (?): Setting custom parameters for a Spark MLlib pipeline

martin Wed, 20 Oct 2021 00:28:02 -0700

Hello,

This is my first post to this list, so I hope I won't violate any(un)written rules.

I recently started working with SparkNLP for a larger project. SparkNLPin turn is based Apache Spark's MLlib. One thing I found missing is theability to store custom parameters in a Spark pipeline. It seems onlycertain pre-configured parameter values are allowed (e.g. "stages" forthe Pipeline class).

IMHO, it would be handy to be able to store custom parameters, e.g. formodel versions or other meta-data, so that these parameters are storedwith a trained pipeline, for instance. This could also be used toinclude evaluation results, such as accuracy, with trained ML models.

(I also asked this on Stackoverflow, but didn't get a response, yet:https://stackoverflow.com/questions/69627820/setting-custom-parameters-for-a-spark-mllib-pipeline)

Would does the community think about this proposal? Has it beendiscussed before perhaps? Any thoughts?


Cheers,

Martin

Feature (?): Setting custom parameters for a Spark MLlib pipeline

Reply via email to