Egor Pakhomov created SPARK-3878:
------------------------------------

             Summary: Benchmarks and common tests for mllib algorithm
                 Key: SPARK-3878
                 URL: https://issues.apache.org/jira/browse/SPARK-3878
             Project: Spark
          Issue Type: New Feature
          Components: MLlib
            Reporter: Egor Pakhomov


There is no common practice among MLlib for testing algorithms: every model 
generates it's own random test data. There is no easy extractable test cases 
applible to another algorithm. There is no benchmarks for comparing algorithms. 
After implementing new algorithm it's very hard to understand how it should be 
tested. 
Lack of serialization testing: MLlib algorithms don't contain tests which test 
that model work after serialization. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to