Egor Pakhomov created SPARK-3878: ------------------------------------ Summary: Benchmarks and common tests for mllib algorithm Key: SPARK-3878 URL: https://issues.apache.org/jira/browse/SPARK-3878 Project: Spark Issue Type: New Feature Components: MLlib Reporter: Egor Pakhomov
There is no common practice among MLlib for testing algorithms: every model generates it's own random test data. There is no easy extractable test cases applible to another algorithm. There is no benchmarks for comparing algorithms. After implementing new algorithm it's very hard to understand how it should be tested. Lack of serialization testing: MLlib algorithms don't contain tests which test that model work after serialization. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org