GitHub user BryanCutler opened a pull request: https://github.com/apache/spark/pull/23063
[SPARK-26033][PYTHON][TESTS] Break large ml/tests.py file into smaller files ## What changes were proposed in this pull request? This PR breaks down the large ml/tests.py file that contains all Python ML unit tests into several smaller test files to be easier to read and maintain. The tests are broken down as follows: ``` pyspark âââ __init__.py ... âââ ml â âââ __init__.py ... â âââ tests â â âââ __init__.py â â âââ test_algorithms.py â â âââ test_base.py â â âââ test_evaluation.py â â âââ test_feature.py â â âââ test_image.py â â âââ test_linalg.py â â âââ test_param.py â â âââ test_persistence.py â â âââ test_pipeline.py â â âââ test_stat.py â â âââ test_training_summary.py â â âââ test_tuning.py â â âââ test_wrapper.py ... âââ testing ... â âââ mlutils.py ... ``` ## How was this patch tested? Ran tests manually by module to ensure test count was the same, and ran `python/run-tests --modules=pyspark-ml` to verify all passing with Python 2.7 and Python 3.6. You can merge this pull request into a Git repository by running: $ git pull https://github.com/BryanCutler/spark python-test-breakup-ml-SPARK-26033 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/23063.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #23063 ---- commit a4f8f12f6357861572ffbf34190947983545ba98 Author: Bryan Cutler <cutlerb@...> Date: 2018-11-17T01:30:29Z separated out ml tests ---- --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org