Nicholas Chammas created SPARK-5161: ---------------------------------------
Summary: Parallelize Python test execution Key: SPARK-5161 URL: https://issues.apache.org/jira/browse/SPARK-5161 Project: Spark Issue Type: Improvement Components: Project Infra Affects Versions: 1.2.0 Reporter: Nicholas Chammas [Original discussion here.|https://github.com/apache/spark/pull/3564#issuecomment-67785676] As of 1.2.0, Python tests take around 10-12 minutes to run. Once [SPARK-3431] is complete, this will become a significant fraction of the total test time. There are 2 separate approaches to explore for parallelizing the execution of Python unit tests: * Use GNU parallel to run each Python test file in parallel. * Use [{{nose}}|http://nose.readthedocs.org/en/latest/doc_tests/test_multiprocess/multiprocess.html] to parallelize all Python tests in a more extensible/configurable way. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org