Hello, I was running python tests and found that pyspark.sql.tests.GroupbyAggPandasUDFTests.test_unsupported_types <https://github.com/apache/spark/blob/52e00f70663a87b5837235bdf72a3e6f84e11411/python/pyspark/sql/tests.py#L4528-L4548> does not run with Python 2 because the test uses "assertRaisesRegex" (supported by Python 3) instead of "assertRaisesRegexp" (supported by Python 2). However, spark jenkins does not fail because of this issue (see run history at here <https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test/job/spark-master-test-sbt-hadoop-2.7/>). After looking into this issue, seems test script will skip tests related to pandas if pandas is not installed <https://github.com/apache/spark/blob/2ac895be909de7e58e1051dc2a1bba98a25bf4be/python/pyspark/sql/tests.py#L51-L63>, which means that jenkins does not have pandas installed.
@Shane, can you help us check if jenkins workers have pandas installed? Thanks, Yin