Hello,

I was running python tests and found that
pyspark.sql.tests.GroupbyAggPandasUDFTests.test_unsupported_types
<https://github.com/apache/spark/blob/52e00f70663a87b5837235bdf72a3e6f84e11411/python/pyspark/sql/tests.py#L4528-L4548>
does
not run with Python 2 because the test uses "assertRaisesRegex" (supported
by Python 3) instead of "assertRaisesRegexp" (supported by Python 2).
However, spark jenkins does not fail because of this issue (see run history
at here
<https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test/job/spark-master-test-sbt-hadoop-2.7/>).
After looking into this issue, seems test script will skip tests related to
pandas if pandas is not installed
<https://github.com/apache/spark/blob/2ac895be909de7e58e1051dc2a1bba98a25bf4be/python/pyspark/sql/tests.py#L51-L63>,
which means that jenkins does not have pandas installed.

@Shane, can you help us check if jenkins workers have pandas installed?

Thanks,

Yin

Reply via email to