I created https://issues.apache.org/jira/browse/SPARK-23292 for this issue.

On Wed, Jan 31, 2018 at 8:17 PM, Yin Huai <yh...@databricks.com> wrote:

> btw, seems we also have the same skipping logic for pyarrow. But, I have
> not looked into if tests related to pyarrow are skipped or not.
>
> On Wed, Jan 31, 2018 at 8:15 PM, Yin Huai <yh...@databricks.com> wrote:
>
>> Hello,
>>
>> I was running python tests and found that pyspark.sql.tests.Groupby
>> AggPandasUDFTests.test_unsupported_types
>> <https://github.com/apache/spark/blob/52e00f70663a87b5837235bdf72a3e6f84e11411/python/pyspark/sql/tests.py#L4528-L4548>
>>  does
>> not run with Python 2 because the test uses "assertRaisesRegex" (supported
>> by Python 3) instead of "assertRaisesRegexp" (supported by Python 2).
>> However, spark jenkins does not fail because of this issue (see run history
>> at here
>> <https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test/job/spark-master-test-sbt-hadoop-2.7/>).
>> After looking into this issue, seems test script will skip tests related
>> to pandas if pandas is not installed
>> <https://github.com/apache/spark/blob/2ac895be909de7e58e1051dc2a1bba98a25bf4be/python/pyspark/sql/tests.py#L51-L63>,
>> which means that jenkins does not have pandas installed.
>>
>> @Shane, can you help us check if jenkins workers have pandas installed?
>>
>> Thanks,
>>
>> Yin
>>
>
>

Reply via email to