Github user shaneknapp commented on the issue:
https://github.com/apache/spark/pull/19884
@BryanCutler @HyukjinKwon
pandas and pyarrow are most definitely installed on all of the jenkins
workers. the 'missing' packages happened after we had a power outage at the
colo, and the jenkins workers rebooted while the master (on UPS) didn't. this
causes the PATH env var to be dropped, which means that instead of seeing the
anaconda installation in PATH, jenkins defaults to system python (which has the
absolute minimum of packages installed).
regarding the pyarrow upgrade: let's schedule it for wednesday (tomorrow)
morning, EST. i'm about to get on another plane and have a few more hours of
traveling left today.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]