Hi , I have a CDH cluster and running pyspark script in client mode There are different python version installed in client and worker nodes and was getting python version mismatch error. To resolve this issue I followed below cludera document https://www.cloudera.com/documentation/data-science-workbench/latest/topics/cdsw_troubleshooting.html#workloads__job_fail_python
added below lines export PYSPARK_PYTHON=/usr/bin/python/ export PYSPARK_DRIVER_PYTHON=python Still getting the version mismatch error. Does anybody encounter this issue . Can you please share how did you resolve it . Would really appreciate the help. PS - attaching the screen shot of the code added . Thanks, Divya
--------------------------------------------------------------------- To unsubscribe e-mail: user-unsubscr...@spark.apache.org