Re: PySpark Pandas UDF

2019-11-17 Thread Gourav Sengupta
Hi, sorry a completely unrelated question. when is the upcoming release of SPARK 3.0. There are several parallel distributed deep learning frameworks that are being developed, do you think that we could use SPARK 3.0 for distributed deep learning using Pytorch or Tensorflow? Is there any place

Re: PySpark Pandas UDF

2019-11-17 Thread Bryan Cutler
There was a change in the binary format of Arrow 0.15.1 and there is an environment variable you can set to make pyarrow 0.15.1 compatible with current Spark, which looks to be your problem. Please see the doc below for instructions added in SPARK-2936. Note, this will not be required for the