Re: py4j.protocol.Py4JJavaError: An error occurred while calling o794.parquet

2018-01-10 Thread Felix Cheung
Subject: Re: py4j.protocol.Py4JJavaError: An error occurred while calling o794.parquet Caused by: org.apache.spark.SparkException: Task not serializable That's the answer :) What are you trying to save? Is it empty or None / null? On Wed, Jan 10, 2018 at 4:58 PM, Liana Napalkova <liana.napalk

Re: py4j.protocol.Py4JJavaError: An error occurred while calling o794.parquet

2018-01-10 Thread Liana Napalkova
py4j.protocol.Py4JJavaError: An error occurred while calling o794.parquet. : org.apache.spark.SparkException: Job aborted. at org.apache.spark.sql.execution.datasources.FileFormatWriter$$anonfun$write$1.apply$mcV$sp(FileFormatWriter.scala:215) at org.apache.spark.sql.execution.datasourc

Re: py4j.protocol.Py4JJavaError: An error occurred while calling o794.parquet

2018-01-10 Thread Timur Shenkao
following problem in PySpark? (Python 2.7.12): > > df.show() # works fine and shows the first 5 rows of DataFrame > > df.write.parquet(outputPath + '/data.parquet', mode="overwrite") # > throws the error > > The last line throws the following error: > > py4j.

py4j.protocol.Py4JJavaError: An error occurred while calling o794.parquet

2018-01-10 Thread Liana Napalkova
wing error: py4j.protocol.Py4JJavaError: An error occurred while calling o794.parquet. : org.apache.spark.SparkException: Job aborted. at org.apache.spark.sql.execution.datasources.FileFormatWriter$$anonfun$write$1.apply$mcV$sp(FileFormatWriter