[jira] [Created] (SPARK-21226) Save empty dataframe in pyspark prints nothing

Carlos M. Casas (JIRA) Tue, 27 Jun 2017 03:43:19 -0700

Carlos M. Casas created SPARK-21226:
---------------------------------------


             Summary: Save empty dataframe in pyspark prints nothing
                 Key: SPARK-21226
                 URL: https://issues.apache.org/jira/browse/SPARK-21226
             Project: Spark
          Issue Type: Bug
          Components: PySpark
    Affects Versions: 2.1.0, 2.0.0
            Reporter: Carlos M. Casas


I try the following:

schema = whatever schema you want
df1 = sqlContext.createDataFrame(sc.emptyRDD(), schema)
df1.write.parquet("as1")

and I just get a directory as1 with a _SUCCESS file in it. If I try to read 
that file, I get an exception.

On the other hand, if I run:

schema = whatever schema you want
df2 = sqlContext.createDataFrame([], schema)
df2.write.parquet("as2")

I get a directory as2 with some files on it (representing field type 
information?). If I try to read it, it works: it read an empty df with the 
proper schema.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Created] (SPARK-21226) Save empty dataframe in pyspark prints nothing

Reply via email to