[ 
https://issues.apache.org/jira/browse/SPARK-26846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arvind Krishnan Iyer updated SPARK-26846:
-----------------------------------------
    Description: 
 
{code:java}
import spark.implicits._
val sc = spark.sparkContext
val df = Seq((8,"100","sfd"),(0,"","sfd"),(8, null, 
"asfasd")).toDF("num","str_num","word").toDF().coalesce(1)
df.write.mode(SaveMode.Overwrite).csv("/Users/arvind.iyer/abcd.csv")
{code}
We are writing the contents of this CSV into a DB, and the contents of that 
column are going in as "". 

+Output+ 

8,100,sfd
 0,"",sfd
 8,"",asfasd

  was:
 
{code:java}
import spark.implicits._
val sc = spark.sparkContext
val df = Seq((8,"100","sfd"),(0,"","sfd"),(8, null, 
"asfasd")).toDF("num","str_num","word").toDF().coalesce(1)
df.write.mode(SaveMode.Overwrite).csv("/Users/arvind.iyer/abcd.csv")
{code}
+Output+ 

8,100,sfd
0,"",sfd
8,"",asfasd


> Empty Strings in dataframe are written as "" in CSV
> ---------------------------------------------------
>
>                 Key: SPARK-26846
>                 URL: https://issues.apache.org/jira/browse/SPARK-26846
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.4.0
>            Reporter: Arvind Krishnan Iyer
>            Priority: Major
>
>  
> {code:java}
> import spark.implicits._
> val sc = spark.sparkContext
> val df = Seq((8,"100","sfd"),(0,"","sfd"),(8, null, 
> "asfasd")).toDF("num","str_num","word").toDF().coalesce(1)
> df.write.mode(SaveMode.Overwrite).csv("/Users/arvind.iyer/abcd.csv")
> {code}
> We are writing the contents of this CSV into a DB, and the contents of that 
> column are going in as "". 
> +Output+ 
> 8,100,sfd
>  0,"",sfd
>  8,"",asfasd



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to