[ https://issues.apache.org/jira/browse/SPARK-15226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16081154#comment-16081154 ]
Andrew Ash edited comment on SPARK-15226 at 7/10/17 9:07 PM: ------------------------------------------------------------- Fixed by https://issues.apache.org/jira/browse/SPARK-19610 was (Author: aash): Fixed by Fixed by https://issues.apache.org/jira/browse/SPARK-19610 > CSV file data-line with newline at first line load error > -------------------------------------------------------- > > Key: SPARK-15226 > URL: https://issues.apache.org/jira/browse/SPARK-15226 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 2.0.0, 2.1.0 > Reporter: Weichen Xu > Fix For: 2.2.0 > > Original Estimate: 24h > Remaining Estimate: 24h > > CSV file such as: > --------------- > v1,v2,"v > 3",v4,v5 > a,b,c,d,e > --------------- > it contains two row,first row : > v1, v2, v\n3, v4, v5 (in value v\n3 it contains a newline character,it is > legal) > second row: > a,b,c,d,e > then in spark-shell run commands like: > val sqlContext = new org.apache.spark.sql.SQLContext(sc); > var reader = sqlContext.read > var df = reader.csv("path/to/csvfile") > df.collect > then we find the load data is wrong, > the load data has only 3 columns, but in fact it has 5 columns. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org