Bomi Kim created SPARK-14726: -------------------------------- Summary: Support for sampling when inferring schema in CSV data source Key: SPARK-14726 URL: https://issues.apache.org/jira/browse/SPARK-14726 Project: Spark Issue Type: Improvement Components: SQL Affects Versions: 2.0.0 Reporter: Bomi Kim
Currently, I am using CSV data source and trying to get used to Spark 2.0 because it has built-in CSV data source. I realized that CSV data source infers schema with all the data. JSON data source supports sampling ratio option. It would be great if CSV data source has this option too (or is this supported already?). -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org