Bomi Kim created SPARK-14726:
--------------------------------

             Summary: Support for sampling when inferring schema in CSV data 
source
                 Key: SPARK-14726
                 URL: https://issues.apache.org/jira/browse/SPARK-14726
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 2.0.0
            Reporter: Bomi Kim


Currently, I am using CSV data source and trying to get used to Spark 2.0 
because it has built-in CSV data source.

I realized that CSV data source infers schema with all the data. JSON data 
source supports sampling ratio option.

It would be great if CSV data source has this option too (or is this supported 
already?).




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to