GitHub user jaceklaskowski opened a pull request:

    https://github.com/apache/spark/pull/14680

    [SPARK-17101][SQL] Provide format identifier for TextFileFormat

    ## What changes were proposed in this pull request?
    
    Define the format identifier that is used in Optimized Logical Plan in 
explain for text file format (following CSV and JSON formats).
    
    ```
    scala> spark.read.text("people.csv").cache.explain(extended = true)
    == Parsed Logical Plan ==
    Relation[value#0] text
    
    == Analyzed Logical Plan ==
    value: string
    Relation[value#0] text
    
    == Optimized Logical Plan ==
    InMemoryRelation [value#0], true, 10000, StorageLevel(disk, memory, 
deserialized, 1 replicas)
       +- *FileScan text [value#0] Batched: false, Format: TEXT, InputPaths: 
file:/Users/jacek/dev/oss/spark/people.csv, PartitionFilters: [], 
PushedFilters: [], ReadSchema: struct<value:string>
    
    == Physical Plan ==
    InMemoryTableScan [value#0]
       +- InMemoryRelation [value#0], true, 10000, StorageLevel(disk, memory, 
deserialized, 1 replicas)
             +- *FileScan text [value#0] Batched: false, Format: TEXT, 
InputPaths: file:/Users/jacek/dev/oss/spark/people.csv, PartitionFilters: [], 
PushedFilters: [], ReadSchema: struct<value:string>
    ```
    
    ## How was this patch tested?
    
    Local build.
    
    


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/jaceklaskowski/spark SPARK-17101

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/14680.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #14680
    
----
commit 133e5deff497ce1be92497344bb7e0d4e7d57c21
Author: Jacek Laskowski <ja...@japila.pl>
Date:   2016-08-17T06:43:34Z

    [SPARK-17101][SQL] Provide format identifier for TextFileFormat

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to