[ 
https://issues.apache.org/jira/browse/FLINK-2061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560962#comment-14560962
 ] 

ASF GitHub Bot commented on FLINK-2061:
---------------------------------------

GitHub user chiwanpark opened a pull request:

    https://github.com/apache/flink/pull/734

    [FLINK-2061] CSVReader: quotedStringParsing and includeFields yields 
ParseException

    Fix the bug in `GenericCsvInputFormat` when skipped field is quoted string. 
I also added a unit test for this case.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/chiwanpark/flink FLINK-2061

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/flink/pull/734.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #734
    
----
commit 99fb79beda88c73d80e630aa5e22e9ee401538ed
Author: Chiwan Park <chiwanp...@icloud.com>
Date:   2015-05-27T13:24:59Z

    [FLINK-2061] [java api] Fix GenericCsvInputFormat skipping fields error 
with quoted string

----


> CSVReader: quotedStringParsing and includeFields yields ParseException
> ----------------------------------------------------------------------
>
>                 Key: FLINK-2061
>                 URL: https://issues.apache.org/jira/browse/FLINK-2061
>             Project: Flink
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.9
>            Reporter: Fabian Hueske
>            Assignee: Chiwan Park
>
> Fields in a CSV file with quoted String cannot be skipped.
> Parsing a line such as: 
> {code}
> "20:41:52-1-3-2015"|"Re: Taskmanager memory error in Eclipse"|"Stephan Ewen 
> <se...@apache.org>"|"bla"|"blubb"
> {code}
> with a CSVReader configured as: 
> {code}
> DataSet<Tuple2<String, String>> data =
>                       env.readCsvFile("/path/to/my/data")
>                               .lineDelimiter("\n")
>                               .fieldDelimiter("|")
>                               .parseQuotedStrings('"')
>                               .includeFields("101")
>                               .types(String.class, String.class);
> {code}
> gives a {{ParseException}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to