[ https://issues.apache.org/jira/browse/DRILL-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14904841#comment-14904841 ]
Chun Chang commented on DRILL-3795: ----------------------------------- With tab between columns, this is what I got: {noformat} 0: jdbc:drill:schema=dfs.tpch_maprdb> select columns[0], columns[1] from dfs.tmp.`drill-3718.tsv`; +------------------------------------------+---------------------------------+ | EXPR$0 | EXPR$1 | +------------------------------------------+---------------------------------+ | another no quote | ""another with quote"" | | "another with double quotes" no quotes | null | +------------------------------------------+---------------------------------+ 2 rows selected (0.419 seconds) 0: jdbc:drill:schema=dfs.tpch_maprdb> select * from dfs.tmp.`drill-3718.tsv`; +------------------------------------------------------------+ | columns | +------------------------------------------------------------+ | ["another no quote"," \"\"another with quote\"\""] | | ["\"another with double quotes\"\tno quotes\n"] | +------------------------------------------------------------+ 2 rows selected (0.428 seconds) {noformat} > TextReader can't read .tsv file contains multiple double quotes > --------------------------------------------------------------- > > Key: DRILL-3795 > URL: https://issues.apache.org/jira/browse/DRILL-3795 > Project: Apache Drill > Issue Type: Bug > Components: Storage - Text & CSV > Affects Versions: 1.2.0 > Reporter: Chun Chang > Assignee: Sean Hsuan-Yi Chu > Attachments: drill-3795.tsv > > > commit_id: 69c73af54ac3d15b8e7c21e8a3c35b4a62ebc844 > I have a simple tab delimitated file contains multiple double quoted text: > {noformat} > another no quote ""another with quote"" > ""another with double quotes"" no quotes > {noformat} > This cause the following error: > {noformat} > 0: jdbc:drill:schema=dfs.drillTestDirDropTabl> select columns[0], columns[1] > from dfs.tmp.`drill-3718.tsv`; > Error: SYSTEM ERROR: TextParsingException: Error processing input: Cannot use > newline character within quoted string, line=2, char=61. Content parsed: [ ] > Fragment 0:0 > [Error Id: c631eccc-038c-4d61-bda8-e7037c3677e8 on 10.10.30.166:31010] > (state=,code=0) > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)