Hi,
I am working on a sample csv data on which I want to do query. The CSV data
contains xml data in one field , second field has string that enclosed by
double quote third field that has backslash in string and forth field (Should
be last Last Field) that also has data same as second field string enclosed
by double quote.
*Apache drill settings:-*
- Drill version :- 1.10.0
- CSV File Settings :-
> * "csv": {*
> * "type": "text",*
> * "extensions": [*
> * "csv",*
> * "gz",*
> * "tar.gz",*
> * "tar"*
> * ],*
> * "quote": "x",*
> * "escape": "`",*
> * "skipFirstLine": true,*
> * "delimiter": "^"** },*
*Result*:
- *CASE 1:*- In Apache drill when we execute query on the file then if
double quote("") exists in column data then it was replacing it with
triple slash(\\\) .
- *CASE 2*:- If XML data are exists in column then html tag were not
showing in results with version 1.10.0 . After updating version 1.13.0 then
it was working fine.
- *CASE 3*:- If double quote exists in end of the column data then next
row of file are merging with first column .
PFA for Apache Drill Query Result.
*Please suggest proper solution.*
Thanks