[ https://issues.apache.org/jira/browse/DRILL-5674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Arina Ielchiieva updated DRILL-5674: ------------------------------------ Reviewer: Vova Vysotskyi > Drill should support .zip compression > ------------------------------------- > > Key: DRILL-5674 > URL: https://issues.apache.org/jira/browse/DRILL-5674 > Project: Apache Drill > Issue Type: Improvement > Components: Storage - Text & CSV > Affects Versions: 1.10.0 > Reporter: Paul Rogers > Assignee: Arina Ielchiieva > Priority: Major > Labels: doc-impacting > Fix For: 1.17.0 > > > Zip is a very common compression format. Create a compressed CSV file with > column headers: data.csv.zip. > Define a storage plugin config for the file, call it "dfs.myws", set > delimiter = ",", extract header = true, skip header = false. > Run a simple query: > SELECT * FROM dfs.myws.`data.csv.zip` > The result is garbage as the CSV reader is trying to parse Zipped data as if > it were text. > DRILL-5506 asks how to do this; the responder said to add a library to the > path. Better would be to simply support zip out-of-the-box as a default > format. -- This message was sent by Atlassian Jira (v8.3.4#803005)