Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/8418 )

Change subject: IMPALA-4539: [DOCS] Add known issue for uncompressed Parquet 
correctness
......................................................................


Patch Set 1:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/8418/1/docs/topics/impala_known_issues.xml
File docs/topics/impala_known_issues.xml:

http://gerrit.cloudera.org:8080/#/c/8418/1/docs/topics/impala_known_issues.xml@936
PS1, Line 936:             Examine the <codeph>HDFS_SCAN_NODE</codeph> portion 
of a query profile that scans the
This unfortunately won't give accurate info for all queries: if the query isn't 
materialising any columns (e.g. count(*)) or the file is filtered out by 
runtime filters, the file compression was inaccurate in previous versions - see 
IMPALA-5311 and IMPALA-4863 and respectively.

One way to tell for sure is to run something like "select * from table" and 
then look. Or, say, "select min(string_col) from table"


http://gerrit.cloudera.org:8080/#/c/8418/1/docs/topics/impala_known_issues.xml@937
PS1, Line 937:             suspected table. Look for <q>File Formats</q>. A 
value containing <codeph>PARQUET/NONE</codeph>
It might be helpful to note common cases where uncompressed Parquet is/isn't 
created. Impala generates snappy-compressed Parquet by default unless 
compression_codec is changed. Most uncompressed parquet we see in the wild is 
generated by Hive or other non-Impala tools.



--
To view, visit http://gerrit.cloudera.org:8080/8418
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I731eb0e029dc3cc251f4df0c5a8ad281c81595cb
Gerrit-Change-Number: 8418
Gerrit-PatchSet: 1
Gerrit-Owner: John Russell <jruss...@cloudera.com>
Gerrit-Reviewer: Greg Rahn <gr...@cloudera.com>
Gerrit-Reviewer: John Russell <jruss...@cloudera.com>
Gerrit-Reviewer: Tim Armstrong <tarmstr...@cloudera.com>
Gerrit-Comment-Date: Mon, 30 Oct 2017 21:35:39 +0000
Gerrit-HasComments: Yes

Reply via email to