[ https://issues.apache.org/jira/browse/IMPALA-7098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16495949#comment-16495949 ]
Tianyi Wang edited comment on IMPALA-7098 at 5/31/18 1:04 AM: -------------------------------------------------------------- [~dhecht] A block group is the logical unit in HDFS. I'm checking the Hadoop source code but I believe non-splittable file formats or parquet row groups are stored in such units - it doesn't make sense otherwise. was (Author: tianyiwang): [~dhecht] A block group is the logical unit in HDFS. I'm checking the Hadoop source code but I believe non-splittable file formats or parquet row groups are stored in such units. > Re-enable blocksize-related tests under EC > ------------------------------------------ > > Key: IMPALA-7098 > URL: https://issues.apache.org/jira/browse/IMPALA-7098 > Project: IMPALA > Issue Type: Sub-task > Components: Frontend, Infrastructure > Affects Versions: Impala 3.1.0 > Reporter: Tianyi Wang > Assignee: Taras Bobrovytsky > Priority: Major > > In EC the unit of a scan range is a block group. So in our mini cluster a > scan range is of 3X the size of a regular block. This breaks some tests under > EC. One way to address this problem is to shrink the blocks to 1/3 of the > original size in those tests. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org For additional commands, e-mail: issues-all-h...@impala.apache.org