[ 
https://issues.apache.org/jira/browse/IMPALA-7004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tianyi Wang resolved IMPALA-7004.
---------------------------------
       Resolution: Fixed
    Fix Version/s: Impala 3.1.0

Commit 13a1acd7e42b533c39b9f4eea1c17823bde4c1c5 in impala's branch 
refs/heads/master from [~tianyiwang]
[ [https://git-wip-us.apache.org/repos/asf?p=impala.git;h=13a1acd] ]

IMPALA-7003: Deflake erasure coding data loading

Erasure coding data loading is flaky in two ways:
1. HBase sometimes doesn't work because of HBase-19369
2. Nested data loading sometimes fails because the HDFS namenode cannot
find enough good datanodes.

For problem 1, this patch enables erasure coding only on /test-warehouse
directory. For problem 2, this patch sets
dfs.namenode.redundancy.considerLoad to false, preventing namenode from
excluding heavily-loaded datanodes.

Change-Id: I219106cd3ec7ffab7a834700f2a722b165e5f66c
Reviewed-on: [http://gerrit.cloudera.org:8080/10362]
Reviewed-by: Alex Behm <alex.b...@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenk...@cloudera.com>

> Deflake erasure coding data loading
> -----------------------------------
>
>                 Key: IMPALA-7004
>                 URL: https://issues.apache.org/jira/browse/IMPALA-7004
>             Project: IMPALA
>          Issue Type: Sub-task
>    Affects Versions: Impala 3.1.0
>            Reporter: Tianyi Wang
>            Assignee: Tianyi Wang
>            Priority: Major
>             Fix For: Impala 3.1.0
>
>
> Currently the data loading of erasure coding is flaky. There are 2 separate 
> problems:
>  # HBase data loading sometimes fails: HBASE-19369
>  # Nested data loading sometimes fails because there are not enough "Good" 
> data nodes. The root cause might be same as IMPALA-6394



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to