[jira] [Commented] (IMPALA-7236) Erasure coding dataload broken by IMPALA-7102

2019-03-07 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16787336#comment-16787336
 ] 

ASF subversion and git services commented on IMPALA-7236:
-

Commit 1d8f9ee16bbdbb43d68db5007e22c41aaa3b8de7 in impala's branch 
refs/heads/2.x from stiga-huang
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=1d8f9ee ]

Ignore "IMPALA-7236: Fix the parsing of ALLOW_ERASURE_CODED_FILES"

Change-Id: I68677c3bd8cc70f0e6e6880f514502d787a39efc
Reviewed-on: http://gerrit.cloudera.org:8080/12697
Reviewed-by: Tim Armstrong 
Tested-by: Tim Armstrong 


> Erasure coding dataload broken by IMPALA-7102
> -
>
> Key: IMPALA-7236
> URL: https://issues.apache.org/jira/browse/IMPALA-7236
> Project: IMPALA
>  Issue Type: Bug
>  Components: Infrastructure
>Affects Versions: Impala 3.1.0
>Reporter: Joe McDonnell
>Assignee: Tianyi Wang
>Priority: Blocker
>  Labels: broken-build
> Fix For: Impala 3.1.0
>
>
> Loading nested data now fails on erasure coding:
> {code:java}
> 16:58:45 Loading nested data (logging to 
> /data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/logs/data_loading/load-nested.log)...
>  
> 17:01:22 FAILED (Took: 2 min 37 sec)
> 17:01:22 
> '/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/testdata/bin/load_nested.py'
>  failed. Tail of log:
> 17:01:22 2018-06-30 16:59:54,241 INFO:load_nested[277]:Executing: 
> 17:01:22 
> 17:01:22 CREATE TABLE customer
> 17:01:22 STORED AS PARQUET
> 17:01:22 TBLPROPERTIES('parquet.compression'='SNAPPY')
> 17:01:22 AS SELECT * FROM tmp_customer
> 17:01:22 2018-06-30 17:01:00,454 INFO:load_nested[277]:Executing: 
> 17:01:22 
> 17:01:22 CREATE TABLE region
> 17:01:22 STORED AS PARQUET
> 17:01:22 TBLPROPERTIES('parquet.compression'='SNAPPY')
> 17:01:22 AS SELECT * FROM tmp_region
> 17:01:22 2018-06-30 17:01:01,744 INFO:load_nested[277]:Executing: 
> 17:01:22 
> 17:01:22 CREATE TABLE supplier
> 17:01:22 STORED AS PARQUET
> 17:01:22 TBLPROPERTIES('parquet.compression'='SNAPPY')
> 17:01:22 AS SELECT * FROM tmp_supplier
> 17:01:22 2018-06-30 17:01:06,033 INFO:load_nested[296]:Executing: 
> 17:01:22 DROP TABLE tmp_orders_string
> 17:01:22 2018-06-30 17:01:06,097 INFO:load_nested[296]:Executing: 
> 17:01:22 DROP TABLE tmp_customer_string
> 17:01:22 2018-06-30 17:01:06,127 INFO:load_nested[296]:Executing: 
> 17:01:22 DROP TABLE tmp_customer
> 17:01:22 2018-06-30 17:01:10,067 INFO:load_nested[296]:Executing: 
> 17:01:22 
> 17:01:22 DROP TABLE tmp_region_string
> 17:01:22 2018-06-30 17:01:10,097 INFO:load_nested[296]:Executing: 
> 17:01:22 DROP TABLE tmp_region
> 17:01:22 2018-06-30 17:01:14,071 INFO:load_nested[296]:Executing: 
> 17:01:22 
> 17:01:22 DROP TABLE tmp_supplier
> 17:01:22 2018-06-30 17:01:18,075 INFO:load_nested[296]:Executing: 
> 17:01:22 DROP TABLE tmp_supplier_string
> 17:01:22 Traceback (most recent call last):
> 17:01:22   File 
> "/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/testdata/bin/load_nested.py",
>  line 331, in 
> 17:01:22 load()
> 17:01:22   File 
> "/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/testdata/bin/load_nested.py",
>  line 303, in load
> 17:01:22 impala.compute_stats()
> 17:01:22   File 
> "/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/tests/comparison/db_connection.py",
>  line 760, in compute_stats
> 17:01:22 self.execute("COMPUTE STATS %s" % table_name)
> 17:01:22   File 
> "/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/tests/comparison/db_connection.py",
>  line 206, in execute
> 17:01:22 return self._cursor.execute(sql, *args, **kwargs)
> 17:01:22   File 
> "/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/infra/python/env/lib/python2.7/site-packages/impala/hiveserver2.py",
>  line 304, in execute
> 17:01:22 self._wait_to_finish()  # make execute synchronous
> 17:01:22   File 
> "/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/infra/python/env/lib/python2.7/site-packages/impala/hiveserver2.py",
>  line 380, in _wait_to_finish
> 17:01:22 raise OperationalError(resp.errorMessage)
> 17:01:22 impala.error.OperationalError: ImpalaRuntimeException: Scanning of 
> HDFS erasure-coded file 
> (hdfs://localhost:20500/test-warehouse/tpch_nested_parquet.db/customer/00_0)
>  is not supported
> 17:01:22 
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: 

[jira] [Commented] (IMPALA-7236) Erasure coding dataload broken by IMPALA-7102

2018-07-03 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16532120#comment-16532120
 ] 

ASF subversion and git services commented on IMPALA-7236:
-

Commit 61e6a47776ba7f14139b69f91a49d2072a76178b in impala's branch 
refs/heads/master from [~tianyiwang]
[ https://git-wip-us.apache.org/repos/asf?p=impala.git;h=61e6a47 ]

IMPALA-7236: Fix the parsing of ALLOW_ERASURE_CODED_FILES

This patch adds a missing "break" statement in a switch statement
changed by IMPALA-7102.
Also fixes an non-deterministic test case.

Change-Id: Ife1e791541e3f4fed6bec00945390c7d7681e824
Reviewed-on: http://gerrit.cloudera.org:8080/10857
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 


> Erasure coding dataload broken by IMPALA-7102
> -
>
> Key: IMPALA-7236
> URL: https://issues.apache.org/jira/browse/IMPALA-7236
> Project: IMPALA
>  Issue Type: Bug
>  Components: Infrastructure
>Affects Versions: Impala 3.1.0
>Reporter: Joe McDonnell
>Assignee: Tianyi Wang
>Priority: Blocker
>  Labels: broken-build
>
> Loading nested data now fails on erasure coding:
> {code:java}
> 16:58:45 Loading nested data (logging to 
> /data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/logs/data_loading/load-nested.log)...
>  
> 17:01:22 FAILED (Took: 2 min 37 sec)
> 17:01:22 
> '/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/testdata/bin/load_nested.py'
>  failed. Tail of log:
> 17:01:22 2018-06-30 16:59:54,241 INFO:load_nested[277]:Executing: 
> 17:01:22 
> 17:01:22 CREATE TABLE customer
> 17:01:22 STORED AS PARQUET
> 17:01:22 TBLPROPERTIES('parquet.compression'='SNAPPY')
> 17:01:22 AS SELECT * FROM tmp_customer
> 17:01:22 2018-06-30 17:01:00,454 INFO:load_nested[277]:Executing: 
> 17:01:22 
> 17:01:22 CREATE TABLE region
> 17:01:22 STORED AS PARQUET
> 17:01:22 TBLPROPERTIES('parquet.compression'='SNAPPY')
> 17:01:22 AS SELECT * FROM tmp_region
> 17:01:22 2018-06-30 17:01:01,744 INFO:load_nested[277]:Executing: 
> 17:01:22 
> 17:01:22 CREATE TABLE supplier
> 17:01:22 STORED AS PARQUET
> 17:01:22 TBLPROPERTIES('parquet.compression'='SNAPPY')
> 17:01:22 AS SELECT * FROM tmp_supplier
> 17:01:22 2018-06-30 17:01:06,033 INFO:load_nested[296]:Executing: 
> 17:01:22 DROP TABLE tmp_orders_string
> 17:01:22 2018-06-30 17:01:06,097 INFO:load_nested[296]:Executing: 
> 17:01:22 DROP TABLE tmp_customer_string
> 17:01:22 2018-06-30 17:01:06,127 INFO:load_nested[296]:Executing: 
> 17:01:22 DROP TABLE tmp_customer
> 17:01:22 2018-06-30 17:01:10,067 INFO:load_nested[296]:Executing: 
> 17:01:22 
> 17:01:22 DROP TABLE tmp_region_string
> 17:01:22 2018-06-30 17:01:10,097 INFO:load_nested[296]:Executing: 
> 17:01:22 DROP TABLE tmp_region
> 17:01:22 2018-06-30 17:01:14,071 INFO:load_nested[296]:Executing: 
> 17:01:22 
> 17:01:22 DROP TABLE tmp_supplier
> 17:01:22 2018-06-30 17:01:18,075 INFO:load_nested[296]:Executing: 
> 17:01:22 DROP TABLE tmp_supplier_string
> 17:01:22 Traceback (most recent call last):
> 17:01:22   File 
> "/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/testdata/bin/load_nested.py",
>  line 331, in 
> 17:01:22 load()
> 17:01:22   File 
> "/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/testdata/bin/load_nested.py",
>  line 303, in load
> 17:01:22 impala.compute_stats()
> 17:01:22   File 
> "/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/tests/comparison/db_connection.py",
>  line 760, in compute_stats
> 17:01:22 self.execute("COMPUTE STATS %s" % table_name)
> 17:01:22   File 
> "/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/tests/comparison/db_connection.py",
>  line 206, in execute
> 17:01:22 return self._cursor.execute(sql, *args, **kwargs)
> 17:01:22   File 
> "/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/infra/python/env/lib/python2.7/site-packages/impala/hiveserver2.py",
>  line 304, in execute
> 17:01:22 self._wait_to_finish()  # make execute synchronous
> 17:01:22   File 
> "/data/jenkins/workspace/impala-asf-master-core-erasure-coding/repos/Impala/infra/python/env/lib/python2.7/site-packages/impala/hiveserver2.py",
>  line 380, in _wait_to_finish
> 17:01:22 raise OperationalError(resp.errorMessage)
> 17:01:22 impala.error.OperationalError: ImpalaRuntimeException: Scanning of 
> HDFS erasure-coded file 
> (hdfs://localhost:20500/test-warehouse/tpch_nested_parquet.db/customer/00_0)
>  is not supported
> 17:01:22 
> {code}



--
This message was sent by Atlassian JIRA