[ https://issues.apache.org/jira/browse/HAWQ-1426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15960282#comment-15960282 ]
Lili Ma edited comment on HAWQ-1426 at 4/7/17 5:12 AM: ------------------------------------------------------- RCA: When hawq extract tries to find the HDFS files information, it wrongly treated the pg_aoseg.pg_aoseg_$relid as the catalog table for storing those information. Here is the correct processing steps: 1. Find the directory on HDFS which stores the actual data for the table. This can be achieved by following the column "relfilenode" in pg_class table. 2. Find the detailed file name for the table under above directory. This can be achieved by searching the catalog table pg_aoseg.pg_aoseg(paqseg)_$. The table name suffix is neither $relid nor $relfilenode under some circumstances. We should get it by referring the column "segrelid" in catalog table "pg_appendonly", and then looking up the table "pg_class" to get the accurate table name. was (Author: lilima): RCA: When hawq extract tries to find the HDFS files information, it wrongly treated the pg_aoseg.pg_aoseg_$relid as the catalog table for storing those information. When determining the file path of a table, hawq extract should follow below steps: 1. Find the directory on HDFS which stores the actual data for the table. This can be achieved by following the column "relfilenode" in pg_class table. 2. Find the detailed file name for the table under above directory. This can be achieved by searching the catalog table pg_aoseg.pg_aoseg(paqseg)_$. The table name suffix is neither $relid nor $relfilenode under some circumstances. We should get it by referring the column "segrelid" in catalog table "pg_appendonly", and then looking up the table "pg_class" to get the accurate table name. > hawq extract meets error after the table was reorganized. > --------------------------------------------------------- > > Key: HAWQ-1426 > URL: https://issues.apache.org/jira/browse/HAWQ-1426 > Project: Apache HAWQ > Issue Type: Bug > Components: Command Line Tools > Reporter: Lili Ma > Assignee: Ed Espino > Fix For: 2.3.0.0-incubating > > > After one table is reorganized, hawq extract the table will meet error. > Reproduce Steps: > 1. create an AO table > 2. insert into several records into it > 3. Get the table reorganized. "alter table a set with (reorganize=true);" > 4. run hawq extract, error thrown out. > For the bug fix, we should also guarantee that hawq extract should work if > the table is truncated and re-inserted. -- This message was sent by Atlassian JIRA (v6.3.15#6346)