Quanlong Huang created IMPALA-11124:
---------------------------------------

             Summary: testdata loading should reuse TPCH/TPCDS local data if 
they exist
                 Key: IMPALA-11124
                 URL: https://issues.apache.org/jira/browse/IMPALA-11124
             Project: IMPALA
          Issue Type: Improvement
          Components: Infrastructure
            Reporter: Quanlong Huang
            Assignee: Quanlong Huang


When loading testdata for TPC-H/TPC-DS, we first run a preload script to 
generate local data, and then upload them to HDFS to be used by Hive. It's 
time-consuming to run the preload script in large scale factors (e.g. 30). We 
should reuse them if they exist.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to