[ https://issues.apache.org/jira/browse/HIVE-13496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15238128#comment-15238128 ]
Siddharth Seth commented on HIVE-13496: --------------------------------------- Couple of options 1. Checkin the derby file that is generated. (This would create another update step if anyone changes the generation scripts. This may not be a problem, given that q_test_init was last modified in November 2014) 2. [~ashutoshc] - was mentioning some other way to load derby which is cheaper. 3. Eventually - automate this, i.e. look for the existence of the data - and create it only if it does not exist. > Create initial test data once across multiple runs > -------------------------------------------------- > > Key: HIVE-13496 > URL: https://issues.apache.org/jira/browse/HIVE-13496 > Project: Hive > Issue Type: Improvement > Components: Test > Reporter: Siddharth Seth > Assignee: Siddharth Seth > > All TestCliDriver, TezMiniTezCliDriver etc tests create a standard data set > when they start up. When running on a box with SSDs - this step takes over a > minute. > Running a single qtest cannot be faster than this. On the ptest framework - > all batches end up doing this which is a lot of wastage. > Instead, this data generation should be shared across runs. -- This message was sent by Atlassian JIRA (v6.3.4#6332)