Input Sampling By Splits
------------------------
Key: HIVE-2121
URL: https://issues.apache.org/jira/browse/HIVE-2121
Project: Hive
Issue Type: New Feature
Reporter: Siying Dong
Assignee: Siying Dong
We need a better input sampling to serve at least two purposes:
1. test their queries against a smaller data set
2. understand more about how the data look like without scanning the whole
table.
A simple function that gives a subset splits will help in those cases. It
doesn't have to be strict sampling.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira