[
https://issues.apache.org/jira/browse/HIVE-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Phabricator updated HIVE-2780:
------------------------------
Attachment: HIVE-2780.D1623.3.patch
navis updated the revision "HIVE-2780 [jira] Implement more restrictive table
sampler".
Reviewers: JIRA
Rebased to trunk & add support for total length sampling
REVISION DETAIL
https://reviews.facebook.net/D1623
AFFECTED FILES
ql/src/java/org/apache/hadoop/hive/ql/io/CombineHiveInputFormat.java
ql/src/java/org/apache/hadoop/hive/ql/io/SplitSampler.java
ql/src/test/queries/clientpositive/split_sample_sampler.q
ql/src/test/results/clientpositive/split_sample_sampler.q.out
To: JIRA, navis
> Implement more restrictive table sampler
> ----------------------------------------
>
> Key: HIVE-2780
> URL: https://issues.apache.org/jira/browse/HIVE-2780
> Project: Hive
> Issue Type: Improvement
> Reporter: Navis
> Assignee: Navis
> Priority: Trivial
> Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.1.patch,
> ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.2.patch, HIVE-2780.D1623.3.patch
>
>
> Current table sampling scans whole block, making more rows included than
> expected especially for small tables.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira