[
https://issues.apache.org/jira/browse/HIVE-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Phabricator updated HIVE-2780:
------------------------------
Attachment: HIVE-2780.D1623.4.patch
navis updated the revision "HIVE-2780 [jira] Implement more restrictive table
sampler".
Addressed comments & rebased to trunk
Reviewers: ashutoshc, JIRA
REVISION DETAIL
https://reviews.facebook.net/D1623
CHANGE SINCE LAST DIFF
https://reviews.facebook.net/D1623?vs=25479&id=28587#toc
AFFECTED FILES
common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
conf/hive-default.xml.template
ql/src/java/org/apache/hadoop/hive/ql/io/CombineHiveInputFormat.java
ql/src/java/org/apache/hadoop/hive/ql/io/DefaultSplitSampler.java
ql/src/java/org/apache/hadoop/hive/ql/io/ShirinkSplitSampler.java
ql/src/java/org/apache/hadoop/hive/ql/io/SplitSampler.java
ql/src/test/queries/clientpositive/split_sample_custom.q
ql/src/test/results/clientpositive/split_sample_custom.q.out
To: JIRA, ashutoshc, navis
> Implement more restrictive table sampler
> ----------------------------------------
>
> Key: HIVE-2780
> URL: https://issues.apache.org/jira/browse/HIVE-2780
> Project: Hive
> Issue Type: Improvement
> Reporter: Navis
> Assignee: Navis
> Priority: Trivial
> Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.1.patch,
> ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.2.patch, HIVE-2780.D1623.3.patch,
> HIVE-2780.D1623.4.patch
>
>
> Current table sampling scans whole block, making more rows included than
> expected especially for small tables.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira