-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/25190/
-----------------------------------------------------------
(Updated Aug. 29, 2014, 12:27 p.m.)
Review request for drill.
Changes
-------
Correcting the issue id and description.
Summary (updated)
-----------------
DRILL-1346: Use HBase table size information to improve scan parallelization
Bugs: DRILL-1346
https://issues.apache.org/jira/browse/DRILL-1346
Repository: drill-git
Description
-------
This patch enables a better estimation of the size of HBase data under scan by
computing the total data size of all regions and sample. This estimation can be
disabled by setting "drill.exec.hbase.scan.sizecalculator.enabled" to false.
Diffs
-----
contrib/storage-hbase/src/main/java/org/apache/drill/exec/store/hbase/HBaseGroupScan.java
8e9ae18
contrib/storage-hbase/src/main/java/org/apache/drill/exec/store/hbase/HBaseScanSpec.java
c2ee723
contrib/storage-hbase/src/main/java/org/apache/drill/exec/store/hbase/TableStatsCalculator.java
PRE-CREATION
contrib/storage-hbase/src/main/resources/drill-module.conf 0edceaf
contrib/storage-hbase/src/test/java/org/apache/drill/hbase/TestTableGenerator.java
3678c78
Diff: https://reviews.apache.org/r/25190/diff/
Testing
-------
Manual testing with a full/partial scan on a large HBase table.
Thanks,
Aditya Kishore