[ https://issues.apache.org/jira/browse/PIG-1828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12988040#action_12988040 ]
Lukas commented on PIG-1828: ---------------------------- Hi there, I set pig.splitCombination to false in the pig.properties and now the table is fully processed/the bug went away. Pig issued one map job for each region. > HBaseStorage has problems with processing multiregion tables > ------------------------------------------------------------ > > Key: PIG-1828 > URL: https://issues.apache.org/jira/browse/PIG-1828 > Project: Pig > Issue Type: Bug > Affects Versions: 0.8.0 > Environment: Hadoop 0.20.2, Hbase 0.20.6, Distributed mode > Reporter: Lukas > > As brought up in the pig user mailing list > (http://www.mail-archive.com/user%40pig.apache.org/msg00606.html) Pig does > sometime not scan the full HBase table. > It seems that HBaseStorage has problems scanning large tables. It issues just > one mapper job instead of one mapper job per table region. > Ian Stevens, who brought this issue up in the mailing list, attached a script > to reproduce the problem (https://gist.github.com/766929). > However, in my case, the problem only occurred, after the table was split > into more than one regions. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.