Sergey Soldatov created PHOENIX-2649:
----------------------------------------

             Summary: GC/OOM during BulkLoad
                 Key: PHOENIX-2649
                 URL: https://issues.apache.org/jira/browse/PHOENIX-2649
             Project: Phoenix
          Issue Type: Bug
    Affects Versions: 4.7.0
         Environment: Mac OS, Hadoop 2.7.2, HBase 1.1.2
            Reporter: Sergey Soldatov
            Priority: Critical


Phoenix fails to complete  bulk load of 40Mb csv data with GC heap error during 
Reduce phase. The problem is in the comparator for TableRowkeyPair. It expects 
that the serialized value was written using zero-compressed encoding, but at 
least in my case it was written in regular way. So, trying to obtain length for 
table name and row key it always get zero and reports that those byte arrays 
are equal. As the result, the reducer receives all data produced by mappers in 
one reduce call and fails with OOM. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to