[ 
https://issues.apache.org/jira/browse/TEZ-3402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gopal V updated TEZ-3402:
-------------------------
    Description: 
Bad configs triggers integer overflow. This is a 5Tb query which tries to group 
by max-size of 4096.

{code}        
        // splits too small to work. Need to override with size.
        int newDesiredNumSplits = (int)(totalLength/minLengthPerGroup) + 1;
{code}

{code}
diagnostics=[Vertex vertex_1470081722620_0072_3_00 [Map 2] killed/failed due 
to:ROOT_INPUT_INIT_FAILURE, Vertex Input: srvc_fee initializer failed, 
vertex=vertex_1470081722620_0072_3_00 [Map 2], 
java.lang.IllegalArgumentException: Illegal Capacity: -1401168103
        at java.util.ArrayList.<init>(ArrayList.java:156)
        at 
org.apache.hadoop.mapred.split.TezMapredSplitsGrouper.getGroupedSplits(TezMapredSplitsGrouper.java:230)
        at 
org.apache.hadoop.hive.ql.exec.tez.SplitGrouper.group(SplitGrouper.java:89)
        at 
org.apache.hadoop.hive.ql.exec.tez.SplitGrouper.generateGroupedSplits(SplitGrouper.java:168)
        at 
org.apache.hadoop.hive.ql.exec.tez.SplitGrouper.generateGroupedSplits(SplitGrouper.java:138)
        at 
org.apache.hadoop.hive.ql.exec.tez.HiveSplitGenerator.initialize(HiveSplitGenerator.java:159)
        at 
org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable$1.run(RootInputInitializerManager.java:273)
        at 
org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable$1.run(RootInputInitializerManager.java:266)
{code}

  was:
Bad configs triggers integer overflow. This is a 5Tb query which tries to group 
by max-size of 4096.

{code}
    groupedSplits = new ArrayList<GroupedSplitContainer>(desiredNumSplits);
{code}

{code}
diagnostics=[Vertex vertex_1470081722620_0072_3_00 [Map 2] killed/failed due 
to:ROOT_INPUT_INIT_FAILURE, Vertex Input: srvc_fee initializer failed, 
vertex=vertex_1470081722620_0072_3_00 [Map 2], 
java.lang.IllegalArgumentException: Illegal Capacity: -1401168103
        at java.util.ArrayList.<init>(ArrayList.java:156)
        at 
org.apache.hadoop.mapred.split.TezMapredSplitsGrouper.getGroupedSplits(TezMapredSplitsGrouper.java:230)
        at 
org.apache.hadoop.hive.ql.exec.tez.SplitGrouper.group(SplitGrouper.java:89)
        at 
org.apache.hadoop.hive.ql.exec.tez.SplitGrouper.generateGroupedSplits(SplitGrouper.java:168)
        at 
org.apache.hadoop.hive.ql.exec.tez.SplitGrouper.generateGroupedSplits(SplitGrouper.java:138)
        at 
org.apache.hadoop.hive.ql.exec.tez.HiveSplitGenerator.initialize(HiveSplitGenerator.java:159)
        at 
org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable$1.run(RootInputInitializerManager.java:273)
        at 
org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable$1.run(RootInputInitializerManager.java:266)
{code}


> SplitGrouper: Integer overflow
> ------------------------------
>
>                 Key: TEZ-3402
>                 URL: https://issues.apache.org/jira/browse/TEZ-3402
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.8.4
>            Reporter: Gopal V
>
> Bad configs triggers integer overflow. This is a 5Tb query which tries to 
> group by max-size of 4096.
> {code}        
>         // splits too small to work. Need to override with size.
>         int newDesiredNumSplits = (int)(totalLength/minLengthPerGroup) + 1;
> {code}
> {code}
> diagnostics=[Vertex vertex_1470081722620_0072_3_00 [Map 2] killed/failed due 
> to:ROOT_INPUT_INIT_FAILURE, Vertex Input: srvc_fee initializer failed, 
> vertex=vertex_1470081722620_0072_3_00 [Map 2], 
> java.lang.IllegalArgumentException: Illegal Capacity: -1401168103
>         at java.util.ArrayList.<init>(ArrayList.java:156)
>         at 
> org.apache.hadoop.mapred.split.TezMapredSplitsGrouper.getGroupedSplits(TezMapredSplitsGrouper.java:230)
>         at 
> org.apache.hadoop.hive.ql.exec.tez.SplitGrouper.group(SplitGrouper.java:89)
>         at 
> org.apache.hadoop.hive.ql.exec.tez.SplitGrouper.generateGroupedSplits(SplitGrouper.java:168)
>         at 
> org.apache.hadoop.hive.ql.exec.tez.SplitGrouper.generateGroupedSplits(SplitGrouper.java:138)
>         at 
> org.apache.hadoop.hive.ql.exec.tez.HiveSplitGenerator.initialize(HiveSplitGenerator.java:159)
>         at 
> org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable$1.run(RootInputInitializerManager.java:273)
>         at 
> org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable$1.run(RootInputInitializerManager.java:266)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to