[jira] [Commented] (DRILL-5564) IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space (16674816) + prealloc space (0) + child space (0) != allocated (16740352)
[ https://issues.apache.org/jira/browse/DRILL-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16188470#comment-16188470 ] ASF GitHub Bot commented on DRILL-5564: --- Github user asfgit closed the pull request at: https://github.com/apache/drill/pull/967 > IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space > (16674816) + prealloc space (0) + child space (0) != allocated (16740352) > --- > > Key: DRILL-5564 > URL: https://issues.apache.org/jira/browse/DRILL-5564 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Flow >Affects Versions: 1.11.0 > Environment: 3 node CentOS cluster >Reporter: Khurram Faraaz >Assignee: Roman Kulyk > Labels: ready-to-commit > Fix For: 1.12.0 > > > Run a concurrent Java program that executes TPCDS query11 > while the above concurrent java program is under execution > stop foreman Drillbit (from another shell, using below command) > ./bin/drillbit.sh stop > and you will see the IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: > and another assertion error, in the drillbit.log > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Drill 1.11.0 git commit ID: d11aba2 (with assertions enabled) > > details from drillbit.log from the foreman Drillbit node. > {noformat} > 2017-06-05 18:38:33,838 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested RUNNING --> > FAILED > 2017-06-05 18:38:33,849 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested FAILED --> > FINISHED > 2017-06-05 18:38:33,852 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] > ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: AssertionError: > Failure while stopping processing for operator id 10. Currently have states > of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > at > org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:544) > ~[drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:295) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup(FragmentExecutor.java:160) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:264) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) > [drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > [na:1.8.0_91] > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > [na:1.8.0_91] > at java.lang.Thread.run(Thread.java:745) [na:1.8.0_91] > Caused by: java.lang.RuntimeException: java.lang.AssertionError: Failure > while stopping processing for operator id 10. Currently have states of > processing:false, setup:false, waiting:true. > at > org.apache.drill.common.DeferredException.addThrowable(DeferredException.java:101) > ~[drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.fail(FragmentExecutor.java:409) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:250) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > ... 4 common frames omitted > Caused by: java.lang.AssertionError: Failure while stopping processing for > operator id 10. Currently have states of processing:false, setup:false, > waiting:true. > at > org.apache.drill.exec.ops.OperatorStats.stopProcessing(OperatorStats.java:167) > ~[drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at >
[jira] [Commented] (DRILL-5564) IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space (16674816) + prealloc space (0) + child space (0) != allocated (16740352)
[ https://issues.apache.org/jira/browse/DRILL-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16186195#comment-16186195 ] ASF GitHub Bot commented on DRILL-5564: --- Github user priteshm commented on the issue: https://github.com/apache/drill/pull/967 @parthchandra can you review this? > IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space > (16674816) + prealloc space (0) + child space (0) != allocated (16740352) > --- > > Key: DRILL-5564 > URL: https://issues.apache.org/jira/browse/DRILL-5564 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Flow >Affects Versions: 1.11.0 > Environment: 3 node CentOS cluster >Reporter: Khurram Faraaz >Assignee: Roman Kulyk > > Run a concurrent Java program that executes TPCDS query11 > while the above concurrent java program is under execution > stop foreman Drillbit (from another shell, using below command) > ./bin/drillbit.sh stop > and you will see the IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: > and another assertion error, in the drillbit.log > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Drill 1.11.0 git commit ID: d11aba2 (with assertions enabled) > > details from drillbit.log from the foreman Drillbit node. > {noformat} > 2017-06-05 18:38:33,838 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested RUNNING --> > FAILED > 2017-06-05 18:38:33,849 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested FAILED --> > FINISHED > 2017-06-05 18:38:33,852 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] > ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: AssertionError: > Failure while stopping processing for operator id 10. Currently have states > of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > at > org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:544) > ~[drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:295) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup(FragmentExecutor.java:160) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:264) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) > [drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > [na:1.8.0_91] > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > [na:1.8.0_91] > at java.lang.Thread.run(Thread.java:745) [na:1.8.0_91] > Caused by: java.lang.RuntimeException: java.lang.AssertionError: Failure > while stopping processing for operator id 10. Currently have states of > processing:false, setup:false, waiting:true. > at > org.apache.drill.common.DeferredException.addThrowable(DeferredException.java:101) > ~[drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.fail(FragmentExecutor.java:409) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:250) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > ... 4 common frames omitted > Caused by: java.lang.AssertionError: Failure while stopping processing for > operator id 10. Currently have states of processing:false, setup:false, > waiting:true. > at > org.apache.drill.exec.ops.OperatorStats.stopProcessing(OperatorStats.java:167) > ~[drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.ScanBatch.next(ScanBatch.java:255) >
[jira] [Commented] (DRILL-5564) IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space (16674816) + prealloc space (0) + child space (0) != allocated (16740352)
[ https://issues.apache.org/jira/browse/DRILL-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16186143#comment-16186143 ] ASF GitHub Bot commented on DRILL-5564: --- GitHub user KulykRoman opened a pull request: https://github.com/apache/drill/pull/967 DRILL-5564: Added finally block for stopWait() to avoid all situation… …s where Drill able to miss stopWait() in case of exceptions (it can lead to assertions). You can merge this pull request into a Git repository by running: $ git pull https://github.com/KulykRoman/drill DRILL-5564 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/drill/pull/967.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #967 commit c0e567cefe7ad9bf9745ed41fb87abba60dbf1b9 Author: Roman KulykDate: 2017-09-29T17:26:39Z DRILL-5564: Added finally block for stopWait() to avoid all situations where Drill able to miss stopWait() in case of exceptions (it can lead to assertions). > IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space > (16674816) + prealloc space (0) + child space (0) != allocated (16740352) > --- > > Key: DRILL-5564 > URL: https://issues.apache.org/jira/browse/DRILL-5564 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Flow >Affects Versions: 1.11.0 > Environment: 3 node CentOS cluster >Reporter: Khurram Faraaz >Assignee: Roman Kulyk > > Run a concurrent Java program that executes TPCDS query11 > while the above concurrent java program is under execution > stop foreman Drillbit (from another shell, using below command) > ./bin/drillbit.sh stop > and you will see the IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: > and another assertion error, in the drillbit.log > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Drill 1.11.0 git commit ID: d11aba2 (with assertions enabled) > > details from drillbit.log from the foreman Drillbit node. > {noformat} > 2017-06-05 18:38:33,838 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested RUNNING --> > FAILED > 2017-06-05 18:38:33,849 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested FAILED --> > FINISHED > 2017-06-05 18:38:33,852 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] > ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: AssertionError: > Failure while stopping processing for operator id 10. Currently have states > of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > at > org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:544) > ~[drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:295) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup(FragmentExecutor.java:160) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:264) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) > [drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > [na:1.8.0_91] > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > [na:1.8.0_91] > at java.lang.Thread.run(Thread.java:745) [na:1.8.0_91] > Caused by: java.lang.RuntimeException: java.lang.AssertionError: Failure > while stopping processing for operator id 10. Currently have states of > processing:false, setup:false, waiting:true. > at > org.apache.drill.common.DeferredException.addThrowable(DeferredException.java:101) >
[jira] [Commented] (DRILL-5564) IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space (16674816) + prealloc space (0) + child space (0) != allocated (16740352)
[ https://issues.apache.org/jira/browse/DRILL-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16183001#comment-16183001 ] Khurram Faraaz commented on DRILL-5564: --- Adding this information here, the same issue is also reproducible on an older build Drill 1.10.0 commit 38ef562b1ced59efe57b0dc606f2c36694569102, on same test environment. > IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space > (16674816) + prealloc space (0) + child space (0) != allocated (16740352) > --- > > Key: DRILL-5564 > URL: https://issues.apache.org/jira/browse/DRILL-5564 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Flow >Affects Versions: 1.11.0 > Environment: 3 node CentOS cluster >Reporter: Khurram Faraaz >Assignee: Roman Kulyk > > Run a concurrent Java program that executes TPCDS query11 > while the above concurrent java program is under execution > stop foreman Drillbit (from another shell, using below command) > ./bin/drillbit.sh stop > and you will see the IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: > and another assertion error, in the drillbit.log > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Drill 1.11.0 git commit ID: d11aba2 (with assertions enabled) > > details from drillbit.log from the foreman Drillbit node. > {noformat} > 2017-06-05 18:38:33,838 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested RUNNING --> > FAILED > 2017-06-05 18:38:33,849 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested FAILED --> > FINISHED > 2017-06-05 18:38:33,852 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] > ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: AssertionError: > Failure while stopping processing for operator id 10. Currently have states > of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > at > org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:544) > ~[drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:295) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup(FragmentExecutor.java:160) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:264) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) > [drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > [na:1.8.0_91] > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > [na:1.8.0_91] > at java.lang.Thread.run(Thread.java:745) [na:1.8.0_91] > Caused by: java.lang.RuntimeException: java.lang.AssertionError: Failure > while stopping processing for operator id 10. Currently have states of > processing:false, setup:false, waiting:true. > at > org.apache.drill.common.DeferredException.addThrowable(DeferredException.java:101) > ~[drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.fail(FragmentExecutor.java:409) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:250) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > ... 4 common frames omitted > Caused by: java.lang.AssertionError: Failure while stopping processing for > operator id 10. Currently have states of processing:false, setup:false, > waiting:true. > at > org.apache.drill.exec.ops.OperatorStats.stopProcessing(OperatorStats.java:167) > ~[drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at >
[jira] [Commented] (DRILL-5564) IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space (16674816) + prealloc space (0) + child space (0) != allocated (16740352)
[ https://issues.apache.org/jira/browse/DRILL-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16180762#comment-16180762 ] Roman Kulyk commented on DRILL-5564: [~khfaraaz], thank you for a comment! It seems I got reproduce on my local 2-nodes cluster: {code:xml} Fragment 36:1 [Error Id: 42bb75c2-fc75-4f53-9187-77b15d2554c7 on node4:31010] org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: AssertionError: Failure while stopping processing for operator id 2. Currently have states of processing:false, setup:false, waiting:true. Fragment 36:1 [Error Id: 42bb75c2-fc75-4f53-9187-77b15d2554c7 on node4:31010] at org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:550) ~[drill-common-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:298) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup(FragmentExecutor.java:160) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:264) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) [drill-common-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_111] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_111] at java.lang.Thread.run(Thread.java:745) [na:1.8.0_111] Caused by: java.lang.RuntimeException: java.lang.AssertionError: Failure while stopping processing for operator id 2. Currently have states of processing:false, setup:false, waiting:true. at org.apache.drill.common.DeferredException.addThrowable(DeferredException.java:101) ~[drill-common-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.fail(FragmentExecutor.java:410) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:250) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] ... 4 common frames omitted {code} As I see a reproduce in unstable (sometimes I can get a reproduce and sometimes not). Investigating for the root cause. > IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space > (16674816) + prealloc space (0) + child space (0) != allocated (16740352) > --- > > Key: DRILL-5564 > URL: https://issues.apache.org/jira/browse/DRILL-5564 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Flow >Affects Versions: 1.11.0 > Environment: 3 node CentOS cluster >Reporter: Khurram Faraaz > > Run a concurrent Java program that executes TPCDS query11 > while the above concurrent java program is under execution > stop foreman Drillbit (from another shell, using below command) > ./bin/drillbit.sh stop > and you will see the IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: > and another assertion error, in the drillbit.log > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Drill 1.11.0 git commit ID: d11aba2 (with assertions enabled) > > details from drillbit.log from the foreman Drillbit node. > {noformat} > 2017-06-05 18:38:33,838 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested RUNNING --> > FAILED > 2017-06-05 18:38:33,849 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested FAILED --> > FINISHED > 2017-06-05 18:38:33,852 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] > ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: AssertionError: > Failure while stopping processing for operator id 10. Currently have states > of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > at >
[jira] [Commented] (DRILL-5564) IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space (16674816) + prealloc space (0) + child space (0) != allocated (16740352)
[ https://issues.apache.org/jira/browse/DRILL-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16179532#comment-16179532 ] Khurram Faraaz commented on DRILL-5564: --- [~RomanKulyk] The AssertionError is reproducible on Drill 1.12.0 commit id : aaff1b35b7339fb4e6ab480dd517994ff9f0a5c5 {noformat} 1. schema used = dfs.tpcds_sf1_parquet_views 2. number of Threads used were for (int i = 1; i <= 8; i++) { executor.submit(new ConcurrentQuery(conn)); } 3. Drillbit stopped was the foreman drillbit 4. Assertions were enabled in drill-env.sh 5. TPC-DS query 11 was executed from the concurrent Java program 6. No other additional system options were set during or before the test was executed, except the below option drill.exec.hashagg.fallback.enabled that needs to be set to true 7. TPC-DS SF1 views over parquet data were used in test. 8. Expedted result is the query should fail once the foreman drillbit is stopped, bin/drillbit.sh stop 9. Yes, we only need to fix the AssertionError (below) 2017-09-25 17:55:54,440 [2636bcf9-ebd6-3dde-ba9c-e008ebcffb80:frag:21:1] ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: AssertionError: Failure while stopping processing for operator id 10. Currently have states of processing:false, setup:false, waiting:true. Fragment 21:1 [Error Id: 2d36c6a2-4ccc-4081-adf0-bc7592e0ed6f on centos-01.qa.lab:31010] org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: AssertionError: Failure while stopping processing for operator id 10. Currently have states of processing:false, setup:false, waiting:true. Fragment 21:1 [Error Id: 2d36c6a2-4ccc-4081-adf0-bc7592e0ed6f on centos-01.qa.lab:31010] at org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:550) ~[drill-common-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:298) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup(FragmentExecutor.java:160) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:264) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) [drill-common-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_91] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_91] at java.lang.Thread.run(Thread.java:745) [na:1.8.0_91] Caused by: java.lang.RuntimeException: java.lang.AssertionError: Failure while stopping processing for operator id 10. Currently have states of processing:false, setup:false, waiting:true. at org.apache.drill.common.DeferredException.addThrowable(DeferredException.java:101) ~[drill-common-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.fail(FragmentExecutor.java:409) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:250) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] ... 4 common frames omitted Caused by: java.lang.AssertionError: Failure while stopping processing for operator id 10. Currently have states of processing:false, setup:false, waiting:true. at org.apache.drill.exec.ops.OperatorStats.stopProcessing(OperatorStats.java:167) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.physical.impl.ScanBatch.next(ScanBatch.java:220) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:225) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:119) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:109) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.record.AbstractSingleRecordBatch.innerNext(AbstractSingleRecordBatch.java:51) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.innerNext(ProjectRecordBatch.java:141) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:164) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at
[jira] [Commented] (DRILL-5564) IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space (16674816) + prealloc space (0) + child space (0) != allocated (16740352)
[ https://issues.apache.org/jira/browse/DRILL-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176699#comment-16176699 ] Roman Kulyk commented on DRILL-5564: [~khfaraaz], I still can not reproduce this issue. I use 2 nodes cluster with 8G RAM and 2 core on each. I tried to reproduce using your steps: run concurrency program (with the same number of threads) on TP-CDS query11 and stop foreman drillbit. I tried to stop foreman in a different time, tried to use different numbers of concurrent threads, tried run the query on tables with different capacity, but I never get SYSTEM ERROR: AssertionError. I can get only expected errors which are associated with a connection failure: * CONNECTION ERROR: Connection /192.168.122.103:58259 <--> node3/192.168.122.103:31010 (user client) closed unexpectedly. Drillbit down? * ERROR o.apache.drill.exec.rpc.BasicClient - Failed to establish connection * ERROR o.a.drill.exec.ops.FragmentContext - Received an unexpected interrupt while waiting for the data send to complete. * ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: InterruptedException * ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: ConnectException: Connection refused: node3/192.168.122.103:31012 * ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: IOException: Broken pipe So could you provide some details of reproducing? It would be great to see the capacity of tables, a time when you stopped foreman and in what drillbit you got this error (foreman or not). And maybe you set some additional system options? Also, I want to know what should be the expected result. As in this case query is not hanging and there is no chance to finish query (query fails with stopping drillbit), I need to fix assertion error (but in this case we should get some connection errors), am I right? > IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space > (16674816) + prealloc space (0) + child space (0) != allocated (16740352) > --- > > Key: DRILL-5564 > URL: https://issues.apache.org/jira/browse/DRILL-5564 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Flow >Affects Versions: 1.11.0 > Environment: 3 node CentOS cluster >Reporter: Khurram Faraaz > > Run a concurrent Java program that executes TPCDS query11 > while the above concurrent java program is under execution > stop foreman Drillbit (from another shell, using below command) > ./bin/drillbit.sh stop > and you will see the IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: > and another assertion error, in the drillbit.log > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Drill 1.11.0 git commit ID: d11aba2 (with assertions enabled) > > details from drillbit.log from the foreman Drillbit node. > {noformat} > 2017-06-05 18:38:33,838 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested RUNNING --> > FAILED > 2017-06-05 18:38:33,849 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested FAILED --> > FINISHED > 2017-06-05 18:38:33,852 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] > ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: AssertionError: > Failure while stopping processing for operator id 10. Currently have states > of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > at > org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:544) > ~[drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:295) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup(FragmentExecutor.java:160) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:264) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at >
[jira] [Commented] (DRILL-5564) IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space (16674816) + prealloc space (0) + child space (0) != allocated (16740352)
[ https://issues.apache.org/jira/browse/DRILL-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16173886#comment-16173886 ] Roman Kulyk commented on DRILL-5564: [~khfaraaz], I can't reproduce these errors. Could you please provide more information about your env and dataset capacity? Also as I see there should not be some hangs. In this case, we got only errors, am I right? And what should be the expected result: the query should correctly finish without 1 drillbit or the query should fail without assertion error? > IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space > (16674816) + prealloc space (0) + child space (0) != allocated (16740352) > --- > > Key: DRILL-5564 > URL: https://issues.apache.org/jira/browse/DRILL-5564 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Flow >Affects Versions: 1.11.0 > Environment: 3 node CentOS cluster >Reporter: Khurram Faraaz > > Run a concurrent Java program that executes TPCDS query11 > while the above concurrent java program is under execution > stop foreman Drillbit (from another shell, using below command) > ./bin/drillbit.sh stop > and you will see the IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: > and another assertion error, in the drillbit.log > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Drill 1.11.0 git commit ID: d11aba2 (with assertions enabled) > > details from drillbit.log from the foreman Drillbit node. > {noformat} > 2017-06-05 18:38:33,838 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested RUNNING --> > FAILED > 2017-06-05 18:38:33,849 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested FAILED --> > FINISHED > 2017-06-05 18:38:33,852 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] > ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: AssertionError: > Failure while stopping processing for operator id 10. Currently have states > of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > at > org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:544) > ~[drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:295) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup(FragmentExecutor.java:160) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:264) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) > [drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > [na:1.8.0_91] > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > [na:1.8.0_91] > at java.lang.Thread.run(Thread.java:745) [na:1.8.0_91] > Caused by: java.lang.RuntimeException: java.lang.AssertionError: Failure > while stopping processing for operator id 10. Currently have states of > processing:false, setup:false, waiting:true. > at > org.apache.drill.common.DeferredException.addThrowable(DeferredException.java:101) > ~[drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.fail(FragmentExecutor.java:409) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:250) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > ... 4 common frames omitted > Caused by: java.lang.AssertionError: Failure while stopping processing for > operator id 10. Currently have states of processing:false, setup:false, > waiting:true. > at >
[jira] [Commented] (DRILL-5564) IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space (16674816) + prealloc space (0) + child space (0) != allocated (16740352)
[ https://issues.apache.org/jira/browse/DRILL-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16172474#comment-16172474 ] Khurram Faraaz commented on DRILL-5564: --- The same issue is reproducible, it didn't show up in the first run, in the later runs it does show up. Hence reopening this issue. test setup was {noformat} 1. three node cluster Drill 1.12.0 commit : aaff1b3 2. start drillbits 3. run concurrent java program that runs TP-CDS query11 4. stop the foreman drillbit $DRILL_HOME/bin/drillbit.sh stop 5. you should see the same AssertionError {noformat} Stack trace from drillbit.log {noformat} 2017-09-19 20:51:21,902 [263e7cd2-56e8-dd4c-d6c8-64de0a4626b0:frag:21:1] DEBUG o.a.d.exec.ops.OperatorContextImpl - Closing context for org.apache.drill.exec.physical.config.SingleSender 2017-09-19 20:51:21,902 [263e7cd2-56e8-dd4c-d6c8-64de0a4626b0:frag:21:1] DEBUG o.a.drill.exec.memory.BaseAllocator - closed allocator[op:21:1:0:SingleSender]. 2017-09-19 20:51:21,902 [263e7cd2-56e8-dd4c-d6c8-64de0a4626b0:frag:21:1] DEBUG o.a.drill.exec.memory.BaseAllocator - closed allocator[frag:21:1]. 2017-09-19 20:51:21,902 [263e7cd2-56e8-dd4c-d6c8-64de0a4626b0:frag:21:1] INFO o.a.d.e.w.fragment.FragmentExecutor - 263e7cd2-56e8-dd4c-d6c8-64de0a4626b0:21:1: State change requested FAILED --> FINISHED 2017-09-19 20:51:21,904 [263e7cd2-56e8-dd4c-d6c8-64de0a4626b0:frag:21:1] ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: AssertionError: Failure while stopping processing for operator id 10. Currently have states of processing:false, setup:false, waiting:true. Fragment 21:1 [Error Id: f14abd2f-d8c3-466c-a2c1-fdf622b7cee6 on centos-01.qa.lab:31010] org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: AssertionError: Failure while stopping processing for operator id 10. Currently have states of processing:false, setup:false, waiting:true. Fragment 21:1 [Error Id: f14abd2f-d8c3-466c-a2c1-fdf622b7cee6 on centos-01.qa.lab:31010] at org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:550) ~[drill-common-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:298) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup(FragmentExecutor.java:160) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:264) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) [drill-common-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_91] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_91] at java.lang.Thread.run(Thread.java:745) [na:1.8.0_91] Caused by: java.lang.RuntimeException: java.lang.AssertionError: Failure while stopping processing for operator id 10. Currently have states of processing:false, setup:false, waiting:true. at org.apache.drill.common.DeferredException.addThrowable(DeferredException.java:101) ~[drill-common-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.fail(FragmentExecutor.java:409) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:250) [drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] ... 4 common frames omitted Caused by: java.lang.AssertionError: Failure while stopping processing for operator id 10. Currently have states of processing:false, setup:false, waiting:true. at org.apache.drill.exec.ops.OperatorStats.stopProcessing(OperatorStats.java:167) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.physical.impl.ScanBatch.next(ScanBatch.java:220) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:225) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:119) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:109) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.record.AbstractSingleRecordBatch.innerNext(AbstractSingleRecordBatch.java:51) ~[drill-java-exec-1.12.0-SNAPSHOT.jar:1.12.0-SNAPSHOT] at org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.innerNext(ProjectRecordBatch.java:141)
[jira] [Commented] (DRILL-5564) IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space (16674816) + prealloc space (0) + child space (0) != allocated (16740352)
[ https://issues.apache.org/jira/browse/DRILL-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16166528#comment-16166528 ] Khurram Faraaz commented on DRILL-5564: --- Verified by executing same concurrent test on Drill 1.12.0 commit aaff1b3 The Exception is not seen. > IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: buffer space > (16674816) + prealloc space (0) + child space (0) != allocated (16740352) > --- > > Key: DRILL-5564 > URL: https://issues.apache.org/jira/browse/DRILL-5564 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Flow >Affects Versions: 1.11.0 > Environment: 3 node CentOS cluster >Reporter: Khurram Faraaz > > Run a concurrent Java program that executes TPCDS query11 > while the above concurrent java program is under execution > stop foreman Drillbit (from another shell, using below command) > ./bin/drillbit.sh stop > and you will see the IllegalStateException: allocator[op:21:1:5:HashJoinPOP]: > and another assertion error, in the drillbit.log > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Drill 1.11.0 git commit ID: d11aba2 (with assertions enabled) > > details from drillbit.log from the foreman Drillbit node. > {noformat} > 2017-06-05 18:38:33,838 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested RUNNING --> > FAILED > 2017-06-05 18:38:33,849 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] INFO > o.a.d.e.w.fragment.FragmentExecutor - > 26ca5afa-7f6d-991b-1fdf-6196faddc229:23:1: State change requested FAILED --> > FINISHED > 2017-06-05 18:38:33,852 [26ca5afa-7f6d-991b-1fdf-6196faddc229:frag:23:1] > ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: AssertionError: > Failure while stopping processing for operator id 10. Currently have states > of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: > AssertionError: Failure while stopping processing for operator id 10. > Currently have states of processing:false, setup:false, waiting:true. > Fragment 23:1 > [Error Id: a116b326-43ed-4569-a20e-a10ba03d215e on centos-01.qa.lab:31010] > at > org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:544) > ~[drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:295) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup(FragmentExecutor.java:160) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:264) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) > [drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > [na:1.8.0_91] > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > [na:1.8.0_91] > at java.lang.Thread.run(Thread.java:745) [na:1.8.0_91] > Caused by: java.lang.RuntimeException: java.lang.AssertionError: Failure > while stopping processing for operator id 10. Currently have states of > processing:false, setup:false, waiting:true. > at > org.apache.drill.common.DeferredException.addThrowable(DeferredException.java:101) > ~[drill-common-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.fail(FragmentExecutor.java:409) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:250) > [drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > ... 4 common frames omitted > Caused by: java.lang.AssertionError: Failure while stopping processing for > operator id 10. Currently have states of processing:false, setup:false, > waiting:true. > at > org.apache.drill.exec.ops.OperatorStats.stopProcessing(OperatorStats.java:167) > ~[drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at > org.apache.drill.exec.physical.impl.ScanBatch.next(ScanBatch.java:255) > ~[drill-java-exec-1.11.0-SNAPSHOT.jar:1.11.0-SNAPSHOT] > at >