[jira] [Assigned] (IMPALA-5746) Remote fragments continue to hold onto memory after stopping the coordinator daemon
[ https://issues.apache.org/jira/browse/IMPALA-5746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sahil Takiar reassigned IMPALA-5746: Assignee: Wenzhe Zhou (was: Sahil Takiar) > Remote fragments continue to hold onto memory after stopping the coordinator > daemon > --- > > Key: IMPALA-5746 > URL: https://issues.apache.org/jira/browse/IMPALA-5746 > Project: IMPALA > Issue Type: Bug > Components: Distributed Exec >Affects Versions: Impala 2.10.0 >Reporter: Mostafa Mokhtar >Assignee: Wenzhe Zhou >Priority: Critical > Attachments: remote_fragments_holding_memory.txt > > > Repro > # Start running queries > # Kill the coordinator node > # On the running Impalad check the memz tab, remote fragments continue to run > and hold on to resources > Remote fragments held on to memory +30 minutes after stopping the coordinator > service. > Attached thread dump from an Impalad running remote fragments . > Snapshot of memz tab 30 minutes after killing the coordinator > {code} > Process: Limit=201.73 GB Total=5.32 GB Peak=179.36 GB > Free Disk IO Buffers: Total=1.87 GB Peak=1.87 GB > RequestPool=root.default: Total=1.35 GB Peak=178.51 GB > Query(f64169d4bb3c901c:3a21d8ae): Total=2.64 MB Peak=104.73 MB > Fragment f64169d4bb3c901c:3a21d8ae0051: Total=2.64 MB Peak=2.67 MB > AGGREGATION_NODE (id=15): Total=2.54 MB Peak=2.57 MB > Exprs: Total=30.12 KB Peak=30.12 KB > EXCHANGE_NODE (id=14): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=12.29 KB > DataStreamSender (dst_id=17): Total=85.31 KB Peak=85.31 KB > CodeGen: Total=1.53 KB Peak=374.50 KB > Block Manager: Limit=161.39 GB Total=512.00 KB Peak=1.54 MB > Query(2a4f12b3b4b1dc8c:db7e8cf2): Total=258.29 MB Peak=412.98 MB > Fragment 2a4f12b3b4b1dc8c:db7e8cf2008c: Total=2.29 MB Peak=2.29 MB > SORT_NODE (id=11): Total=4.00 KB Peak=4.00 KB > AGGREGATION_NODE (id=20): Total=2.27 MB Peak=2.27 MB > Exprs: Total=25.12 KB Peak=25.12 KB > EXCHANGE_NODE (id=19): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=0 > DataStreamSender (dst_id=21): Total=3.88 KB Peak=3.88 KB > CodeGen: Total=4.17 KB Peak=1.05 MB > Block Manager: Limit=161.39 GB Total=256.25 MB Peak=321.66 MB > Query(68421d2a5dea0775:83f5d972): Total=282.77 MB Peak=443.53 MB > Fragment 68421d2a5dea0775:83f5d972004a: Total=26.77 MB Peak=26.92 MB > SORT_NODE (id=8): Total=8.00 KB Peak=8.00 KB > Exprs: Total=4.00 KB Peak=4.00 KB > ANALYTIC_EVAL_NODE (id=7): Total=4.00 KB Peak=4.00 KB > Exprs: Total=4.00 KB Peak=4.00 KB > SORT_NODE (id=6): Total=24.00 MB Peak=24.00 MB > AGGREGATION_NODE (id=12): Total=2.72 MB Peak=2.83 MB > Exprs: Total=85.12 KB Peak=85.12 KB > EXCHANGE_NODE (id=11): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=84.80 KB > DataStreamSender (dst_id=13): Total=1.27 KB Peak=1.27 KB > CodeGen: Total=24.80 KB Peak=4.13 MB > Block Manager: Limit=161.39 GB Total=280.50 MB Peak=286.52 MB > Query(e94c89fa89a74d27:82812bf9): Total=258.29 MB Peak=436.85 MB > Fragment e94c89fa89a74d27:82812bf9008e: Total=2.29 MB Peak=2.29 MB > SORT_NODE (id=11): Total=4.00 KB Peak=4.00 KB > AGGREGATION_NODE (id=20): Total=2.27 MB Peak=2.27 MB > Exprs: Total=25.12 KB Peak=25.12 KB > EXCHANGE_NODE (id=19): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=0 > DataStreamSender (dst_id=21): Total=3.88 KB Peak=3.88 KB > CodeGen: Total=4.17 KB Peak=1.05 MB > Block Manager: Limit=161.39 GB Total=256.25 MB Peak=321.62 MB > Query(4e43dad3bdc935d8:938b8b7e): Total=2.65 MB Peak=105.60 MB > Fragment 4e43dad3bdc935d8:938b8b7e0052: Total=2.65 MB Peak=2.68 MB > AGGREGATION_NODE (id=15): Total=2.55 MB Peak=2.57 MB > Exprs: Total=30.12 KB Peak=30.12 KB > EXCHANGE_NODE (id=14): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=13.68 KB > DataStreamSender (dst_id=17): Total=91.41 KB Peak=91.41 KB > CodeGen: Total=1.53 KB Peak=374.50 KB > Block Manager: Limit=161.39 GB Total=512.00 KB Peak=1.30 MB > Query(b34bdd65f1ed017e:5a0291bd): Total=2.37 MB Peak=106.56 MB > Fragment b34bdd65f1ed017e:5a0291bd004b: Total=2.37 MB Peak=2.37 MB > SORT_NODE (id=6): Total=4.00 KB Peak=4.00 KB > AGGREGATION_NODE (id=10): Total=2.35 MB Peak=2.35 MB > Exprs: Total=34.12 KB Peak=34.12 KB > EXCHANGE_NODE (id=9): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=4.23 KB > DataStreamSender (dst_id=11): Total=3.45 K
[jira] [Assigned] (IMPALA-5746) Remote fragments continue to hold onto memory after stopping the coordinator daemon
[ https://issues.apache.org/jira/browse/IMPALA-5746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sahil Takiar reassigned IMPALA-5746: Assignee: Sahil Takiar (was: Joe McDonnell) > Remote fragments continue to hold onto memory after stopping the coordinator > daemon > --- > > Key: IMPALA-5746 > URL: https://issues.apache.org/jira/browse/IMPALA-5746 > Project: IMPALA > Issue Type: Bug > Components: Distributed Exec >Affects Versions: Impala 2.10.0 >Reporter: Mostafa Mokhtar >Assignee: Sahil Takiar >Priority: Critical > Attachments: remote_fragments_holding_memory.txt > > > Repro > # Start running queries > # Kill the coordinator node > # On the running Impalad check the memz tab, remote fragments continue to run > and hold on to resources > Remote fragments held on to memory +30 minutes after stopping the coordinator > service. > Attached thread dump from an Impalad running remote fragments . > Snapshot of memz tab 30 minutes after killing the coordinator > {code} > Process: Limit=201.73 GB Total=5.32 GB Peak=179.36 GB > Free Disk IO Buffers: Total=1.87 GB Peak=1.87 GB > RequestPool=root.default: Total=1.35 GB Peak=178.51 GB > Query(f64169d4bb3c901c:3a21d8ae): Total=2.64 MB Peak=104.73 MB > Fragment f64169d4bb3c901c:3a21d8ae0051: Total=2.64 MB Peak=2.67 MB > AGGREGATION_NODE (id=15): Total=2.54 MB Peak=2.57 MB > Exprs: Total=30.12 KB Peak=30.12 KB > EXCHANGE_NODE (id=14): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=12.29 KB > DataStreamSender (dst_id=17): Total=85.31 KB Peak=85.31 KB > CodeGen: Total=1.53 KB Peak=374.50 KB > Block Manager: Limit=161.39 GB Total=512.00 KB Peak=1.54 MB > Query(2a4f12b3b4b1dc8c:db7e8cf2): Total=258.29 MB Peak=412.98 MB > Fragment 2a4f12b3b4b1dc8c:db7e8cf2008c: Total=2.29 MB Peak=2.29 MB > SORT_NODE (id=11): Total=4.00 KB Peak=4.00 KB > AGGREGATION_NODE (id=20): Total=2.27 MB Peak=2.27 MB > Exprs: Total=25.12 KB Peak=25.12 KB > EXCHANGE_NODE (id=19): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=0 > DataStreamSender (dst_id=21): Total=3.88 KB Peak=3.88 KB > CodeGen: Total=4.17 KB Peak=1.05 MB > Block Manager: Limit=161.39 GB Total=256.25 MB Peak=321.66 MB > Query(68421d2a5dea0775:83f5d972): Total=282.77 MB Peak=443.53 MB > Fragment 68421d2a5dea0775:83f5d972004a: Total=26.77 MB Peak=26.92 MB > SORT_NODE (id=8): Total=8.00 KB Peak=8.00 KB > Exprs: Total=4.00 KB Peak=4.00 KB > ANALYTIC_EVAL_NODE (id=7): Total=4.00 KB Peak=4.00 KB > Exprs: Total=4.00 KB Peak=4.00 KB > SORT_NODE (id=6): Total=24.00 MB Peak=24.00 MB > AGGREGATION_NODE (id=12): Total=2.72 MB Peak=2.83 MB > Exprs: Total=85.12 KB Peak=85.12 KB > EXCHANGE_NODE (id=11): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=84.80 KB > DataStreamSender (dst_id=13): Total=1.27 KB Peak=1.27 KB > CodeGen: Total=24.80 KB Peak=4.13 MB > Block Manager: Limit=161.39 GB Total=280.50 MB Peak=286.52 MB > Query(e94c89fa89a74d27:82812bf9): Total=258.29 MB Peak=436.85 MB > Fragment e94c89fa89a74d27:82812bf9008e: Total=2.29 MB Peak=2.29 MB > SORT_NODE (id=11): Total=4.00 KB Peak=4.00 KB > AGGREGATION_NODE (id=20): Total=2.27 MB Peak=2.27 MB > Exprs: Total=25.12 KB Peak=25.12 KB > EXCHANGE_NODE (id=19): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=0 > DataStreamSender (dst_id=21): Total=3.88 KB Peak=3.88 KB > CodeGen: Total=4.17 KB Peak=1.05 MB > Block Manager: Limit=161.39 GB Total=256.25 MB Peak=321.62 MB > Query(4e43dad3bdc935d8:938b8b7e): Total=2.65 MB Peak=105.60 MB > Fragment 4e43dad3bdc935d8:938b8b7e0052: Total=2.65 MB Peak=2.68 MB > AGGREGATION_NODE (id=15): Total=2.55 MB Peak=2.57 MB > Exprs: Total=30.12 KB Peak=30.12 KB > EXCHANGE_NODE (id=14): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=13.68 KB > DataStreamSender (dst_id=17): Total=91.41 KB Peak=91.41 KB > CodeGen: Total=1.53 KB Peak=374.50 KB > Block Manager: Limit=161.39 GB Total=512.00 KB Peak=1.30 MB > Query(b34bdd65f1ed017e:5a0291bd): Total=2.37 MB Peak=106.56 MB > Fragment b34bdd65f1ed017e:5a0291bd004b: Total=2.37 MB Peak=2.37 MB > SORT_NODE (id=6): Total=4.00 KB Peak=4.00 KB > AGGREGATION_NODE (id=10): Total=2.35 MB Peak=2.35 MB > Exprs: Total=34.12 KB Peak=34.12 KB > EXCHANGE_NODE (id=9): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=4.23 KB > DataStreamSender (dst_id=11): Total=3.4
[jira] [Assigned] (IMPALA-5746) Remote fragments continue to hold onto memory after stopping the coordinator daemon
[ https://issues.apache.org/jira/browse/IMPALA-5746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Ho reassigned IMPALA-5746: -- Assignee: Joe McDonnell (was: Michael Ho) > Remote fragments continue to hold onto memory after stopping the coordinator > daemon > --- > > Key: IMPALA-5746 > URL: https://issues.apache.org/jira/browse/IMPALA-5746 > Project: IMPALA > Issue Type: Bug > Components: Distributed Exec >Affects Versions: Impala 2.10.0 >Reporter: Mostafa Mokhtar >Assignee: Joe McDonnell >Priority: Critical > Attachments: remote_fragments_holding_memory.txt > > > Repro > # Start running queries > # Kill the coordinator node > # On the running Impalad check the memz tab, remote fragments continue to run > and hold on to resources > Remote fragments held on to memory +30 minutes after stopping the coordinator > service. > Attached thread dump from an Impalad running remote fragments . > Snapshot of memz tab 30 minutes after killing the coordinator > {code} > Process: Limit=201.73 GB Total=5.32 GB Peak=179.36 GB > Free Disk IO Buffers: Total=1.87 GB Peak=1.87 GB > RequestPool=root.default: Total=1.35 GB Peak=178.51 GB > Query(f64169d4bb3c901c:3a21d8ae): Total=2.64 MB Peak=104.73 MB > Fragment f64169d4bb3c901c:3a21d8ae0051: Total=2.64 MB Peak=2.67 MB > AGGREGATION_NODE (id=15): Total=2.54 MB Peak=2.57 MB > Exprs: Total=30.12 KB Peak=30.12 KB > EXCHANGE_NODE (id=14): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=12.29 KB > DataStreamSender (dst_id=17): Total=85.31 KB Peak=85.31 KB > CodeGen: Total=1.53 KB Peak=374.50 KB > Block Manager: Limit=161.39 GB Total=512.00 KB Peak=1.54 MB > Query(2a4f12b3b4b1dc8c:db7e8cf2): Total=258.29 MB Peak=412.98 MB > Fragment 2a4f12b3b4b1dc8c:db7e8cf2008c: Total=2.29 MB Peak=2.29 MB > SORT_NODE (id=11): Total=4.00 KB Peak=4.00 KB > AGGREGATION_NODE (id=20): Total=2.27 MB Peak=2.27 MB > Exprs: Total=25.12 KB Peak=25.12 KB > EXCHANGE_NODE (id=19): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=0 > DataStreamSender (dst_id=21): Total=3.88 KB Peak=3.88 KB > CodeGen: Total=4.17 KB Peak=1.05 MB > Block Manager: Limit=161.39 GB Total=256.25 MB Peak=321.66 MB > Query(68421d2a5dea0775:83f5d972): Total=282.77 MB Peak=443.53 MB > Fragment 68421d2a5dea0775:83f5d972004a: Total=26.77 MB Peak=26.92 MB > SORT_NODE (id=8): Total=8.00 KB Peak=8.00 KB > Exprs: Total=4.00 KB Peak=4.00 KB > ANALYTIC_EVAL_NODE (id=7): Total=4.00 KB Peak=4.00 KB > Exprs: Total=4.00 KB Peak=4.00 KB > SORT_NODE (id=6): Total=24.00 MB Peak=24.00 MB > AGGREGATION_NODE (id=12): Total=2.72 MB Peak=2.83 MB > Exprs: Total=85.12 KB Peak=85.12 KB > EXCHANGE_NODE (id=11): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=84.80 KB > DataStreamSender (dst_id=13): Total=1.27 KB Peak=1.27 KB > CodeGen: Total=24.80 KB Peak=4.13 MB > Block Manager: Limit=161.39 GB Total=280.50 MB Peak=286.52 MB > Query(e94c89fa89a74d27:82812bf9): Total=258.29 MB Peak=436.85 MB > Fragment e94c89fa89a74d27:82812bf9008e: Total=2.29 MB Peak=2.29 MB > SORT_NODE (id=11): Total=4.00 KB Peak=4.00 KB > AGGREGATION_NODE (id=20): Total=2.27 MB Peak=2.27 MB > Exprs: Total=25.12 KB Peak=25.12 KB > EXCHANGE_NODE (id=19): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=0 > DataStreamSender (dst_id=21): Total=3.88 KB Peak=3.88 KB > CodeGen: Total=4.17 KB Peak=1.05 MB > Block Manager: Limit=161.39 GB Total=256.25 MB Peak=321.62 MB > Query(4e43dad3bdc935d8:938b8b7e): Total=2.65 MB Peak=105.60 MB > Fragment 4e43dad3bdc935d8:938b8b7e0052: Total=2.65 MB Peak=2.68 MB > AGGREGATION_NODE (id=15): Total=2.55 MB Peak=2.57 MB > Exprs: Total=30.12 KB Peak=30.12 KB > EXCHANGE_NODE (id=14): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=13.68 KB > DataStreamSender (dst_id=17): Total=91.41 KB Peak=91.41 KB > CodeGen: Total=1.53 KB Peak=374.50 KB > Block Manager: Limit=161.39 GB Total=512.00 KB Peak=1.30 MB > Query(b34bdd65f1ed017e:5a0291bd): Total=2.37 MB Peak=106.56 MB > Fragment b34bdd65f1ed017e:5a0291bd004b: Total=2.37 MB Peak=2.37 MB > SORT_NODE (id=6): Total=4.00 KB Peak=4.00 KB > AGGREGATION_NODE (id=10): Total=2.35 MB Peak=2.35 MB > Exprs: Total=34.12 KB Peak=34.12 KB > EXCHANGE_NODE (id=9): Total=0 Peak=0 > DataStreamRecvr: Total=0 Peak=4.23 KB > DataStreamSender (dst_id=11): Total=3.45 KB