Austin Haas created BEAM-3481: --------------------------------- Summary: Query with subquery and aggregates cannot be implemented. Key: BEAM-3481 URL: https://issues.apache.org/jira/browse/BEAM-3481 Project: Beam Issue Type: Bug Components: dsl-sql Affects Versions: 2.2.0 Reporter: Austin Haas Assignee: Xu Mingmin
This query results in the error below: {noformat} "SELECT (COUNT(`p`)) FROM (SELECT `p` FROM `contains` GROUP BY `p`) AS `t1`"{noformat} This works correctly: {noformat} "SELECT (COUNT(`p`)) FROM (SELECT `p`, CURRENT_TIME FROM `contains` GROUP BY `p`) AS `t1`"{noformat} Error: {noformat} [nREPL-worker-5] INFO org.apache.beam.sdk.extensions.sql.impl.planner.BeamQueryPlanner - SQL: SELECT COUNT(`t1`.`p`) FROM (SELECT `contains`.`p` FROM `contains` AS `contains` GROUP BY `contains`.`p`) AS `t1` [nREPL-worker-5] INFO org.apache.beam.sdk.extensions.sql.impl.planner.BeamQueryPlanner - SQLPlan> LogicalAggregate(group=[{}], EXPR$0=[COUNT()]) LogicalAggregate(group=[{0}]) LogicalProject(p=[$0]) LogicalTableScan(table=[[contains]]) CannotPlanException Node [rel#157:Subset#3.BEAM_LOGICAL.[]] could not be implemented; planner state: Root: rel#157:Subset#3.BEAM_LOGICAL.[] Original rel: LogicalAggregate(subset=[rel#157:Subset#3.BEAM_LOGICAL.[]], group=[{}], EXPR$0=[COUNT()]): rowcount = 1.0, cumulative cost = {1.125 rows, 0.0 cpu, 0.0 io}, id = 155 LogicalAggregate(subset=[rel#154:Subset#2.NONE.[]], group=[{0}]): rowcount = 10.0, cumulative cost = {10.0 rows, 0.0 cpu, 0.0 io}, id = 153 LogicalProject(subset=[rel#152:Subset#1.NONE.[]], p=[$0]): rowcount = 100.0, cumulative cost = {100.0 rows, 100.0 cpu, 0.0 io}, id = 151 LogicalTableScan(subset=[rel#150:Subset#0.NONE.[]], table=[[contains]]): rowcount = 100.0, cumulative cost = {100.0 rows, 101.0 cpu, 0.0 io}, id = 146 Sets: Set#0, type: RecordType(VARCHAR p, VARCHAR s, BIGINT c) rel#150:Subset#0.NONE.[], best=null, importance=0.6561 rel#146:LogicalTableScan.NONE.[](table=[contains]), rowcount=100.0, cumulative cost={inf} rel#162:Subset#0.BEAM_LOGICAL.[], best=rel#164, importance=0.32805 rel#164:BeamIOSourceRel.BEAM_LOGICAL.[](table=[contains]), rowcount=100.0, cumulative cost={100.0 rows, 101.0 cpu, 0.0 io} Set#1, type: RecordType(VARCHAR p) rel#152:Subset#1.NONE.[], best=null, importance=0.7290000000000001 rel#151:LogicalProject.NONE.[](input=rel#150:Subset#0.NONE.[],p=$0), rowcount=100.0, cumulative cost={inf} rel#159:Subset#1.BEAM_LOGICAL.[], best=rel#163, importance=0.36450000000000005 rel#163:BeamProjectRel.BEAM_LOGICAL.[](input=rel#162:Subset#0.BEAM_LOGICAL.[],p=$0), rowcount=100.0, cumulative cost={200.0 rows, 201.0 cpu, 0.0 io} Set#2, type: RecordType(VARCHAR p) rel#154:Subset#2.NONE.[], best=null, importance=0.81 rel#153:LogicalAggregate.NONE.[](input=rel#152:Subset#1.NONE.[],group={0}), rowcount=10.0, cumulative cost={inf} rel#161:Subset#2.BEAM_LOGICAL.[], best=rel#160, importance=0.405 rel#160:BeamAggregationRel.BEAM_LOGICAL.[](group={0},window=org.apache.beam.sdk.transforms.windowing.GlobalWindows,trigger=Repeatedly.forever(AfterWatermark.pastEndOfWindow())), rowcount=10.0, cumulative cost={210.0 rows, 201.0 cpu, 0.0 io} Set#3, type: RecordType(BIGINT EXPR$0) rel#156:Subset#3.NONE.[], best=null, importance=0.9 rel#155:LogicalAggregate.NONE.[](input=rel#154:Subset#2.NONE.[],group={},EXPR$0=COUNT()), rowcount=1.0, cumulative cost={inf} rel#157:Subset#3.BEAM_LOGICAL.[], best=null, importance=1.0 rel#158:AbstractConverter.BEAM_LOGICAL.[](input=rel#156:Subset#3.NONE.[],convention=BEAM_LOGICAL,sort=[]), rowcount=1.0, cumulative cost={inf} org.apache.beam.sdks.java.extensions.sql.repackaged.org.apache.calcite.plan.volcano.RelSubset$CheapestPlanReplacer.visit (RelSubset.java:441) {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)