[ 
https://issues.apache.org/jira/browse/DRILL-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jacques Nadeau updated DRILL-60:
--------------------------------
    Component/s: Execution - Operators

> multiple query support
> ----------------------
>
>                 Key: DRILL-60
>                 URL: https://issues.apache.org/jira/browse/DRILL-60
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Execution - Operators
>            Reporter: Lisen Mu
>             Fix For: Future
>
>         Attachments: DRILL-60.0.patch.txt
>
>
> parse/optimize several SQL(or other QL)s into one logical plan and execute 
> them once.
>  
> * use case
> If a report generating system is used as a frontend system of drill instead 
> of a command line, There would be a set of similar SQL to execute to generate 
> a report(consider each day's DAU statistics with geoip distribution(groupby) 
> for 7 continuous days). If processed individually, plans for SQLs in this 
> kind of set are similar(scan pv log in these days) or same(scan GeoIP table) 
> in majority. If processed together, duplicated scan(GeoIP) could be 
> eliminated to reduce I/O, or similar scan(pv) could be merged to achieve 
> better throughput, and provides chances of other intermediate result reuse.
> * possibility
> Query in a QL is represented as a tree. Drill models logical plan and 
> physical plan as DAG, which is more expressive than tree. More precisely, a 
> set of trees(with common nodes or common edges) can be union-ed into a dag. 
> So, the current drill logical/physical plan model supports multiple query 
> already. Each sql would have a corresponding StoreOP as its result.
> * problems
> To do so, physical operators must support multiple iteration over its output. 
> That would have side effect on mem management.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to