[ 
https://issues.apache.org/jira/browse/TAJO-540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13892877#comment-13892877
 ] 

Min Zhou commented on TAJO-540:
-------------------------------

I'd like to write more about the code implementation.

Currently, tajo has 2 threads for sheduling. One which is standalone mode 
schedules queries by tajo itself, the other is yarn mode where queries 
resources are allocated by a yarn cluster.

Regarding to the standalone mode. When TajoMaster receives a query,  it firstly 
use  TajoWorkerResourceManager to choose a idler QueryMaster for this query.  
Then the query will be sent to that QueryMaster. QueryMaster breaks the query 
into several execution blocks, each execution block  consists of  several 
tasks.   TajoResourceAllocator reside in that QueryMaster send a rpc call 
TajoMasterProtocol.allocateWorkerResources() to TajoMaster. TajoMaster then use 
TajoWorkerResourceManager again to allocate workers for execution block.  Yarn 
mode is quite similar.

What we want to do some change is for the standalone mode, right?   

> (Umbrella) Implement Tajo Query Scheduler
> -----------------------------------------
>
>                 Key: TAJO-540
>                 URL: https://issues.apache.org/jira/browse/TAJO-540
>             Project: Tajo
>          Issue Type: New Feature
>            Reporter: Hyunsik Choi
>
> Currently, there is no Tajo query scheduler. So, all queries launched 
> simultaneously compete cluster resource which is managed by 
> TajoResourceManager.
> In this issue, we will investigate,  design, and implement a Tajo query 
> scheduler. This is an umbrella issue for that. We will create subtasks for 
> them.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to