Re: Questions about Tez under the hood

Fabio Wed, 29 Oct 2014 02:35:09 -0700

Thanks Bikas for your answer and suggestion, actually my work deals morewith high level modeling/behavior/performance of Tez, but there isanother guy who is goign to handle Tez sources, I will suggest him tocontribute.I've just found many commented configuration parameters inorg.apache.tez.dag.api.TezConfiguration that I didn't even know, theywill help.


Right now I have another question that came to my mind while modeling Tez:

Situation: I have a DAG with 2 tasks waiting to run, the cluster isquite overloaded. The Tez AM will ask for 2 containers at the ResourceManager and wait for them. At some point a single container becomesavailable and a task can run and finish, so Tez (I guess) will exploitthat same container for reuse, but what about the other request sent tothe RM? Is it somehow actively voided by Tez or at some point it willjust get another container that wont be used (and possibly discardedafterward)? I don't even know if YARN have such a feature for removing apreviously submitted request to the RM.

I would keep this thread for future generic questions about Tez behaviorif it's ok.


Thanks so far :)

Fabio

On 10/27/2014 05:48 PM, Bikas Saha wrote:

Also, any contributions to the project via your thesis work would bewelcome. Please do first open a jira and provide a design overviewbefore submitting code.
*From:*Bikas Saha [mailto:[email protected]<mailto:[email protected]>]
*Sent:* Monday, October 27, 2014 9:47 AM
*To:* [email protected] <mailto:[email protected]>
*Subject:* RE: Questions about Tez under the hood

Answers inline.

*From:*Fabio C. [mailto:[email protected] <mailto:[email protected]>]
*Sent:* Monday, October 27, 2014 7:08 AM
*To:* [email protected] <mailto:[email protected]>
*Subject:* Questions about Tez under the hood
Hi guys, I'm currently working at my master degree thesis on Tez, andI am trying to understand how Tez works under the hood. I have somequestions, I hope someone can help with this:
1) How does Tez handle containers for reuse? Are they kept for someseconds (how long?) in a sort of buffer waiting for tasks which willneed them? Or a container is sent back to the RM if no task isimmediately ready to take it?
*/[Bikas] Yes they wait around for a buffer period of time. Idlecontainers are released back the RM randomly between a mix and a maxrelease time until a minimum held container threshold is met. So thebehavior can be customized using the min/max timeouts and the min heldthreshold./*
2) Let's say I have a DAG with two branches proceeding in parallelbefore joining in a root node (such as the example on the tez homepage http://tez.apache.org/images/PigHiveQueryOnTez.png ). In thiscase, we will have both branches running at the same time. At somepoint we may have the first branch that is almost complete, while thesecond is still at an early stage. In this case, does Tez knows that"soon or later" the two branches will merge, thus there will be acommon consumer waiting for the slower branch to complete? Actuallythe real question is: does Tez prioritize the scheduling/resourceallocation of tasks belonging to slower branches? If yes, what kind ofpolicy is adopted? Is it configurable?
*/[Bikas] Currently the priority of a vertex is the distance from thesource of the DAG. So vertices can run in parallel. On the roadmap areitems like critical path scheduling where the vertex that is holdingup the job the most or that’s going to unblock the most amount ofdownstream work to be given higher priority./*
3) tez.am.shuffle-vertex-manager.min-src-fraction: if I have a dagmade of two producer vertexes, each one running 10 tasks, and belowthem a consumer vertex, let's say running 5 tasks, so if this propertyis set to 0.2, does it mean that before running any consumer task weneed 2 producer tasks to complete for each of the producer vertexes?Or are they considered as a whole and we need just 4 tasks completed(even just from one vertex)?
*/[Bikas] It currently looks at the fraction of the whole (bothcombined) but we are going to change it to look at the fraction persource vertex. Again, this is just a hint. (With auto-parallelism on)the vertex also looks at whether enough data has been produced beforetriggering the tasks because the real intention is to have enough dataavailable for the reduce to pull so that it can overlap the pull withthe completion of the map tasks. /*
4) As far as I understand, a single Tez Application Master can handlemultiple DAGs at the same time, but only if the user-application hasbeen coded to do so (for example, if I run two wordcount with the sameuser, it simply creates two different Tez App Master). Is this correct?
*/[Bikas] If the TezClient is started in session mode then it re-usesthe App Master for multiple DAGs. The code is the same in session andnon-session mode. The behavior can be changed via configuration (orhard coded in the code if desired). So you can use both modes with thesame code. To be clear, the AppMaster does not run dags concurrently.It runs one DAG at a time./*
Thanks in advance

Fabio


CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual orentity to which it is addressed and may contain information that isconfidential, privileged and exempt from disclosure under applicablelaw. If the reader of this message is not the intended recipient, youare hereby notified that any printing, copying, dissemination,distribution, disclosure or forwarding of this communication isstrictly prohibited. If you have received this communication in error,please contact the sender immediately and delete it from your system.Thank You.

Re: Questions about Tez under the hood

Reply via email to