Re: [gradle-dev] Task Optimization

Hans Dockter Fri, 26 Jun 2009 14:18:14 -0700

4) I would like to be able to specify that a chain of dependenttasks only execute a task if Task.didWork is true for all of itsdependents. Note that this is not always desired, so you need tobe able to turn this on and off. I'm not sure of the best way toconfigure this. If we use the onlyIf method suggested above, itmight take another closure to check this that would be returnedfrom a "needed" method. This would look like:
 myTask.onlyIf(needed())
This probably should be the default for tests, but perhaps not forall Tasks.
I'm not sure about this approach.
The tests should run if either the test classes or the classes undertest have changed since last time we successfully ran the tests.Arguably a change to the test runtime classpath should also causethe tests to run. In other words, the tests should be run only ifthe input artifacts have not changed since last time we ran thetests. Checking whether all the dependencies of the test task haveexecuted or not is only an approximation of this, and not a generalsolution. For example, if I assemble my classes under test using,say, 2 independent Compile tasks, then the test task should run ifeither task has done something. Or, I may assemble my classes usingsome other build tool, so that there's no task which we can use tocheck whether or not the classes have changed.
To me, the key to task optimisation is to base it on the input andoutput artifacts of a task. If we make it easy to declare both theinput and output artifacts of a task, we make the model much richer,and from this we get a lot of goodness.
For example, if we know what the input artifacts for a task are,Gradle can apply change detection to those input artifacts on thetask's behalf. If we also know which tasks produce those artifacts,then Gradle can optimise the change detection. Gradle could, forexample, when it knows which task produces a given artifact, simplyuse the fact that the producer task executed an action or not todecide whether the input artifacts have changed, and only fall backto hashing or timestamps or a Java 7 file watcher or whatever whenit doesn't know how the artifact is produced. Similarly, it coulduse the fact that a Jar was downloaded by the dependency managementsystem to decide whether the input artifacts have changed.

This is very interesting. I'm just trying to play a little with someterminology. There are output-affecting input values (e.g. classpaths,src dirs, compiler options, ...) and also some non-output-affectinginput values like log level. The output affecting input values can besubdivided into belonging to something like an Outputter and somethinglike plain input values. Outputters can tell if they did some work,for plain input values the task needs its own history and changedetection management. By providing a rich domain model important typesof plain input values can be turned into outputters (e.g. SourceDir).And for a subset of the remaining range of input value types we shouldbe able to provide a nice toolkit that makes it easy to define changedetection.

With the above model, the default behavior of onlyIf isinputValues.haveChanged == true

There are also scenarios like: This task should not be executed onFriday. I think they don't fit into the input value model. So we stillneed to accommodate custom onlyIf rules.


One of the interesting issues is to make it easy to write such tasks.

Adding input and output artifacts to the model also lets us use thisinformation to build the DAG, and to be smart about skipping tasks.For example, if the test task were to declare that it uses the testsclasses directory and the test runtime configuration as inputartifacts, then Gradle would be able to automatically add the tasksthat produce these (if any) to the task dependencies of the test task.

One things that comes to my mind is a scenario, that two tasks outputinto classesDir. But a third tasks only wants to be dependent on oneof those other tasks. Yet I see your point. It is a very interestingquestion how to integrate the concepts of the input/output model withthe DAG model. Again, a richer domain model can help. If the testtasks declares to use for example a SourceDir object as an inputvalue, my scenario from above could easily be solved. But you couldask why not declaring a dependsOn relation from test to SourceDir? Ithink this is basically what we do with this new input model, with thedifference that it is more specific. Instead of just providing adependsOn method, an input value of type SourceDir could be translatedinto: This is a dependsOn for the purpose of having the classpath ofthe production code in the runtime classpath of the tests.


- Hans

--
Hans Dockter
Gradle Project Manager
http://www.gradle.org


---------------------------------------------------------------------
To unsubscribe from this list, please visit:

   http://xircles.codehaus.org/manage_email

Re: [gradle-dev] Task Optimization

Reply via email to