Re: [gradle-dev] Task Optimization

Hans Dockter Fri, 25 Sep 2009 00:58:19 -0700


On Sep 25, 2009, at 2:10 AM, Adam Murdoch wrote:

Hi,
It sounds to me like the generic solution might actually be easierthan the hard-coded solution, once you chase down all the edgecases, and will also end up more accurate and reusable. Given thatwe want to throw away the hard-coded solution as soon as 0.8 is outand replace it with a generic solution, I wonder if it's worthpursuing the hard-coded solution at all.
Hans Dockter wrote:
Hi,
I have implemented a task optimization functionality that we mightput into 0.8. I have uploaded my branch to: http://github.com/hansd/gradle/tree/optim
A couple of comments:
1.) The task history is now stored in gradle user home with somehash that relates it to the actual project. The base for the hashis the path of the root dir. We might have issues if a subprojecttakes part in multiple multi-project builds, if the output issensitive to the respective multi-project build. The only way I seeto solve such a problem, would be to have multiple output dirs.
We want a unique identifier for the build, not for the project. Atthis stage, the settings dir path would do. Or the project dir ofthe root project.

That's the way it is done (I was not precise enough, when I said'actual project' above. It is the build.). The base for the hash isthe path of the root dir.

2.) Each task has a now doesOutputExists() method which defaults tofalse. So far all archive tasks have a custom implementation whichchecks for the existence of the archive. The test task also has acustom implementation which checks for at least one test resultsfile. I hope that we find a way to automate this in 0.9 byintroducing a generic notion of task output.
We already have the notion to some degree: properties can be markedwith @OutputFile and @OutputDirectory. The default doesOutputExists() could make use of these.


Right.

3.) So far there are onlyIf implementations only for the test andthe jar task provided by the Java plugin. I will add an onlyIfmodification for the test task when the Groovy plugin is appliedtomorrow. For 0.9 we want to automate the onlyIf statements basedon the information we have on the input arguments of a task.
4.) What about the other tasks? For java compile the Ant javac taskhas its own optimization checking for changed files. I'm not sureabout groovyc, I need to check. The Ant Javadoc/Groovydoc tasks donot check for changed files. To optimize them we would need tocheck for changed source files. The same is true for the codequality stuff. I'm not sure whether I will have time to get thisdone before 0.8. I would use Tom's change detection stuff. Ihaven't had a look at that yet. For 0.9 I guess the SourceSet'swill be a good place for source change detection. For 0.8 it mightbe already good enough to distinguish between no changes/do nothingand do the full thing.
I think you can pretty quickly do something general for all taskswith file inputs:
- In the onlyIf predicate, calculate the set of (file path,timestamp) for all input files in the history. You could create ahash from this.
- In the onlyIf predicate, skip the task if the input files hash ==the input files hash from last successful execution andtask.doesOutputExists()
- execute the task

- store the input files hash in the history.


Yes.

5.) The onlyIf optimization needs to be disabled if anybuild.gradle which is part of the multi-project build, thesettings.grade or an init.gradle changes. Therefore a ScriptSourceobject now has a method hasChanged which defaults to true. TheDefaultScriptCompilerFactory sets it to false if a script is readfrom the cache. I'm not very happy about the latter mechanism. Tome this looks like a hint that the ScriptSource should beresponsible for the compilation, instead of the compile classhaving a side effect on the state of ScriptSource. I will thinkabout this in more detail tomorrow.
I think a better approach is to use the properties of the task. Thisis more accurate, in that it catches changes to the taskconfiguration that aren't the result of changes to the build/init/settings scripts. Some types of changes we don't catch by checkingif the scripts has changed:* Task is configured using -PsomeProperty=value, and that value isdifferent to last execution.* Task is configured using system property, and that value isdifferent to last execution.* Task is configured based on the DAG, and the DAG containsdifferent tasks to last execution.* Task is configured by a 3rd party plugin, and that plugin haschanged since last execution* Task is configured by buildSrc code, and that code has changedsince last execution* Task is configured using properties from an imported build.xml,and that build.xml has changed
* Task is configured using properties from gradle.properties, ...
* ... you get the idea ...
So, checking whether the scripts have changed since last executiondoesn't come close to accurately detecting if we need to re-executea task. It also means we unnecessarily re-execute tasks when anunrelated change has been made to the build script.
I think accuracy is really important with this stuff. It absolutelymust be reliable, or people will just run clean all the time to geta reliable build. We want to avoid this.
I would suggest instead that we add an @Input annotation which onecan use to mark up the properties of a task which contribute in somesignificant way to the output of the task. The input of a task isstored in the history, and the set of input files is simply treatedas one piece of input.

I agree. I guess that is what we should do. And with the annotationsit looks rather straight forward to implement.

6.) The GradleInternal class exposes now the settings and the initscript ScriptSource objects. It also provides a convenience methodto check whether any ScriptSource object has changed. To get holdof the settings object it registers as a BuildListener. I thinkthere should be a better way. I will think more about this tomorrow.
Remove the settings file, perhaps? :)
I'm not completely sure whether we want to push this into 0.8 ornot. Feedback is welcome.
I don't think it will be reliable enough.


I also think we should leave it out for 0.8.

- Hans

--
Hans Dockter
Gradle Project Manager
http://www.gradle.org



---------------------------------------------------------------------
To unsubscribe from this list, please visit:

   http://xircles.codehaus.org/manage_email

Re: [gradle-dev] Task Optimization

Reply via email to