Hi,

I sent a mail a few days ago but it didn't make it to the list.

One very important feature would be the separation of build artifacts
(maven plugins and their dependencies), and project artifacts.
The separation isn't clear in maven itself - repo's get mixed up,
wrong repo's consulted; build artifacts interfering with plugin artifacts.
Having a separate directory containing information on what build artifacts
are used makes it easy to freeze a maven environment, and see what
was used runtime.

Also see [1], which you, Brett, weren't in favour of back then. Perhaps
now this can be taken into consideration with this proposal.


[1] http://jira.codehaus.org/browse/MNG-724


Brett Porter wrote:
See: http://docs.codehaus.org/display/MAVEN/Local+repository+separation

Text included below for inline comments (which I'll feed back into the document as needed).

Context

The current local repository is a single file structure, stored typically in an individual user's home directory.

The suffers from the following problems:
* there is no locking, so if multiple Maven instances attempt to run on the same machine they can corrupt each other's metadata * it serves multiple purposes - it is both a cache of remote repository artifacts, and a place to locally install artifacts that you build. Because of this, it is possible that the local cache does not always accurately reflect the state of the remote repository ** downloading a snapshot from a remote repository also writes the chosen version out as -SNAPSHOT, meaning that continues to get used even if the snapshot repository is removed ** downloading an artifact from a remote repository with a fixed version does not write metadata, so if that repository is later removed the artifact is still used though a clean build would fail. This particularly affects testing staged releases * it can be different to isolate differences in the local repository without deleting the entire cache, requiring time consuming downloads. * it isn't possible have multiple checkouts of the same development version and build them independently (particularly important for CI servers).
* it isn't possible to easily clean out a subsection of the repository
* the artifact code is over-complicated to implement the logic for sharing the storage

Solution

General Considerations

This solution aims to not change the current behaviour, other than to make it easier/possible to correct things considered known bugs as documented above. Resolution behaviour should not otherwise be affected and any such changes should be in the related proposal.

This proposal simply alters the storage of the artifacts.

Locking

Locking should be implemented at the individual artifact level. This can be done with a lockfile in the artifact top level directory (rather than the individual version), locking both the metadata and artifact.

An artifact operation should be done with files in a temporary location, and moved to the final location in one operation, wrapped by the creation of the lockfile. This makes the duration of the lock relatively short, so that Maven can simply block on the existence of a lockfile (both read and write operations), and remove it after a short period of time.

Local repository separation

The structure of the local repository should become:

.
|-- cache
|-- remote
|   |-- apache.snapshots
|   |-- central
|   |-- codehaus.snapshots
|   `-- ...
`-- workspace
    |-- default
    |-- workspace1
    `-- ...

The purposes of these directories are as follows:

* cache: immutable artifacts downloaded from a remote repository. No metadata is stored in this directory tree.

* remote: contains a directory for each remote repository (by repository identifier). This contains the metadata and mutable artifacts from that repository. Metadata files will return to the format {{maven-metadata.xml}} instead of the current {{maven-metadata-<id>.xml}} file format. Files in these repositories will typically be snapshots and metadata for releases, since actual releases are not mutable and can be stored in the {{cache}} directory

* workspace: contains a directory for each local workspace, with the primary one being {{default}}. This contains the metadata and files for any artifact built by maven (both snapshots, and releases).

Under each of these locations, the standard layout remains as it is now:

.
|-- cache
|   |-- com
|   |   `-- example
|   |       `-- ...
|   `-- org
|       |-- apache
|       |   `-- ...
|       `-- codehaus
|           `-- ...
|-- remote
|   |-- apache.snapshots
|   |   `-- org
|   |       `-- apache
|   |           `-- ...
|   |-- central
|   |-- codehaus.snapshots
|   |   `-- org
|   |       `-- codehaus
|   |           `-- ...
|   `-- ...
`-- workspace
    |-- default
    |   `-- com
    |       `-- example
    |           `-- ...
    |-- workspace1
    `-- ...

Search sequence

As current behaviour is to be retained when correct, the solution should aim to merge the metadata across the current workspace and active remote repositories to decide what artifact to use. The artifact can then be utilised from either the workspace, remote, or cache repositories.

Existence in the {{cache}} directory is not a decision point for using an artifact - this must be achieved and the artifact from there used if possible. This will help enable better utilising the remote repository metadata for tracking the source of an artifact in the future to resolve some of the problems listed in the context section of this proposal.

Deployment and Merging

Content will be deployed from the workspace directory, but will not be merged to other sections - there should be no need to migrate and data among the repository sections.

Rolling back a reactor build

While this would be a separate feature, and not the default behaviour, it would now be possible to use a temporary workspace to build artifacts during a reactor build and merge them into another workspace on completion, making an entire reactor build "atomic" with respect to the local repository.

Co-existence with Maven 2.0.x

A best practice should be to change your Maven 2.0.x configuration to use {{~/.m2/repository/cache}} as the local repository, and move the existing content, as this will co-exist properly with Maven 2.1.

Upgrade path

There is no need to upgrade existing local repositories - first use of Maven 2.1 will only mean users need to rebuild any local software, as remote artifacts will be redownloaded (however see above for the minimisation of this).

Shared Local Repositories

Though locking will now make this possible, it is still not a recommended practice to share the local repository. However, this structure will allow people to share the {{cache}} location safely to reduce disk usage if desired.

Cheers,
Brett




--
Brett Porter - [EMAIL PROTECTED]
Blog: http://www.devzuz.org/blogs/bporter/



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to