Re: [DISCUSS] Incorporating an ArchitectureId into the GAVCT of the repository

Robert Scholte Sun, 11 Sep 2016 05:20:08 -0700

Go ahead.

Maybe I get the chance to share these thoughts with some people at JavaOnetoo.And let's make a list of third parties we want to contact, direct contactprobably works better than broadcasting.


Robert

On Sun, 11 Sep 2016 12:49:18 +0200, Stephen Connolly<[email protected]> wrote:

I've not seen any major issues identified with this scheme (other than
perhaps platformId might be a better name than architectureId)

Will I take a stab at writing this up more formally then?

Should we circulate it more widely?

Did anyone involved with the NMaven effort have any thoughts?


On Thursday 1 September 2016, Stephen Connolly <
[email protected]> wrote:

One of the things I feel is necessary to grow Maven in the modelVersion

5.0.0 world is to start taking account of architecture specificartifacts.


Currently, the Maven repository layout does not handle architecture
specific dependencies well.

So, for example:

Say I have a foo.jar that depends on a native library... bar.dll /
libbar.so / etc

Ideally we'd like to say that foo just depends on bar...

A consumer of foo that is running on, say my local machine, could thenseethat I am running on os-x- x86_64 and because I am wanting to runtests...

it would look for bar with the architecture of `os-x-x86_64` to get the
native library for me

When I am building the installer for windows on my os-x machine (usingsay.NET and the WiX toolchain) the corresponding (future does not existyet)maven plugin could request the win-x86 architecture of the dependencyand

the rpm plugin could request the linux-ppc, linux-arm64, linux-x86 and
linux-x86_64 artifacts in order to produce the corresponding rpm
architecture artifacts

So when I think about this concept... I feel it is important that wefind

a way to introduce the architectureId into the GACVT of the repository.

When we do this, to my mind, we need to be mindful that modelVersion4.0.0

consumers would like to be able to consume these architecture specific
dependencies also... and the 4.0.0 GAV constraints will constrain the
possible solutions that we can pick if we value letting 4.0.0 consumers
access these architecture specific artifacts via the `default` layout we
currently employ for the maven repository.

So the first things first... our current `default` layout transforms the
GroupId:ArtifactId:Version:Classifier:Type into a repository URL of

`${groupId.replaceAll('.','/')}/${artifactId}/${version}/${
artifactId}-${version}${classifier==null?'':'-'+classifier}.${type}`

If we want to add architectureId into that URL Path and still have that
resolvable by GAVCT at a modelVersion 4.0.0 coordinate, we are basically

left with stuffing the architectureId into one of the existingcomponents...

Now when we think about an architecture specific artifact, the firstthingthat comes to mind is that each architecture specific artifact mostlikely

has different dependencies... hopefully the .pdt file (that would be

deployed at the GAV without an architecture... modulo multi-machinebuilds)

would provide the architecture specific dependency trees so that
modelVersion 5.0.0 aware consumers would - just naturally - be aware of
those differences in dependencies

But - if we want to give the modelVersion 4.0.0 consumers our besteffort- we probably need to give each architectureId it's own modelVersion4.0.0

pom.

In other words, I do not think we should try to munge the architectureId
into either classifier or type as both of those would force the
dependencies to be viewed as having the same dependencies in the
modelVersion 4.0.0 world

So that leaves us with groupId, artifactId and version...

I personally think version is a non-runner. In modelVersion 4.0.0 youcan

only depend on one version of a dependency at a time... version ranges
would become completely and utterly unusable (never mind that they are
unusable now)... plus my gut tells me that it would be a total mess!

So that leaves groupId and artifactId... our choices basically boildown to


legacyGroupId == '${groupId}'; legacyArtifactId == '${architectureId}.${
artifactId}'
legacyGroupId == '${groupId}'; legacyArtifactId == '${architectureId}-${
artifactId}'
legacyGroupId == '${groupId}'; legacyArtifactId == '${artifactId}.${
architectureId}'
legacyGroupId == '${groupId}'; legacyArtifactId == '${artifactId}-${
architectureId}'
legacyGroupId == '${groupId}.${architectureId}'; legacyArtifactId ==
'${artifactId}'
legacyGroupId == '${groupId}.${artifactId}'; legacyArtifactId ==
'${architectureId}'

I personally think that the ones that place `architectureId` lexically

before `artifactId` are not "right"... the most important coordinate isthegroupId, the next most is the artifactId, then the architecture, thenthe

version, etc

So to my mind that leaves us with:

legacyGroupId == '${groupId}'; legacyArtifactId == '${artifactId}.${
architectureId}'
legacyGroupId == '${groupId}'; legacyArtifactId == '${artifactId}-${
architectureId}'
legacyGroupId == '${groupId}.${artifactId}'; legacyArtifactId ==
'${architectureId}'

Now when we look at how, say, a modelVersion 4.0.0 consumer would use
these dependencies... the variant where we shift the artifactId into the
groupId would mean that you would end up with loads of `linux-arm`
"legacyArtifactId" dependencies in your modelVersion 4.0.0 consumer...
which would presumably be ugly (just like now if you have two matching

`artifactId` dependencies in your .war which forces us to disambiguateby

prefixing the groupId when copying into WEB-INF/lib)... so I am going to
reject that one also.

The convention seems to be that the artifactId does not contain a `.`withmost artifacts that I am aware of using `-` as the separator... thiscould

be used to argue either way... my preference is to run with `-` as the
separator... though I am open to using `.` to provide a convention that
architecture is distinguished using a `.`

So how would this work...

Ok, I have my foobar project that builds a .jar and the native libraries
that are required by that .jar

So from the reactor for that project we want to deploy

com.example:foobar:::1.0:pom (the legacy pom for the .jar to allow
modelVersion 4.0.0 consumption of the jar)

com.example:foobar:::1.0:pdt (the modern project dependency trees forall

attached artifacts)
com.example:foobar:::1.0:jar (the jar)
com.example:foobar::javadoc:1.0:jar (the javadoc jar)
com.example:foobar::sources:1.0:jar (the source jar)
com.example:foobar:win_x86::1.0:pom (the legacy pom for the 32-bit DLL)
com.example:foobar:win_x86::1.0:dll (the 32-bit DLL... alternatively the
type might be `native-library` or `lib` but let's assume DLL)

com.example:foobar:win_x86_64::1.0:pom (the legacy pom for the 64-bitDLL)

com.example:foobar:win_x86_64::1.0:dll (the 64-bit DLL)
com.example:foobar:osx_x86_64::1.0:pom (the legacy pom for the 64-bit
OS-X .dylib)
com.example:foobar:osx_x86_64::1.0:dylib (the 64-bit .dylib...

alternatively the type might be `native-library` or `lib` but let'sassume

dylib)

com.example:foobar:elf_arm::1.0:pom (the legacy pom for the linux ARM.so)

com.example:foobar:elf_arm::1.0:so (the ARM .so ... alternatively the
type might be `native-library` or `lib` but let's assume so)
com.example:foobar:elf_x86::1.0:pom (the legacy pom for the linux x86
32-bit .so)
com.example:foobar:elf_x86::1.0:so (the x86-32-bit .so)
com.example:foobar:elf_x86_64::1.0:pom (the legacy pom for the linux x86
64-bit .so)
com.example:foobar:elf_x86_64::1.0:so (the x86 64-bit .so)

My main build machine cannot cross-compile for PPC or ARM64... so wehavetwo other build machines that will want to produce the extraarchitecture

specific artifacts...

com.example:foobar:elf_ppc::1.0:pom (the legacy pom for the linux PPC.so)

com.example:foobar:elf_ppc::1.0:so (the PPC .so)

and

com.example:foobar:elf_arm_64::1.0:pom (the legacy pom for the linux ARM
64-bit .so)
com.example:foobar:elf_arm_64::1.0:so (the ARM 64-bit .so)

In order to accommodate delayed deployment, I am going to suggest thatthePPC and ARM64 deployments should publish their *supplemental* pdts attheir

coordinates, e.g.

com.example:foobar:elf_ppc::1.0:pdt (the suplemental project dependency
trees for the PPC reactor artifacts)

and

com.example:foobar:elf_arm_64::1.0:pdt (the suplemental project
dependency trees for the ARM64 reactor artifacts)

So ultimately we would end up with the following files being deployed(in

three "atomic" deployments):

com/example/foobar/1.0/foobar-1.0.pom
com/example/foobar/1.0/foobar-1.0.pdt
com/example/foobar/1.0/foobar-1.0.jar
com/example/foobar/1.0/foobar-1.0-javadoc.jar
com/example/foobar/1.0/foobar-1.0-sources.jar
com/example/foobar-win_x86/1.0/foobar-win_x86-1.0.pom
com/example/foobar-win_x86/1.0/foobar-win_x86-1.0.dll
com/example/foobar-win_x86_64/1.0/foobar-win_x86_64-1.0.pom
com/example/foobar-win_x86_64/1.0/foobar-win_x86_64-1.0.dll
com/example/foobar-osx_x86_64/1.0/foobar-win_x86_64-1.0.pom
com/example/foobar-osx_x86_64/1.0/foobar-win_x86_64-1.0.dylib
com/example/foobar-elf_arm/1.0/foobar-elf_arm-1.0.pom
com/example/foobar-elf_arm/1.0/foobar-elf_arm-1.0.so
com/example/foobar-elf_x86/1.0/foobar-elf_x86-1.0.pom
com/example/foobar-elf_x86/1.0/foobar-elf_x86-1.0.so
com/example/foobar-elf_x86_64/1.0/foobar-elf_x86_64-1.0.pom
com/example/foobar-elf_x86_64/1.0/foobar-elf_x86_64-1.0.so

com/example/foobar-elf_ppc/1.0/foobar-elf_ppc-1.0.pom
com/example/foobar-elf_ppc/1.0/foobar-elf_ppc-1.0.pdt
com/example/foobar-elf_ppc/1.0/foobar-elf_ppc-1.0.so

com/example/foobar-elf_arm_64/1.0/foobar-elf_arm_64-1.0.pom
com/example/foobar-elf_arm_64/1.0/foobar-elf_arm_64-1.0.pdt
com/example/foobar-elf_arm_64/1.0/foobar-elf_arm_64-1.0.so

When a modelVersion 5.0.0 consumer does something like:

compile: {
  dependencies: ["com.example:foobar:1.0:jar"]
}
test: {
  dependencies: ["org.junit:junit:5.0:jar"]
}

and wants to run its tests on linux ARM64 it will start by resolving
`com/example/foobar/1.0/foobar-1.0.pdt` this will give it the dependency
tree of the `.jar` which will declare an architecture dependent native
library dependency (somehow or other... this is why we may use

`native-library` as the "type")... because it knows that it is runningon

ARM64 architecture it will then know that it needs
`com.example:foobar:elf_arm_64::1.0:so` since this is not available in

the `com/example/foobar/1.0/foobar-1.0.pdt` trees it will then attempttodownload `com/example/foobar-elf_arm_64/1.0/foobar-elf_arm_64-1.0.pdt`if

that exists, it will use that tree... if it doesn't exist... we fail the
build (technically we could fall back to checking for
`com/example/foobar-elf_arm_64/1.0/foobar-elf_arm_64-1.0.pom` and
`com/example/foobar-elf_arm_64/1.0/foobar-elf_arm_64-1.0.so` before

failing the build... but as we know the artifacts were produced by a5.0.0

aware producer - as we have `com/example/foobar/1.0/foobar-1.0.pdt`
resolved)

A modelVersion 4.0.0 consumer is not really going to be able to have as
flexible a build... but at least they can - through declarations such as

<dependency>
  <groupId>com.example</groupId>
  <artifactId>foobar-elf_arm64</artifactId>
  <version>1.0</version>
  <type>so</type>
</dependency>

grab the .so to bundle into a .zip or installer and if they want towrite

a pom with architecture based profile activation injecting test scoped
dependencies they can do that also

WDYT?

If anyone has any experience from the NMaven experiments, or learnings

from .deb or .rpm attempts to solve architecture dependent artifactsmixed

with noarch artifacts... please step forward and join the discussion.

-Stephen

Notes:

1. I am not saying what conventions will be used to define the
`architectureId` values here
2. I am not discussing the schema for the .pdt files here... other than

the general priciple that they will contain multiple dependency treesfor

each artifact produced by the project

3. I am not discussing how a modelVersion 5.0.0 build would be invokedor

detect that it should just do the PPC deployment
4. This proposal does not include the new metadata schema that we would
likely require to assist with such a deployment format

5. I am not discussing or proposing a modelVersion 5.0.0 schema here...I

use a non-XML format to help people mentally disassociate thinking about
the architectureId specific things from the current 4.0.0 way of doing
things


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [DISCUSS] Incorporating an ArchitectureId into the GAVCT of the repository

Reply via email to