Re: [DISCUSS] Moving Apache River to the Attic

Peter Firmstone Tue, 15 Feb 2022 04:05:24 -0800


On 15/02/2022 8:29 pm, Michał Kłeczek wrote:

Hi Peter,
JGDMS uses a new implementation of a subset of the JavaSerialization’s stream format, with input validation and defensesagainst malicious data (all connections are first authenticated whenusing secure endpoints). Codebase annotations are no longerappended in serialization streams, this feature is deprecated but itcan still be enabled.
How the client knows the code needed to deserialise?

The service provides this information, typically in a servicesconfiguration, by default this is a space separated list of URI, similarto a codebase annotation, but it doesn't have to be. JERI manages thedeserialization of code through a default ProxyCodebaseSpiimplementation, the client applies constraints, to ensure that inputvalidation is used as well as any other constraints, such as principals,or encryption strength. ProxyCodebaseSpi can be customized by theclient, so the client may implement ProxyCodebaseSpi if it wants to dosomething different, eg use OSGi or Maven to manage dependency resolution.

Does it require to have it installed in advance? If so - how?


Only JGDMS platform and JERI.

How is the following scenario handled:
class JavaSpaceEventPublisher implements RemoteEventListener,Serializable {
  private final JavaSpace space;

//… publish event in JavaSpace implementation
}
The smart proxy class has dependencies on RemoteEventListener and onJavaSpace. How do you properly resolve classes in this case?

Typically the client has the ServiceAPI it needs already installedlocally, however this may not always be the case, depending on how youwant to resolve the proxy classes and how much you want to share withthe client, you can include additional jar files in the annotation, anduse preferred.list or you can use Maven or OSGi to resolve dependenciesand provision the ClassLoader used for proxy deserialization.

This paper documents the problems with this approach:https://dl.acm.org/doi/pdf/10.5555/1698139
JGDMS provisions a ClassLoader at each Endpoint, the ClassLoader issolely responsible for class resolution, once it has been assigned tothe relevant ObjectEndpoint. A provider mechanism allows customization.
JGDMS doesn't suffer from codebase annotation loss, nor classresolution issues. But it did have to give up some functionality;it cannot resolve classes that do not belong to a service proxy orits service api and are not resolvable from the Endpoint ClassLoader,if they are not present on the remote machine. The solution is toalways use a service, for parameters passed to a service, if they arenot part of the service api, eg the client overrides the type ofparameter arguments for a service. This means that if the parameteris not an interface, you cannot create a service that implements itand pass it as an argument. That’s why its still possible, but notrecommended to use codebase annotations appended to the serializationstream. The solution is to create service api that uses onlyinterfaces for parameter arguments. For example a remote events andlisteners use this pattern. To prevent unexpected breakages, eitheruse interfaces, or final classes, or both, for service api remotemethod parameters. Then you won’t get into the situation where youneed codebase annotations appended in the stream.
I am not sure I follow but...
What I am trying to achieve is exactly the opposite - place as littleconstraints as possible on service implementors and make the wholething “magically work” :)

JGDMS only does this with AtomicILFactory by default, you aren'tconstrained to using that, you can enable codebase annotations in thestream, or override BasicILFactory, if you want to do somethingdifferent, or just use BasicILFactory as is, you can avoid applying thisrestriction to service parameter arguments, but then you have to acceptthe compromises that come with that such as codebase annotation loss,which can spoil the magic.

For me it is simpler to use interface types for service method argumentsand provide a final implementation class as part of the Service API,this allows the client to either use the default ServiceAPI classes orimplement the interface with another service.

If you want to use non final classes for your service method argumentsand allow clients to override these classes, then you will need toenable codebase annotations in AtomicILFactory in your configuration. The caveat is there is no guarantee, the service will be able to resolvethese classes at the server endpoint, or that codebase annotation losswon't occur, it will try using existing mechanisms, such asRMIClassLoaderSPI, which is probably fine for seasoned Jini vets, butnot so user friendly for the newbie, who now has to debugClassNotFoundExceptions.


It's like Java serialization, magic comes with compromises.

For example if a service proxy is serialized within a serializationstream, it will be replaced by a proxy serializer and it will beassigned its own independent stream, with ClassLoader, independent ofthe stream in which it was serialized. This is based on theObjectEndpoint identity, so it will always resolve to the sameClassLoader. Note that ProxyCodebaseSpi can be a provider or OSGiservice.
Does it mean you cannot provide services that don’t have anyObjectEndpoint (ie. local only)?
This would be IMO unacceptable constraint. For example:

- How do you provide the above mentioned JavaSpaceEventPublisher
- How would you provide a java.sql.DataSource as a service?

If you don't have an ObjectEndpoint, then there is no one toauthenticate, you only have bytes to de-serialize, so how do youestablish trust, ie who did the bytes come from? However it is possibleto have a service that has an ObjectEndpoint and only uses it forauthentication of the proxy serializer and codebase provisioning afterwhich the deserialized bytes become objects and don't make remote methodcalls. I think that would be an acceptable alternative; someone needsto vouch for the serialized bytes.

Now the proxy serializer is itself a service (bootstrap proxy), thatis authenticated when using secure endpoints. You could quite easilyadd an interface to the proxy serializer to return your objectannotation.
Note that I use a string, because I also use it in secure multicastdiscovery protocols (typically IPv6), which don't include objects,for authentication and provisioning a ClassLoader for a lookupservice proxy prior to any Object de-serialization.
https://www.iana.org/assignments/ipv6-multicast-addresses/ipv6-multicast-addresses.xhtml
Summing up to simplify JGDMS and solve some very difficult issues, ithad to give up:
 1. Support for circular references in serialized object graphs, was
    dropped.
My solution supports any object graph.

What capability do you need circular references for? For exampleThrowable creates a circular object graph. These circular relationshipscan be reconstructed during deserialization by code, rather than as partof the serialized bytes. Allowing circular object graphs, sounds likea neat feature, however it is not possible to validate circularrelationships atomically, in which case if validation checks fail, youhave fully constructed objects than an attacker might manipulate forgadget attacks. Atomic input validation prevents object creation, sono gadget attack. This is a low risk if you are using authentication,but it is still nice to harden the api, in the event that the identityof someone you trust has been stolen.

1.



 2. Extensible classes in service api method parameters are not advised.

Yes - this one is tricky although my solution supports that as well(there are edge cases though).


2.


 3. ProxyTrust - deprecated and replaced with secure authentication
    and httpmd (SHA-256) or signer certificates using ProxySerializer.

Deprecated in my solution as well: code is validated _before_ execution.

Good decision, it wasn't a good solution and causes unnecessarycomplexity for zero benefit.

 4. Untrusted machines are not allowed in a djinn, some level of
    trust is required, with authentication and authorisation constraints.
Not necessary in my solution.
In general I think the differences are caused by the differentperspective:
I see software distribution (ie. mobile code) as orthogonal to networking.
You see River primarily as networking solution with mobile codedependent on it.
It would be great to be able to merge the ideas and work on commonsolution.
The question is whether it should be Apache River project…

I think the PMC has already decided River's fate, and I tend to agreewith their decision, the problem is that historically, it hasn't beenpossible to innovate inside the Apache River project, innovation hasbeen forced to happen elsewhere and it wasn't just what I was doing,there was an attempt to do some container work in River but that alsogot shut down. People had trouble in the past agreeing on Riversdirection and there are no currently active developers. It is stillpossible to get a group of people together to create an Apache project,but I don't think the code needs it. github and other sites like itare better for loose collaboration, where developers can feed off eachothers ideas and innovations and the best solutions survive.

BTW the name of the software I am working on is CodeSpaces (and yes Iam aware of MS/GitHub product named the same but I came up with itearlier and even registered a domain net.codespaces for this purpose).
Michal

I'll keep an eye out for your work. I think you'll be able to add thisfunctionality extensible to JGDMS as a downstream project, dependent ona few JGDMS modules, that way you can focus on what you need to achievewithout worrying about managing the whole repository. There may be nonJGDMS forks of River out there too, but I don't think they will supportIPv6 discovery and other features that will be of benefit, such assupport for modern stateless TLS v1.3 session tickets.


Cheers,

Peter.

Re: [DISCUSS] Moving Apache River to the Attic

Reply via email to