Re: OSGi - deserialization remote invocation strategy

Michał Kłeczek (XPro Sp. z o. o.) Tue, 07 Feb 2017 11:51:43 -0800

So I must have misunderstood the part about smart proxies being obtainedvia "reflection proxies" or MarshalledInstances.


What are these "reflection proxies"?


Thanks,
Michal

Peter wrote:

No, no bootstrap objects.

Cheers,

Peter.



Sent from my Samsung device.
Include original message
---- Original message ----
From: "Michał Kłeczek (XPro Sp. z o. o.)"<michal.klec...@xpro.biz>
Sent: 08/02/2017 12:28:50 am
To: dev@river.apache.org
Subject: Re: OSGi - deserialization remote invocation strategy
Are you proposing to provide a bootstrap object that will download somemeta information prior to class resolution?
How does it differ from simply changing annotations to be those"bootstrap objects" instead of Strings?
Thanks,
Michal

Peter wrote:
  Proposed JERI OSGi class loading strategy during deserialization.
Record caller context - this is the default bundle at the beginning ofthe stack. It is obtained by the InvocationHandler on theclient side. The InvocationDispatcher on the server side has thecalling context of the Remoteimplementation. The reflection dynamic proxy must be installed in theclient's class loader, so theInvocationHandler knows exactly what it is, it will be passed to theMarshalInputStream. Anyinterfaces not found in the client's bundle can be safely shed. For asmart proxy the reflection proxy willbe installed in the smart proxy loader. The smart proxy is obtainedeither via a reflection proxy or a MarshalledInstance.MarshalledInstance also passes in the callers loader to theMarshalInputStream.
The smart proxy classloader is not a child loader of the clientsloader, instead it's a bundle that importsservice api packages, with a version range that overlaps those alreadyimported by the client.
Both Invocationhandler and InvocationDispatcher utiliseMarshalInputStream and MarshalOutputStream, for marshalling parametersand return values.
The codebase annotation bundle's manifest contains a list of packageimports.
Do we need to make a list of package imports for every new bundle thatwe load?Do we need to record the wiring and packages and their imports fromthe remote end?
I don't think so, the bundles themselves contain this information, Ithink we just need to keep the view of available classes relevant tothe current object being deserialized.
Codebase Annotations are exact versions! They need to be to allow theservice to ensure the correct proxy codebase is used. Other proxycodebases will be installed in the client, possibly differentversions, but these won't be visible through the resolveddependencies, because the proxy codebases only import packages at theclient and OSGi restricts visibility to the current bundle's ownclasses and any imported packages.Instead of appending dependencies to the codebase annotation they'llneed be defined in the proxy's bundle manifest. Of course if anidentical version of a proxy codebase bundle is already installed atthe client, this will be used again.
Because a bundle generally imports packages (importing entire bundlesis discouraged in OSGi), there may be classesthat aren't visible from those bundles, such as transient imports, butalso including private packages that aren't exported, privateimplementations need to be deserialized, but is it possible to do sosafely, without causing packageconflicts? Private implementation classes can be used as fieldswithin an exported public object, but cannot and should notescape their private scope, doing so risks them being resolved to abundle with the version of the remote end, instead of the locallyresolved / wired package, causing ClassClassExceptions.
Initial (naive) first pass strategy of class resolution (for eachbranch in the serialized object graph)?:1. Try current bundle on the stack (which will be the callersbundle if we haven't loaded any new bundles yet).2. Then use the package name of a class to determine if the packageis loaded by any of the bundlesreferenced by the callers bundle imports (to handle any privateimplementation packagesthat aren't in the current imports). Is this a good idea? Or shouldwe go straight to step 3and let the framework resolve common classes, what if we use adifferent version to theclient's imported bundle? Should we first compare our bundleannotation to the currentlyimported bundles and select one of those if it's a compatibleversion? Yes, this could be an
  application bundle, otherwise goto 3.
3. Load bundle from annotation (if already loaded, it will be anexact version match). Place thenew bundle on top of the bundle stack, remove this bundle from thestack once all fields ofthis object have been deserialized, returning to the previous bundlecontext. We are relyingon the current bundle to wire itself up to the same package versionsof the clients bundleimports, for shared classes. Classes that use different bundles willnot be visible to the client,
  but will need to be visible to the current object's bundle.
4. Place a bundle reference on the stack when a new object isdeserialized from the stream andremove it once all fields have been deserialized. (we might need toremember stack depth).5. Don't place non bundle references on the stack. For examplesystem class loader or anyother class loader, we want resolution to occur via the OSGiresolution process.
What about a simpler strategy (again naive), where we don't attempt toresolve private implementation classes?
  1.    The calling class' bundle, is given priority.
2. Load bundle from annotation (exact version), when not found incalling class.3. No stack, what if an application bundle from server is loadedthat conflicts with an existing
  bundle resolved by the client?
4. What about walking back through the stack? Probablyunnecessary, as the containing objectwill reference the class by a common interface, the outer object maynot need to referenceit at all. But what if the outer object passed it in duringconstruction?
  Revised strategy:
1. Attempt to load from current bundle on stack (the stack beginswith the client's Bundle, eachnode in the graph has its bundle added to the stack and is alsoremoved after that node is completely deserialized.2. If unsuccessful, walk back through deserialized bundle referencestack and attempt to load class.Why not start at the beginning of the stack? We are expecting bundlesto wire up tocurrently loaded versions, but bundles can import different packageversions forimplementation, safest to start with current bundle and consult parentif not found in the current bundledependency graph, ie possibly passed in during object construction oran handbackimplemented in the client, from an earlier invocation or dependencyinjected.3. The client is responsible for determining compatibility with theservice api it's interested in
  from the Import Package Entry's, prior to unmarshalling a service proxy.
4. If a bundle previously on the stack resolves a class, then thisobject's bundle reference is placedon the top of the stack, it is removed once the current object and allit's fields have been completely deserialized.
  5.    Load bundle from annotation (exact version).
6. No attempt will be made to directly load from wired bundles,always rely on wires,
  otherwise we may utilise an incompatible package / bundle.

  Do we need a graph of the wiring from the remote end?
During serialization (from the remote end) do we need to determine ifa bundle has dependants and send some sort of version range information?When a class descriptor is read in from a stream, the class descriptorcontains information
  about fields and it's serializable supertype class (if it exists)
are also read in from the stream, before any field objects are readin, the declared field typesare visible from the bundle of the current object being deserialized.The objects that will beassigned to those field types must also resolve to those types. Hencebundles being resolved as part
  of deserialization must favour already resolved packages for imports.
What if a bundle requires a specific package version? This is why thebundle must be given firstattempt to resolve an objects class and rely on the bundle dependencyresolution process.OSGi must be allowed to wire up dependencies, we must avoid attemptingto make decisions about
  compatibility and use the current bundle wires instead (our stack).
The BundleReference stack is designed to follow the wires (dependencylinks between bundles),to allow private classes to be resolved, as they're not visible fromother bundles.
We can't rely on annotations to resolve private classes, because wecan't predict the way bundle
  dependency's are resolved in remote JVM's.

  General recommendations for OSGi:
* The service should use as wide a version range as possible forservice api.* It is better to create new service api in a new bundle than toevolve in a backward compatible manner, asan incremental change may not be compatible if additional classes andmethods are missing
  from the client, that the service proxy depends on.
  *    Don't split packages.
* Private implementation classes are ok, provided they remainwithin public exported classes and don't escape, otherwise
  they may not link up properly upon deserialization.
  *    The proxy should minimise the package imports it uses.
* There must be only one compatible service api version installedalready in the client.
  *    Duplicates of incompatible versions of service api are ok.
The catch is, it may not be possible to build the bundle stack withoutsome programming hooks in ObjectInputStream.
Unfortunately we don't have any control over OIS, the necessary hookscould however be added to AtomicMarshalInputStream.
  Cheers,

  Peter.

Re: OSGi - deserialization remote invocation strategy

Reply via email to