Re: Cache invalidation

Joerg von Frantzius Mon, 06 Feb 2006 16:39:08 -0800

Hello Wes,

thanks for that analysis, I must admit I haven't really thought of whatthis means with optimistic txs!

While it is good that a stale read under optimistic transactions won'tlead to wrong persistent data under any circumstances (your #2), I'drather like to avoid optimistic verification exceptions where possible,though.

Concerning #1: the point with reading from P-NT instances outsidetransactions for me is that read-requests can be satisfied from cachewithout round-tripping to the DB. For a typical web-application that youcan browse, and where most requests are read-requests, I find this themost effective. Now if my P-NT instances are never invalidated/hollowedout, e.g. because they never take part in a datastore transaction, Imight be showing wrong information to the user, which, depending on theapplication, can be highly undesired.

The next thing I'd consequently ask for is that P-NT instances are infact made transactionally consistent, in that they should also behollowed out automatically if they have siblings by id that went fromP-dirty to P-clean in some transaction. That's what I currently mustimitate in a clumsy way after each transaction

This all would make P-NT much more safe to use, and more useful inconsequence, IMHO...


Regards,
Jörg

Wes Biggs schrieb:

I agree that it would be nice to change the method signatures to"evictById" for those that take OIDs in order to avoid confusion.
To clarify what I mean about persistent nontransactional objects, seesection 5.6.1 of the spec:"A persistent-nontransactional instance transitions topersistent-clean if any managed fieldis accessed when a datastore transaction is in progress. The state ofthe instance in memory
is discarded and the state is loaded from the datastore."
If you are running with an optimistic transaction instead, you'll getan optimistic verification exception at commit time. So I guess it ispossible to read stale data from the instance in the PM cache under acouple of scenarios:
1. Reading previously loaded fields of a P-NT instance outside of atransaction.2. Reading previously loaded fields of a P-NT instance inside anoptimistic transaction.
In these cases, I think you're right that it would be necessary tohollow the instances in order to be absolutely sure that no stale datais read after a L2 cache evict().
On the other hand, if you're in an optimistic transaction, don't youwant to retain the previously read values (they represent the ACIDguarantee from the optimistic transaction)? So the only case where itmight make sense to me is #1 above, and that seems debatable to me.Do most people using P-NT objects expect them to be consistent withthe L2 cache at all times? Or are they expected to act like a limitedform of an optimistic transaction?
I don't have a strong opinion about this, I'm just trying to fullyarticulate the question.
Wes

Joerg von Frantzius wrote:
Hi Wes,

thanks for your answer, please see my comments below.

Wesley Biggs schrieb:
Joerg von Frantzius wrote:
The problem here is that either evict() accepts only PC objects,not object ids, so we have to call PM.getObjectById() beforehand.If no object for that id was present, we're instantiating a hollowobject here only to discard it afterwards, that's not very effective.
I'm not quite parsing your "either" here, sorry. ButDataStoreCache.evict() accepts object IDs. I'm not sure I see thenecessity of calling PM.evict() as well, unless you have someparticularly long-lived transactions.
We're doing nontransactional reads on long-living objects, so Iguessed we needed to call PM.evict() to avoid accessing stale fielddata.
You're of course right about DatastoreCache.evict() accepting IDs,thanks for pointing that out. I had just seen the same methodsignature, and so I assumed the parameter semantics also being the same.
Calling it evictById() probably would be less misleading, even moreso as a mistake here won't show up immediately. Also, if you onlyhave a jar without sourcecode, the signatures are absolutelyindistinguishable (Which of course is not an excuse for not havingread the spec thoroughly enough ;)
As we really want cache invalidation here, not eviction, this iseven worse. For this purpose, it would be far more convenient tohave some method like invalidateCachesFor(Object id) onPersistenceManagerFactory.
That's the intention of DataStoreCache.evict(). The semantics aredifferent than PM.evict().
Only now I start understanding that I was misled by the word evict()for the L2-cache: as the user never gets hold of an L2 cache objectanyway (a L1-cache object will be created for that), he shouldn'tneed to care whether the L2 cache internally needs to throw away(evict) some object in order to invalidate cached state. Spec says"/The evict methods are hints to the implementation that theinstances referred to by the object ids are stale and should beevicted from the cache./" It might be nit-picking, but I think itwould be clearer if the method was called invalidateByÍd(), whichwould be natural for some cache interface, and if the explanationsaid "/that the object state referred to by the object ids should bediscarded/"
Also, the spec doesn't say anything about DatastoreCache.evict()having any impact on P-nontrans instances. So I still need to go toevery PM and evict there as well, which is very inconvenient.
Or does the "evict" row in table 2 for P-nontrans really apply to/both /evict() methods, not only PM.evict()!? The RI JPOX isn't doinganything like that, by the way.
To make our wish complete ;) this method would transition allnon-transactional instances to hollow for that id, for all the PMsthe PMF has given out. All transactional objects with that idshould be transitioned to hollow after their transaction hascompleted (either with commit or rollback).
Persistent nontransactional instances will have to be revalidatedagainst the datastore (or cache thereof) before being re-enlisted ina transaction anyway. The behavior you mention is a good way toimplement that, but it doesn't need to be mandated (hollow is not auser-visible state).
I'm not sure what you mean by mandating here? I'd just like to makesure that invalidated non-transactional instances will reload stateupon next read access, without having to iterate all PMs. Also, I'drather not like a call to PM.getObjectById() afterwards returning anew Java object for the same id, which I guess is the case aftercalling PM.evict(PM.getObjectById(id)).
If a method invalidateById() existed, I'd see the sense of evict() inreleasing the associated memory. evict() currently does two things atsame time: evicting and transitioning to hollow. For (distributed)cache invalidation, I find it sensible to desire only the latter.
Regards,
Jörg

Re: Cache invalidation

Reply via email to