[hibernate-dev] Re: [infinispan-dev] Hibernate Search alternative Directory distribution

Manik Surtani Thu, 09 Jul 2009 07:19:24 -0700


On 9 Jul 2009, at 14:57, Emmanuel Bernard wrote:

Here is the concall notes on how to cluster and copy Hibernateindexes using non file system approaches.
Forget JBoss Cache, forget plain JGroups and focus on Infinispan
Start with Infinispan in replication mode (the most stable code) andthen try distribution. It should be interesting to test the distalgo and see how well L1 cache behaves in a search environment.For the architecture, we will try the following approach indecreasing interest )If the first one works like a charm we stickwith it):
1. share the same grid cache between the master and the slaves
2. have a local cache on the master where indexing is done andmanually copy over the chuncks of changed data to the gridThis requires to store some metadata (namely the list of chunks fora given index and the lastupdate for each chunk) to implement thesame algorithm as the one implemented in FSMaster/SlaveDirectoryProvider (incremental copy).3. have a local cache on the master where indexing is done andmanually copy over the chuncks of changed data to the grid. Eachslave copy from the grid to a local version of the index and use thelocal version for search.
When writing the InfinispanDirectory (inspired by the RAMDirectoryand the JBossCacheDirectory), one need to consider than Infinispanhas a flat structure. The key has to contain:
- the index name
- the chunk name
Both with essentially be the unique identifier.
Each chunk should have its size limited (Lucene does that alreadyAFAIK)Question on the metadata. one need ot keep the last update and thelist of chuncks. Because Infinispan is not queryable, we need tostore that as metadata:- should it be on each chunk (ie last time on each chunk, the sizeof a chunk)- on a dedicated metadata chunk ie one metadata chunk per chunk + achink containing the list
- on a single metadata chunk (I fear conflicts and inconsistencies)
On changes or read explore the use of Infinispan transaction toensure RR semantic. Is it necessary? A file system does notguarantee that anyway.
In the case of replication, make sure a FD back end can be activatedin case the grid goes to the unreachable clouds of total inactivity.

FD backend? I presume you mean a cache store. Have a look at thedifferent cache stores we ship with, I reckon a FileCacheStore woulddo the trick for you.


http://infinispan.sourceforge.net/4.0/apidocs/org/infinispan/loaders/CacheStore.html
http://infinispan.sourceforge.net/4.0/apidocs/org/infinispan/loaders/file/FileCacheStore.html

Question to Manik: do you have a cluster to play with once we reachthis stage?

The cluster team does have a set of lab servers used to test,benchmark, etc. You will need to "book" time on this cluster thoughsince it is shared between JBC/Infinispan, JGroups and JBoss ASclustering devs.


Cheers
--
Manik Surtani
ma...@jboss.org
Lead, Infinispan
Lead, JBoss Cache
http://www.infinispan.org
http://www.jbosscache.org




_______________________________________________
hibernate-dev mailing list
hibernate-dev@lists.jboss.org
https://lists.jboss.org/mailman/listinfo/hibernate-dev

[hibernate-dev] Re: [infinispan-dev] Hibernate Search alternative Directory distribution

Reply via email to