[hibernate-dev] Re: [jbosscache-dev] JBoss Cache Lucene Directory

Manik Surtani Tue, 26 May 2009 08:37:16 -0700

Sanne,

Agreed. Could all involved please make sure we post to both hibernate-dev as well as infinispan-dev (rather than jbosscache-dev) whendiscussing anything to do with such integration work. As there areparallel efforts which can be brought together.


Cheers
Manik

On 25 May 2009, at 10:53, Sanne Grinovero wrote:

Hello,
I'm forwarding this email to Emmanuel and Hibernate Search dev, as I
believe we should join the discussion.
Could we keep both dev-lists (jbosscache-...@lists.jboss.org,
hibernate-dev@lists.jboss.org ) on CC ?

Sanne

2009/4/29 Manik Surtani <ma...@jboss.org>:
On 27 Apr 2009, at 05:18, Andrew Duckworth wrote:
Hello,
I have been working on a Lucene Directory provider based on JBossCache,my starting point was an implementation Manik had already writtenwhichpretty much worked with a few minor tweaks. Our use case was tocluster aLucene index being used with Hibernate Search in our application,with therequirements that searching needed to be fast, there was no sharedfilesystem and it was important that the index was consistent acrossthe cluster
in a relatively short time frame.
Maniks code used a token node in the cache to implement thedistributedlock. During my testing I set up multiple cache copies withmultiple threadsreading/writing to each cache copy. I was finding a lot oftransactions toacquire or release this lock were timing out, not understandingJBC well Imodified the distributed lock to use JGroupsDistrubutedLockManager. Thisworked quite well, however the time taken to acquire/release thelock (~100
ms for both) dwarfed the time to process the index update, lowering
throughput. Even using Hibernate Search with an async workerthread, therewas still a lot of contention for the single lock which seemed tolimit thescalability of the solution. I thinkl part of the problem was thatour useof HB Search generates a lot of small units of work (remove indexentry, addindex entry) and each of these UOW acquire a new IndexWriter andnew write
lock on the underlying Lucene Directory implementation.
Out of curiosity, I created an alternative implementation based ontheHibernate Search JMS clustering strategy. Inside JBoss Cache Icreated aqueue node and each slave node in the cluster creates a separatequeue
underneath where indexing work is written:

 /queue/slave1/[work0, work1, work2 ....]
           /slave2
           /slave3

etc
In each cluster member a background thread runs continuously whenit wakesup, it decides if it is the master node or not (currently checksif it isthe view coordinator, but I'm considering changing it to use alonger liveddistributed lock). If it is the master it merges the tasks fromeach slavequeue, and updates the JBCDirectory in one go, it can safely dothis withonly local VM locking. This approach means that in all the slavenodes theycan write to their queue without needing a global lock that anyother slaveor the master would be using. On the master, it can performmultiple updates
in the context of a single Lucene index writer. With a cache loader
configured, work that is written into the slave queue ispersistent, so itcan survive the master node crashing with automatic fail over to anewmaster meaning that eventually all updates should be applied tothe index.Each work element in the queue is time stamped to allow them to beprocessed
in order (requires!
time synchronisation across the cluster) by the master. For ourworkloadthe master/slave pattern seems to improve the throughput of thesystem.
Currently I'm refining the code and I have a few JBoss Cachequestions
which I hope you can help me with:
1) I have noticed that under high load I get LockTimeoutExceptionswriting
to /queue/slave0 when the lock owner is a transaction working on
/queue/slave1 , i.e. the same lock seems to be used for 2unrelated nodes inthe cache. I'm assuming this is a result of the lock stripingalgorithm, ifyou could give me some insight into how this works that would beveryhelpful. Bumping up the cache concurrency level from 500 to 2000seemed toreduce this problem, however I'm not sure if it just reduces theprobabilityof a random event of if there is some level that will besufficient to
eliminate the issue.
It could well be the lock striping at work. As of JBoss Cache3.1.0 you candisable lock striping and have one lock per node. While this isexpensivein that if you have a lot of nodes, you end up with a lot of locks,if you
have a finite number of nodes this may help you a lot.
2) Is there a reason to use separate nodes for each slave queue ?Will ithelp with locking, or can each slave safely insert to the sameparent nodein separate transactions without interfering or blocking eachother ? If I
can reduce it to a single queue I thin that would be a more elegant
solution. I am setting the lockParentForChildInsertRemove to falsefor the
queue nodes.
It depends. Are the work objects attributes in /queue/slaveN ?Rememberthat the granularity for all locks is the node itself so if allslaves write
to a single node, they will all compete for the same lock.
3) Similarly, is there any reason why the master should/shouldn'ttake
responsibility for removing work nodes that have been processed ?
Not quite sure I understand your design - so this distributes theworkobjects and each cluster member maintains indexes locally? If so,you needto know when all members have processed the work objects beforeremoving
these.
Thanks in advance for help, I hope to make this solution generalpurposeenough to be able to contribute back to Hibernate Search and JBCteams.
Thanks for offering to contribute. :-) One other thing that maybe of
interest is that I just launched Infinispan [1] [2] - a new data grid
product. You could implement a directory provider on Infinispantoo - it isa lot more efficient than JBC at many things, includingconcurrency. Also,
Infinispan's lock granularity is per-key/value pair.  So a single
distributed cache would be all you need for work objects. Also,anotherthing that could help is the eager locking we have on the roadmap[3] whichmay make a more traditional approach of locking + writing indexesto the
cache more feasible.  I'd encourage you to check it out.

[1] http://www.infinispan.org
[2]
http://infinispan.blogspot.com/2009/04/infinispan-start-of-new-era-in-open.html
[3] https://jira.jboss.org/jira/browse/ISPN-48
--
Manik Surtani
ma...@jboss.org
Lead, Infinispan
Lead, JBoss Cache
http://www.infinispan.org
http://www.jbosscache.org




_______________________________________________
jbosscache-dev mailing list
jbosscache-...@lists.jboss.org
https://lists.jboss.org/mailman/listinfo/jbosscache-dev


--
Manik Surtani
ma...@jboss.org
Lead, Infinispan
Lead, JBoss Cache
http://www.infinispan.org
http://www.jbosscache.org




_______________________________________________
hibernate-dev mailing list
hibernate-dev@lists.jboss.org
https://lists.jboss.org/mailman/listinfo/hibernate-dev

[hibernate-dev] Re: [jbosscache-dev] JBoss Cache Lucene Directory

Reply via email to