Andy Armstrong wrote:
Vadim Gritsenko wrote:
Firstly thanks folks for Xindice; it seems to provide a load of
functionality that we're looking for.
Secondly: has there been any thought about supporting replication
between xindice servers? If not then it's something we'd be
interested in working on.
No, there is no replication now. How do you want to approach it?
Slowly :)
Before doing anything else we need to alter our product to use Xindice
- at the moment it's just storing documents on the filesystem. Once
that's done I'll be thinking about the best ways to:
* build full text indexes of the documents in a collection
* replicate between Xindice servers
We currently solve the full text problem by loading our whole document
tree into memory at startup and building an index at that time. We
then periodically scan the filesystem for updated documents and
reload/reindex them accordingly so we need to be able to replace that
mechanism without merely loading all the documents from Xindice at
startup.
The replication mechanism I'm most familiar with is the one Lotus
Domino uses. The main problem if you're doing peer to peer replication
is how to resolve update conflicts and I suspect that the 'right'
solution depends to a significant extent on the application so my
feeling is that that's the area that needs the most consideration.
The full text indexing thing actually takes precedence at the moment.
I'll be having a look at that next week if anyone has any bright ideas :)
The Apache Lucene project is a full text search engine. It is included
within Cocoon for doing full text searches of Cocoon based web sites
(see the Cocoon Lucene block in Cocoon under src/blocks/lucene for the
code). With the right interface, allowing Lucene to crawl documents in
your Xindice database, it could then build for you a full text index
that you can query at your leasure. Probably something that could be
quite useful as an option for all XIndice users.
Regards, Upayavira