In the past I hit a scalability limit with BaseX - a billion+ nodes kind of a 
made querying it a bit slower than I liked.

I thought I'd try and address this, so I’ve written some code and placed it in 
GitHub: https://github.com/jameshnsears/xqa-documentation

What I've done is proof of concept, that's all - no way "finished". I'm 
emailing the list in the hope that what I've done so far might generate some 
constructive criticism. Maybe my approach has potential, maybe it doesn't?

There are only four components so far, the first three are Docker containers:
* an ActiveMQ instance
* a load balancer
* a shard
* a command line client exists to load the XML, from file, into an ActiveMQ 
queue.

The software requires close to zero configuration. For example, each shard you 
start will automatically receive XML from the load balancer. And the load 
balancer distributes XML so that each shard holds the same # of documents.

There's a Travis project associated with the above link - it shows how easy it 
is to run the software end to end.

So far my effort is all about ingesting the XML, before I move further I 
thought I'd canvass some feedback - so if anyone has any then please give it :-)

Thanks.

Reply via email to