Re: Proposal for simple LCF deployment model

Jack Krupansky Fri, 28 May 2010 07:33:17 -0700

(b) The alternative starting point should probably autocreate thedatabase,and should also autoregister all connectors. This will require a list,somewhere,of the connectors and authorities that are included, and their preferredUI
names for that installation.  This could come from the configuration
information, or from some other place.  Any ideas?

I would like to see two things: 1) A way to request LCF to "dump" allconfiguration parameters, including parameters for all output connections,repositories, jobs, et al to an "LCF config file", and 2) The ability tostart from scratch with a fresh deployment of LCF and feed it that configfile to then create all of the output connections, repository connections,and jobs to match the LCF configuration state desired.

Now, whether that config file is simple XML ala solrconfig.xml can be amatter for debate. Whether it is a separate file from the current configfile can also be a matter for debate.

But, in short, the answer to your question would be that there would be anLCF config file (not just the simple keyword/value file that LCF has forglobal configuration settings) to see the initial output connections,repository connections, et al.

Maybe this config file is a little closer to the Solr schema file. I thinkit feels that way. OTOH, the list of registered connectors, as opposed tothe user-created connections that use those connectors, seems more like Solrrequest handlers that are in solrconfig.xml, so maybe the initial"configuration" would be split into two separate files as in Solr. Or,maybe, the Solr guys have a better proposal for how they would have managedthat split in Solr if they had it to do all over again. My preference wouldbe one file for the whole configuration.

Another advantage of such a config file is that it is easier for people topost problem reports that show exactly how they set up LCF.


-- Jack Krupansky

--------------------------------------------------
From: <karl.wri...@nokia.com>
Sent: Friday, May 28, 2010 5:48 AM
To: <connectors-dev@incubator.apache.org>
Subject: Proposal for simple LCF deployment model

The current LCF standard deployment model requires a number of movingparts, which are probably necessary in some cases, but simply introducecomplexity in others. It has occurred to me that it may be possible toprovide an alternate deployment model involving Jetty, which would reducethe number of moving parts by one (by eliminating Tomcat). A simple LCFdeployment could then, in principle, look pretty much like Solr's.
In order for this to work, the following has to be true:

(1) jetty's basic JSP support must be comparable to Tomcat's.
(2) the class loader that jetty uses for webapp's must provide classisolation similar to Tomcat's. If this condition is not met, we'd need tobuild both a Tomcat and a Jetty version of each webapp.
The overall set of changes that would be required would be the following:
(a) An alternative "start" entry point would need to be coded, which wouldstart Jetty running the lcf-crawler-ui and lcf-authority-service webappsbefore bringing up the agents engine.(b) The alternative starting point should probably autocreate thedatabase, and should also autoregister all connectors. This will requirea list, somewhere, of the connectors and authorities that are included,and their preferred UI names for that installation. This could come fromthe configuration information, or from some other place. Any ideas?(c) There would need to an additional jar produced by the build process,which would be the equivalent of the solr start.jar, so as to make runningthe whole stack trivial.(d) An "LCF API" web application, which provides access to all of thecurrent LCF commands, would also be an obvious requirement to go forwardwith this model.
What are the disadvantages? Well, I think that the main problem would besecurity. This deployment model, though simple, does not control accessto LCF is any way. You'd need to introduce another moving part to dothat.
Bear in mind that this change would still not allow LCF to run using onlyone process. There are still separate RMI-based processes needed for someconnectors (Documentum and FileNet). Although these could in theory bestarted up using Java Activation, a main reason for a separate process inDocumentum's case is that DFC randomly crashes the JVM under which itruns, and thus needs to be independently restarted if and when it dies.If anyone has experience with Java Activation and wants to contributetheir time to develop infrastructure that can deal with that problem,please let me know.
Finally, there is no way around the fact that LCF requires awell-performing database, which constitutes an independent moving part ofits own. This proposal does nothing to change that at all.
Please note that I'm not proposing that the current model go away, butrather that we support both.
Thoughts?
Karl

Re: Proposal for simple LCF deployment model

Reply via email to