Re: Genral application architecture question

André Warnier Wed, 26 Sep 2007 12:52:31 -0700

The least that can be said is that people on this list are eager tohelp. Thanks for the ideas, and keep them coming.

About this "gen(e)ral application architecture question", I would justlike to narrow down the scope a bit, if it's allright for everyone.

I do already have the full-text indexing and search engine, and theapplication based on it, so I don't really need more choices there.

Also, the storage architecture for the original documents is fixed.

It is based on the files individually stored on disk, spread out in somekind of multi-level directory structure, not in a database.To put this in another way : I am grateful for the miscellaneoussuggestions in those respects, but I cannot afford right now to changethose parts of the system.

My question was thus - in my intention - centered basically the linkbetween the two : once the user has found and displayed on a web page,via the search engine, the meta-data and text of a document (storedwithin the indexing and search engine), and next to it a URL link to theoriginal document itself, how to deliver this original document asefficiently as possible.The URL link to the document contains an "abstract" path identifier (nota path) which must be translated into a real path on the documentserver, which itself is located on the same host or on a different host.The translation cannot be calculated, it needs to use a tableassociating identifiers to the path they represent, and this table is atthe base a simple flat text file located on the document server, andmust remain so for the time being.The translation is currently effected by a separate single-process"document server", and it is this document server that I am thinking ofreplacing by an Apache2/mp2 based dedicated server.

The first part of the question was thus aimed at finding out ifrewriting the document server on the base of Apache2/mod_perl2 seemed agood idea, rather than developing oneself a stand-alone forking orthreaded new document server.

The general gist of the responses seems to indicate that it is, or atleast nobody so far came in against it.

The second part of the question was to sound out what solutions existed,in a multi-process or multi-threaded document server thus based onAP2/mp2, to share the id/path translation table as much as possiblebetween document request handlers so as to avoid mutiple reads andparses of this table, which originally is in a flat text file on diskand for the time being must remain so.

I think that part has now been "forked off" to the separate thread
"Re: Sharing data between many requests".

Thanks to all.

Re: Genral application architecture question

Reply via email to