I need to build a lucene search engine that can handle very high loads
(hundreds of requests per second) via a web interface.  It will be deployed
on one or more multi-proc servers, with the index pre-generated and
available via an NFS partition.  The index is small enough to fit into RAM,
so assume linux will cache the whole thing (it seems to be now - there's no
performance difference for me between FSDirectory and RAMDirectory).  I'm
new to both Python and Lucene, so I have little experience with what the
best solutions are.

I've looked around for embeddable python web servers, or python http server
classes, and could only find the ones included in the python standard
library (like BaseHTTPServer), which frankly suck (though I'm using it for
development for now, for lack of something better).

My current thoughts:
1) As I understand it (and please correct me if I'm wrong), Python threading
probably won't cut it (especially since I'll likely end up with many python
extensions to Lucene), so I imagine I'll need a multi-process solution
2) I could run under mod_python, though I previously had high hopes of a
standalone solution for ease of deployment/administration - would be great
to start the service by just running a python script, rather than
configuring apache.

Ideas/suggestions/anecdotes?

-ofer

_______________________________________________
pylucene-dev mailing list
[email protected]
http://lists.osafoundation.org/mailman/listinfo/pylucene-dev

Reply via email to