Scott,

Have a look at the MemoryIndex (in contrib/memory) - it is perfect for this sort of thing. You'd index each e-mail individually into a MemoryIndex and then run all the queries against it getting a hit or not.

        Erik


On May 11, 2006, at 9:28 PM, Scott Smith wrote:

I'm building an application which has to provide "real-time" searching
of emails as they come in.  I have a number of search strings that I
need to apply against each email as it comes in and then do something
with the email based on which search string(s) get a hit.



My initial thought was to create a lucene index of the emails received
in the last N seconds (where N is around 5 since I don't have to be
quite real-time) in a memory directory, do my searches and then delete
the index and create a new index for emails received in the next 5
seconds.   I'm a little concerned because the number of search strings
will probably grow over time and so there is a bit of a scalability
issue-though I'm not sure there's anyway around that other than doing
parallel processing on different machines.



I'm wondering if anyone has any experience doing this kind of thing and
has additional or alternate suggestions??



Scott







---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to