Hi Scott,

I saw your email and thought of some work I have been doing recently for matching text on the fly. If you are not going to be keeping the emails for later searching then this may provide a faster and easier way of checking your email. It involves using monq which is a java api - http://www.ebi.ac.uk/Rebholz-srv/whatizit/software

http://www.ebi.ac.uk/~kirsch/JfaWiki/index.php/Main_Page

As your emails come in you can pass them through this software and any matched regular expressions will trigger java callbacks. The best bit is that no matter how many regular expressions you have (I tried 30,000) you can process 1Mb per second of data, this is because the program just runs through the Finite State Automaton that gets created one character at a time... (this is demoed graphically here - http://www-sr.informatik.uni-tuebingen.de/~buehler/AC/AC.html if you choose the Aho/Corasick algorithm)

Anyways, off topic, so if you want more advice I can email you direct and I'm sure someone else on the list can help you out with the Lucene-related answer...

Ta

steve

-----Original Message-----
From: Scott Smith [mailto:[EMAIL PROTECTED]
Sent: 12 May 2006 02:29
To: lucene-user@jakarta.apache.org
Subject: Can lucene do this?

I'm building an application which has to provide "real-time" searching of emails as they come in. I have a number of search strings that I need to apply against each email as it comes in and then do something with the email based on which search string(s) get a hit.



My initial thought was to create a lucene index of the emails received in the last N seconds (where N is around 5 since I don't have to be quite real-time) in a memory directory, do my searches and then delete the index and create a new index for emails received in the next 5
seconds.   I'm a little concerned because the number of search strings
will probably grow over time and so there is a bit of a scalability issue-though I'm not sure there's anyway around that other than doing parallel processing on different machines.



I'm wondering if anyone has any experience doing this kind of thing and has additional or alternate suggestions??



Scott

_________________________________________________________________
Are you using the latest version of MSN Messenger? Download MSN Messenger 7.5 today! http://join.msn.com/messenger/overview


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to