Information: For Indexing Emails

2006-08-22 Thread Suba Suresh
I thought I would pass it along if anyone is interested. If the emails are in mbox format "Tropo" as suggested in lucene's faq works perfectly if it is in an imap store. For archived emails stored in mbox format I used mstor.jar from mstor.sourceforge.net with "Tropo". thanks, suba suresh.

Re: indexing emails

2006-06-19 Thread John Haxby
Michael J. Prichard wrote: We are actually grabbing emails by becoming part of the SMTP stream. This part is figured out and we have archived over 600k emails into a mysql database. The problem is that since we currently store the blobs in the DB this databases are getting large and searching

Re: indexing emails

2006-06-19 Thread Michael J. Prichard
net/projects/ol2mbox), if they are 97-2002 format. -Original Message- From: Mike Streeton [mailto:[EMAIL PROTECTED] Sent: 19 June 2006 08:33 To: java-user@lucene.apache.org Subject: RE: indexing emails When you talk about indexing emails are you indexing Outlook mails? We have only foun

Re: indexing emails

2006-06-19 Thread John Haxby
Michael J. Prichard wrote: I am working on indexing emails and want to have a "to" field. I am currently putting all the emails on one line seperated w/ spaces...example: [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED] Then i index that with a StandardAnalyzer as follows: d

RE: indexing emails

2006-06-19 Thread Chris Hostetter
: As far as indexing goes index each address in a separate un-tokenized : field not space delimited in a single field. It is also useful to put : the To; CC and BCC in a single field to enable you to search to email INdexing email isn't something i've had to think about a lot in my life .. but if

RE: indexing emails

2006-06-19 Thread Rob Staveley (Tom)
RE: indexing emails When you talk about indexing emails are you indexing Outlook mails? We have only found a few libraries that will do this and all require Outlook to be online at the time i.e. you cannot index PST files standalone. As far as indexing goes index each address in a separate un-tokeni

RE: indexing emails

2006-06-19 Thread Mike Streeton
When you talk about indexing emails are you indexing Outlook mails? We have only found a few libraries that will do this and all require Outlook to be online at the time i.e. you cannot index PST files standalone. As far as indexing goes index each address in a separate un-tokenized field not

Re: indexing emails

2006-06-19 Thread Michael Wechner
- From: karl wettin [mailto:[EMAIL PROTECTED] Sent: 16 June 2006 21:13 To: java-user@lucene.apache.org Subject: Re: indexing emails On Fri, 2006-06-16 at 15:20 -0400, Michael J. Prichard wrote: I am working on indexing emails and want to have a "to" field. I am currently putti

Re: indexing emails --> mutliple "to" emails, setting position same

2006-06-18 Thread Michael J. Prichard
So I have emails with multiple recipients (of course, this is very common). I currently put them all on the same string seperated by space and then tokenize them with Standard Analyzer. I was looking into SynonymAnalyzers and see that you can drop multiple tokens with the same position. Woul

Re: indexing emails

2006-06-18 Thread Michael J. Prichard
From: karl wettin [mailto:[EMAIL PROTECTED] Sent: 16 June 2006 21:13 To: java-user@lucene.apache.org Subject: Re: indexing emails On Fri, 2006-06-16 at 15:20 -0400, Michael J. Prichard wrote: I am working on indexing emails and want to have a "to" field. I am currently putting all the

RE: indexing emails

2006-06-17 Thread Rob Staveley (Tom)
d last name still need a PrefixQuery or WildcardQuery, if you want to search for last names, but it does make some queries possible which would otherwise blow up. -Original Message- From: karl wettin [mailto:[EMAIL PROTECTED] Sent: 16 June 2006 21:13 To: java-user@lucene.apache.org Subje

Re: indexing emails

2006-06-16 Thread karl wettin
On Fri, 2006-06-16 at 15:20 -0400, Michael J. Prichard wrote: > I am working on indexing emails and want to have a "to" field. I am > currently putting all the emails on one line seperated w/ spaces...example: > > [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED] >

indexing emails

2006-06-16 Thread Michael J. Prichard
I am working on indexing emails and want to have a "to" field. I am currently putting all the emails on one line seperated w/ spaces...example: [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED] Then i index that with a StandardAnalyzer as follows: doc.add(new Field(&qu

Re: Indexing emails from Exchange

2005-06-22 Thread Otis Gospodnetic
Peter, see this: http://www.tropo.com/techno/java/lucene/imap.html http://guests.evectors.it/zoe/ Otis --- Peter Lewis <[EMAIL PROTECTED]> wrote: > Hi all > > Has anyone got an example of how to index emails from Microsoft > Exchange using Lucene? > > TIA > > William > > >

Indexing emails from Exchange

2005-06-22 Thread Peter Lewis
Hi all Has anyone got an example of how to index emails from Microsoft Exchange using Lucene? TIA William - How much free photo storage do you get? Store your holiday snaps for FREE with Yahoo! Photos. Get Yahoo! Photos