OT: Parsing Mail Headers via JavaMail (for list thread management)

2006-02-13 Thread Katz, Dov B \(IT\)
Does anyone know of any resources for parsing mail headers to build threaded storage of mailing list traffic? I'm looking in particular for how to correlate/parse Message-ID headers, with Thread-Topic and Thread-Index Thanks in advance -Dov

Parsing Mail Headers

2003-02-21 Thread Jeff Chastain
Does anybody know of a tag or script somewhere that will parse out email headers? Michael Dinowitz mentioned CFX_ODSMime, but I am not finding much in the way of documentation for it and it looks like it does a lot more that just parse the header. Thanks -- Jeff

Re: Parsing Mail Headers

2003-02-21 Thread Jochem van Dieten
Jeff Chastain wrote: Does anybody know of a tag or script somewhere that will parse out email headers? What do you need to extract? Does it need to run under CF MX (which has a ug in CFPOP which strips all linebreaks)? Jochem

RE: Parsing Mail Headers

2003-02-21 Thread Jeff Chastain
, February 21, 2003 2:04 PM To: CF-Talk Subject: Re: Parsing Mail Headers Jeff Chastain wrote: Does anybody know of a tag or script somewhere that will parse out email headers? What do you need to extract? Does it need to run under CF MX (which has a ug in CFPOP which strips all linebreaks

Re: Parsing Mail Headers

2003-02-21 Thread Michael Dinowitz
The CFX_ODSMime tag takes all the headers and puts it into a query that you can loop over. If you want, I can explain the code sample I sent you in reference to the tag. The problem with parsing email headers is that there is a LOT of information and the format of the information is very poor.

RE: Parsing Mail Headers

2003-02-21 Thread Jeff Chastain
goes, yes I have been collecting headers for the last several days and there is very little in common with them. Thanks -- Jeff -Original Message- From: Michael Dinowitz [mailto:[EMAIL PROTECTED]] Sent: Friday, February 21, 2003 2:20 PM To: CF-Talk Subject: Re: Parsing Mail Headers

Re: Parsing Mail Headers

2003-02-21 Thread Michael Dinowitz
information. I tried parsing by linebreak, which did not work and I was not sure how accurate that would be. Thanks -- Jeff -Original Message- From: Jochem van Dieten [mailto:[EMAIL PROTECTED]] Sent: Friday, February 21, 2003 2:04 PM To: CF-Talk Subject: Re: Parsing Mail Headers

Re: Parsing Mail Headers

2003-02-21 Thread Jochem van Dieten
Jeff Chastain wrote: I am running CFMX and am specifically looking for the message ID and reply information. cfscript mID = REReplaceNoCase(headers,^.*messageid: ([^]*).*$,\1); irt = REReplaceNoCase(headers,^.*in-reply-to: ([^]*).*$,\1); etc. /cfscript I tried parsing by linebreak, which did

Re: Parsing Mail Headers

2003-02-21 Thread Michael Dinowitz
Ah. It does some other stuff with the message's body, but that's besides the point. Its just a parser tag and does nothing else. Once I get a free second (yeh right!) I'll probably write some regex patterns for parsing mail headers. RFC compliant ones that'll reject malformed mail. Michael

Re: Parsing Mail Headers

2003-02-21 Thread Jochem van Dieten
Michael Dinowitz wrote: Um, the code I sent you does that already. Otherwise, you can do a RegEx looking for the start text of Message-ID and an end text of . The regex below will get the message ID for you from a raw header Message-ID: ([^]+) The and are part of the messageid, without them

Re: Parsing Mail Headers

2003-02-21 Thread Jochem van Dieten
Michael Dinowitz wrote: Ah. It does some other stuff with the message's body, but that's besides the point. Its just a parser tag and does nothing else. Once I get a free second (yeh right!) I'll probably write some regex patterns for parsing mail headers. RFC compliant ones that'll reject

Re: Parsing Mail Headers

2003-02-21 Thread Michael Dinowitz
They are standard inside the header as a container for the message-id. When I store the message-id for a post, I remove them as unneeded. When I send a post, they are added in as they have to be used for ALL posts. Just a preference on how to store and use as each will work for storage and

Re: Parsing Mail Headers

2003-02-21 Thread Michael Dinowitz
(yeh right!) I'll probably write some regex patterns for parsing mail headers. RFC compliant ones that'll reject malformed mail. That would be nice. If you start rejecting Dave's emails because they don't have a vaild References header at least some other people stand a chance of becomming top