Does anyone know of any resources for parsing mail headers to build
threaded storage of mailing list traffic?
I'm looking in particular for how to correlate/parse Message-ID
headers, with Thread-Topic and Thread-Index
Thanks in advance
-Dov
Does anybody know of a tag or script somewhere that will parse out email
headers? Michael Dinowitz mentioned CFX_ODSMime, but I am not finding much
in the way of documentation for it and it looks like it does a lot more that
just parse the header.
Thanks
-- Jeff
Jeff Chastain wrote:
Does anybody know of a tag or script somewhere that will parse out email
headers?
What do you need to extract? Does it need to run under CF MX (which has
a ug in CFPOP which strips all linebreaks)?
Jochem
, February 21, 2003 2:04 PM
To: CF-Talk
Subject: Re: Parsing Mail Headers
Jeff Chastain wrote:
Does anybody know of a tag or script somewhere that will parse out
email headers?
What do you need to extract? Does it need to run under CF MX (which has
a ug in CFPOP which strips all linebreaks
The CFX_ODSMime tag takes all the headers and puts it into a query that you can
loop over. If you want, I can explain the code sample I sent you in reference to
the tag. The problem with parsing email headers is that there is a LOT of
information and the format of the information is very poor.
goes, yes I have been collecting headers
for the last several days and there is very little in common with them.
Thanks
-- Jeff
-Original Message-
From: Michael Dinowitz [mailto:[EMAIL PROTECTED]]
Sent: Friday, February 21, 2003 2:20 PM
To: CF-Talk
Subject: Re: Parsing Mail Headers
information. I tried parsing by linebreak, which did not work and I was not
sure how accurate that would be.
Thanks
-- Jeff
-Original Message-
From: Jochem van Dieten [mailto:[EMAIL PROTECTED]]
Sent: Friday, February 21, 2003 2:04 PM
To: CF-Talk
Subject: Re: Parsing Mail Headers
Jeff Chastain wrote:
I am running CFMX and am specifically looking for the message ID and reply
information.
cfscript
mID = REReplaceNoCase(headers,^.*messageid: ([^]*).*$,\1);
irt = REReplaceNoCase(headers,^.*in-reply-to: ([^]*).*$,\1);
etc.
/cfscript
I tried parsing by linebreak, which did
Ah. It does some other stuff with the message's body, but that's besides the
point. Its just a parser tag and does nothing else. Once I get a free second
(yeh right!) I'll probably write some regex patterns for parsing mail headers.
RFC compliant ones that'll reject malformed mail.
Michael
Michael Dinowitz wrote:
Um, the code I sent you does that already. Otherwise, you can do a RegEx looking
for the start text of Message-ID and an end text of . The regex below will get
the message ID for you from a raw header
Message-ID: ([^]+)
The and are part of the messageid, without them
Michael Dinowitz wrote:
Ah. It does some other stuff with the message's body, but that's besides the
point. Its just a parser tag and does nothing else. Once I get a free second
(yeh right!) I'll probably write some regex patterns for parsing mail headers.
RFC compliant ones that'll reject
They are standard inside the header as a container for the message-id. When I
store the message-id for a post, I remove them as unneeded. When I send a post,
they are added in as they have to be used for ALL posts. Just a preference on
how to store and use as each will work for storage and
(yeh right!) I'll probably write some regex patterns for parsing mail
headers.
RFC compliant ones that'll reject malformed mail.
That would be nice. If you start rejecting Dave's emails because they
don't have a vaild References header at least some other people stand a
chance of becomming top
13 matches
Mail list logo