Re: DKIM_VERIFIED Test

Ben Lentz Sun, 29 Oct 2006 09:41:56 -0800

I can't figure out why all my systems, all running sendmail 8.13.8, thebest MTA in the world (TM), are not transmitting RFC2822 compliant emailover the wire. *That's* what makes my head spin. If it were MS Exchange,I could see it.


Below is Anthony's response.

Ben Lentz wrote, On 29/10/06 1:26 AM:

However, after providing all this information to Anthony Howe, developer
of milter-spamc he's responded with:

I'm going to reject this patch on the grounds that I claim the DKIM
test in SpamAssassin is wrong. RFC 2822 line endings for ALL headers,
body lines, and the blank line separating the two are CRLF, not LF.


The problem with this line of reasoning, and I believe the reason why we
ended up with the practical solution of looking at the line endings of
the first lines and using what we find for the rest, is that RFC2822
applies only to the mail as it is sent between computers to an MTA. We
found that we could not count on the line endings conforming to RFC2822
at the time that it is sent to SpamAssassin.

To quote from RFC2822:

 This specification is intended as a definition of what message
 content format is to be passed between systems. Though some message
 systems locally store messages in this format (which eliminates the
 need for translation between formats) and others use formats that
 differ from the one specified in this standard, local storage is
 outside of the scope of this standard.

We ran into problems when we did anything other than decide on what line
ending format the message was using and then use that when we add headers.

It seems to me that milter-spamc is making the same mistake that we did,
which is to assume that it is always ok to add a header in RFC2822
format. As long as it is not acting as a filter of mail in transit to an
MTA, then it cannot rely on RFC2822. In practice, we see mail systems
that internally use RFC2822 format _except_ for using the newline
convention of the local OS, only taking care of that aspect of RFC2822
when the mail is sent out to or received from other MTAs. Absent a
standard, all we can do is figure out which newline is being used and do
the same with any headers that are added.

 -- sidney

Anthony wrote:

This is a variant of what I mentioned previously about Unix newlinesfound in saved files vs. the newlines used by RFC 2822. This is anartifact of how lines are often read in stripping CRLF then writen to afile adding back a LF. This is a common mistake by mail app.implementors who might see it as unimportant (I'm not referring to SAdev here, just history). As an aside Mark Crispin, author of UW-IMAP,said this caused so much problems that newer versions of his IMAPsoftware now always save the mail to the mailbox folder using CRLF.

The SA spamd protocol document from 2.5 (the original document used whenmilter-spamc was first written) did not specify whether the clientcommunications should use CRLF or just LF on the protocol's own headers;only the end of header mark. But there is no mention how the mailheaders and content should be sent to spamd, therefore the assumptionhas always been "as seen off the wire".

Later versions of the spamd protocol document from 3.0, and 3.1 are alittle more clear concerning the newlines used for spamc client headers,but get wishy-washy about the spamd response headers varying betweenCRLF and LF. And in neither document do they state what form the mailcontent passed should take, ie. "as seen off the wire" or normalised tousing CRLF or LF or hell why not Berstien Strings (and avoid CRLF v LFissues).

Again I stuck with my original choice of maintaining RFC 2822 newlines,since this avoids unnecessary translation, is consist with mail protocolstandards, allows for saving the message in form that could later bereintroduced into the mail system, and has worked with SA up until DKIM.

I would suggest that SA update the spamd protocol document to be moreprecise as to what it wants to see at every stage of the protocol rightdown to newline format as this would aid implementors.

Its not a mistake to assume RFC 2822 line endings, its the standard.That other mail MUA/MTA developers have choosen to be careless with itsuch that we have to dumb down our products for the mistakes of others.

I've considered doing as SA suggests, using some limited look ahead inthe first body chunk to determine newline type, but the milter API islinear such that this information comes after the headers have beengiven to the milter, sans CRLF or exact white space between heder-colonand the value, already be placed in a buffer using CRLF. It gets messyhaving to hold that information until the body chunks arrive.

It feels inherently wrong.

I would like to know why the CRLF header separator is treated as part ofthe message body by SA and not the header section? I send all themessage headers using CRLF and the separator as CRLF, then I send themessage body chunks exactly as sendmail provided them to the milter, seemilter API doc for xxfi_body hook:


http://www.milter.org/milter_api/xxfi_body.html

It states the body chunks _should_ have RFC 2822 CRLF newlines, thoughit may have arrived as LF (grr).

Doing as SA suggests, using the newlines as found in the message body,will break one day when some poorly written mail app. send headers &separator with CRLF and a message body using LF or worse visa versaheaders with LF and body with CRLF.

Essentially to avoid the newline issue, the DKIM spec and theirimplementations should

not be signing newlines.

---

I would argue that SpamAssassin should correct their implementation touse two different newlines types, those of the headers and separator,followed by those for the body after the header section and CRLF.

---

I'd also be wondering how SpamAssassin CLI handles DKIM on Windows wheretheir newlines are CRLF.


So many issues make my head spin.

Re: DKIM_VERIFIED Test

Reply via email to