Em 06/12/11 18:27, Wietse Venema escreveu:
Leonardo Rodrigues:
      i'm facing some weird problems with a particular customer which has
2 MXs published. The first one (lower priority number) is rejecting our
emails lots of times (421 You are disconnected for policy reasons). And,
on the logs, i can see that for some periods of times, postfix only try
to delivers mail to his backup MX (higher priority number), which is
most all the time offline.
If Postfix skips an MX host, then it logs a note saying why, so
that people like you don't have to wonder why it skips the server.

You need to show evidence. grep for the PID of the smtp client and show
all records.


Here are the logs ... as i said, from 2.8.5 postfix. For not taking the logs TOO long, i have trimmed them to a single destination address. The original message was to 10 different email addresses on the the same domain and all of them are behaving just like the one i'll show. I have also obfuscated our domain and the our customer real address. But the destination domain, which is a important information, is kept.


Nov 16 15:23:36 correio postfix_notif/smtpd[26616]: C955515D4F2: client=ip68.interno[192.168.9.68], sasl_method=LOGIN, sasl_username=notifica...@ourdomain.com.br Nov 16 15:23:36 correio postfix/cleanup[26447]: C955515D4F2: message-id=<20111116172336.c955515d...@correio.ourdomain.com.br> Nov 16 15:23:36 correio postfix/qmgr[6855]: C955515D4F2: from=<notifica...@ourdomain.com.br>, size=11247, nrcpt=10 (queue active) Nov 16 15:23:36 correio postfix/error[26594]: C955515D4F2: to=<ourcusto...@azevedosette.com.br>, relay=none, delay=0.04, delays=0.02/0.01/0/0.01, dsn=4.4.2, status=deferred (delivery temporarily suspended: lost connection with mailsp.azevedosette.com.br[200.162.47.19] while receiving the initial server greeting)

Nov 16 15:38:29 correio postfix/qmgr[6855]: C955515D4F2: from=<notifica...@ourdomain.com.br>, size=11247, nrcpt=10 (queue active) Nov 16 15:38:30 correio postfix/error[29037]: C955515D4F2: to=<ourcusto...@azevedosette.com.br>, relay=none, delay=894, delays=893/0.48/0/0.03, dsn=4.4.2, status=deferred (delivery temporarily suspended: lost connection with mailsp.azevedosette.com.br[200.162.47.19] while receiving the initial server greeting)

Nov 16 15:53:29 correio postfix/qmgr[6855]: C955515D4F2: from=<notifica...@ourdomain.com.br>, size=11247, nrcpt=10 (queue active) Nov 16 15:53:32 correio postfix/error[31483]: C955515D4F2: to=<ourcusto...@azevedosette.com.br>, relay=none, delay=1796, delays=1792/3.8/0/0.03, dsn=4.4.2, status=deferred (delivery temporarily suspended: lost connection with mailsp.azevedosette.com.br[200.162.47.19] while receiving the initial server greeting)

Nov 16 16:23:29 correio postfix/qmgr[6855]: C955515D4F2: from=<notifica...@ourdomain.com.br>, size=11247, nrcpt=10 (queue active) Nov 16 16:23:30 correio postfix/error[3582]: C955515D4F2: to=<ourcusto...@azevedosette.com.br>, relay=none, delay=3593, delays=3593/0.32/0/0.04, dsn=4.4.2, status=deferred (delivery temporarily suspended: lost connection with mailsp.azevedosette.com.br[200.162.47.19] while receiving the initial server greeting)

Nov 16 17:23:29 correio postfix/qmgr[6855]: C955515D4F2: from=<notifica...@ourdomain.com.br>, size=11247, nrcpt=10 (queue active) Nov 16 17:23:30 correio postfix/error[11561]: C955515D4F2: to=<ourcusto...@azevedosette.com.br>, relay=none, delay=7194, delays=7192/1.3/0/0.03, dsn=4.4.2, status=deferred (delivery temporarily suspended: lost connection with mailsp.azevedosette.com.br[200.162.47.19] while receiving the initial server greeting)

Nov 16 19:23:29 correio postfix/qmgr[6855]: C955515D4F2: from=<notifica...@ourdomain.com.br>, size=11247, nrcpt=10 (queue active) Nov 16 19:23:30 correio postfix/error[28647]: C955515D4F2: to=<ourcusto...@azevedosette.com.br>, relay=none, delay=14394, delays=14392/1.3/0/0.05, dsn=4.4.2, status=deferred (delivery temporarily suspended: lost connection with mailsp.azevedosette.com.br[200.162.47.19] while receiving the initial server greeting)

Nov 16 21:23:29 correio postfix/qmgr[6855]: C955515D4F2: from=<notifica...@ourdomain.com.br>, size=11247, nrcpt=10 (queue active) Nov 16 21:23:29 correio postfix/error[12474]: C955515D4F2: to=<ourcusto...@azevedosette.com.br>, relay=none, delay=21593, delays=21593/0.01/0/0.03, dsn=4.4.2, status=deferred (delivery temporarily suspended: lost connection with mailsp.azevedosette.com.br[200.162.47.19] while receiving the initial server greeting)

Nov 16 23:23:29 correio postfix/qmgr[6855]: C955515D4F2: from=<notifica...@ourdomain.com.br>, size=11247, nrcpt=10 (queue active) Nov 16 23:23:29 correio postfix/error[1842]: C955515D4F2: to=<ourcusto...@azevedosette.com.br>, relay=none, delay=28793, delays=28792/0.45/0/0.03, dsn=4.4.2, status=deferred (delivery temporarily suspended: lost connection with mailsp.azevedosette.com.br[200.162.47.19] while receiving the initial server greeting)

Nov 17 01:23:29 correio postfix/qmgr[6855]: C955515D4F2: from=<notifica...@ourdomain.com.br>, size=11247, nrcpt=10 (queue active) Nov 17 01:23:33 correio postfix/error[15140]: C955515D4F2: to=<ourcusto...@azevedosette.com.br>, relay=none, delay=35996, delays=35993/3.3/0/0.02, dsn=4.4.2, status=deferred (delivery temporarily suspended: lost connection with mailsp.azevedosette.com.br[200.162.47.19] while receiving the initial server greeting)

Nov 17 03:28:29 correio postfix/qmgr[6855]: C955515D4F2: from=<notifica...@ourdomain.com.br>, size=11247, nrcpt=10 (queue active) Nov 17 03:28:31 correio postfix/error[28703]: C955515D4F2: to=<ourcusto...@azevedosette.com.br>, relay=none, delay=43494, delays=43493/1.5/0/0.03, dsn=4.4.2, status=deferred (delivery temporarily suspended: lost connection with mailsp.azevedosette.com.br[200.162.47.19] while receiving the initial server greeting)

Nov 17 05:33:29 correio postfix/qmgr[6855]: C955515D4F2: from=<notifica...@ourdomain.com.br>, size=11247, nrcpt=10 (queue active) Nov 17 05:33:31 correio postfix/smtp[9875]: C955515D4F2: to=<ourcusto...@azevedosette.com.br>, relay=mail3.azevedosette.com.br[200.198.91.178]:25, delay=50994, delays=50992/0.92/0.24/0.8, dsn=2.6.0, status=sent (250 2.6.0 <20111116172336.c955515d...@correio.ourdomain.com.br> Queued mail for delivery)
Nov 17 05:33:31 correio postfix/qmgr[6855]: C955515D4F2: removed


since this specific customer (and only this one) started complaining receiving our emails with huge delays, i have checked his MX records, from our email server, which is running a local bind instance and acting as his own nameserver (no ISPs nameservers are involved) and the results are always the same. I really believe we're not facing DNS problems nor the customer changing his MX records:

;; ANSWER SECTION:
azevedosette.com.br.    85088   IN      MX      5 mail3.azevedosette.com.br.
azevedosette.com.br. 85088 IN MX 100 mailsp.azevedosette.com.br.


the log for the specific message i posted shows that several delivery tries were made to the secondary MX (mailsp.azevedosette.com.br) but no one to the primary MX (mail3.azevedosette.com.br) but the final one. The interesting, and that's because i initially asked about some feature that would make postfix 'skip' bad MXs, is that during the timespan of those tries, i have logs for other messages for the same destination domain (different queueids) being delivered to the primary MX and several others connection being rejected by his primary MX (421 You are disconnected for policy reasons). Given this information, i can imagine both MXs were published at that moment. Some logs showing transactions to the primary MX (some successfully deliveries and some rejections) on the same timespan of the logs previously posted:



Nov 16 15:53:29 correio postfix/smtp[30078]: F07EA15D4E2: host mail3.azevedosette.com.br[200.198.91.178] said: 421 You are disconnected for policy reasons, please try again later (in reply to MAIL FROM command) Nov 16 15:53:29 correio postfix/smtp[30078]: F07EA15D4E2: lost connection with mail3.azevedosette.com.br[200.198.91.178] while sending RCPT TO Nov 16 15:53:29 correio postfix/smtp[30077]: B3B0D15D4F9: host mail3.azevedosette.com.br[200.198.91.178] said: 421 You are disconnected for policy reasons, please try again later (in reply to MAIL FROM command)

Nov 16 15:53:30 correio postfix/smtp[31464]: 0026615D4FB: to=<addr...@azevedosette.com.br>, relay=mail3.azevedosette.com.br[200.198.91.178]:25, delay=1783, delays=1782/0.04/0.2/0.77, dsn=2.6.0, status=sent (250 2.6.0 <20111116172347.0026615d...@correio.ourdomain.com.br> Queued mail for delivery)

Nov 16 16:18:30 correio postfix/smtp[3000]: 6EDF015D4BB: host mail3.azevedosette.com.br[200.198.91.178] said: 421 You are disconnected for policy reasons, please try again later (in reply to RCPT TO command) Nov 16 16:18:30 correio postfix/smtp[3000]: 6EDF015D4BB: lost connection with mail3.azevedosette.com.br[200.198.91.178] while sending RCPT TO

Nov 16 16:18:30 correio postfix/smtp[3002]: DB13515D4AD: to=<addr...@azevedosette.com.br>, relay=mail3.azevedosette.com.br[200.198.91.178]:25, delay=3373, delays=3372/0.04/0.25/0.79, dsn=2.6.0, status=sent (250 2.6.0 <20111116172217.db13515d...@correio.ourdomain.com.br> Queued mail for delivery)

Nov 16 17:18:31 correio postfix/smtp[10887]: 28B3015D4A4: to=<addr...@azevedosette.com.br>, relay=mail3.azevedosette.com.br[200.198.91.178]:25, delay=6987, delays=6984/0.05/0.22/2.4, dsn=2.6.0, status=sent (250 2.6.0 <20111116172205.28b3015d...@correio.ourdomain.com.br> Queued mail for delivery)

Nov 16 18:15:06 correio postfix/smtp[17592]: 0E00615D5B2: to=<addr...@azevedosette.com.br>, relay=mail3.azevedosette.com.br[200.198.91.178]:25, delay=8.8, delays=0.02/3.6/0.39/4.8, dsn=2.6.0, status=sent (250 2.6.0 <20111116201458.0e00615d...@correio.ourdomain.com.br> Queued mail for delivery)

Nov 16 18:21:23 correio postfix/smtp[17582]: AEC8B15D750: host mail3.azevedosette.com.br[200.198.91.178] said: 421 You are disconnected for policy reasons, please try again later (in reply to MAIL FROM command) Nov 16 18:21:23 correio postfix/smtp[17582]: AEC8B15D750: lost connection with mail3.azevedosette.com.br[200.198.91.178] while sending RCPT TO

Nov 16 21:48:32 correio postfix/smtp[14247]: 9F75015D511: to=<addr...@azevedosette.com.br>, relay=mail3.azevedosette.com.br[200.198.91.178]:25, delay=1558, delays=1555/0.01/0.34/2.6, dsn=2.6.0, status=sent (250 2.6.0 <20111116232233.9f75015d...@correio.ourdomain.com.br> Queued mail for delivery)

Nov 16 22:58:49 correio postfix/smtp[31315]: EDE2315E7D8: to=<addr...@azevedosette.com.br>, relay=mail3.azevedosette.com.br[200.198.91.178]:25, delay=1680, delays=1659/0.06/0.23/20, dsn=2.6.0, status=sent (250 2.6.0 <4ec455b88202b_68d1..fdbe810d8189@projourdomain.tmail> Queued mail for delivery)

Nov 16 23:23:30 correio postfix/smtp[1836]: 5EF7A15D4BC: to=<addr...@azevedosette.com.br>, relay=mail3.azevedosette.com.br[200.198.91.178]:25, delay=6911, delays=6910/0.06/0.3/0.86, dsn=2.6.0, status=sent (250 2.6.0 <20111116232819.5ef7a15d...@correio.ourdomain.com.br> Queued mail for delivery)

Nov 17 01:23:32 correio postfix/smtp[15118]: 612C115D4D7: to=<addr...@azevedosette.com.br>, relay=mail3.azevedosette.com.br[200.198.91.178]:25, delay=35992, delays=35989/1.4/0.21/0.81, dsn=2.6.0, status=sent (250 2.6.0 <20111116172340.612c115d...@correio.ourdomain.com.br> Queued mail for delivery)

Nov 17 01:23:32 correio postfix/smtp[15130]: ABB9A15D505: host mail3.azevedosette.com.br[200.198.91.178] said: 421 You are disconnected for policy reasons, please try again later (in reply to MAIL FROM command) Nov 17 01:23:32 correio postfix/smtp[15130]: ABB9A15D505: lost connection with mail3.azevedosette.com.br[200.198.91.178] while sending RCPT TO

Nov 17 03:23:29 correio postfix/smtp[28130]: B497B15D483: to=<addr...@azevedosette.com.br>, relay=mail3.azevedosette.com.br[200.198.91.178]:25, delay=21335, delays=21334/0.03/0.21/0.65, dsn=2.6.0, status=sent (250 2.6.0 <20111116232754.b497b15d...@correio.ourdomain.com.br> Queued mail for delivery)



I have tried grepping the PID of the smtp process with errors, as Wietse requested, and as this message on the mailing list i found on the archives also suggested

http://marc.info/?l=postfix-users&m=119393677222356&w=2

but i found nothing. In fact my logs only show smtp process logging on the very last try, which was the successfull one. All the others, which were basically timeout to the secondary MX, are logged as postfix/error and not postfix/smtp. Anyway, i tried grepping qmgr and error PID, on that time, and found absolutely nothing related.


by the way, we're already working with the IT team of this customer to identify why it's primary MX is rejecting LOTS of our connections. By now, this wasnt solved yet. I expect this to be solved soon.

Anyway, i'm having hard times trying to figure out why, for some messages, like the queueid i posted the full log, postfix is apparently not even trying to delivery to the primary MX for some large periods of time.

sorry for the huge message, but it was quite a lot information i have to post to explain the whole scenario.



--


        Atenciosamente / Sincerily,
        Leonardo Rodrigues
        Solutti Tecnologia
        http://www.solutti.com.br

        Minha armadilha de SPAM, NÃO mandem email
        gertru...@solutti.com.br
        My SPAMTRAP, do not email it




Reply via email to