SpamAssassin's URI code is getting confused on URLs that contain parameters
that include "&image=".  For instance, this URL:

http://www.example.com/whatever?x=a&image=x.gif

shows up in the spamassassin -D output as

debug: uri found: http://www.example.com/whatever?x=a\342\204\221=x.gif
debug: uri found: http://www.example.com/whatever?x=a%e2%84%91=x.gif

(where the \342 etc were actual 8bit characters)

Running SA 3.0.2 with Perl 5.8.5 on Solaris 8

DBD::mysql -- 2.9004
DBI -- 1.43
DB_File -- 1.810
Digest::HMAC -- 1.01
Digest::SHA1 -- 2.10
HTML::Parser -- 3.45
HTML::Tagset -- 3.03
Mail::SPF::Query -- 1.997
Mail::SpamAssassin -- 3.000002
Net::CIDR::Lite -- 0.15
Net::DNS -- 0.48
Perl -- 5.8.5
Sys::Hostname::Long -- 1.2
URI -- 1.35
razor-agents -- ???

Here is a sample message that triggers the bug:

Date: Mon, 07 Feb 2005 10:21:19 -0500
From: Somebody <[EMAIL PROTECTED]>
Subject: Some subject
To: [EMAIL PROTECTED]
Message-id: <[EMAIL PROTECTED]>
MIME-version: 1.0
Content-type: text/html; charset=us-ascii
Content-transfer-encoding: 8BIT

<html>
<body>
<a href = "http://www.example.com/whatever?x=a&image=x.gif";>click me</a>
</body>
</html>

Reply via email to