Your message dated Fri, 1 Mar 2024 06:40:07 +0100
with message-id <zefqn8trbnpei...@grub.nussbaum.fr>
and subject line Re: Bug#1057878: qa.debian.org: UDD upload_history has 
truncated email addresses
has caused the Debian Bug report #1057878,
regarding UDD/upload_history: truncated email addresses
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact ow...@bugs.debian.org
immediately.)


-- 
1057878: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1057878
Debian Bug Tracking System
Contact ow...@bugs.debian.org with problems
--- Begin Message ---
Package: qa.debian.org
Severity: normal
X-Debbugs-Cc: stu...@debian.org

The 'maintainer' and 'maintainer_email' columns of the upload_history table
in UDD have truncated email addresses. Somewhere the 'maintainer' data
is being truncated and then the maintainer_email is consequently broken.

udd=> SELECT maintainer, maintainer_email FROM upload_history WHERE 
maintainer_email LIKE '%=' LIMIT 10;
                           maintainer                           |               
maintainer_email
----------------------------------------------------------------+----------------------------------------------
 Maintainers of GStreamer packages <pkg-gstreamer-maintainers@= | 
pkg-gstreamer-maintainers@=
 Maintainers of GStreamer packages <pkg-gstreamer-maintainers@= | 
pkg-gstreamer-maintainers@=
 Zenoss Packaging Team <pkg-zenoss-t...@lists.alioth.debian.or= | 
pkg-zenoss-t...@lists.alioth.debian.or=
 Debian GNOME Maintainers <pkg-gnome-maintainers@lists.alioth.= | 
pkg-gnome-maintainers@lists.alioth.=
 Debian Perl Group <pkg-perl-maintainers@lists.alioth.debian.o= | 
pkg-perl-maintainers@lists.alioth.debian.o=
 Debian VoIP Team <pkg-voip-maintain...@lists.alioth.debian.or= | 
pkg-voip-maintain...@lists.alioth.debian.or=
 Debian Python Modules Team <python-modules-team@lists.alioth.= | 
python-modules-team@lists.alioth.=
 Debian Python Modules Team <python-modules-team@lists.alioth.= | 
python-modules-team@lists.alioth.=
 Debian Firebird Group <pkg-firebird-gene...@lists.alioth.debi= | 
pkg-firebird-gene...@lists.alioth.debi=
 Debian Samba Maintainers <pkg-samba-maint@lists.alioth.debian= | 
pkg-samba-maint@lists.alioth.debian=
(10 rows)


The input data from the d-d-c mailing list looks fine in the web archive,
but I can imagine this being due to linewrappig in the mbox files.

Looking at one specific example:

https://lists.debian.org/debian-devel-changes/2007/12/msg00466.html

udd=> SELECT maintainer, maintainer_email FROM upload_history WHERE 
maintainer_email LIKE '%=' AND source = 'libxml-rss-perl' AND version = 
'1.31-3';
maintainer                           |              maintainer_email
----------------------------------------------------------------+---------------------------------------------
Debian Perl Group <pkg-perl-maintainers@lists.alioth.debian.o= | 
pkg-perl-maintainers@lists.alioth.debian.o=
(1 row)

This particular example is quite old but the problem also exists in
recent uploads; as of writing the most recent one is libgetdata (0.11.0-9)
that was uploaded today.

Of the 850k rows in upload_history, this data issue is in 70k of them.

cheers
Stuart

--- End Message ---
--- Begin Message ---
On 29/02/24 at 18:04 +0100, Lucas Nussbaum wrote:
> I did some changes to the email decoding that solved most cases. We are
> down to 1162 badly processed emails (from the 70k you reported):
> 
> udd=> SELECT count(*) FROM upload_history WHERE maintainer_email LIKE '%=';
>  count
> -------
>   1162

This is now fixed, after an additional fix in the email parsing code.

udd=> SELECT count(*) FROM upload_history WHERE maintainer_email LIKE '%=';
 count
-------
     0

Lucas

--- End Message ---

Reply via email to