https://bugzilla.wikimedia.org/show_bug.cgi?id=22883

           Summary: WMF wikis need consistent MySQL data types for
                    page.page_title and image.img_name
           Product: Wikimedia
           Version: unspecified
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: Normal
         Component: Site requests
        AssignedTo: wikibugs-l@lists.wikimedia.org
        ReportedBy: b...@mzmcbride.com


This is related to https://jira.toolserver.org/browse/TS-549

Certain WMF slaves are using varchar(255) for page.page_title and
image.img_name (possibly other columns as well), while other slaves/wiki
databses are using varbinary(255). This has to do with MySQL 5 treating
varbinary as the same as varchar, from what I understand.

The issue that we've run into on the Toolserver is that comparing varchar
fields to varbinary fields is incredibly slow. For example,
enwiki_p.page.page_title is varbinary while commonswiki_p.image.img_name is
varchar, so the following query takes an incredibly long time to run:

SELECT
  page_title
FROM page
WHERE NOT EXISTS (SELECT
                    img_name
                  FROM commonswiki_p.image
                  WHERE img_name = page_title)
LIMIT 1;

The Toolserver folks are hesitant to alter the tables on their side, and it
would probably be better all around for there to be consistency among the WMF
databases and slaves.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to