[EM] Feature extraction and criteria for multiwinner elections

Kristofer Munsterhjelm Fri, 02 Jan 2009 02:39:22 -0800

One way of making multiwinner elections proportional is to have themethod pass certain criteria. Most obvious of these are Droopproportionality, which is the multiwinner analog to mutual majority.However, such criteria can only say what the method should do, incertain cases, not what it does in all cases. This is like Condorcet - aCondorcet method elects a CW whenever there is one, but it says nothingabout what happens when there isn't a CW.

So, if we're going to make better multiwinner methods, I think we need away that's applicable to all situations, so that we can apply the samerule consistently. Another approach would be to try to devise criteriathat cover increasingly more of the situations, but I'll get to that later.

If we're going to have a rule or metric for a good multiwinner election,what should it be? Well, the intuitive nature of proportionalrepresentation is that if there are some factions, and those factionsall vote accordingly to their preferences, then those factions should berepresented according to their numbers. This suggests some way offeature extraction: find underlying points in issue space, or thecomposition of issue space itself; then pick the candidates that mostaccurately represent this configuration.

That, in a sense, is what my test system does to find good multiwinnersystems: it constructs n issues (of varying support among the people),then randomly assigns agrees and disagrees for each issue to each voter(and candidate) so that the support reaches the fraction in question.For instance, for an issue set so that 70% are in favor, a randomfraction of 0.7 of the people (voters and candidates) have that issue infavor as well.

It may be possible to use this idea in reverse: construct some model ofhow voters weight disagreements, then assign n bits (for some n) so thatthe RMSE between their ballot scores and the predicted ballot scores(based on assigned issues to voters and candidates) is minimized. For abinary issue profile, in my simulations, I used simple "Hammingagreement". That is, if a voter and a candidate agree on five issues outof ten, the voter gives five points of ten (or 2.5 of 5 or whatnot).Obviously, this lends itself much more to cardinal than to ordinalballots. What's important is not whether voters actually do this, butwhether it's a good model - that is, whether the RMSE can be made verylow by doing this. If voters vote based on personal appeal or somethinglike that, and that can be modeled as three or four virtual "issues",then no great loss.

The good thing about binary issues is that once we have reconstructedthem, it's simple (well, in the NP sense) to ensure proportionality.Simply pick the set of candidates so that the difference between thefractions supporting each issue, and those fractions for the entirepeople, is minimized according to some error (probably should be theSainte-Lague metric, Gini, or RMSE).

What's not so easy is to assign the issues in the first place. Theformal problem becomes something like: "define an issue matrix, n_i *(voters + candidates), where n_i[issue][person] is either 0 or 1;further define a model scoring function f(voter, cand) = SUM(k =1..num_issues, (1 - |n_i[k][voter] - n_i[k][cand]|)); now, given avoters*candidates score matrix q, and an integer p > 0, populate anissue array of p*(voter+candidates) so that the RMSE of the matrix,where difference at a point is defined as (f(voter, cand) -q[voter][cand]), is minimized". The decision version of the problem is,"is there any way of constructing such a binary issue matrix so that theRMSE (or some other error) is below a certain value?". I think thedecision version is in NP, so at worst, the problem is NP-complete, butwhat's worse is that if it is, it's in not just the number ofcandidates, but also in the number of voters.

So that might be too hard. The idea seems sound, though; construct anopinion space and then pick proportionally from it. Could we use otherfeature extraction methods? There is one such function/method that canbe done in polytime, namely SVD - singular value decomposition. It'sused in, among other things, predicting movie ratings by most entries tothe Netflix contest. However, though I have tinkered a bit with SVD, Ihaven't found any way of translating its result into issue space, orgetting good results in my simulations by any such SVD-driven selection.The two ways I've tried have been to pick candidates so that the sum ofeach row is the same for the candidates and for the population at large,and also one so that a histogram over the candidates (or rather, a KDE,but it's roughly the same) is similar to one over the people, for all"issues". Neither seems to give much better results than random. Do anyof you have ideas as to how to use SVD for this purpose? I'm no expertin statistics.

The binary issue model might also be expanded into a continuous valuemodel, where each "issue" is no longer a yes/no but a point along aline. The right way of reproducing that issue space would probably be,as above, to use a KDE to synthesize a probability distribution and thencheck the similarity of that for some candidate set against that for thepeople; but if we can't construct binary issue models, we probably can'tconstruct continuous value ones either, unless there's some"discontinuity makes it more difficult" case analogous to the differencebetween linear and integer programming.

I said I was going to mention some criteria that would cover more thanDroop proportionality. I'm going to use the shorthand (k,n) for anelection that picks a council of k out of n candidates. Some ideas are:


Condorcet: If the election is (1, n) and there's a CW, it should be elected.

Reverse Condorcet: If the election is (n-1, n) and there's a Condorcetloser, all but the Condorcet loser should be elected.

Fragmented Condorcet: If the election is (k, n), and there's a way ofdividing the ballots into k piles so that each of those piles have a CW,and all k CWs are different, then those CWs should be elected. If thereare multiple such partitions, the method passes if it elects accordingto one of them.

The idea of Condorcet is rather simple; if the multiwinner method's anygood, it should be a good single-winner method in the (1,n) electioncase as well.

Reverse Condorcet is inspired by something I observed in Raph Frank's2-of-3 multiwinner diagrams. If you assign the three candidatesaccording to primary colors (that is, red, green, and blue), then youget a diagram with Voronoi-type shapes in composite colors. See this,for instance: http://munsterhjelm.no/km/composite_multiwinner.png ,which was made by overlaying "Optimal Utility (PAV)" fromhttp://ivnryan.com/ping_yee/triang_10000_2.html

Fragmented Condorcet is something I'm more uncertain about, but it canbe traced to two ideas. The first one is that of PR being like electionsby faction: if there are two factions, one can imagine each factionhaving a separate CW, in which case, if the factions are of equal size,those two are those that should be elected. The other idea is that ofhow PR methods have to fail house monotonicity. Say you have something like

30: Left > Center > Right
30: Right > Center > Left
 4: Center > Left > Right

When electing one, that one should be Center to be Condorcet. Whenelecting two, it should be Left and Right. This can be done by splittinglike this:


15: Left > Center > Right
 2: Center > Left > Right

15: Right > Center > Left
 2: Center > Left > Right

which gives {Left, Right}, W5. By similar reasoning, if there's a directdemocracy and everybdoy votes for themselves first, and the council'sthe same size as the people (direct democracy) then everybody is in it.

However, the Left-Center-Right example may also suggest that FragmentedCondorcet isn't desirable at all, since it splits the Center voters, andthus is merely a way of doing automatic gerrymandering.


What are your opinions on the criteria in general?
----
Election-Methods mailing list - see http://electorama.com/em for list info

[EM] Feature extraction and criteria for multiwinner elections

Reply via email to