[Election-Methods] a strategy-free range voting variant?

Jobst Heitzig Thu, 17 Jul 2008 11:19:54 -0700

Dear folks,

some time ago we discussed shortly whether it was possible to design a
strategy-free ratings-based method, that is, a method where voters give
ratings and never have any incentive to misrepresent their "true" ratings.


If I remember right, the methods that were discussed then were only of
academic use since they were far from being efficient and often elected
bad options unwanted by most of the voters.

Several days ago, I had a new idea how range voting could be modified to

get a method both strategy-free and efficient. A bit of researchrevealed that much of it resembles the ideas in the paperhttp://mpra.ub.uni-muenchen.de/627/, but not all of it. I will firstdescribe the basic idea and then the method.


Disclaimer: All of what follows is suitable only for the case where one
can assume that voters can sincerely attribute some numerical "utility"
to all options, which is an assumption I personally don't believe to
hold generally :-)  Anyway, here's the...


Basic Idea
-----------

In order to understand the basic idea, consider a decision problem with
two options, A and B, and two voters, V1 and V2, who are able to
attribute some monetary values
  U1(A)>U1(B),
  U2(B)>U2(A)
to these options. (We will not need to assume monetary values later on,
but the idea is easier to grasp this way)

Now consider the following method: Both voters fill in a ratings ballot
for A and B, giving ratings
  R1(A)>R1(B),
  R2(B)>R2(A).
Then a coin is tossed to decide which of the two voters is the "seller"
and which is the "buyer". Let's assume throughout the following that V1
turns out to be the seller. Now the winner is determined like this: If
  R2(B)-R2(A) <= R1(A)-R1(B)
then A wins. Otherwise, that is, if
  R2(B)-R2(A) > R1(A)-R1(B),
then V2 "buys" the decision from V1: B wins but V2 pays an amount of
  ( R2(B)-R2(A) + R1(A)-R1(B) ) / 2
to V1.

If this deal happens, V2 profits from it if and only if this "price" for
getting B instead of A,
  ( R2(B)-R2(A) + R1(A)-R1(B) ) / 2,
is at most U2(B)-U2(A). Fortunately, she can ensure that the deal
happens exactly when this is fulfilled: she only needs to specify her
sincere ratings by putting R2(A)=U2(A) and R2(B)=U2(B). If she does so,
the deal happens if and only if
  U2(B)-U2(A)>R1(A)-R1(B),
which is equivalent to
  ( U2(B)-U2(A) + R1(A)-R1(B) ) / 2 < U2(B)-U2(A),
so the deal happens if and only if it is profitable for V2. Moreover, V2
can ensure this independently of V1's behaviour!

Analogously, V1 profits from the deal if the price is at least
U1(A)-U1(B), and she can also ensure that the deal happens exactly when
it is profitable for her: she specifies her sincere ratings by putting
R1(A)=U1(A) and R1(B)=U1(B), no matter what V2 does.

Of course, this is far from being a new idea so far, and it is not yet
the whole idea since it has an obvious problem: although it obviously
manages to elect the "better" option (the one with the larger total
monetary value), it encourages both the seller and the buyer to
misrepresent their ratings so that the gap between R2(B)-R2(A) and
R1(A)-R1(B) becomes as small as possible and hence their respective
profit as large as possible. In other words, this method is not at all
strategy-free.

However, there is a simple modification which makes it strategy-free!
The reason for the strategic incentives is that the ratings V1 (and
analogously V2) gives not only influence whether the deal happens but
also how much V1 profits from the deal when it happens. This is no
longer the case when we change the method so that V1's profit depends on
V2's ratings only and vice versa: If the deal happens, that is, when
  R2(B)-R2(A) > R1(A)-R1(B),
then
  B wins instead of A,
  V1 gets an amount of R2(B)-R2(A)
  but V2 only pays an amount of R1(A)-R1(B).
As before, both voters can ensure that the deal happens exactly when
they profit from it by voting sincerely. The difference is that now they
no longer have any incentive to narrow the gap between R2(B)-R2(A) and
R1(A)-R1(B) since a voter's profit is independent of her ratings!

There is just a minor problem with this: The balance of the money
transfers is positive, so where is this extra money supposed to come
from? Obviously, we cannot let V1 and V2 each pay half of the required
extra money since that would make the method identical to the original
method.


Solving the extra money problem
--------------------------------

A solution to this "extra money" problem becomes clear when we now
increase the number of voters and assume 3 instead of 2 voters. Consider
this method next: Each voter fills in a ratings ballot for the options
A,B. We draw at random one "default" option, say A, and one
"compensating" voter, say V3. The other voters (here V1,V2) are the
"deciding" voters. That option whose total ratings from the deciding
voters is maximal wins. If this is not the default option (so if it's
B), the following money transfers happen:
- Each deciding voter gets an amount equal to the total rating
difference between the winner and the default, minus her own rating
difference between the winner and the default.
- The compensating voter pays the needed excess money.
In our example, this means that if B wins then
- V1 gets R2(B)-R2(A) as above
- V2 gets R1(B)-R1(A) as above (that is, she pays R1(A)-R1(B))
- V3 pays R2(B)-R2(A)+R1(B)-R1(A).

To see that it is still optimal for everyone to specify sincere ratings,
one only needs to analyse in what situations a changing of a voter's
ratings makes a difference to her outcome. If she is amoung the deciding
voters, the case is the same as above. And if she is the compensating
voter, everything is completely independent from her ratings anyway.

By the way, note that this method is neutral, anonymous, and monotonic.

Now we are ready for the real thing...


The method RRVC
(Representative Range Voting with Compensations)
--------------------------------------------------

In order to get rid of the requirement to transfer real money, we need

to assume that not only one decision is made but that a fixed number ofN voter need to make decisions with possibly varying numbers of optionson a regular basis.

Every voter has a "voting account" whose balance at the beginning is setto a constant value, say 100.


A single decision is made like this:

1. Everybody fills in a ratings ballot in which she can use ratingsbetween 0 and the current balance of her voting account. If the latterhappens to be negative, she can use the rating 0 only. In addition, eachvoter also marks her favourite option amoung those she assigned thelargest rating to (if these are more than one).

2. Then, for each voter a die is tossed to decide to which group itbelongs: In case of 1 or 2, the voter is a "benchmark" voter, in case of3 or 4 a "deciding" voter, and in case of 5 or 6 a "compensating" voter.

3. The ballots of all "deciding" voters decide the election via rangevoting, that is, that option wins whose total rating on these ballots islargest.

4. For each option, determine the probability P(Y) of being a randomlychosen "benchmark" voter's favourite. These probabilities build the"benchmark lottery".


5. Finally, the voting accounts are adjusted like this:

a) Each deciding voter's account is increased by an amount equal to thetotal rating difference between the winner and the benchmark lotteryamoung the *other* deciding voters, minus some fixed fee F, say10*N^(1/2). (Note that the resulting adjustment may be positive ornegative.)

b) The compensating voter's accounts are decreased by the same totalamount as the deciding voter's accounts are increased, but in equalparts. (This may also be positive or negative)

With large N, both the benchmark and the deciding group are likely to bea representative sample, so the range decision of the deciding groupwill tend to equal the range decision amoung all voters, and thebenchmark lottery will tend to equal the ordinary random ballot lotterywith all voters. In particular, the method is quite efficient.Obviously, it is also neutral, monotonic, cloneproof, and anonymous(when the whole sequence of decisions is considered).

If the fee F is set so that the expected (in a reasonable model)adjustments of the individual accounts are all zero, then the method isalso quite just in the sense that voters who would favour a randomballot lottery over the actual winner get a compensation with which theycan increase their influence in subsequent decisions. Since the decidingand compensating groups are of roughly equal size, the variance of theindividual account adjustments then also tends to be equal for thedeciding and compensating voters.

The reason why we need compensating voters although no real money isused is that the "voting accounts" can be treated as virtual money ofconstant value only when their total amoung all voters remains constant.Otherwise, there would be "voting money inflation" or "deflation".

Finally, if I'm not mistaken, the method is still strategy-free forthese reasons: i) A benchmark voter's "favourite" mark does neitherinfluence the winner nor the voter's own account, so there is noincentive to misstate the favourite. ii) A deciding voter's ratings doinfluence the voter's account only by influencing the winner; the samearguments as above show that it is optimal for a deciding voter tosincerely state her true ratings. iii) A compensating voter's ratings tonot affect anything. Since you don't know in which group you end up,your optimal vote is the true ratings!



I would be very interested in your opinion on this!
Jobst
----
Election-Methods mailing list - see http://electorama.com/em for list info

[Election-Methods] a strategy-free range voting variant?

Reply via email to