Summary and Takeaways: Hanson’s “Shall We Vote on Values, But Bet on Beliefs?”

Lizka25 Aug 2021 0:43 UTC

38 points

Improving institutional decision-making Prediction markets Rethink Priorities Robin Hanson Research summary Forecasting Policy

Introduction

Robin Hanson’s “Shall We Vote on Values, But Bet on Beliefs?”^[1] is a foundational paper that outlines some ways to use prediction markets for policy and introduces the idea of futarchy, a form of government that puts prediction markets in charge of accepting and rejecting policies. Prediction markets are potentially promising for improving policymaking and institutional decision-making because they align the incentives of their participants with the main goals of the system by directly rewarding participants for accuracy. They also aggregate the information better than most other systems, like polling or voting. Prediction markets have a good empirical track record of producing accurate forecasts.

This post contains a summary of key parts of Hanson’s paper, some takeaways and thoughts on possible limitations of using prediction markets for policy, and a brief outline of the paper. I have also linked a document with an expanded outline of Hanson’s paper. Parts of this post are written in the spirit of red-teaming (for instance, in the expanded outline, I make note of some apparent logical gaps in Hanson’s paper).

If you have already read Hanson’s paper, you may wish to read the section of the post called “Key takeaways and some issues with the proposals.” If you choose to read parts of the expanded outline, you may find that sections 8, 9, and 11 are the most informative.

I read this paper as part of a project on prediction markets at Rethink Priorities, where I am interning this summer.^[2] If you find inaccuracies, misunderstandings, or misinterpretations on my part, I would appreciate it if you would let me know, as this is both useful for improving this post and for my overall project.

Proposals in the paper

In this paper, Hanson:

claims that poor information institutions^[3] are a major failing of democracy
claims that prediction markets are a better kind of information institution than most
outlines several proposals for using prediction markets (or “decision markets”) for policy decisions

What is a decision market?

Hanson views decision markets—a variation on prediction markets—as an excellent source of information, and builds his entire paper around the concept, so it is worth understanding the basic mechanism. A basic decision market is a pair of prediction (or “speculative”) markets in which each prediction market is conditional on an event, like a policy being accepted or rejected. It works as follows.

If an entity (e.g. a company) needs to make a big decision (e.g. choosing a new store location), it has different means of collecting information to inform this decision. The entity could consult experts, it could run trials, it could poll its own local employees, etc. It could also run a decision market or a prediction market to aggregate a group’s collective knowledge on the topic in a way that seems to outperform polling. To do this, the entity would host bets on whatever outcomes interest it (e.g. profits), and make these bets conditional on the different options that are available (e.g. a shortlist of locations). This involves setting up contracts that are rendered void if an option is not picked in the end.^[4] The entity can then extract information from the prices that naturally emerge for contracts that are conditional on different options, and use that information to come to a decision on the topic.^[5]

A hypothetical example of a decision market

Suppose a company is opening a new store, and wants to open it either in Arcadia or in Boston. The company can set up the market by declaring some incentives (like a fake currency) and encourage bets on the outcome of opening the store in Arcadia or in Boston (bets that the company will enforce). The company might announce that “shares of Arcadia” or “shares of Boston” are contracts that will eventually be worth N of the currency unit, where the value of N depends on the future net revenue from the corresponding store. The company will pay out Arcadia contracts at a specified time if Arcadia is chosen (in which case all trades about Boston will be reversed), and it will pay out Boston contracts if Boston is chosen (in which case Arcadia contracts will be reversed).

Now suppose two employees disagree about how much a store in Arcadia would bring in profits, and therefore about what one should pay for a share of Arcadia. Xander thinks that a share of Arcadia will be worth 80 units (ie. he thinks N will be 80), and Zoe thinks that a share will be worth 100 units. Conditional on a store being opened in Arcadia, Xander should be happy to owe the company N units if he receives, say, 95 units now. (He expects to earn 15 units off this trade, if a store is opened in Arcadia.) Similarly, Zoe should be content to exchange 95 units for a share of Arcadia. (She expects to earn 5 units off this trade.)

If Zoe buys a share of Arcadia from the company (i.e. agrees to give the company N units if a store is opened in Arcadia), she would not expect to make a profit; she expects to pay the company N = 100 units later, and if she is more optimistic about Arcadia than the average employee participating in the market, she should not expect to be able to trade her share of Arcadia for more than 100 units from another employee. However, Xander thinks that N will only be 80, so if he can exchange a share of Arcadia from the company for a promise to give the company N units later, and sell a share of Arcadia to Zoe for 95 units (a trade she is happy for, since she expects that it will be worth N = 100 units later), then he expects to make a profit of 15 units. (In the end, if a store is not opened in Arcadia, all of the Arcadia traders get reversed.)

As more people make such trades, the market price should stabilize, and shares of Arcadia or Boston should have relatively stable prices. Suppose that Xander was correct, so shares of Arcadia are worth around 80 units. Also, suppose that shares of Boston are worth around 120 units. This implies that the market—or the collective mind of the bettors—expects that the revenue from a store in Boston will be greater than revenue from a store in Arcadia. As a result, the company might choose to open a store in Boston, not Arcadia. (And all Arcadia-conditional trades get reversed.)

Small-scale proposals (decision markets for policy)

Hanson offers two small-scale proposals for using decision markets for policy— a very simple one and a more complicated one.

Let anyone create and participate in decision markets for advisory purposes. (Right now, prediction markets are currently heavily regulated and largely illegal in places like the United States; Hanson proposes to change this.) In particular, we would be able to set up a prediction market in which future outcomes of interest (e.g., future scores on some measure of welfare) are bet on, conditional on certain policies being accepted. The markets have some time to settle on prices, and then, ideally, for important policies, prices conditional on the policy being accepted will be significantly different from prices conditional on the policy being rejected. This is an indicator that (Hanson suggests) would describe the effect traders expect a certain policy to have on relevant outcomes. Policymakers can then consult this information when deciding whether to accept or reject the policy in question.
Let markets veto proposed bills. Per the proposal, an official measure of welfare is voted on and maintained by legislatures. A bill cannot become law if a market estimate of national welfare conditional on the bill becoming law is clearly lower than the market estimate of national welfare given the status quo (the bill not becoming law). (Futarchy, described below, is a larger and more complicated version of this.)

Hanson presents these as relatively ready-to-go options that can also help prepare the world for his large-scale proposal. For more on the small-scale proposals, you may also want to see sections 5 (“Corporate Governance Example”) and 6 (“Monetary Policy Example”) of the paper.

Large-scale proposal (Futarchy)

In section nine of his paper, Hanson outlines his vision of futarchy. Under futarchy, policies get adopted if and only if decision markets imply that the policy is likely to improve a measure of national welfare. What goes into the “official measure of welfare,” which can be thought of as an augmented version of GDP, is determined by national vote. (Hence the title, “Shall We Vote on Values, But Bet on Beliefs?”)

Under this system, legislatures pass bills on the creation and management of national welfare, on their own internal procedures, and on the process by which legislators are chosen. A proposed policy must clear some qualification tests and follow an agenda process. Finally, the markets (futarchy markets) decide whether to adopt the policy in the manner described below.

How futarchy markets work

A base asset (an alternative to money, like a bond or stock index fund) is defined. A second asset, the “welfare asset” (or “welfare future”), is created; the value of this asset is related to the official measure of welfare at any given time. Depending on what they want to do and their forecasts about the official measure of welfare, people can trade the welfare asset in exchange for the base asset.

An official process regularly declares national welfare (in order to resolve bets). A market is set up such that people can buy “shares of a policy” if they think that the policy would improve the measure of national welfare, or sell such shares if they think that it will not.

“Shares of a policy” are futures contracts on the welfare asset that are conditional on the policy passing. This means that if you buy a share of a policy, you are buying a contract that guarantees you will get a set amount of the welfare asset if the policy passes. If it does not, your contract will be dissolved and whatever you paid for it will be returned to you.

A valid proposal is adopted if the price of the welfare futures conditional on the policy being adopted (the price of “shares of the policy”) is clearly higher than the price of the welfare futures conditional on the policy not being adopted. If the policy is not accepted, bets that were conditional on the policy being accepted are called off. If the policy is accepted, shares of that policy become the appropriate amount of the welfare asset, which can be traded for the base asset at any time.

(The system is actually more complicated. For instance, policies that traders think will seriously harm welfare as it will be defined in the future get vetoed to provide a sanity-check. Note also that the fact that traders can also buy shares of the welfare asset conditional on the policy not happening makes the system purer and less dependent on estimates of whether a policy will be adopted.)

A simplified worked example of trading in futarchy

Trader Joan is considering whether to bet on a Policy P. Suppose that national welfare is currently evaluated at something that translates to 100 of our base asset unit (BA); if you own a “welfare asset,” you can exchange it for 100 BA. Joan believes that things are generally going downhill right now, so the welfare asset will be worth 65 BA if nothing changes. But she also believes that Policy P could really turn things around, and welfare would actually be at 140 if P were adopted. So she believes that, conditional on P being adopted, the welfare asset should be valued at around 140 BA. In theory, then, Joan should be willing to commit to buying shares of the welfare asset conditional on P (which we can also call “shares of P”) for anything below 140 BA, and to selling shares of P at anything above 140 BA.

Let’s say that Joan makes a trade with Trader Henry who thinks that Policy P will actually only bring things up to 100 and is thus willing to sell shares of P for anything above 100 BA. Let’s say that Henry sells a share of P to Joan for 130 BA.

If P is rejected, then the bet is void. If, however, P is adopted, then shares of welfare conditional on P simply transform into some amount of the welfare asset. Joan will be able to cash in her share of the welfare asset for however much it is worth at that point. If it turns out that she was right, and P’s adoption has brought welfare up to 140 BA, then Joan has earned 10 BA in the process— a reward for being right in her prediction about P’s potential.

Some time after trades on P open, the “market price” of a share of P, or the “fair value” of a share of P,^[6] should emerge. If people tend to think that P is likely to be very good, this market price should be high; they might collectively decide that paying 125 BA for P is “fair”. A similar market will be run on shares of welfare conditional on P being rejected. In this example, if Joan’s estimate was close, these shares should cost around 65 BA each.

In this (unrealistically extreme) scenario, the price of future shares of P is 125 BA, which is much higher than the price of future shares of not-P (65 BA), so this pair of markets implies that people believe that P is likely to increase the official measure of welfare. So P should be adopted, and by the main rule of futarchy, it becomes law.

Key takeaways and some issues with the proposals

Use of prediction markets for policy can be split into two different kinds:

using advisory markets as a source of information (as in the first small-scale proposal) and
hard-wiring markets to whether or not a policy gets accepted (as in futarchy, or, to a lesser extent, in the second small-scale proposal).

At their core, both of these align personal and global incentives by rewarding participants directly for making good predictions that benefit the nation or the organization. But both kinds of prediction markets for policy also have issues. Broadly, my impression is that the first system (advisory markets) has the potential to be very useful, but is not quite capable of taking full advantage of the power of markets to aggregate information. Putting decision markets directly in charge of policy decisions has some benefits, like eliminating intermediaries between the information the market provides and the decisions themselves. However, it also has more risks, some of which I note below, and many of which I will describe in a future post.

Some notes:

Next steps. Hanson considers private decision markets—decision markets run and supported by companies in order to improve their internal decision-making—as the most promising next step for using prediction and decision markets. Frequent private use of decision markets should help us identify issues in and improvements to the framework of decision markets (and prediction markets), and we would then be able to organize better decision markets for public use.
Pathways to impact. Work on decision and prediction markets impacts the world in two main ways. First, decision markets might be technical tools for improving institutional decision-making by aligning decisions with the specific goals of an institution. For instance, Hanson claims that futarchy would better align policy decisions with voters’ actual values. However, one can imagine that this would simply more efficiently execute flawed goals; if a national measure of welfare in futarchy excludes future people and non-human animals, for instance, policies that are accepted will probably often seem harmful from the points of view of longtermists and those who care about animal welfare.^[7] A second pathway to impact could be through improving decision-making within the Effective Altruism movement.
Other criticisms of the paper. Hanson presents many valuable ideas in the paper, but it does have some aspects which point to a lack of care (e.g. there are typos in the numbering of the sections) and some poor epistemic norms, like a lack of explanations for many of Hanson’s terms. One example of this is his phrase “info institutions,” which he uses to describe things like prediction markets and expert polls, but which he never actually defines. This allows him to do things like favorably compare prediction markets to “other info institutions” without being explicit about what he is considering. Occasionally, Hanson also makes strong claims that he does not explain,^[8] or cherry-picks certain examples to fit an argument.^[9] Finally, Hanson’s stated aims for the paper (making the possible benefits of large-scale use of public decision markets for policy plausible enough to encourage field tests in the form of small-scale use of private decision markets) do not quite seem to match with his conclusions in some sections, like his implication the small-scale proposals are more theoretically flawed than futarchy.

Issues with the proposals

Hanson should be commended for listing 25 objections within the body of the paper itself, although he seems to be too casually dismissive of some of them (possibly due to length constraints). Below are some issues to which Hanson’s paper gives no attention or gives less attention to than I believe is warranted:

Causality might diverge from conditionality in the case of advisory/indirect markets.^[10] Traders are sometimes rewarded for guessing at hidden info about the world—information that is revealed by the fact that a policy decision was made—instead of causal relationships between the policy and outcomes.^[11]
1. For instance, suppose a company is running a market to decide whether to keep an unpopular CEO, and they ask if, say, stocks conditional on the CEO remaining would be higher than stocks conditional on the CEO not staying. Then traders might think that, if it is the case that the CEO stayed, it is likely that the Board found out something really great about the CEO, which would increase expectations that the CEO would perform very well (and stocks would rise). So the market would seem to imply that the CEO is good for the company even if they were actually terrible.
Thin markets, or markets where there are few buyers and sellers, will be less accurate. Thin markets are often more volatile (prices shift rapidly) and less efficient or accurate than liquid markets, where there are many buyers and sellers. Policy-oriented prediction markets could become much thinner for policies that are complicated or which do not affect the interests of sufficient numbers of people or of sufficiently rich people.^[12]
1. For instance, if a policy is technical and affects only, say, the agricultural practices of a specific area, there may not be enough natural interest in it, and all but a few people may believe that it is not worth their time to learn the details of how the policy would affect welfare. As a result, the final prices would be based on very little information: the best guesses of a few traders.
Maintaining a careful and aligned measure of welfare is likely to be extremely difficult. It is hard to capture everything we value as a society (especially on different levels, like cities and states), and it would also be very difficult to avoid manipulations. Hanson notes this issue (in objections 13-15, 22-23), but does not treat it with the seriousness it deserves. Additionally, Hanson occasionally proposes modifying the measure of welfare to fix other issues, and this is an added complication.
1. A simpler measure of welfare might, for instance, prompt blind maximization of something that is not quite aligned with our values. If we try to compensate by adding everything we value, however, we may encounter issues of corruption in the measurement processes for certain parameters, encode policies in our measure of welfare (an oversimplified example of this is adding miles of roads built to the measure of welfare), or create a more messy system by attempting to solve other problems (e.g. on page 24, Hanson mentions the possibility of agreeing, by treaty, to give welfare weight to other nations’ welfare).
We would like to extract the expected consequences of proposed policies, but the market setup might diverge from expected values in various ways.
1. For instance, traders may change their bets depending on the risks they are taking (just as most people should pay less than 10,000 dollars for a coin flip that will give them either 0 or 20,000 dollars), which would push market prices away from expected values.
This system grants more power to the rich. Even though the values of the poor are, in theory, weighed in the official measure of welfare, their beliefs will not ultimately be granted as much weight as those of the rich. Additionally, someone with the means and motivation could exploit weaknesses of the system to push a set of markets in a given direction. The combined power of the market can correct this to a certain extent, but even then, it is possible that the extremely wealthy of the near future will be orders of magnitude wealthier than at present. (There are ways we can begin to address this issue, but they do not currently seem satisfying enough. I will elaborate on this more in my next post.)

Misconceptions: below are also a couple of things that might first seem like critical issues, but which do not actually pose risks to futarchy (in theory).

You cannot simply buy a bunch of shares to get a policy accepted. One might think that a rich participant in futarchy can easily tank a policy by selling shares of the policy very cheaply (to lower the price), or boost a policy by buying its shares for high prices. However, if everything works as it should, the market will largely correct for this possibility.^[13] Traders will notice that something is not right and either buy those cheap shares and re-sell them for a lot more, raising the price back up (in the first case, when the rich buyer is trying to lower the price and make the policy fail), or sell the policy’s shares to the rich bettor for large amounts and then use those profits to buy more shares of the policy for cheaper amounts that are closer to the market price, lowering the market price down (in the second case). In the end, the manipulation effort adds liquidity to the market.
Hedging does not distort market prices. Entities with interests outside of the prediction markets could hedge against a policy that they expect will harm them by buying shares of that policy (in order to have a new source of profit if the policy passes and does harm them financially). We might expect that to distort market prices. However, hedging like this only pays if hedgers buy policies that will actually be good for national welfare, since shares of the policy will only become a source of profit if adopting that policy leads to an increase in welfare—but that is exactly the kind of policy whose shares we want to encourage people in futarchy to buy. This means that hedging should actually support futarchy by adding incentives for people to trade, and thus (once again) adding liquidity.

Two outlines of the paper

This document contains an expanded outline of the paper (with some commentary and notes).

Also, below is a very sparse outline.

Part 1. Info failures^[14] of democracy (starts p3)

Makes the claim that one of the major weaknesses of democracy is its inability to integrate good information into policy choices.

Part 2. Info successes of speculative markets (p6)

Makes the claim that speculative markets consistently outperform other information institutions^[15]

Part 3. Measuring welfare (p10)

Acknowledges that agreeing on a consistent or national way of measuring welfare (to settle bets) will be difficult, but claims that with more resources, we will be able to come up with something reasonable.

Part 4. Decision market mechanics (p11)

Explains how markets can be used to estimate conditional probabilities and expected values, and sets up a system for tying this into betting on how much policies will improve national welfare.

Part 5. Corporate Governance Example (p14)

Outlines a way for a company to decide whether to keep or dump its CEO based on a market it runs.

Part 6. Monetary policy example (p16)

Outlines a way to determine interest rates with a decision market.

Part 7. The engineering of institutions (p17)

Explains some thoughts on the structure of the paper and how one should read it.

Part 8. Simple Approaches (p18)

Proposes two smaller ways prediction markets could be used for making policy decisions.

Part 9. A reference proposal (p20)

This is the main place where Hanson explains his idea of Futarchy: using a prediction market as the main mechanism by which new policies are approved or rejected.

Part 11 (sic). Welfare Possibilities (p24)

Tries to convince readers that there is some reasonable way to democratically put a number on national welfare.

Part 13 (sic). Objections (p25)

Lists 25 possible objections and responds to them.

Part 14. Conclusion

Credits

This essay is a project of Rethink Priorities.

It was written by Lizka, an intern at Rethink Priorities. Thanks to Robin Hanson for the source material and some feedback, and to Eric Neyman and Scott for conversations about prediction markets and futarchy. Thanks also to Michael Aird, Marie Buhl, and Linch Zhang for their extremely helpful feedback. Any mistakes are (still) the fault of Linch Zhang, my supervisor. If you like our work, please consider subscribing to our newsletter. You can see all our work to date here

Notes

↩︎
HANSON, ROBIN. “Shall We Vote on Values, but Bet on Beliefs?” Journal of Political Philosophy 21, no. 2 (2013): 151–78. https://doi.org/10.1111/jopp.12008.
↩︎
The project entails studying the current status of work on the possibilities of prediction markets for policy, and compiling some possible issues or open questions on them.
↩︎
“Information institution” or “info institution” is a term Hanson uses for (as far as I can tell) institutions that provide public information of various kinds; this includes markets (prices are information), “experts,” and groups that conduct polling. He does not provide a list of the “information institutions” he considers.
↩︎
More details on how to set this up (and a brief discussion on when it would or would not work particularly well) can be found in another paper by Hanson, “Decision Markets for Policy Advice”.
↩︎
In order to properly align incentives of participants and decision-makers, decision market scoring should be different from that of regular prediction markets, as explained in this paper. Also, “Prediction Markets as Decision Support Systems,” has some more discussion on past uses of decision markets.
↩︎
“Market price” and “fair value” actually diverge significantly in some contexts, but should not do so too much under Hanson’s proposal (due to the efficient-market hypothesis). (In a future post, however, I will likely examine some situations in which market price and fair value might actually be distinct.)
↩︎
This post describes two “pathways by which better institutional decisions can lead to better outcomes.” First, “an institution’s mission or stated goals can become more altruistic,” which does not clearly happen under futarchy. And second, “an institution can take steps to increase the likelihood that its decisions will lead to the outcomes it desires,” for which, Hanson argues, futarchy and decision markets are useful. The authors note that, “Because this type of work treats an institution’s goals as a given, we must be careful to avoid scenarios in which improving the technical quality of decision-making at an institution yields outcomes that are beneficial for the institution but harmful by the standards of the ‘better world’ definition above.”
↩︎
E.g. Wealth differences between nations largely result from some nations adopting worse economic policies, in turn due to “huge info institution failure in the nations that refused to believe in and adopt the clearly better policies. Some explain this as due to irrational and proud voters.” On page 6.
↩︎
One example of what seems like cherry-picking is near the bottom of page 5, where Hanson’s evidence for voters’ general “proud unwillingness to defer to expert opinion” is a complicated list of examples.
↩︎
Advisory markets are those where the differences in market prices are not hardwired to the decision of whether a policy should be accepted or not.
↩︎
This issue might be resolvable via proxy measurements or by some clever set of markets, but the issue should not be brushed off immediately. Additionally, some solutions to issues like this have been proposed, but require more work to be generalizable and careful. I will probably elaborate on this issue in my next post.
↩︎
There are promising suggestions for ways to mitigate this issue. I will go into this more in my next post.
↩︎
There has been some doubt about the extent to which markets can correct for this sort of manipulation. See this post for one example.
↩︎
This is Hanson’s term, which he does not carefully define but in which he does include inaccuracies in expert and opinion polling.
↩︎
See footnote 3, above, about Hanson’s use of the term “information institution.”

What links here?

Lizka25 Aug 2021 0:43 UTC

38 points

12 comments14 min readEA link

Improving institutional decision-making Prediction markets Rethink Priorities Robin Hanson Research summary Forecasting Policy

Marcel2 25 Aug 2021 5:11 UTC
4 points
0 ∶ 0
I was glad to see the mention of the correlation=/=causality issue. To clarify, is the following similar to what you had in mind: if you are trying to analyze the effects of some kind of sanction or other foreign policy measure, simply asking “If the USFG does X (e.g., imposes some limited sanctions), what is the probability of Z (e.g., the sanctioned country ceases its activity)?” doesn’t necessarily tell you the effects of X if traders are thinking “If the USFG has the political will to do X, it’s likely they will also do Y (e.g., impose heavier sanctions)—and Y is what will actually cause Z.” Alternatively, it might be: “If the USFG resorts to doing X, it means policymakers likely don’t have the political will to also do Y—and so doing X is a sign that they won’t do Y, which is what will actually cause Z.”

That being said, I noticed you also said “You cannot simply buy a bunch of shares to get a policy accepted”

While this may be a valid point in theory, it is a very crucial assumption and I think deserves a bit more skepticism than you provide. In reality, there can be market conditions where people are unsure of whether a policy’s shares are being bought because of (a) insider trading (or any other form of smart trading), (b) dumb trading/trading mistakes (e.g., fat finger trades), (c) price manipulation (i.e., pump and dump schemes or other things to induce dumb trading and make a profit), (d) price distortion (e.g., to do the thing you describe), etc. If a trader in the market is unsure of whether it’s because of something like (a) vs. any of the other options, then they may not be willing to risk millions of dollars to “correct” the prices (which might already be correct). This problem may be exacerbated if the markets are thin (although obvious attempts at manipulation, e.g., when there is no possibility for insider information, will probably actually improve these markets).

Ultimately, I went through a very brief hype and disillusionment cycle with the idea of futarchy (although I am still very much a proponent of crowdsourced forecasting such as prediction markets), and that is one of the reasons. I definitely think there are areas where prediction markets for policy could theoretically be tried/beneficial, but I think any such attempt would have to be very carefully implemented.
- Linch 25 Aug 2021 6:51 UTC
  6 points
  0 ∶ 0
  Parent
  I was glad to see the mention of the correlation=/=causality issue. To clarify, is the following similar to what you had in mind:...
  I can’t speak for Lizka, but I do think the generalization of what you said (whether decisionmakers make specific choices tells us hidden information about their motivations and/or abilities) is an important subset of the potential issues. However, there are other issues. A more general version is that decisionmakers’ choices may give us information about world-states that either they have access to and forecasters don’t, or (if this is a prediction about a future decision, and most predictions are about the future) information neither forecasters nor decisionmakers currently have access to, but decisionmakers are expected to get access to after the forecast’s time but before decisionmakers make said decision.
  
  An example of the latter actually happened live in a (private) covid forecasting tournament last year.
  
  I might be butchering details a little, but basically we were asked whether and how much severe lockdowns in the future will result in reduced deaths. After some consideration, a reasonable fraction of forecasters, myself pretty loud among them, concluded that given the information available to us at the time, lockdowns is most likely correlated with increased deaths, since decisionmakers in that country in the future will know stuff about the trajectory of covid that neither them nor us currently have access to, and the decisionmakers will most likely only issue lockdowns if it looks like the number of deaths would be sufficiently high.
  
  This was a ~unpaid tournament where only pride and a desire to do good was on the line, so we were pretty open with our reasoning. I can imagine a much stronger desire (and incentive) to be circumspect about our reasons in a market setting.
  
  Note that this is probably only relevant to advisory markets, and not futarchy.
  - Lizka 2 Sep 2021 0:59 UTC
    1 point
    0 ∶ 0
    Parent
    lockdowns is most likely correlated with increased deaths [since...] decisionmakers will most likely only issue lockdowns if it looks like the number of deaths would be sufficiently high
    That is a really interesting illustration of the general causality =/= conditionality issue I mention in the post (and which Harrison elaborates on), thank you!
    I agree that the generalization—the fact that a decision is made reveals currently unavailable information—is the key point, here, and Harrison’s interpretation seems like a reasonable and strong version or manifestation of the issue.
- Lizka 2 Sep 2021 1:03 UTC
  1 point
  0 ∶ 0
  Parent
  On buying a bunch of shares to get a policy accepted:
  I agree that there would be scenarios in which manipulation by the wealthy is possible (and likely would happen), and you describe them well (thank you!). I mainly wanted to clarify or push back against a misconception I personally had when I initially read the paper, which was that this system basically grants decision-power entirely to those who are rich and motivated enough. The system is less silly than I initially thought, because the manipulation that is possible is much harder and less straightforward than what one might naively think (if one is new to markets, as I was).
Nathan Young 28 Aug 2021 17:37 UTC
2 points
0 ∶ 0
A large question I have is “Why haven’t any EA orgs used futarchy?”

It seems to me that it’s easiest to implement in a small organisation. The fact that is seen nowhere (to my knowledge) suggests it can’t be as clearcut as it first seems.

Some suggestions for how it would be used if people were convinced:
- Forecast GiveWell scores, investigate charities with high ones
- Forecast fund evaluations
- Forecast community metrics under different large scale community decisions
- Set up an EA org that runs on a futarchy
In my mind, the fact that these aren’t happening despite being possible suggests there must be more flaws than those raised in this peice.

If I had to guess it’s that:
- Futarchy seems to weird
- EAs (like everyone else) are concerned that such an inflexible system would lead to unpredictable badness and so push away from creating it.
- Linch 28 Aug 2021 22:53 UTC
  13 points
  0 ∶ 0
  Parent
  Unless I’m missing something, thin markets and the difficulties of measuring value in sufficiently precise ways should be enough to mostly doom futarchy attempts in EA organizations.
  
  Advisory markets or just frequent betting seems more plausible but still hard.
  
  For example, if we try to do a market on whether or not Lizka should write this post or not, I just don’t (currently) see a way for us to have sufficiently large and sufficiently precise definitions of welfare to make welfare conditional predictions on whether or not Lizka should do this post.
  However, I can imagine some betting or a very lightweight prediction market to resolve disagreements on specific interesting proxies(eg “will this post have >50 karma”, “will any work be built on top of this in <3 years”, “will Lizka think this post is a good use of her time 2 months after publication”, “will this be complete by Y date”) , in addition to the project forecasts we currently have.
  More generally I’m skeptical that markets are an unusually efficient way to convert information into prices in smaller ecosystems. Like it’s very rare that stuff like internal hiring and conference room bookings within a company is allocated through prices, and the few exceptions I’m aware of do not, to the best of my knowledge, become unusually successful companies.
  - Lizka 2 Sep 2021 1:15 UTC
    1 point
    0 ∶ 0
    Parent
    I basically agree with Linch’s answer, and just want to add that a futarchy-like system (or even, likely, coherent use of prediction markets) would require a lot of management/organizational support (in addition to subsidization, probably, to push back against thin markets), and management/operations already seems like a current bottleneck in EA.
    (I’m also unconvinced that EA is the best place to kickstart something like using prediction markets, since people in EA are presumably already incentivized to make decisions that are likely to produce good outcomes and to share information they feel is relevant to those decisions. The strength of futarchy is (in theory) channeling private monetary/profit incentives towards common values or a kind of communal good, so it makes more sense outside of communities that are inherently allied under a common project. I might be quite wrong, though, and would be interested in possible counter-arguments.
    On a similar note, my understanding is that Hanson considers medium to large and private companies as the ideal place to kickstart the use of prediction markets, with the idea that eventually, the techniques developed as prediction markets are used and improved in that sphere can also be used for direct public benefit.)
Nathan Young 28 Aug 2021 17:26 UTC
2 points
0 ∶ 0
1. Maintaining a careful and aligned measure of welfare is likely to be extremely difficult. It is hard to capture everything we value as a society (especially on different levels, like cities and states), and it would also be very difficult to avoid manipulations. Hanson notes this issue (in objections 13-15, 22-23), but does not treat it with the seriousness it deserves. Additionally, Hanson occasionally proposes modifying the measure of welfare to fix other issues, and this is an added complication.
  A simpler measure of welfare might, for instance, prompt blind maximization of something that is not quite aligned with our values. If we try to compensate by adding everything we value, however, we may encounter issues of corruption in the measurement processes for certain parameters, encode policies in our measure of welfare (an oversimplified example of this is adding miles of roads built to the measure of welfare), or create a more messy system by attempting to solve other problems (e.g. on page 24, Hanson mentions the possibility of agreeing, by treaty, to give welfare weight to other nations’ welfare).
I’m not sure that it would be any harder than in a society without futarchy. In some ways I think it’s quite neat that Hanson acknowledges this would be the problem of the legislator and that people could vote for politicians they thought would derive a good function.

All the bad situations I can think of here apply to current societies so it seems harsh to judge futarchy by those standards.
- Lizka 2 Sep 2021 1:27 UTC
  3 points
  0 ∶ 0
  Parent
  I agree that we’re not currently good at “maximizing welfare,” but I worry that futarchy would lead to issues stemming from over-optimization of a measure that is misaligned from what we actually want. In other words, my worry is that common sense barriers would be removed under futarchy (or we would lose sight of what we actually care about after outlining an explicit welfare measure), and we would over-optimize whatever is outlined in our measure of welfare, which is never going to be perfectly aligned to our actual needs/desires/values.
  This is a version of Goodhart’s Law: “When a measure becomes a target, it ceases to be a good measure.” (Or possibly Campbell’s law, which is more specific.)
Nathan Young 28 Aug 2021 17:21 UTC
2 points
0 ∶ 0
1. Thin markets, or markets where there are few buyers and sellers, will be less accurate. Thin markets are often more volatile (prices shift rapidly) and less efficient or accurate than liquid markets, where there are many buyers and sellers. Policy-oriented prediction markets could become much thinner for policies that are complicated or which do not affect the interests of sufficient numbers of people or of sufficiently rich people.^[12]
  For instance, if a policy is technical and affects only, say, the agricultural practices of a specific area, there may not be enough natural interest in it, and all but a few people may believe that it is not worth their time to learn the details of how the policy would affect welfare. As a result, the final prices would be based on very little information: the best guesses of a few traders.
I’m not sure this argument holds up:
1. If there is an agricultural practise that will be effected the poeple in that area can bet on the market. Unlike voting for a general political campaign, they will have a big impact on the result. They will have good incentives to bet in order to change the result or to offset losses in the case it goes badly.
2. This doesn’t seem worse than status quo. Decision are already made on the basis of a few people’s opinions, often without any sense of track record that the profit and loss here would provide.
Nathan Young 28 Aug 2021 17:16 UTC
2 points
0 ∶ 0
Thanks for writing this. I enjoyed it and I’d like to look more at the paper.
1. Causality might diverge from conditionality in the case of advisory/indirect markets.^[10] Traders are sometimes rewarded for guessing at hidden info about the world—information that is revealed by the fact that a policy decision was made—instead of causal relationships between the policy and outcomes.^[11]
  For instance, suppose a company is running a market to decide whether to keep an unpopular CEO, and they ask if, say, stocks conditional on the CEO remaining would be higher than stocks conditional on the CEO not staying. Then traders might think that, if it is the case that the CEO stayed, it is likely that the Board found out something really great about the CEO, which would increase expectations that the CEO would perform very well (and stocks would rise). So the market would seem to imply that the CEO is good for the company even if they were actually terrible.
I don’t understand the point here. If the market is the only chooser, the traders would be stupid to assume that some other reason made the board choose. If the board chose based on the market and other features, then yes the market would be predicting given that choice. This seems like a restatement of the theory rather than an issue.

Am I misunderstanding?
- Lizka 2 Sep 2021 1:31 UTC
  3 points
  0 ∶ 0
  Parent
  I don’t think you are misinterpreting; this issue is applicable when the market is advisory or indirect (not hard-wired to decisions, like futarchy is—that has its own issues). There’s a longer discussion of this issue in the thread that starts with Harrison’s comment.