Bayesian Mindset

Holden KarnofskyDec 21, 2021, 7:54 PM

73 points

Click lower right to download or find on Apple Podcasts, Spotify, Stitcher, etc.

This piece is about the in-practice pros and cons of trying to think in terms of probabilities and expected value for real-world decisions, including decisions that don’t obviously lend themselves to this kind of approach.
The mindset examined here is fairly common in the “effective altruist” and “rationalist” communities, and there’s quite a bit of overlap between this mindset and that of Rationality: A-Z (aka The Sequences), although there are some differing points of emphasis.¹ If you’d like to learn more about this kind of thinking, this piece presents a ~20-minute read rather than the >1000 pages of Rationality: A-Z.

This piece is a rough attempt to capture the heart of the ideas behind rationalism, and I think a lot of the ideas and habits of these communities will make more sense if you’ve read it, though I of course wouldn’t expect everyone in those communities to think I’ve successfully done this.

If you’re already deeply familiar with this way of thinking and just want my take on the pros and cons, you might skip to Pros and Cons.

This piece is about the “Bayesian mindset,” my term for a particular way of making decisions. In a nutshell, the Bayesian mindset is trying to approximate an (unrealistic) ideal of making every decision based entirely on probabilities and values, like this:

Should I buy travel insurance for $10? I think there’s about a 1% chance I’ll use it (probability—blue), in which case it will get me a $500 airfare refund (value—red). Since 1% * $500 = $5, I should not buy it for $10.

(Two more examples below in case that’s helpful.)

The ideal here is called expected utility maximization (EUM): making decisions that get you the highest possible expected value of what you care about.² (I’ve put clarification of when I’m using “EUM” and when I’m using “Bayesian mindset” in a footnote, but it isn’t ultimately that important.³)

It’s rarely practical to literally spell out all the numbers and probabilities like this. But some people think you should do so when you can, and when you can’t, use this kind of framework as a “North Star”—an ideal that can guide many decisions even when you don’t do the whole exercise.

Others see the whole idea as much less promising.

I think it’s very useful to understand the pros and cons, and I think it’s good to have the Bayesian Mindset as one option for thinking through decisions. I think it’s especially useful for decisions that are (a) important; (b) altruistic (trying to help others, rather than yourself); (c) “unguided,” in the sense that normal rules of thumb aren’t all that helpful.

In the rest of this piece, I’m going to walk through:

The “dream” behind the Bayesian mindset.
- If we could put the practical difficulties aside and make every decision this way, we’d be able to understand disagreements and debates much better—including debates one has with oneself. In particular, we’d know which parts of these disagreements and debates are debates about how the world is (probabilities) vs. disagreements in what we care about (values).
- When debating probabilities, we could make our debates impersonal, accountable, and focused on finding the truth. Being right just means you have put the right probabilities on your predictions. Over time, it should be possible to see who has and has not made good predictions. Among other things, this would put us in a world where bad analysis had consequences.
- When disagreeing over values, by contrast, we could all have transparency about this. If someone wanted you to make a certain decision for their personal benefit, or otherwise for values you didn’t agree with, they wouldn’t get very far asking you to trust them.
The “how” of the Bayesian mindset—what kinds of practices one can use to assign reasonable probabilities and values, and (hopefully) come out with reasonable decisions.
The pros and cons of approaching decisions this way.

The dream behind the Bayesian mindset

Theoretical underpinnings

There’s a common intuition (among mathematics- and decision-theory-minded people) that the sort of decision-making outlined at the beginning of this piece—expected utility maximization (EUM) - is the most “fundamentally correct” way of making decisions.

This intuition can be grounded in a pretty large and impressive academic literature. There are a large number of different theoretical frameworks and proofs that all conclude—in one way or another—something like:

Either you’re acting like someone who’s using EUM—assigning a probability and value to each possible outcome, and making the choice best for maximizing the expected value (of whatever it is that you care about) -

or you’re making decisions that are inconsistent, self-defeating, or have something else wrong with them (or at least have some weird, unappealing property, such as “When choosing between A and B you choose A; but when choosing between A, B and C you choose B.”)⁴

You can get an intro to the academic literature at this SEP article (read up to Section 4, which is about halfway). And you can read more about the high-level intuitions at this article by Eliezer Yudkowsky (key quote in footnote).⁵

The theorems don’t say you have to actually write down your probabilities and values and maximize the expected value, like the examples at the beginning of this piece. They just say that you have to act as if that’s what you’re doing. To illustrate the difference—most people don’t write down the number of calories in each bite of food before they eat it, then stop eating once they hit a certain number. But they act as if they do (in that most people do something approximating “eat a set number of calories each day”).

In real life, people are probably not even acting as if they’re doing EUM. Instead, they’re probably just doing the “inconsistent, self-defeating, or something else wrong with it” thing constantly. And that isn’t necessarily a big deal. We can make a lot of mistakes and have a lot of imperfections and still end up somewhere good.

But it’s interesting if the “ideal” version of myself—the one who has no such imperfections—always acts as if they’re (implicitly) doing EUM. It suggests that, if I try hard enough, I might be able to translate any decision into probabilities and values that fully capture what’s at stake.

Transparent values, truth-seeking probabilities

And that translation is exciting because it could allow me to clarify disagreements and debates, both with other people and within my own head.

In the world as it is, I often have a hard time telling what a disagreement or debate is supposed to be about. For example, take this House of Representatives debate⁶ on a proposal to increase spending:

One speaker (a Democrat) says: “Frankly, I think it’s probably surprising to some to see a President … who cares deeply about the future of America, who cares about the families who are in need, who cares about those who are sick … too many Americans are suffering and in crisis.”
In “retort,” another (a Republican) says: “Today’s solutions cannot be tomorrow’s problems … I am in favor of relief … However, what we are considering here today is not relief. Rather, we’re garnishing the wages of future generations … “
In “response” to that, the Democrat says: “This is necessary … We have heard it from the American public. I think the case is clear.”

…What is the actual disagreement here? … Are these two arguing about how valuable it is to help people today, vs. keeping wages high later? Or do they disagree about whether stimulus today means lower wages tomorrow? Or something else?

Some think the disagreement comes from Republicans’ just not caring about lower-income Americans, the ones who would benefit more from a stimulus. Others think it comes from Democrats not understanding how such a stimulus can affect the future.

In an idealized version of this debate, each side would give probabilities about how stimulus will affect the economy, and explain how they value those outcomes. In order for the two sides to reach different conclusions, they’d have to be giving specific different probabilities, and/or specific different valuation methods.

Then:

Values disagreements would be transparent—explicit for all to see. If Republicans conceded that the stimulus would help low-income Americans, but said they just didn’t value this much, they’d have to own the consequences of saying this.
Meanwhile, we’d be judging probability disagreements using an “objective truth” standard, since the disagreements are just about predictions and not about values. The disagreements would be crisp and clear (one side thinks spending more would cause some specific economic problem in the future, the other side does not) - not seas of words we couldn’t interpret. We could also look back later and see which side was closer to the mark with its predictions, and over time, this could turn into extensive documentation of which side makes better predictions.
Of course, a party could lie about how its arguments break down between probabilities and values. For example, someone might say “We value low-income Americans just as much, but we have different predictions about how the stimulus will affect them,” while secretly not valuing low-income Americans. But this kind of lie would require giving non-sincere probabilities—probabilities the speaker didn’t actually believe. Over time, this would presumably lead them to have a bad track record of making predictions.

When I’m arguing with myself, I often have the same sort of confusion that I have when watching Congress.

I tend not to know much about why I decide what I decide.
I often can’t tell which of my motives are selfish vs. altruistic; which of my beliefs are based on seeking the truth vs. wishful thinking or conformity (believing what I’m “supposed to” believe); and which thoughts are coming from my “lizard brain” vs. coming from the parts of myself I respect most.
The dream behind the Bayesian mindset is that I could choose some set of values that I can really stand behind (e.g., putting a lot of value on helping people, and none on things like “feeling good about myself”), and focus only on that. Then the parts of myself driven by “bad” values would have to either quiet down, or start giving non-sincere probabilities. But over time, I could watch how accurate my probabilities are, and learn to listen to the parts of myself that make better predictions.

The bottom line:

Normal disagreements are hard to understand and unravel, and prone to people confusing and manipulating each other (and themselves).
But disagreements broken into probabilities and values could be much easier to make progress on.
Values disagreements—pure statements of what one cares about, freed of any disagreements over how the world works—are relatively straightforward to understand and judge.
Probabilities disagreements—freed of any subjectivity—could be judged entirely based on evidence, reason, and (over time) results.

By practicing and trying to separate probabilities and values when possible, perhaps we can move closer to a world in which we communicate clearly, listen open-mindedly, learn from each other, make our decisions based on the most truth-tracking interpretation of the information we have, and have true accountability for being right vs. wrong over time.

Aiming for this also has some more practical potential advantages—good habits, helpful communication methods, etc. I’ll discuss those next.

The Bayesian mindset in practice

The Bayesian mindset means looking for opportunities to do any and all of the following:

Connect opinions to anticipated observations. When you have an opinion about what action to take, what concrete outcomes or situations are you picturing as a result of taking or not taking it? (E.g., “if we pass this bill, unemployment might fall”)
Assign probabilities. How probable are the outcomes and situations you’re picturing? How does the action change them? (E.g., “The probability of unemployment falling by at least 1 percentage point in the next year is 50% if we pass the bill, 20% if we don’t”)
Assign values. How much do you value the different outcomes compared to each other? (E.g., “It would be worth $X to reduce unemployment by 1 percentage point”)

It’s often the case that just articulating some possible outcomes, probabilities and values will shed a lot of light on a decision, even if you don’t do a full expected-utility maximization (EUM) listing everything that matters.

I find all of these 3 steps to be pretty interesting exercises in their own right.

#1 - connecting opinions to anticipated observations

When you say “Policy X would be a disaster,” what kind of disaster do you have in mind? Are you expecting that the disaster would be widely recognized as such? Or are you picturing the policy doing roughly what its supporters expect, and just saying you don’t like it?

In the Bayesian mindset, the “meaning” of a statement mostly⁷ comes down to what specific, visualizable, falsifiable predictions it points to.

“Meat is bad for you” usually means something like “If you eat more meat, you’ll live less long and/or in worse health than if you eat less meat.”
“This bill is bad for America” is ambiguous and needs to be spelled out more—does it mean the bill would cause a recession? A debt crisis? Falling life expectancy?
“What we are considering here today is not relief. Rather, we’re garnishing the wages of future generations.” means [???] It’s vague, and that’s a problem.

The Bayesian mindset includes habitually going for this kind of “translation.” I find this habit interesting because:

A lot of times it sounds like two people are violently disagreeing, but they’re just talking past each other or lost in confusions over words.
- Sometimes these kinds of disagreements can disappear in a puff with rationalist taboo: one person is saying “X is bad,” the other is saying “X is good,” and they try to break down their differing “anticipated observations” and sheepishly find they just meant different things by X.
- In addition to resolving some disputes, “translating to anticipated observations” has also gotten me used to the idea that it takes a lot of work to understand what someone is actually saying. I should be slower to react judgmentally to things I hear, and quicker to ask for clarification.
And other times it sounds like someone is making profound/brilliant points, but if I try to translate to anticipated observations, I realize I can’t concretely understand what they’re saying.
- A lot of expressed beliefs are “fake beliefs”: things people say to express solidarity with some group (“America is the greatest country in the world”), to emphasize some value (“We must do this fairly”), to let the listener hear what they want to hear (“Make America great again”), or simply to sound reasonable (“we will balance costs and benefits”) or wise (“I don’t see this issue as black or white”).
- Translating to anticipated observations can sometimes “strip away the sorcery” from words and force clarity. This can include my own words: sometimes I “think I believe” something, but it turns out to be just words I was thoughtlessly repeating to myself.

A couple more notes on the connection between this idea and some core “rationality community” ideas in this footnote.⁸

#2 - assigning probabilities

Say I’ve decided to translate “This bill is bad for America” to “This bill means there will either be a debt crisis, a recession, or high (>3%) inflation within 2 years.”⁹ Can I put a probability on that?

One relatively common viewpoint would say something like: “No. In order to say something is 20% likely, you ought to have data showing that it happens about 20% of the time. Or some rigorous, experiment-backed statistical model that predicts 20%. You can’t just describe some future event, close your eyes and think about it, call it 20% likely, and have that mean anything.”

The Bayesian Mindset viewpoint says otherwise, and I think it has a lot going for it.

The classic way to come up with a forecast is to pose the following thought experiment to yourself: Imagine a ticket that is worth $100 if the thing I’m trying to forecast comes true, and $0 otherwise. What’s the most I’d be willing to pay for this ticket (call this $A)? What’s the least I’d be willing to sell this ticket for (call this $B)? A/100 and B/100 are your low- and high-end “credences” (subjective probabilities) that the forecast will come true.

For example, what is the probability that fully self-driving cars (see “level 5” here for definition) will be commercially available by 2030? If I imagine a ticket that pays out $100 if this happens and $0 if it doesn’t:

I notice that there’s no way I’d pay $80 for that ticket.
There’s also no way I’d sell that ticket for $20.
So it seems that my subjective probability is at most 80%, and at least 20%, and if I had to put a single probability on it it wouldn’t be too crazy to go with 50% (halfway in between). I could narrow it down further by actually doing some analysis, but I’ve already got a starting point.
In this case, my numbers are coming from pretty much pure intuition—though thinking about how I would spend money triggers a different sort of intuition from e.g. listening to someone ask “When are we going to have !@#$ing self-driving cars?” and answering in a way that feels good in conversation.
In this and other cases, I might want to do a bunch of research to better inform my numbers. But as I’m doing that research, I’m continually improving my probabilities—I’m not trying to hit some fixed standard of “proof” about what’s true.

Does this actually work—do numbers like this have any predictive value? I think there’s a good case they can/do:

At a minimum, you can seek to become calibrated, which means that events you assign a 30% probability to happen ~30% of the time, events you assign a 40% probability to happen ~40% of the time, etc. Calibration training seems surprisingly quick and effective—most people start off horrifically overconfident, but can relatively quickly become calibrated. This often comes along with making fewer statements like “X is going to happen, I guarantee it,” and replacing them with statements like “I guess X is about 70% likely.” This alone is an inspiring win for the Bayesian mindset.
Scott Alexander puts up a yearly predictions post on all kinds of topics from world events to his personal life, where I’d guess he’s roughly following the thought process above rather than using lots of quantitative data. He not only achieves impressive calibration, but seems (informally speaking) to have good resolution as well, which means roughly that many of his forecasts seem non-obvious. More cases like this are listed here. So it seems like it is possible to put meaningful probabilities on all sorts of things.

“The art of assigning the right probabilities” can be seen as a more tangible, testable, well-defined version of “The art of forming the most correct, reasonable beliefs possible.”

For many, this is the most exciting part of the Bayesian mindset: a concrete vision of what it means to have “reasonable beliefs,” with a number of tools available to help one improve.

There’s a nascent “science of forecasting” on what sorts of people are good at assigning probabilities and why, which you can read about in Superforecasting.
When two people disagree on a probability, they can first try sharing their evidence and moving their probabilities toward each other. (If the other person has heard all your evidence and still thinks X is less probable than you do, you should probably be questioning yourself and lowering your probability of X, to at least some degree.) If disagreement persists, they can make a bet (or “tax on bullshit”), or just record their disagreement and check back later for bragging rights. Over time, someone’s track record can be scored, and their scores could be seen as a guide to how credible they are.
More broadly, the idea of “assigning the right probabilities” is a particular vision of “what it means to have reasonable beliefs,” with some interesting properties.
- For example, it provides a specific (mathematically precise) way in which some beliefs are “more correct than others,” even when there’s very little (or very inconclusive) evidence either way,¹⁰ and specific mathematical rules for changing your beliefs based on new evidence (one video explainer is here).
- This in turn supports a particular “nonconformist truth-seeker” worldview: the only goal of one’s beliefs is to assign the best probabilities, so one should be actively watching out for social pressure and incentives, “beliefs that are fun to express,” and anything else that might interfere with a single-minded pursuit of assigning good probabilities to predictions. I see a lot of Rationality: A-Z as being about this sort of vision.¹¹

The ultimate aspiration here is that disagreements generate light (quantitative updates to probabilities, accumulation of track records) instead of heat, as we collectively build the superpower of being able to forecast the future.

#3 - valuing outcomes

The Bayesian mindset generally includes the attitude that “everything can ultimately be traded off against everything else.” If a bill would reduce suffering this year but might lead to a debt crisis in the future, it should—in theory—be possible to express both benefits and risks in the same units.¹² And if you can express benefits and risks in the same units, and put probabilities on both, then you can make any decision via EUM.

The “everything can be traded off against everything else” mentality might explain some of the fact that Bayesian-mindset enthusiasts tend to be interested in philosophy—in particular, trying to understand what one really values, e.g. by considering sometimes-bizarre thought experiments. I think this is an interesting mentality to try out.

But in practice, valuing very different outcomes against each other is daunting. It often involves trying to put numbers on things in unintuitive and sometimes complex ways—for example, valuing a human life in dollars. (For a general sense of the sort of exercise in question, see this post.)

I think the “figuring out what you value, and how much” step is the least practical part of the Bayesian mindset. It seems most useful when either:

There is luckily some straightforward way of expressing all costs and benefits in the same terms, such as in the examples in the appendix. (More on this below.)
Or it’s worth doing all of the difficult, guess-laden work to convert different benefits into the same terms, which I think can be the case for government policy and for donation recommendations.

Use cases, pros and cons of the Bayesian mindset

Use cases

Using the full process outlined above to make a decision is pretty complex and unwieldy. For most decisions, I don’t think it would be helpful: it’s too hard to list all of the different possible outcomes, all of the different values at stake, etc.

But I think it can be a useful angle when:

There’s a discrete, important decision worth serious thought and analysis.
There’s a pretty clear goal: some “unit of value” that captures most of what’s at stake. The examples in the appendix are examples of how this can be approximately the case.
For whatever reason, one isn’t confident in normal rules of thumb and intuitions.
- The Bayesian mindset might be particularly useful for avoiding scope neglect: the risk of being insensitive to differences between different large numbers, e.g. “Helping 10,000 vs. 12,000 people.”
- I think most policymaking, as well as many decisions about how to handle novel situations (such as the COVID-19 pandemic), qualify here.
Sometimes one is able to identify one or two considerations large enough to plausibly “dominate the calculation,” so one doesn’t have to consider every possible decision and every possible outcome.
- A bit of a notorious example that I have mixed feelings about (to be discussed another day): Astronomical Waste argues that “Do as much good as possible” can be approximately reduced to “Minimize existential risk.” This is because a staggering number of people could eventually live good lives¹³ if we are able to avoid an existential catastrophe.

I think the COVID-19 pandemic has been an example of where the Bayesian mindset shines, generally.

The situation is unprecedented, so normal rules of thumb aren’t reliable, and waiting to have “enough evidence” by normal public-health-expert standards is often not what we want.
Most people I know either took extremely “cautious” or extremely “carefree” attitudes, but calculating your actual probability of getting COVID-19 - and weighing it against the costs of being careful—seems a lot better (ala the examples in the appendix). (Microcovid.org was built for this purpose, by people in the rationalist community.)
EUM calculations tend to favor things that have a reasonably high probability of being very helpful (even if not “proven”) and aren’t too costly to do, such as wearing masks and taking vitamin D supplements.

Bayesian habits

A lot of the appeal of the Bayesian mindset—and, I think, a lot of the value—comes not from specific decisions it helps with, but from the habits and lenses on the world one can get from it.

One doesn’t need to do a full EUM calculation in order to generally look for opportunities to do the three things laid out above: (a) connect opinions to anticipated observations; (b) assign probabilities and keep track of how accurate they are; (c) assign values (try to quantify what one cares about).¹⁴

I’ve done a fair amount of this, while not making the Bayesian mindset my only or even primary orientation toward decision-making. I think I have realized real, practical benefits, such as:

I’ve gotten quicker at identifying “talking past each other” moments in disagreements, and ensuring that we hone in on differing anticipated observations (or values). I’ve also gotten quicker to skip over arguments and essays that sound seductive but don’t have tangible implications. (I’m sure some would think I’m wrong to do this).
Based on my experience with estimating probabilities and making bets, I almost never “rule out” a possibility if someone else is arguing for it, and conversely I never fully plan around the outcomes that seem most likely to me. I think this is one of the most robust and useful results of putting probabilities on things and seeing how it goes: one switches from a natural mode of “If A, then B” to a habitual mode of “If A, then maybe B, maybe C, maybe D.” I think this has generally made me more respectful of others’ views, in tone and in reality, and I think it has improved my decision-making as well.
I’ve spent a lot of time consuming philosophy, interrogating my own values, and trying to quantify different sorts of benefits in comparable terms. Many of the calculations I’ve done are made-up, non-robust and not worth using. But there are also many cases in which the numbers seem both clear and surprising relative to what I would have guessed—often there is one factor so large that it carries a calculation. The most obvious example of this is gaining sympathy for (though not total conviction in) the idea of focusing philanthropy on animal-inclusive or longtermist work. I think the benefits here are major for philanthropy, and a bit less compelling on other fronts.

At the same time, I think there are times when the habits built by the Bayesian mindset can be unhelpful or even lead one astray. Some examples:

De-emphasizing information that tends to be hard to capture in an EUM framework. There are a lot of ways to make decisions that don’t look at all like EUM. Intuition and convention/tradition are often important, and often capture a lot of factors that are hard to articulate (or that the speaker isn’t explicitly aware of). The Bayesian mindset can cause over-emphasis on the kinds of factors that are easy to articulate via probabilities and values.

Here are examples of views that might not play well with the Bayesian mindset:

“Person X seems really good—they’re sharp, they work hard, they deeply understand what they’re working on at the moment. I’m going to try to generally empower/support them. I have no idea where this will lead—what they’re specifically going to end up doing—I just think it will be good.”
“I see that you have many thoughtful reasons to set up your organization with an unorthodox reporting structure (for example, one person having two bosses), and you have listed out probabilities and values for why this structure is best. But this is different from how most successful organizations tend to operate, so I expect something to go wrong. I have no idea what it is or how to express it as a prediction.”¹⁵
“Solar power progress is more important than most people think; we should pay more attention to solar power progress, but I can’t say much about specific events that are going to happen or specific outcomes of specific things we might do.”

It can be extremely hard to translate ideas with this basic structure into predictions and probabilities. I think the Bayesian mindset has sometimes led me and others to put insufficient weight on these sorts of views.

Modesty probabilities. I think that using the language of probability to express uncertainty has some major advantages, but also some pathologies. In particular, the “never be too confident” idea seems great in some contexts, but bad in others. It leads to a phenomenon I call “modesty probabilities,” in which people frequently assign a 1% or 10% chance to some unlikely outcome “just because who knows,” i.e., because our brains don’t have enough reliability or precision to assign very low probabilities for certain kinds of questions.

This in turn leads to a phenomenon sometimes called “Pascal’s Mugging” (though that term has a variety of meanings), in which someone says: “X would be a huge deal if it happened, and it’d be overconfident to say it’s <1% likely, so I’m going to focus a lot on X even though I have no particular reason to think it might happen.”

It’s debatable how comfortable we should be acting on “modesty probabilities” (and in what contexts), but at the very least, “modesty probabilities” can be quite confusing. Someone might intuitively feel like X is almost impossible, but say X is 1% or 10% likely just because they don’t know how to be confident in a lower probability than that.

The wrong tool for many. I’m personally a big fan of some of the habits and frames that come with the Bayesian mindset, particularly the idea of “intense truth-seeking”: striving to make my beliefs as (predictively) accurate as possible, even if this requires me to become “weirder” or suffer other costs. But this isn’t how everyone lives, or should live.

Some people accomplish a lot of good by being overconfident.
Others, by fitting in and doing what others seem to expect them to.
Others, by being good at things like “picking the right sort of person to bet on and support,” without needing any ability to make accurate predictions (about the specifics of what supporting person X will lead to) or have much sense of what “values” they’re pursuing.

I don’t think the Bayesian mindset is likely to be helpful for these sorts of people. An analogy might be trying to strategize about winning a football game using the language of quantum mechanics—it’s not that the latter is “wrong,” but it’s an ill-suited tool for the task at hand.

Furthermore, the Bayesian mindset seems like a particularly bad tool for understanding and learning from these sorts of people.

I often see Bayesian mindset devotees asking, “Why did person X do Y? What beliefs did that reflect? If they believe A they should’ve done C, and if they believe B they should’ve done D.” And in many cases I think this is an actively bad way of understanding someone’s actions and motivations.
I think many people have impressive minds in that they act in patterns that tend to result in good things happening, and we can learn from them by understanding their patterns - but they’re not well-described as doing any sort of EUM, and they may not even be well-described as having any anticipated observations at all (which, in a Bayesian framework, sort of means they don’t have beliefs). We won’t learn from them if we insist on interpreting them through the lens of EUM.

A final high-level point is that the Bayesian mindset is essentially a psychological/social “technology” with little evidence behind it and a thin track record, so far. The theoretical underpinnings seem solid, but there’s a large gulf between those and the Bayesian mindset itself. I think we should assume, by default, that the Bayesian mindset is an early-stage idea that needs a lot of kinks worked out if it’s ever going to become a practical, useful improvement for large numbers of people making decisions (compared to how they would make decisions otherwise, using some ill-defined mix of intuition, social pressure, institutional processes and norms, etc.)

Overall, I am an enthusiastic advocate for the Bayesian mindset. I think following it has real benefits already, and I expect that as people continue to experiment with it, the set of practices for making the most of it will improve. As long as we don’t conflate “an interesting experiment in gaining certain benefits” with “the correct way to make decisions.”

Appendix: simple examples of the Bayesian mindset

Example 1 (repeated from intro). Should I buy travel insurance for $10? I think there’s about a 1% chance I’ll use it (probability—blue), in which case it will get me a $500 airfare refund (value—red). Since 1% * $500 = $5, I should not buy it for $10.

Example 2. Should I move to Portland? I think there’s about a 50% chance that I’ll like it 1x as much (the same) as where I live now; a 40% chance that I’ll like it 0.5x as much (i.e., worse); and a 10% chance I’ll like it 5x as much (better). Since 50% * 1x + 40% * 0.5x + 10% * 5x = 1.2x, I expect to like Portland 1.2x as much as where I am now. So I’ll move. (If you aren’t following the math here, see my brief explanation of expected value.)

Example 3. Should I join two friends who’ve invited me to hang out (indoors :/ ) during the COVID-19 pandemic (February 2021 as I write this draft)?

I can estimate that this would mean a 1/2000 chance of getting COVID-19.¹⁶

How bad is it to get COVID-19? I’d guess it’s about a ¹⁄₅₀₀ chance of dying and losing 50 years (18250 days) of my life; a 10% chance of some unpleasant experience as bad as losing a year (365 days) of my life; a 50% chance of losing about 2 weeks (14 days); and the remaining ~40% of time I expect it to be no big deal (call it about 0 days).
So getting COVID-19 is as bad as losing ¹⁄₅₀₀ * 18250 + 10% * 365 + 50% * 14 + ~40% * 0 =~ 80 days of my life.

So joining my friends is about as bad as a 1/2000 chance of losing 80 days, which is like losing about an hour of my life. So I should join my friends if I’d trade an hour of my life for the pleasure of the visit.

Footnotes

There will be examples of connections between specifics parts of “rationalism” and specific aspects of the Bayesian mindset throughout this piece, generally in footnotes.
Here are a few examples of particularly core posts from Rationality: A-Z that emphasize the general connection to Bayesianism: Rationality: An Introduction, What Do We Mean By “Rationality?”, A Technical Explanation of Technical Explanation. See Twelve Virtues of Rationality for a somewhat “summarizing” post; most of its content could be seen as different implications of adhering to Bayesian belief updating (as well as expected value maximization), both of which are discussed in this piece. ↩
There is some subtlety here: strictly speaking, you should maximize the expected value of something you care about linearly, such that having N times as much of it is N times as good. So for example, while it’s better to have two functioning kidneys than one, an operation that has a 50% chance of leaving you with 2 functioning kidneys is not at all equivalent—and is a lot worse—than one with a 100% chance of leaving you with 1 functioning kidney. To do EUM, you need to rate every outcome using units you care about linearly. But this should always be possible; for example, you might say that 1 functioning kidney is worth 100 “health points” to you, and 2 functioning kidneys is worth only 101 “health points,” or 1.01x as much. And now you could maximize your “expected health points” and get reasonable results, such as: you’d much rather have a 100% chance of 100 “health points” than a 50% chance of 101. This is essentially how I handle the Portland example above. ↩
Throughout this post:
- “EUM” refers to making the decision that maximizes your expected value.
- “Bayesian mindset” refers to explicitly writing down your best-guess probabilities and/or values, and using these as tools to decide what to do.
You could maximize expected value without explicitly thinking that way (for example, you could just have an intuitive judgment about what’s good to do, and it might be right); conversely, you could use the tools of the Bayesian mindset to think about expected value, but ultimately fail to maximize it. ↩
This is weird because C is an “irrelevant alternative.” Adding it to your choice set shouldn’t change how you feel about A vs. B. For example, it’s weird if you choose vanilla ice cream when the only choices are vanilla and chocolate, but choose chocolate ice cream when the choices are vanilla, chocolate and strawberry. ↩
“We have multiple spotlights all shining on the same core mathematical structure, saying dozens of different variants on, ‘If you aren’t running around in circles or stepping on your own feet or wantonly giving up things you say you want, we can see your behavior as corresponding to this shape. Conversely, if we can’t see your behavior as corresponding to this shape, you must be visibly shooting yourself in the foot.’ Expected utility is the only structure that has this great big family of discovered theorems all saying that. It has a scattering of academic competitors, because academia is academia, but the competitors don’t have anything like that mass of spotlights all pointing in the same direction.
So if we need to pick an interim answer for ‘What kind of quantitative framework should I try to put around my own decision-making, when I’m trying to check if my thoughts make sense?’ or ‘By default and barring special cases, what properties might a sufficiently advanced machine intelligence look to us like it possessed, at least approximately, if we couldn’t see it visibly running around in circles?’, then there’s pretty much one obvious candidate: Probabilities, utility functions, and expected utility.” ↩
Starts at the 11:51:55 AM timestamp. It would’ve been more natural to pick a Presidential debate as an example, but all the 2016 and 2020 debates are just too weird. ↩
Putting aside the “values” part of the equation. ↩
The idea of making beliefs pay rent is connected to this section in a fairly obvious way.
A chunk of Rationality: A-Z is about communicating with precision (e.g., 37 Ways That Words Can Be Wrong).
Prizing beliefs that are precise and “pay rent” seems (for many, including me) to lead naturally to prizing science-based, naturalistic ways of looking at the world. A chunk of Rationality: A-Z is about reconciling the desire for sacred or transcendent experiences with an intense commitment to naturalism, e.g. The Sacred Mundane and Joy in the Merely Real. ↩
The basic idea here is that if we spend too much money, and this goes badly, the main ways it would ultimately go badly would be either (a) the spending means we need to raise taxes or cut spending later to balance the budget, which hurts growth (hence the “recession” reference); (b) the spending comes from borrowing, which creates too much debt, which leads to a debt crisis later; (c) the debt gets paid off by printing money, which leads to inflation. To do a more sophisticated version of this analysis, you’d want to get finer-grained about how big these effects could be and when. ↩
See this post for a vivid (if overly aggressive) statement of this idea. ↩
For example, see:
- Conservation of Expected Evidence, which promotes the somewhat counterintuitive (but correct according to this vision) idea that one should generally be as likely to change one’s mind in one direction as another. (If you expect to learn of more evidence for X, you should just adjust your probability of X upwards now.)
- Scientific Evidence, Legal Evidence, Rational Evidence and When Science Can’t Help, which argue that well-respected standards of evidence are “not fast enough” to come to good probabilities, and sometimes a good Bayesian needs to believe things that don’t meet the “standards of evidence” for these domains.
- These two posts arguing that one should see issues neither in black-and-white terms (where one side of an argument is certain) nor as a single shade of grey (where all sides are equally indeterminate). In my experience, this is a pretty distinctive property of probability-centric reasoning: instead of saying “X will happen” or “I don’t know,” one says e.g. “There’s a 70% chance X will happen.” ↩
One can ask: “If the two choices were X outcome and Y outcome, which would be better?”, “What about X outcome vs. a 50% chance of Y outcome?”, etc. In theory, asking enough questions like this should make it possible to quantify how much “better” (or “more choice-worthy”) one outcome is than another. ↩
My post on digital people gives one example of how this could come about. ↩
In fact, some parts of the rationalist community don’t emphasize “actually writing down probabilities and values” very much at all (and Rationality: A-Z doesn’t spend much space on guidance for how to do so). Instead, they emphasize various ideas and mental habits that are inspired by the abstract idea of EUM (some of which are discussed in this piece). FWIW, I think to the extent there are people who are trying to take inspiration from the general idea of EUM, while ~never actually doing it, this is probably a mistake. I think it’s important for people who see EUM as an ideal to get some experience trying to do it in practice. ↩
I actually can say a lot about how I expect this to go wrong, but at previous points in my life, I might’ve said something like this and not been able to say much more. ↩
Hopefully by the time this piece is public, the risk will be much lower. ↩

What links here?

Holden KarnofskyDec 21, 2021, 7:54 PM

73 points

19 comments25 min readEA link

Bayesian epistemology Decision theory

jsteinhardt Dec 21, 2021, 10:54 PM
14 points
0 ∶ 0

Re: Bayesian thinking helping one to communicate more clearly. I agree that this is a benefit, but I don’t think it’s the fastest route or the one with the highest marginal value. For instance, when you write:
A lot of expressed beliefs are “fake beliefs”: things people say to express solidarity with some group (“America is the greatest country in the world”), to emphasize some value (“We must do this fairly”), to let the listener hear what they want to hear (“Make America great again”), or simply to sound reasonable (“we will balance costs and benefits”) or wise (“I don’t see this issue as black or white”).
I’m immediately reminded of Orwell’s essay Politics and the English Language. I would generally expect people to learn more about clear, truth-seeking communication from reading Orwell (and other good books on writing) than by being Bayesian. Indeed, I find many Bayesian rationalists to be highly obscurantist in practice, perhaps moreso than the average similarly-educated person, and I feel that rationalist community norms tend to reward rather than punish this, because many people are drawn to deep but difficult-to-understand truths.
I would say that the value of the rationalist project so far has been in generating important hypotheses, rather than in clear communication around those hypotheses.
rdj Dec 31, 2021, 1:33 AM
8 points
0 ∶ 0

Good write-up. Something that has been nagging at me since reading is that I’m not sure this is specifically Bayesian. It’s not incompatible with Bayesian viewpoints, yet not exclusive to them either. When I try to describe this type of thinking to people I use the term “probabilistic”. In the Venn diagram of viewpoints, the “Bayesian” circle would fit entirely within the “probabilistic” circle. A non-Bayesian, such as someone with a frequentist interpretation of statistics, would still fit all of the reasoning given. This mindset would feel more Bayesian if anything depended on informed priors, or used a Bayesian updating formula, or directly cited Bayes theorem. Maybe what we need is a word for someone who believes in probabilistic rather than certain outcomes.
- Anthony DiGiovanni Dec 31, 2021, 4:05 AM
  3 points
  0 ∶ 0
  Parent
  
  At least based on #2, it does seem fair to call it “Bayesian” in contrast to frequentist philosophy, since the following objection sounds like a classic frequentist view:
  
  One relatively common viewpoint would say something like: “No. In order to say something is 20% likely, you ought to have data showing that it happens about 20% of the time. Or some rigorous, experiment-backed statistical model that predicts 20%. You can’t just describe some future event, close your eyes and think about it, call it 20% likely, and have that mean anything.”
  
  I used to TA an introductory stats class, and when I had to teach the frequentist concepts like confidence intervals, the lesson plans would very firmly hammer in the idea that the probability of some fixed parameter of nature having a given value is either 0 or 1, we just don’t know which one. Frequentists don’t endorse assigning probabilities to deterministic and unprecedented events in the future. (In case this is useful, I wrote/ranted about this and other upshots of Bayesianism here.)
  
  Arguably #1 is also especially Bayesian: the point is that your credence in some belief should be proportional to how much more likely some anticipated observations would be given that belief, than given its negation. That’s just the likelihood ratio in Bayesian updating.
  - rdj Dec 31, 2021, 4:34 PM
    1 point
    0 ∶ 0
    Parent
    
    While I understand that frequentism is based on the ratio of events, but I didn’t think it precluded making probabilistic opinions before any data exists. Can you explain more about how that is a ramification of frequentism? I suppose a frequentist might not ever say something is 20% likely in the absence of data or a proof that the outcome is 20% likely by definition. They might instead construct a hypothesis, which could be that something is 20% likely, and say that they can’t confidently reject the hypothesis. Although I’m not sure a typical Bayesian would literally say something is 20% likely either, but rather that they think something is 20% likely.
    The example that followed in the text, to derive one person’s estimate of the likelihood of some future event happening by imagining bets, seems like a tool that would work no matter how that person came to their probability estimates. And in the example the author reached that opinion through “pretty much pure intuition”, which seems neither specifically frequentist or Bayesian. Although it does seem more Bayesian to acknowledge intuition as an acceptable prior.
    I read #1 as arguing for assigning specific meaning to claims, setting up the problem in a way that can be quantified, ‘the “meaning” of a statement mostly comes down to what specific, visualizable, falsifiable predictions it points to’. That applies to frequentist people too.
    - Holden Karnofsky Jan 3, 2022, 7:56 PM
      3 points
      0 ∶ 0
      Parent
      
      antimonyanthony’s comment is pretty much what I had in mind. I would also point to Wikipedia on Bayesian epistemology: “It is based on the idea that beliefs can be interpreted as subjective probabilities. As such, they are subject to the laws of probability theory, which act as the norms of rationality.” The key idea is that all beliefs (as opposed to values) should be expressible via probabilities, regardless of what kind of data we have for interrogating them, whether they concern deterministic events, etc.
      
      I definitely don’t mean to imply that Bayesian mindset is incompatible with using frequentist statistical tools. I was more just highlighting the “beliefs = probabilities” idea that I think is at the heart of it.
      - rdj Jan 4, 2022, 11:29 PM
        1 point
        0 ∶ 0
        Parent
        
        Thanks for that link. I did not know that this is a term used to describe this viewpoint. I would expect frequentist statisticians to also agree with “beliefs = probabilities”, and when they do so it would feel odd to be able to say they are being (or acting) Bayesian when doing so. They could agree with much of the viewpoint in that Wikipedia page.
        Maybe the way I can reconcile this is to think of “Bayesian epistemology” and “Bayesian statistics” as two concepts inspired by the same source but with different breadths. Rather than only using Bayesian as a word to highlight the specific parts of a belief system that can’t be described by general probability, in epistemology we can use Bayesian as a broader term.
AppliedDivinityStudies Dec 22, 2021, 12:31 AM
6 points
0 ∶ 0


The wrong tool for many.… Some people accomplish a lot of good by being overconfident.

But Holden, rationalists should win. If you can do good by being overconfident, then bayesian habits can and should endorse overconfidence.

Since “The Bayesian Mindset” broadly construed is all about calibrating confidence, that might sound like a contradiction, but it shouldn’t. Overconfidence is an attitude, not an epistemic state.
- NunoSempere Dec 22, 2021, 1:44 PM
  12 points
  0 ∶ 0
  Parent
  
  bayesian habits can and should endorse overconfidence
  I disagree, Bayesian habits would lead one to the self-fulfilling prophecy point.
  - Ozzie Gooen Dec 23, 2021, 7:21 AM
    7 points
    0 ∶ 0
    Parent
    
    I like the idea of the self-fulfilling prophecy point, and expect prediction markets to work that way, but am still not sure if that’s the outcome I’d actually expect.
    
    I think it’s clearly true that there are at least some situations where dramatic overconfidence (above the self-fulfilling prophesy line) would make sense.
    
    That said, these situations might be quite contrived, and in real life, the benefits of the extra accuracy in many situations might outweigh the costs in a few.
    
    Entrepreneurs clearly gain specific benefits from portraying insanely optimistic stories, but perhaps more rational ones would lose some of these benefits but gain others.
    
    One could definitely argue that you could use the bayesian mindset to decide not to use the bayesian mindset in some settings, which is already definitely the case (there are many situations where it’s just too expensive, for example). Similar to how it’s possible to use a good decision theory in order to agree to use “insane decision theory X” in “insane situation where using insane decision theory X is optimal”.
  - Charles He Dec 23, 2021, 7:53 PM
    5 points
    0 ∶ 0
    Parent
    
    This is great! What tools did you use to draw this?
    - NunoSempere Dec 24, 2021, 10:21 AM
      9 points
      0 ∶ 0
      Parent
      
      Hey, thanks, https://excalidraw.com/
- Holden Karnofsky Jan 3, 2022, 7:57 PM
  2 points
  0 ∶ 0
  Parent
  
  It might be true that the right expected utility calculation would endorse being overconfident, but “Bayesian mindset” isn’t about behaving like a theoretically ideal utility maximizer—it’s about actually writing down probabilities and values and taking action based on those. I think trying to actually make decisions this way is a very awkward fit with an overconfident attitude: even if the equation you write down says you’ll do best by feeling overconfident, that might be tough in practice.
  - AppliedDivinityStudies Jan 4, 2022, 9:29 PM
    2 points
    0 ∶ 0
    Parent
    
    The tension between overconfidence and rigorous thinking is overrated:
    Swisher: Do you take criticism to heart correctly?
    
    Elon: Yes.
    
    Swisher: Give me an example of something if you could.
    
    Elon: How do you think rockets get to orbit?
    
    Swisher: That’s a fair point.
    
    Elon: Not easily. Physics is very demanding. If you get it wrong, the rocket will blow up.
    Cars are very demanding. If you get it wrong, a car won’t work. Truth in engineering and science is extremely important.
    
    Swisher: Right. And therefore?
    
    Elon: I have a strong interest in the truth.
    Source and previous discussion.
Ben_West🔸Dec 21, 2021, 11:29 PM
6 points
0 ∶ 0

I think there is a typo in your introductory example:
Should I buy travel insurance for $10? I think there’s about a 1% chance I’ll use it (probability—blue), in which case it will get me a $500 airfare refund (value—red). Since 1% * values = $500, I should not buy it for $10.
“Values” should be $500 and “= $500″ should be “= $5”. (This is fixed in your appendix.)
- Holden Karnofsky Dec 21, 2021, 11:53 PM
  4 points
  0 ∶ 0
  Parent
  
  Fixed, thanks!
sjsjsj Dec 23, 2021, 6:28 PM
5 points
0 ∶ 0

This was really great. As someone who has been lurking around LW/EA Forum for a few years but has never found reading the Sequences the highest-return investment compared to other things I could be doing, I very much appreciate your writing it.

A thought on something which is probably not core to your post but worth considering:

You said:

The dream behind the Bayesian mindset is that I could choose some set of values that I can really stand behind (e.g., putting a lot of value on helping people, and none on things like “feeling good about myself”), and focus only on that. Then the parts of myself driven by “bad” values would have to either quiet down, or start giving non-sincere probabilities. But over time, I could watch how accurate my probabilities are, and learn to listen to the parts of myself that make better predictions.

I think it’s perhaps… not feasible, or has long-term side effects, to think that if you currently care about feeling good about yourself, you can just decide you don’t and jump immediately to ignoring that need of yours. I would predict that taking that approach is likely to result in resistance to making accurate predictions or doing the things you endorse valuing, and/ or mysterious unhappy emotions because your need to feel good about yourself is not being met.

It seems to me that it would be better to use some method like Internal Double Crux to dialogue between the part of you that wants to generate good feelings by generating skewed predictions and the part of you that wants to help people, and find a way to meet the former part’s needs that don’t require making skewed predictions. An example of such an approach could be feeling good about yourself for cultivating more effective predictions. I imagine that’s implicit in the approach you describe, but it may be more effective to hold explicit space for the part that wants to feel good about itself, rather than making it wrong for generating skewed predictions.
- kokotajlod Dec 31, 2021, 9:46 PM
  15 points
  0 ∶ 0
  Parent
  
  +1. That paragraph to me reads like: “Here’s a neat trick by which you can forcibly self-modify to care about fewer, simpler, easier-to-measure things! Yay!”
  - Holden Karnofsky Jan 3, 2022, 7:57 PM
    4 points
    0 ∶ 0
    Parent
    
    I agree with what you say here, as a general matter. I’m not sure I want to make an edit, as I really do think there are some “bad” parts of myself that I’d prefer to “expose and downweight,” but I agree that it’s easy to get carried away with that sort of thing.
    - kokotajlod Jan 4, 2022, 5:05 PM
      2 points
      0 ∶ 0
      Parent
      
      Cool. We are on the same page then. :) I also agree that there are some bad parts of myself I’d prefer to expose and downweight.