Habryka comments on AMA or discuss my 80K podcast episode: Ben Garfinkel, FHI researcher

Habryka 20 Jul 2020 5:02 UTC
11 points
0 ∶ 0
There’s also a lot of pseudo-superforcasting, like “I have 80% confidence in this”, without any evidence backing up those credences.
From a bayesian perspective there is no particular reason why you have to provide more evidence if you provide credences, and in general I think there is a lot of value in people providing credences even if they don’t provide additional evidence, if only to avoid problems of ambiguous language.
- MichaelA 22 Jul 2020 1:17 UTC
  2 points
  0 ∶ 0
  Parent
  From a bayesian perspective there is no particular reason why you have to provide more evidence if you provide credences
  I’m not sure I know what you mean by this.
  I’d agree that you’re definitely not obligated to provide more evidence, and that your credence does fully capture how likely you think it is that X will happen.
  But it seems to me that the evidence that informed your credence can also be very useful information for people, both in relation to how much they should update their own credences (as they may have info you lack regarding how relevant and valid those pieces of evidence are), and in relation to how—and how much—you might update your views (e.g., if they find out you just thought for 5 seconds and went with your gut, vs spending a year building expertise and explicit models). It also seems like sharing that evidence could help them with things like building their general models of the world or of how to make estimates.
  (This isn’t an argument against giving explicit probabilities that aren’t based on much or that aren’t accompanied by explanations of what they’re based on. I’m generally, though tentatively, in favour of that. It just seems like also explaining what the probabilities are based on is often quite useful.)
  (By the way, Beard et al. discuss related matters in the context of existential risk estimates, using the term “evidential reasoning”.)
  - Habryka 22 Jul 2020 18:10 UTC
    3 points
    0 ∶ 0
    Parent
    This is in contrast to a frequentist perspective, or maybe something close to a “common-sense” perspective, which tends to bucket knowledge into separate categories that aren’t easily interchangeable.
    Many people make a mental separation between “thinking something is true” and “thinking something is X% likely, where X is high”, with one falling into the category of lived experience, and the other falling into the category of “scientific or probabilistic assessment”. The first one doesn’t require any externalizable evidence and is a fact about the mind, the second is part of a collaborative scientific process that has at its core repeatable experiments, or at least recurring frequencies (i.e. see the frequentist discussion of it being meaningless to assign probabilities to one-time events).
    Under some of these other non-bayesian interpretations of probability theory, an assignment of probabilities is not valid if you don’t associate it with either an experimental setup, or some recurring frequency. So under those interpretations you do have an additional obligation to provide evidence and context to your probability estimates, since otherwise they don’t really form even a locally valid statement.
    - MichaelA 22 Jul 2020 23:45 UTC
      4 points
      0 ∶ 0
      Parent
      Thanks for that answer. So just to check, you essentially just meant that it’s ok to provide credences without saying your evidence—i.e., you’re not obligated to provide evidence when you provide credences? Not that there’s no added value to providing your evidence alongside your credences?
      If so, I definitely agree.
      (And it’s not that your original statement seemed to clearly say something different, just that I wasn’t sure that that’s all it was meant to mean.)
      - Habryka 23 Jul 2020 4:35 UTC
        2 points
        0 ∶ 0
        Parent
        Yep, that’s what I was implying.
  - FCCC 22 Jul 2020 8:17 UTC
    −9 points
    0 ∶ 0
    Parent
    From a bayesian perspective there is no particular reason why you have to provide more evidence if you provide credences
    This statement is just incorrect.
- FCCC 20 Jul 2020 8:30 UTC
  1 point
  0 ∶ 0
  Parent
  From a bayesian perspective there is no particular reason why you have to provide more evidence if you provide credences
  Sure there is: By communicating, we’re trying to update one another’s credences. You’re not going to be very successful in doing so if you provide a credence without supporting evidence. The evidence someone provides is far more important than someone’s credence (unless you know the person is highly calibrated and precise). If you have a credence that you keep to yourself, then yes, there’s no need for supporting evidence.
  if only to avoid problems of ambiguous language.
  Ambiguous statements are bad, 100%, but so are clear, baseless statements.
  As you say, people can legitimately have credences about anything. It’s how people should think. But if you’re going to post your credence, provide some evidence so that you can update other people’s credences too.
  - Pablo 20 Jul 2020 17:01 UTC
    13 points
    0 ∶ 0
    Parent
    Ambiguous statements are bad, 100%, but so are clear, baseless statements.
    You seem to have switched from the claim that EAs often report their credences without articulating the evidence on which those credences rest, to the claim that EAs often lack evidence for the credences they report. The former claim is undoubtedly true, but it doesn’t necessarily describe a problematic phenomenon. (See Greg Lewis’s recent post; I’m not sure if you disagree.). The latter claim would be very worrying if true, but I don’t see reason to believe that it is. Sure, EAs sometimes lack good reasons for the views they espouse, but this is a general phenomenon unrelated to the practice of reporting credences explicitly.
    - FCCC 21 Jul 2020 3:43 UTC
      3 points
      0 ∶ 0
      Parent
      You seem to have switched from the claim that EAs often report their credences without articulating the evidence on which those credences rest, to the claim that EAs often lack evidence for the credences they report.
      Habryka seems to be talking about people who have evidence and are just not stating it, so we might be talking past one another. I said in my first comment “There’s also a lot of pseudo-superforcasting … without any evidence backing up those credences.” I didn’t say “without stating any evidence backing up those credences.” This is not a guess on my part. I’ve seen comments where they say explicitly that the credence they’re giving is a first impression, and not something well thought out. It’s fine for them to have a credence, but why should anyone care what your credence is if it’s just a first impression?
      See Greg Lewis’s recent post; I’m not sure if you disagree.
      I completely agree with him. Imprecision should be stated and significant figures are a dumb way to do it. But if someone said “I haven’t thought about this at all, but I’m pretty sure it’s true”, is that really all that much worse than providing your uninformed prior and saying you haven’t really thought about it?
      - Linch 21 Jul 2020 6:16 UTC
        4 points
        0 ∶ 0
        Parent
        pseudo-superforcasting
        I agree that EAs put superforecasters and superforecasting techniques on a pedestal, more than is warranted.
        But if someone said “I haven’t thought about this at all, but I’m pretty sure it’s true”, is that really all that much worse than providing your uninformed prior and saying you haven’t really thought about it?
        Yes, I think it’s a lot worse. Consider the two statements:
        I haven’t thought much about it, but I’m pretty sure (99.99%) based on a cursory read that human extinction from climate change won’t happen.
        And
        I haven’t thought much about it, but I’m pretty sure (80%) based on a cursory read that human extinction from climate change won’t happen.
        The two statements are pretty similar in verbalized terms (and each falls under loose interpretations of what “pretty sure” means in common language), but ought to have drastically different implications for behavior!
        I basically think EA and associated communities would be better off to have more precise credences, and be accountable for them. Otherwise, it’s difficult to know if you were “really” wrong, even after checking hundreds of claims!
        FCCC 21 Jul 2020 7:15 UTC
        1 point
        0 ∶ 0
        Parent
        The two statements are pretty similar in verbalized terms (and each falls under loose interpretations of what “pretty sure” means in common language), but ought to have drastically different implications for behavior!
        Yes you’re right. But I’m making a distinction between people’s own credences and their ability to update the credences of other people. As far as changing the opinion of the reader, when someone says “I haven’t thought much about it”, it should be an indicator to not update your own credence by very much at all.
        I basically think EA and associated communities would be better off to have more precise credences, and be accountable for them
        I fully agree. My problem is that this is not the current state of affairs for the majority of Forum users, in which case, I have no reason to update my credences because an uncalibrated random person says they’re 90% confident without providing any reasoning that justifies their position. All I’m asking for is for people to provide a good argument along with their credence.
        I agree that EAs put superforecasters and superforecasting techniques on a pedestal, more than is warranted.
        I think that they should be emulated. But superforcasters have reasoning to justify their credences. They break problems down into components that they’re more confident in estimating. This is good practice. Providing a credence without any supporting argument, is not.
        Linch 21 Jul 2020 8:01 UTC
        2 points
        0 ∶ 0
        Parent
        I’m curious if you agree or disagree with this claim:
        The median EA is closer to a typical superforecaster than they are to random people
        With a specific operationalization like:
        If asked to predict on 50 random questions on Good Judgement Open, the median commenter on the EA Forum would have a Brier score closer to a typical superforecaster than the median commenter’s score to a randomly selected English-speaking person.
        FCCC 21 Jul 2020 8:12 UTC
        1 point
        0 ∶ 0
        Parent
        It’s almost irrelevant, people still should provide their supporting argument of their credence, otherwise evidence can get “double counted” (and there’s “flow on” effects where the first person who updates another person’s credence has a significant effect on the overall credence of the population). For example, say I have arguments A and B supporting my 90% credence on something. And you have arguments A, B and C supporting your 80% credence on something. And neither of us post our reasoning; we just post our credences. It’s a mistake for you to then say “I’ll update my credence a few percent because FCCC might have other evidence.” For this reason, providing supporting arguments is a net benefit, irrespective of EA’s accuracy of forecasts.
        Linch 23 Jul 2020 9:51 UTC
        4 points
        0 ∶ 0
        Parent
        I don’t find your arguments persuasive for why people should give reasoning in addition to credences. I think posting reasoning is on the margin of net value, and I wish more people did it, but I also acknowledge that people’s time is expensive so I understand why they choose not to. You list reasons why giving reasoning is beneficial, but not reasons for why it’s sufficient to justify the cost.
        My question probing predictive ability of EAs earlier was an attempt to set right what I consider to be an inaccuracy in the internal impressions EAs have about the ability of superforecasters. In particular, it’s not obvious to me that we should trust the judgments of superforecasters substantially more than we trust the judgments of other EAs.
        MichaelA 22 Jul 2020 1:29 UTC
        2 points
        0 ∶ 0
        Parent
        My view is that giving explicit, quantitative credences plus stating the supporting evidence is typically better than giving explicit, quantitative credences without stating the supporting evidence (at least if we ignore time costs, information hazards, etc.), which is in turn typically better than giving qualitative probability statements (e.g., “pretty sure”) without stating the supporting evidence, and often better than just saying nothing.
        Does this match your view?
        In other words, are you essentially just arguing that “providing supporting arguments is a net benefit”?
        I ask because I had the impression that you were arguing that it’s bad for people to give explicit, quantitative credences if they aren’t also giving their supporting evidence (and that it’d be better for them to, in such cases, either use qualitative statements or just say nothing). Upon re-reading the thread, I got the sense that others may have gotten that impression too, but also I don’t see you explicitly make that argument.
        FCCC 22 Jul 2020 8:07 UTC
        1 point
        0 ∶ 0
        Parent
        Does this match your view?
        Basically, yeah.
        But I do think it’s a mistake to update your credence based off someone else’s credence without knowing their argument and without knowing whether they’re calibrated. We typically don’t know the latter, so I don’t know why people are giving credences without supporting arguments. It’s fine to have a credence without evidence, but why are people publicising such credences?
        MichaelA 22 Jul 2020 12:41 UTC
        4 points
        0 ∶ 0
        Parent
        I do think it’s a mistake to update your credence based off someone else’s credence without knowing their argument and without knowing whether they’re calibrated.
        I’d agree with a modified version of your claim, along the following lines: “You should update more based on someone’s credence if you have more reason to believe their credence will track the truth, e.g. by knowing they’ve got good evidence (even if you haven’t actually seen the evidence) or knowing they’re well-calibrated. There’ll be some cases where you have so little reason to believe their credence will track the truth that, for practical purposes, it’s essentially not worth updating.”
        But your claim at least sounds like it’s instead that some people are calibrated while others aren’t (a binary distinction), and when people aren’t calibrated, you really shouldn’t update based on their credences at all (at least if you haven’t seen their arguments).
        I think calibration increases in a quantitative, continuous way, rather than switching from off to on. So I think we should just update on credences more the more calibrated the person they’re from is.
        Does that sound right to you?
  - Habryka 20 Jul 2020 16:50 UTC
    8 points
    0 ∶ 0
    Parent
    I mean, very frequently it’s useful to just know what someone’s credence is. That’s often an order of magnitude cheaper to provide, and often is itself quite a bit of evidence. This is like saying that all statements of opinions or expressions of feelings are bad, unless they are accompanied with evidence, which seems like it would massively worsen communication.
    - FCCC 21 Jul 2020 4:15 UTC
      −1 points
      0 ∶ 0
      Parent
      I mean, very frequently it’s useful to just know what someone’s credence is. That’s often an order of magnitude cheaper to provide, and often is itself quite a bit of evidence.
      I agree, but only if they’re a reliable forecaster. A superforecaster’s credence can shift my credence significantly. It’s possible that their credences are based off a lot of information that shifts their own credence by 1%. In that case, it’s not practical for them to provide all the evidence, and you are right.
      But most people are poor forecasters (and sometimes they explicitly state they have no supporting evidence other than their intuition), so I see no reason to update my credence just because someone I don’t know is confident. If the credence of a random person has any value to my own credence, it’s very low.
      This is like saying that all statements of opinions or expressions of feelings are bad, unless they are accompanied with evidence, which seems like it would massively worsen communication.
      That would depend on the question. Sometimes we’re interested in feelings for their own sake. That’s perfectly legitimate because the actual evidence we’re wanting is the data about their feelings. But if someone’s giving their feelings about whether there are an infinite number of primes, it doesn’t update my credences at all.
      I think opinions without any supporting argument worsen discourse. Imagine a group of people thoughtfully discussing evidence, then someone comes in, states their feelings without any evidence, and then leaves. That shouldn’t be taken seriously. Increasing the proportion of those people only makes it worse.
      Bayesians should want higher-quality evidence. Isn’t self-reported data is unreliable? And that’s when the person was there when the event happened. So what is the reference class for people providing opinions without having evidence? It’s almost certainly even more unreliable. If someone has an argument for their credence, they should usually give that argument; if they don’t have an argument, I’m not sure why they’re adding to the conversation.
      I’m not saying we need to provide peer-reviewed articles. I just want to see some line of reasoning demonstrating why you came to the conclusion you made, so that everyone can examine your assumptions and inferences. If we have different credences and the set of things I’ve considered is a strict subset of yours, you might update your credence because you mistakenly think I’ve considered something you haven’t.
      - Habryka 21 Jul 2020 5:55 UTC
        2 points
        0 ∶ 0
        Parent
        Isn’t self-reported data is unreliable?
        Yes, but unreliability does not mean that you instead just use vague words instead of explicit credences. It’s a fine critique to say that people make too many arguments without giving evidence (something I also disagree with, but that isn’t the subject of this thread), but you are concretely making the point that it’s additionally bad for them to give explicit credences! But the credences only help, compared to vague and ambiguous terms that people would use instead.
        FCCC 21 Jul 2020 6:57 UTC
        1 point
        0 ∶ 0
        Parent
        I’m not sure how you think that’s what I said. Here’s what I actually said:
        A superforecaster’s credence can shift my credence significantly...
        If the credence of a random person has any value to my own credence, it’s very low...
        The evidence someone provides is far more important than someone’s credence (unless you know the person is highly calibrated and precise)...
        [credences are] how people should think...
        if you’re going to post your credence, provide some evidence so that you can update other people’s credences too.
        I thought I was fairly clear about what my position is. Credences have internal value (you should generate your own credence). Superforecasters’ credences have external value (their credence should update yours). Uncalibrated random people’s credences don’t have much external value (they shouldn’t shift your credence much). And an argument for your credence should always be given.
        I never said vague words are valuable, and in fact I think the opposite.
        This is an empirical question. Again, what is the reference class for people providing opinions without having evidence? We could look at all of the unsupported credences on the forum and see how accurate they turned out to be. My guess is that they’re of very little value, for all the reasons I gave in previous comments.
        you are concretely making the point that it’s additionally bad for them to give explicit credences!
        I demonstrated a situation where a credence without evidence is harmful:
        If we have different credences and the set of things I’ve considered is a strict subset of yours, you might update your credence because you mistakenly think I’ve considered something you haven’t.
        The only way we can avoid such a situation is either by providing a supporting argument for our credences, OR not updating our credences in light of other people’s unsupported credences.