D0TheMath comments on Doing EA Better: Preamble, Summary, and Introduction

D0TheMath 25 Jan 2023 21:10 UTC
11 points
3 ∶ 7
The decisions which caused the FTX catastrophe, the fact that EA is counterfactually responsible for the three primary AGI labs, Anthropic being entirely run by EAs yet still doing net negative work, and the funding of mostly capabilities oriented ML work with vague alignment justifications (and potentially similar dynamics in biotech which are more speculative for me right now), with the creation of ~~GPT and~~^[1] RLHF as particular examples of this.
1. ↩︎
  I recently found out that GPT was not in fact developed for alignment work. I had gotten confused with some rhetoric used by OpenAI and employees during the earlier days which turned out to be entirely independent from modern alignment considerations.
- Davidmanheim 26 Jan 2023 6:19 UTC
  10 points
  5 ∶ 1
  Parent
  Strong disagree for misattributing blame and eliding the question.
  
  To the extent that “EA is counterfactually responsible for the three primary AGI labs,” you would need to claim that the ex-ante expected value of specific decisions was negative, and that those decisions were because of EA, not that it went poorly ex-post. Perhaps you can make those arguments, but you aren’t.
  
  Ditto for “The decisions which caused the FTX catastrophe”—Whose decisions, where does the blame go, and to what extent are they about EA? SBF’s decision to misappropriate funds, or fraudulently misrepresent what he did? CEA not knowing about it? OpenPhil not investigating? Goldman Sachs doing a bad job with due diligence?
  - D0TheMath 27 Jan 2023 0:32 UTC
    1 point
    0 ∶ 0
    Parent
    I agree with this, except when you tell me I was eliding the question (and, of course, when you tell me I was misattributing blame). I was giving a summary of my position, not an analysis which I think would be deep enough to convince all skeptics.
    - Davidmanheim 27 Jan 2023 9:47 UTC
      2 points
      0 ∶ 0
      Parent
      You say you agree, but I was asking questions about what you were claiming and who you were blaming.
- Robi Rahman🔸 27 Jan 2023 6:25 UTC
  8 points
  0 ∶ 0
  Parent
  EAs are counterfactually responsible for DeepMind?
- Arepo 25 Jan 2023 22:37 UTC
  3 points
  0 ∶ 0
  Parent
  Off topic, but can you clarify why you think Anthropic does net negative work?
  - D0TheMath 25 Jan 2023 23:23 UTC
    11 points
    0 ∶ 2
    Parent
    Basically, there are simple arguments around ‘they are an AGI capabilities organization, so obviously they’re bad’, and more complicated arguments around ‘but they say they want to do alignment work’, and then even more complicated arguments on those arguments going ‘well, actually it doesn’t seem like their alignment work is all that good actually, and their capabilities work is pushing capabilities, and still makes it difficult for AGI companies to coordinate to not build AGI, so in fact the simple arguments were correct’. Getting more into depth would require a writeup of my current picture of alignment, which I am writing, but which is difficult to convey via a quick comment.
    - Arepo 26 Jan 2023 23:11 UTC
      3 points
      0 ∶ 0
      Parent
      I upvoted and did not disagreevote this, for the record. I’ll be interested to see your writeup :)
      - D0TheMath 26 Jan 2023 23:16 UTC
        1 point
        0 ∶ 0
        Parent
        Do you disagree, assuming my writeup provides little information or context to you?
        Arepo 26 Jan 2023 23:25 UTC
        3 points
        1 ∶ 0
        Parent
        I don’t feel qualified to say. My impression of Anthropic’s epistemics is weakly negative (see here), but I haven’t read any of their research, but my prior is relatively high AI scepticism. Not because I feel like I understand anything about the field, but because every time I do engage with some small part of the dialogue, it seems totally unconvincing (see same comment), so I have the faint suspicion many of the people worrying about AI safety (sometimes including me) are subject to some mass-Gell-Mann amnesia effect.
        D0TheMath 26 Jan 2023 23:34 UTC
        1 point
        0 ∶ 0
        Parent
        Mass Gell-Mann amnesia effect because, say, I may look at others talking about my work or work I know closely, and say “wow! That’s wrong”, but look at others talking about work I don’t know closely and say “wow! That implies DOOM!” (like dreadfully wrong corruptions of the orthogonality thesis), and so decide to work on work that seems relevant to that DOOM?
        Arepo 26 Jan 2023 23:37 UTC
        3 points
        0 ∶ 0
        Parent
        Yeah, basically that. Even if those same people ultimately find much more convincing (or at least less obviously flawed) arguments, I still worry about the selection effects Nuno mentioned in his thread.
- D0TheMath 25 Jan 2023 21:17 UTC
  1 point
  0 ∶ 1
  Parent
  I could list my current theories about how these problems are interrelated, but I fear such a listing would anchor me to the wrong one, and too many claims in a statement produces more discussion around minor sub-claims than major points (an example of a shallow criticism of EA discussion norms).