Mau comments on Reasons for my negative feelings towards the AI risk discussion

Mau 1 Sep 2022 9:31 UTC
22 points
0 ∶ 0
I also used to be pretty skeptical about the credibility of the field. I was surprised to learn about how much mainstream, credible support AI safety concerns have received:
- Multiple leading AI labs have large (e.g. 30-person) teams of researchers dedicated to AI alignment.
  - They sometimes publish statements like, “Unaligned AGI could pose substantial risks to humanity and solving the AGI alignment problem could be so difficult that it will require all of humanity to work together. ”
- Key findings that are central to concerns over AI risk have been accepted (with peer review) into top ML conferences.
- A top ML conference is hosting a workshop on ML safety (with a description that emphasizes “long-term and long-tail safety risks”).
- Reports and declarations from some major governments have endorsed AI risk worries.
  - The UK’s National AI Strategy states, “The government takes the long term risk of non-aligned Artificial General Intelligence, and the unforeseeable changes that it would mean for the UK and the world, seriously.”
- There are AI faculty at universities including MIT, UC Berkeley, and Cambridge who endorse AI risk worries.
To be fair, AI risk worries are far from a consensus view. But in light of the above, the idea that all respected AI researchers find AI risk laughable seems plainly mistaken. Instead, it seems clear that a significant fraction of respected AI researchers and institutions are worried. Maybe these concerns are misguided, but probably not for any reason that’s obvious to whoever has basic knowledge of AI—or these worried AI experts would have noticed.

(Also, in case you haven’t seen it yet, you might find this discussion on whether there are any experts on these questions interesting.)
- fergusq 1 Sep 2022 15:53 UTC
  4 points
  0 ∶ 0
  Parent
  Thank you for these references, I’ll take a close look on them. I’ll write a new comment if I have any thoughts after going through them.
  Before having read them, I want to say that I’m interested in research about risk estimation and AI progress forecasting. General research about possible AI risks without assigning them any probabilities is not very useful in determining if a threat is relevant. If anyone has papers specifically on that topic, I’m very interested in reading them too.
  - elifland 1 Sep 2022 17:38 UTC
    2 points
    0 ∶ 0
    Parent
    IMO by far the most through estimation of AI x-risk thus far is Carlsmith’s Is Power-Seeking an Existential Risk? (see also summary presentation, reviews).
    
    (edited to add: as you might guess from my previous post, I think some level of AI skepticism is healthy and I appreciate you sharing your thoughts. I’ve become more convinced of the seriousness of AI x-risk over time, feel free to DM me if you’re interested in chatting sometime)
- RationalHippy 15 Nov 2023 8:54 UTC
  1 point
  0 ∶ 0
  Parent
  I would be curious to know if your beliefs have been updated in light of the recent developments?
  - fergusq 12 Jan 2026 18:34 UTC
    7 points
    0 ∶ 0
    Parent
    Sorry for answering late.
    My opinions are mostly the same. Last years have seen mostly incremental improvements in AI capabilities, with no development on areas I believe are crucial for AGI, such as considerably more efficient training algorithms and introspection. The current trend of using exponentially more compute without seeing the same increase in capabilities (outside of few exceptions such as coding^[1]) is a demonstration of our lack of development: algorithmic development should enable us to achieve more with less compute, which is not what we are seeing^[2].
    There are many groups taking AI risk seriously. This enforces my opinion that AI risk is not neglected. Since I also believe it is not tractable, it makes a poor choice for interventions. I believe this to be true regardless of what probability we assign for achieving AGI in near future.
    I might write a longer follow-up post later that goes through these in more detail.
    ^
    Mathematics and coding are examples of skills that can be automatically validated to some extent, enabling us to train them without a training corpus. However, most skills are not like this, and we are not seeing improvements on those areas. Since one of my research areas in computational creativity, one example where progress is lacking noticeably is creative writing. Creativity has indeed seemed to even taken a step backwards in case of some models. This is due to lack of suitable training material and the impossibility of automatically valuating creative text. Human-created corpora are expensive and we’ve ran out of them. I believe strong creativity is one of the key areas required to achieve AGI, and we are not seeing progress there.
    ^
    There are some algorithmic improvements increasing efficiency, but most of them are kind of incremental development that gives small gains but not a breakthrough that would be required.