technicalities answers Strongest real-world examples supporting AI risk claims?

technicalities 7 Sep 2023 18:47 UTC
10 points
0 ∶ 0
Break self-improvement into four:
1. ML optimizing ML inputs: reduced data centre energy cost, reduced cost of acquiring training data, supposedly improved semiconductor designs.
2. ML aiding ML researchers. e.g. >3% of new Google code is now auto-suggested without amendment.
3. ML replacing parts of ML research. Nothing too splashy but steady progress: automatic data cleaning and feature engineering, autodiff (and symbolic differentiation!), meta-learning network components (activation functions, optimizers, …), neural architecture search.
4. Classic direct recursion. Self-play (AlphaGo) is the most striking example but it doesn’t generalise, so far. Purported examples with unclear practical significance: Algorithm Distillation and models finetuned on their own output.^[1]
See also this list
Treachery:
https://arxiv.org/abs/2102.07716
https://lukemuehlhauser.com/treacherous-turns-in-the-wild/
1. ^
  The proliferation of crappy bootleg LLaMA finetunes using GPT as training data (and collapsing when out of distribution) makes me a bit cooler about these results in hindsight.
- technicalities 8 Sep 2023 15:39 UTC
  2 points
  0 ∶ 0
  Parent
  Buckman’s examples are not central to what you want but worth reading: https://jacobbuckman.com/2022-09-07-recursively-self-improving-ai-is-already-here/
- rosehadshar 8 Sep 2023 14:41 UTC
  2 points
  0 ∶ 0
  Parent
  Thanks, really helpful!