JoshuaBlake comments on All AGI Safety questions welcome (especially basic ones) [April 2023]

JoshuaBlake 13 Apr 2023 6:07 UTC
6 points
2 ∶ 0
What’s the strongest argument(s) for the orthogonality thesis, understandable to your average EA?
- aogara 17 Apr 2023 8:08 UTC
  3 points
  1 ∶ 1
  Parent
  I don’t think the orthogonality thesis would have predicted GPT models, which become intelligent by mimicking human language, and learn about human values as a byproduct. The orthogonality thesis says that, in principle, any level of intelligence can be combined with any goal, but in practice the most intelligent systems we have are trained by mimicking human concepts.
  
  On the other hand, after you train a language model, you can ask it or fine-tune it to pursue any goal you like. It will use human concepts that it learned from pretraining on natural language, but you can give it a new goal.
- Riccardo 14 Apr 2023 19:45 UTC
  2 points
  0 ∶ 1
  Parent
  The FAQ response from Stampy is quite good here:
  https://ui.stampy.ai?state=6568_
  - JoshuaBlake 14 Apr 2023 21:42 UTC
    2 points
    0 ∶ 0
    Parent
    This seems pretty weak as an argument for something that seems pretty core to AGI risk arguments. Can we not get any empirical evidence ether way? Also, all the links in the “defence of the thesis” section are broken for me.
    - StevenKaas 15 Apr 2023 1:10 UTC
      1 point
      0 ∶ 0
      Parent
      Thanks for reporting the broken links. It looks like a problem with the way Stampy is importing the LessWrong tag. Until the Stampy page is fixed, following the links from LessWrong should work.