JWS 🔸 comments on On the Dwarkesh/Chollet Podcast, and the cruxes of scaling to AGI

JWS 🔸 29 Jun 2024 13:03 UTC
2 points
0 ∶ 0
Final final edit: Congrats on the ARC-AGI-PUB results, really impressive :)
This will be my final response on this thread, because life is very time consuming and I’m rapidly reaching the point where I need to dive back into the technical literature and stress-test my beliefs and intuitions again. I hope Ryan and any readers have found this exchange useful/enlightening for seeing two different perspectives hopefully have productive disagreement?
If you found my presentation of the scaling-skeptical position highly unconvincing, I’d recommend following the work and thoughts of Tan Zhi Xuan (find her on X here). One of biggest updates was finding her work after she pushed back on Jacob Steinhardt here, and recently she gave a talk about her approach to Alignment. I urge readers to consider spending much more of their time listening to her than to me about AI.
I feel like this is a pretty strange way to draw the line about what counts as an “LLM solution”.
I don’t think so? Again, I wouldn’t call CICERO an “LLM solution”. Surely there’ll be some amount of scaffolding which tips over into the scaffolding being the main thing and the LLM just being a component part? It’s probably all blurry lines for sure, but I think it’s important to separate ‘LLM only systems’ from ‘systems that include LLMs’, because it’s very easy to conceptual scale up the former but harder to do the latter.
Human skeptic: That wasn’t humans sending someone to the moon that was Humans + Culture + Organizations + Science sending someone to the moon! You see, humans don’t exhibit real intelligence!
I mean, you use this as a reductio, but that’s basically the theory of Distributed Cognition, and also linked to the ideas of ‘collective intelligence’, though that’s definitely not an area I’m an expert in by any means. Also reminds me a lot Chalmers and Clarks’ thesis of the Extended Mind.^[1]
Of course, I think actual LLM skeptics often don’t answer “No” to the last question. They often do have something that they think is unlikely to occur with a relatively straightforward scaffold on top of an LLM (a model descended from the current LLM paradigm, perhaps trained with semi-supervised learning and RLHF).
So I can’t speak for Chollet and other LLM skeptics, and I think again LLMs+extra (or extras+LLMs) are a different beast from LLMs on their own and possibly an important crux. Here are some things I don’t think will happen in the near-ish future (on the current paradigm):
- I believe an adversarial Imitation Game, where the interrogator is aware of both the AI system’s LLM-based nature and its failure modes, is unlikely to be consistently beaten in the near future.^[2]
- Primarily-LLM models, in my view, are highly unlikely to exhibit autopoietic behaviour or develop agentic designs independently (i.e. without prompting/direction by a human controller).
- I don’t anticipate these models exponential increase the rate of scientific research or AI development.^[3] They’ll more likely serve as tools used by scientists and researchers themselves to frame problems, but new and novel problems will still remain difficult and be bottlenecked by the real world + Hofstadter’s law.
- I don’t anticipate Primarily-LLM models to become good at controlling and manoeuvring robotic bodies in the 3D world. This is especially true in a novel-test-case scenario (if someone could make a physical equivalent of ARC to test this, that’d be great)
- This would be even less likely if the scaffolding remained minimal. For instance, if there’s no initial sorting code explicitly stating [IF challenge == turing_test GO TO turing_test_game_module].
- Finally, as an anti-RSI operationalisation, the idea of LLM-based models assisting in designing and constructing a Dyson Sphere within 15 years seems… particularly far-fetched for me.
I’m not sure if this reply was my best, it felt a little all-over-the-place, but we are touching on some deep or complex topics! So I’ll respectfully bow out now, and thank again for the disucssion and giving me so much to think about. I really appreciate it Ryan :)
1. ^
  Then you get into ideas like embodiment/enactivism etc
2. ^
  I can think of a bunch of strategies to win here, but I’m not gonna say so it doesn’t end up in GPT-5 or 6′s training data!
3. ^
  Of course, with a new breakthrough, all bets could be off, but it’s also definitionally impossible to predict those, and unrobust to draw straight lines and graphs to predict the future if you think breakthroughs will be need. (Not saying you do this, but some other AIXR people definitely seem to be)

JWS 🔸 comments on On the Dwarkesh/​Chollet Podcast, and the cruxes of scaling to AGI

JWS 🔸 comments on On the Dwarkesh/Chollet Podcast, and the cruxes of scaling to AGI