Parker_Whitfill comments on Estimating the Substitutability between Compute and Cognitive Labor in AI Research

Parker_Whitfill 2 Jun 2025 12:24 UTC
3 points
0 ∶ 0
This is a good point, we agree, thanks! Note that you need to assume that the algorithmic progress that gives you more effective inference compute is the same that gives you more effective research compute. This seems pretty reasonable but worth a discussion.
Although note that this argument works only with the CES in compute formulation. For the CES in frontier experiments, you would have the $\frac{A K_{r e s}}{A K_{t r a i n}}$ so the A cancels out.^[1]
1. ^
  You might be able to avoid this by adding the A’s in a less naive fashion. You don’t have to train larger models if you don’t want to. So perhaps you can freeze the frontier, and then you get $\frac{A K_{r e s}}{A_{f r o z e n} K_{t r a i n}}$ ? I need to think more about this point.
- Tom_Davidson 2 Jun 2025 17:19 UTC
  2 points
  0 ∶ 0
  Parent
  Although note that this argument works only with the CES in compute formulation. For the CES in frontier experiments, you would have the $\frac{A K_{r e s}}{A K_{t r a i n}}$ so the A cancels out.
  Yep, as you say in your footnote, you can choose to freeze the frontier, so you train models of a fixed capability using less and less compute (at least for a while).
- Parker_Whitfill 2 Jun 2025 12:38 UTC
  1 point
  0 ∶ 0
  Parent
  Also, updating this would change all the intelligence explosion conditions, not just when $σ < 1$ .