JoshYou comments on Estimating the Substitutability between Compute and Cognitive Labor in AI Research

JoshYou 4 Jun 2025 20:45 UTC
4 points
0 ∶ 0
(comment originally posted on Twitter, Cheryl’s response here)
I’ll flag that estimating firm-level training compute with [Epoch AI’s] notable models dataset will produce big underestimates. E.g. with your methodology, OpenAI spent ~4e25 FLOP on training and 1.3e25 FLOP on research in 2023 and 2024. the latter would cost ~$30 million. but we know OpenAI spent at least $1 billion on research in 2024! (also note they spent $1 billion on research compute after amortizing this cost with an undisclosed schedule).
But I don’t have a great sense of how sensitive your results are to this issue.
(this raises other questions: what did OpenAI spend $3 billion in training compute on in 2024? that’s enough for 50 GPT-4 sized models. Maybe my cost accounting is quite different from OpenAI’s. A lot of that “training” compute might really be more experimental)
- Parker_Whitfill 5 Jun 2025 0:22 UTC
  4 points
  0 ∶ 0
  Parent
  Here is a fleshed out version of Cheryl’s response. Lets suppose actual research capital is $q K$ but we just used $K$ in our estimation equation.
  Then the true estimation equation is
  $ln \frac{q K}{L} = σ ln \frac{γ}{1 - γ} + σ ln \frac{w}{r}$
  re-arranging we get
  $ln \frac{K}{L} = σ ln \frac{γ}{1 - γ} - ln q + σ ln \frac{w}{r}$
  So if we regress $ln \frac{K}{L}$ on a constant and $ln \frac{w}{r}$ then the coefficient on $ln \frac{w}{r}$ is still $σ$ as long as q is independent of $w / r$ .
  Nevertheless, I think this should increase your uncertainty in our estimates because there is clearly a lot going on behind the scenes that we might not fully understand—like how is research vs. training compute measured, etc.