Parker_Whitfill

Karma: 312

Working on various aspects of Econ + AI.

Parker_Whitfill Jul 8, 2025, 8:56 AM
1 point
0 ∶ 0
on: Estimating the Substitutability between Compute and Cognitive Labor in AI Research
We are still working on getting a more official version of this on Arvix, possibly with estimates for $λ$ and $ϕ$ .
When we do that, we’ll also upload full replication files. But I don’t want to keep anyone waiting for the data in case they have some uses for it, so see here for the main CSV we used: https://github.com/parkerwhitfill/EOS_AI

Parker_Whitfill Jun 9, 2025, 2:25 AM
4 points
0 ∶ 0
in reply to: Charlie Harrison’s comment on: Estimating the Substitutability between Compute and Cognitive Labor in AI Research
On 2), the condition you find makes sense, but aren’t you implicitly assuming an elasticity of substitution of 1 with Cobb-Douglas?
Yes, definitely. In general, I don’t have a great idea about what $Z (\cdot)$ looks like. The Cobb-Douglas case is just an example.

Parker_Whitfill Jun 7, 2025, 2:59 AM
3 points
0 ∶ 0
in reply to: Ryan Greenblatt’s comment on: Estimating the Substitutability between Compute and Cognitive Labor in AI Research
Yep. We are treating $L$ as homogenous (no differentiation in skill, speed, etc.) I’m interested in thinking about quality differentiation a bit more.

Parker_Whitfill Jun 7, 2025, 2:24 AM
3 points
0 ∶ 0
in reply to: Charlie Harrison’s comment on: Estimating the Substitutability between Compute and Cognitive Labor in AI Research
In complete generality, you could write effective labor as
$L = Z (H, A K_{i n f}, A K_{t r a i n})$ .
That is, effective labor is some function of the number of human researchers we have, the effective inference compute we have (quantity of AIs we can run) and the effective training compute (quality of AIs we trained).
The perfect substitution claim is that once training compute is sufficiently high, then eventually we can spend the inference compute on running some AI that substitutes for human researchers. Mathematically, for some $x$ ,
$Z (H, A K_{i n f}, x) = H + \frac{A K_{i n f}}{c}$
where $c$ is the compute cost to run the system.
So you could think of our analysis as saying, once we have an AI that perfectly substitutes for AI researchers, what happens next?
Now of course, you might expect substantial recursive self-improvement even with an AI system that doesn’t perfectly substitute for AI labor. I think this is a super interesting and important question. I’m trying to think more about this question, but its hard to make progress because its unclear what $Z (\cdot)$ looks like. But let me try to gesture at a few things. Let’s fix $x$ at some sub-human level
1. At the very least, you need some function that goes to infinity as A goes to infinity. For example, if there are certain tasks which must be done in AI research and these tasks can only be done by humans, then these tasks will always bottleneck progress.
2. If you assume say Cobb-Douglas, i.e.
  $Z (H, A K_{i n f}, x) = H^{α (x)} {\frac{A K_{i n f}}{c}}^{1 - a (x)}$
  where $1 - α (x)$ denotes the share of labor tasks that AI can do, then you’ll pick up another $1 - α (x)$ in the explosion condition i.e. $ϕ + λ > 1$ will become $ϕ + (1 - α (x)) λ > 1.$ This captures the intuition that as the fraction of tasks an AI can do increases, the explosion condition gets easier and easier to hit.

Parker_Whitfill Jun 5, 2025, 12:22 AM
4 points
0 ∶ 0
in reply to: JoshYou’s comment on: Estimating the Substitutability between Compute and Cognitive Labor in AI Research
Here is a fleshed out version of Cheryl’s response. Lets suppose actual research capital is $q K$ but we just used $K$ in our estimation equation.
Then the true estimation equation is
$ln \frac{q K}{L} = σ ln \frac{γ}{1 - γ} + σ ln \frac{w}{r}$
re-arranging we get
$ln \frac{K}{L} = σ ln \frac{γ}{1 - γ} - ln q + σ ln \frac{w}{r}$
So if we regress $ln \frac{K}{L}$ on a constant and $ln \frac{w}{r}$ then the coefficient on $ln \frac{w}{r}$ is still $σ$ as long as q is independent of $w / r$ .
Nevertheless, I think this should increase your uncertainty in our estimates because there is clearly a lot going on behind the scenes that we might not fully understand—like how is research vs. training compute measured, etc.

Parker_Whitfill Jun 4, 2025, 6:15 AM
3 points
0 ∶ 0
in reply to: Tom_Davidson’s comment on: Estimating the Substitutability between Compute and Cognitive Labor in AI Research
Note that if you accept this, our estimation of $σ$ in the raw compute specification is wrong.
The cost-minimization problem becomes
${min}_{H, K} w H + r K s . t . F (A K, H) = ¯ F$ .
Taking FOCs and re-arranging,
$\frac{K}{H} = σ \frac{γ}{1 - γ} + σ ln \frac{w A}{r}$
So our previous estimation equation was missing an A on the relative prices. Intuitively, we understated the degree to which compute was getting cheaper. Now A is hard to observe, but let’s just assume its growing exponentially with an 8 month doubling time per this Epoch paper.
Imputing this guess of A, and estimating via OLS with firm fixed effects gives us $σ = .89$ with $.10$ standard errors.
Note that this doesn’t change the estimation results for the frontier experiments since the $A$ in $\frac{A K_{r e s}}{A K_{t r a i n}}$ just cancels out.

Parker_Whitfill Jun 3, 2025, 3:02 AM
6 points
0 ∶ 0
in reply to: Tom_Davidson’s comment on: Estimating the Substitutability between Compute and Cognitive Labor in AI Research
I spent a bit of time thinking about this today.
Lets adopt the notation in your comment and suppose that $F (\cdot)$ is the same across research sectors, with common $λ$ . Let’s also suppose common $σ < 1$ .
Then we get blow up in $A_{c o g}$ iff
${\begin{matrix} ϕ_{c o g} + λ > 1 & if ϕ_{c o g} \leq ϕ_{e x p} max {ϕ_{c o g}, ϕ_{e x p} + λ} > 1 & if ϕ_{c o g} > ϕ_{e x p} \end{matrix}$
The intution for this result is that when $σ < 1$ , you are bottlenecked by your slower growing sector.
If the slower growing sector is cognitive labor, then asympotically $F \propto A_{c o g}$ , and we get $˙ A \propto A_{c o g}^{ϕ_{c o g}} A_{c o g}^{λ}$ so we have blow-up iff $ϕ_{c o g} + λ > 1$ .
If the slower growing sector is experimental compute, then there are two cases. If experimental compute is blowing up on its own, then so is cogntive labor because by assumption cognitive labor is growing faster. If experimental compute is not blowing up on its own then asympotically $F \propto A_{e x p}$ and we get ${˙ A}_{c o g} \propto A_{c o g}^{ϕ_{c o g}} A_{e x p}^{λ}$ . Here we get a blow-up iff $ϕ_{c o g} > 1$ .^[1]
In contrast, if $σ > 1$ then F is approximately the fastest growing sector. You get blow-up in both sectors if either sector blows up. Therefore, you get blow-up iff $max {ϕ_{c o g} + λ, ϕ_{e x p} + λ} > 1$ .
So if you accept this framing, complements vs substitutes only matters if some sectors are blowing up but not others. If all sectors have the returns to research high enough, then we get an intelligence explosion no matter what. This is an update for me, thanks!
1. ^
  I’m only analyzing blow-up conditions here. You could get e.g. double exponential growth here by having $ϕ_{c o g} = 1$ and $ϕ_{e x p} + λ = 1$ .

Parker_Whitfill Jun 2, 2025, 12:38 PM
1 point
0 ∶ 0
in reply to: Parker_Whitfill’s comment on: Estimating the Substitutability between Compute and Cognitive Labor in AI Research
Also, updating this would change all the intelligence explosion conditions, not just when $σ < 1$ .

Parker_Whitfill Jun 2, 2025, 12:30 PM
5 points
0 ∶ 0
in reply to: Tom_Davidson’s comment on: Estimating the Substitutability between Compute and Cognitive Labor in AI Research
Yep, I think this gets the high-level dynamics driving the results right.

Parker_Whitfill Jun 2, 2025, 12:28 PM
3 points
0 ∶ 0
in reply to: Tom_Davidson’s comment on: Estimating the Substitutability between Compute and Cognitive Labor in AI Research
Thanks for the clarification. We updated the post accordingly.

Parker_Whitfill Jun 2, 2025, 12:24 PM
3 points
0 ∶ 0
in reply to: Tom_Davidson’s comment on: Estimating the Substitutability between Compute and Cognitive Labor in AI Research
This is a good point, we agree, thanks! Note that you need to assume that the algorithmic progress that gives you more effective inference compute is the same that gives you more effective research compute. This seems pretty reasonable but worth a discussion.
Although note that this argument works only with the CES in compute formulation. For the CES in frontier experiments, you would have the $\frac{A K_{r e s}}{A K_{t r a i n}}$ so the A cancels out.^[1]
1. ^
  You might be able to avoid this by adding the A’s in a less naive fashion. You don’t have to train larger models if you don’t want to. So perhaps you can freeze the frontier, and then you get $\frac{A K_{r e s}}{A_{f r o z e n} K_{t r a i n}}$ ? I need to think more about this point.

Parker_Whitfill Jun 2, 2025, 8:49 AM
7 points
0 ∶ 0
in reply to: Thomas Kwa’s comment on: Estimating the Substitutability between Compute and Cognitive Labor in AI Research
Thanks for the insightful comment.
I take your overall point as the static optimization problem may not be properly specified. For example, costs may not be linear in labor size because of adjustment costs to growing very quickly or costs may not be linear in compute because of bulk discounting. Moreover, these non-linear costs may be changing over time (e.g., adjustment costs might only matter in 2021-2024 as OpenAI, Anthropic have been scaling labor aggressively). I agree that this would bias the estimate of $σ$ . Given the data we have, there should be some way to at least partially deal with this (e.g., by adding lagged labor as a control). I’ll have to think about it more.
On some of the smaller comments:
wages/r_{research} is around 0.28 (maybe you have better data here)
The best data we have is The Information’s article that OpenAI spent $700M on salaries and $1000M on research compute in 2024, so the $\frac{w L}{r K} = .7$ (assuming you meant $\frac{w L}{r K}$ instead of $\frac{w}{r}$ ).
The whole industry is much larger now and elasticity of substitution might not be constant; if so this is worrying because to predict whether there’s a software-only singularity we’ll need to extrapolate over more orders of magnitude of growth and the human labor → AI labor transition.
I agree. $σ$ might not be constant over time, which is a problem for both estimation/extrapolation and also predicting what an intelligence explosion might look like. For example, if $σ$ falls over time, then we may have a foom for a bit until $σ$ falls below 1 and then fizzles. I’ve been thinking about writing something up about this.
Are you planning follow-up work, or is there other economic data we could theoretically collect that could give us higher confidence estimates?
Yes, although we’re not decided yet on what is the most useful to follow-up on. Very short-term there is trying to accomodate non-linear pricing. Of course, data on what non-linear pricing looks like would be helpful e.g., how does Nvidia bulk discount.
We also may try to estimate $ϕ$ with the data we have.

Estimating the Substitutability between Compute and Cognitive Labor in AI Research

Parker_WhitfillJun 1, 2025, 2:27 PM

135 points

28 comments9 min readEA link

Parker_Whitfill May 11, 2025, 6:55 PM
4 points
0 ∶ 0
on: Second-wave endogenous growth models and automation
Great paper, as always Phil.
I’m curious to hear your thoughts a bit more about if we can salvage SWE by introducing non-standard preferences.
Minor quibble: “There is then no straightforward sense in which economic growth has historically been exponential, the central stylized fact which SWE and semi-endogenous models both seek to explain”
I agree that there is no consumption aggregate under non-homothetic preferences, but we can still say economic growth has been exponential in the sense that GDP growth is exponential. Perhaps it is not a very meaningful number under non-homothetic preferences, as you have argued elsewhere, but it still exists. Do you have thoughts on why GDP growth has been exponential in a model without a consumption aggregate?

Parker_Whitfill’s Quick takes

Parker_WhitfillApr 28, 2025, 7:26 PM

3 points

4 comments EA link

Parker_Whitfill Apr 28, 2025, 7:26 PM
3 points
0 ∶ 0
on: Parker_Whitfill’s Quick takes
People often appeal to Intelligence Explosion/Recursive Self-Improvement as some win-condition for current model developers e.g. Dario argues Recursive Self-Improvement could enshrine the US’s lead over China.
This seems non-obvious to me. For example, suppose OpenAI trains GPT 6 which trains GPT 7 which trains GPT 8. Then a fast follower could take GPT 8 and then use it to train GPT 9. In this case, the fast follower has a lead and has spent far less on R&D (since they didn’t have to develop GPT 7 or 8 themselves).
I guess people are thinking that OpenAI will be able to ban GPT 8 from helping competitors? But has anyone argued for why they would be able to do that (either legally or technically)?

Parker_Whitfill Mar 21, 2025, 4:26 PM
2 points
0 ∶ 0
in reply to: tylermjohn’s comment on: tylermjohn’s Quick takes
Is the alignment motivation distinct from just using AI to solve general bargaining problems?

Parker_Whitfill Mar 21, 2025, 4:15 PM
1 point
0 ∶ 0
in reply to: Parker_Whitfill’s comment on: Discussion Thread: Existential Choices Debate Week
Here is a counterargument: focusing on the places where there is altruistic alpha is ‘defecting’ against other value systems. See discussion here

Parker_Whitfill Mar 18, 2025, 6:39 PM
2 points
1 ∶ 0
on: Discussion Thread: Existential Choices Debate Week
Roughly buy that there is more “alpha” in making the future better because most people are not longtermist but most people do want to avoid extinction.

Parker_Whitfill Mar 18, 2025, 6:36 PM
3 points
0 ∶ 0
on: Trade Between Altruists With Different AI Timelines?
Good point, but can’t this trade occur just through financial markets without involving 1 on 1 trades among EAs? For example, if you have short timelines, you could take out a loan, donate it all to AI Safety.

Parker_Whitfill

Es­ti­mat­ing the Sub­sti­tutabil­ity be­tween Com­pute and Cog­ni­tive La­bor in AI Research

Parker_Whit­fill’s Quick takes

Estimating the Substitutability between Compute and Cognitive Labor in AI Research

Parker_Whitfill’s Quick takes