Programme Director at ML4Good.
I also have done work on China policy, AI governance, and animal advocacy in Asia.
Also interested in effective giving (mainly animal charities), economic development (and how AI will affect it), AI x Animals, wild animal welfare, cause prioritisation, and various meta-EA topics.
Good question, but I implore you not to take this post too seriously. Itâs a real phenomenon, but itâs a real stretch to claim that this applies in the implied way to cause areas like AI safety.
The model in the article is just a toy model of a world where existential threats are randomly distributed according to a power law, where genuinely high-probability threats are, by assumption, basically absent from the space of possible threats, and where thereâs no process of updating based on evidence.
A more narrow claim like: âsingle narrowly defined x-risk estimates of genuinely speculative, unresearched causes are likely to be inflatedâ might be valid, but that doesnât seem to be what titotal is implying. Heâs making a claim way beyond anything implied by the model, even if the model were a valid representation of the phenomenon. He seems to believe that almost all of todayâs concern for AI risk is all downstream of a belief cultivated within a narrow subcommunity subject to the optimiserâs curseâan extraordinary claim that requires a lot more evidence than that supplied in the article.
The claim being made is something like:
Some time in the 2000s, Eliezer Yudkowsky and friends made up some numbers for AI risk
Community dynamics (rather than the merits of the arguments) spread these numbers from Bostrom to Tegmark to Sam Harris to Elon Musk to the EA community etc.
These social dynamics had such an effect that multiple seemingly independent and unrelated people/âexperts from Geoffrey Hinton to Yoshua Bengio to Chinese academics and tech people from completely different intellectual lineages have absorbed this false belief from the cultural milieu (again, not at all based on the merits of the arguments), leading many of these people, as well as specialists across unrelated fields, to make estimates of AI x-risk that remain orders of magnitude too high
Note that this requires some pretty wild, difficult-to-justify assumptions on how this belief has spread.
The opposing narrative (which I would advocate for) is that:
AI x-risk is something that has independently been identified as a risk by multiple peopleâoften far before quantifying or ranking risks.
The spread of these beliefs was obviously affected by community dynamics, but people largely adopted somewhat independent beliefs based on the merits of rational arguments
Quantitative predictions were inherently imprecise because theyâre so dependent on messy world models, but they were grounded enough in reason and evidence, and composed of enough independent estimates, that any optimiserâs curse is massively weakened
As certain predictions came to pass, and current LLMs approach AGI in non-ideal geopolitical and competitive circumstances, this is increasingly being seen by a wider range of thinkers as a >1% x-risk
Even if there were âoptimiser curseâ risks in initial prioritisation of AI, itâs now increasingly recognised that AI will be a massive deal. AI-assisted engineered pandemic uplift work, observed cyber-capabilities, and signs of misalignment/âscheming etc. are building on the strong theoretical evidence base that AI-generated catastrophe is possible.
And on your particular question of how to act, even given the optimiserâs curse as stated in the toy model, working on the speculative thing could still be optimal. If a highly uncertain intervention seems exceptionally promising or x-risky, the value of information becomes incredibly high, because accurate or well-reasoned research will lead to this intervention being prioritised or not by far more people. If you have a research focus, itâs therefore probably more recommended to focus on a more uncertain, âhigh-risk-high-rewardâ area. You could also draw up a toy model for explore/âexploit based on the optimiserâs curse.
Finally, you donât necessarily have to âpickâ from a narrow set of pre-defined cause areas. You can also divide or merge risks, cause areas, skill-sets etc. to be more robust, precise, or coherent with your own world model (e.g. focusing on engineered pandemics because you realise this could interact with AI-related x-risk, GCBRs, and global health).