Survey on intermediate goals in AI governance
It seems that a key bottleneck for the field of longtermism-aligned AI governance is limited strategic clarity (see Muehlhauser, 2020, 2021). As one effort to increase strategic clarity, in October-November 2022, we sent a survey to 229 people we had reason to believe are knowledgeable about longtermist AI governance, receiving 107 responses. We asked about:
respondentsā ātheory of victoryā for AI risk (which we defined as the main, high-level āplanā theyād propose for how humanity could plausibly manage the development and deployment of transformative AI such that we get long-lasting good outcomes),
how theyād feel about funding going to each of 53 potential āintermediate goalsā for AI governance,[1]
what other intermediate goals theyād suggest,
how high they believe the risk of existential catastrophe from AI is, and
when they expect transformative AI (TAI) to be developed.
We hope the results will be useful to funders, policymakers, people at AI labs, researchers, field-builders, people orienting to longtermist AI governance, and perhaps other types of people. For example, the report could:
Broaden the range of options people can easily consider
Help people assess how much and in what way to focus on each potential ātheory of victoryā, āintermediate goalā, etc.
Target and improve further efforts to assess how much and in what way to focus on each potential theory of victory, intermediate goal, etc.
You can see a summary of the survey results here. Note that we will expect readers to abide by the policy articulated in āAbout sharing information from this reportā (for the reasons explained there).
Acknowledgments
This report is a project of Rethink Prioritiesāa think tank dedicated to informing decisions made by high-impact organizations and funders across various cause areas. The project was commissioned by Open Philanthropy. Full acknowledgements can be found in the linked āIntroduction & summaryā document.
If you are interested in RPās work, please visit our research database and subscribe to our newsletter.
- ^
Hereās the definition of āintermediate goalā that we stated in the survey itself:
By an intermediate goal, we mean any goal for reducing extreme AI risk thatās more specific and directly actionable than a high-level goal like āreduce existential AI accident riskā but is less specific and directly actionable than a particular intervention. In another context (global health and development), examples of potential intermediate goals could include ādevelop better/ācheaper malaria vaccinesā and āimprove literacy rates in Sub-Saharan Africaā.
- 12 tenĀtaĀtive ideas for US AI policy (Luke Muehlhauser) by Apr 19, 2023, 9:05 PM; 117 points) (
- Why exĀpeĀrienced proĀfesĀsionĀals fail to land high-imĀpact roles (FBB #5) by Apr 10, 2025, 12:44 PM; 94 points) (
- AI policy ideas: ReadĀing list by Apr 17, 2023, 7:00 PM; 60 points) (
- How ReĀthink PriĀoriĀtiesā ReĀsearch could inĀform your grantmaking by Oct 4, 2023, 6:24 PM; 59 points) (
- āRisk AwareĀness MoĀmentsā (Rams): A conĀcept for thinkĀing about AI govĀerĀnance interventions by Apr 14, 2023, 5:40 PM; 53 points) (
- AnĀnouncĀing the AIPoliĀcyIdeas.com Database by Jun 23, 2023, 4:09 PM; 50 points) (
- SlowĀing AI: ReadĀing list by Apr 17, 2023, 2:30 PM; 45 points) (LessWrong;
- EA OrĀgaĀniĀzaĀtion UpĀdates: April 2023 by Apr 13, 2023, 6:48 PM; 41 points) (
- ReĀthink PriĀoriĀties is hiring a ComĀpute GoverĀnance ReĀsearcher or ReĀsearch Assistant by Jun 7, 2023, 1:22 PM; 36 points) (
- EA & LW FoĀrum Weekly SumĀmary (13th ā 19th March 2023) by Mar 20, 2023, 4:18 AM; 31 points) (
- Ideas for AI labs: ReadĀing list by Apr 24, 2023, 7:00 PM; 28 points) (
- ReĀsearch proĀject idea: InĀterĀmeĀdiĀate goals for nuĀclear risk reduction by Apr 15, 2023, 2:25 PM; 24 points) (
- AI policy ideas: ReadĀing list by Apr 17, 2023, 7:00 PM; 24 points) (LessWrong;
- Please help me sense-check my asĀsumpĀtions about the needs of the AI Safety comĀmuĀnity and reĀlated caĀreer plans by Mar 27, 2023, 8:11 AM; 23 points) (
- Part 3: A ProĀposed ApĀproach for AI Safety MoveĀment BuildĀing: ProĀjects, ProĀfesĀsions, Skills, and Ideas for the FuĀture [long post][bounty for feedĀback] by Mar 22, 2023, 12:54 AM; 22 points) (
- InĀciĀdent reĀportĀing for AI safety by Jul 19, 2023, 5:00 PM; 22 points) (LessWrong;
- InĀciĀdent reĀportĀing for AI safety by Jul 19, 2023, 5:00 PM; 18 points) (
- A ProĀposed ApĀproach for AI Safety MoveĀment BuildĀing: ProĀjects, ProĀfesĀsions, Skills, and Ideas for the FuĀture [long post][bounty for feedĀback] by Mar 22, 2023, 1:11 AM; 14 points) (LessWrong;
- EA & LW FoĀrum Weekly SumĀmary (13th ā 19th March 2023) by Mar 20, 2023, 4:18 AM; 13 points) (LessWrong;
- Why ExĀpeĀrienced ProĀfesĀsionĀals Fail to Land High-ImĀpact Roles (FBB #5) by Apr 10, 2025, 12:46 PM; 12 points) (LessWrong;
- InĀsights from an exĀpert surĀvey about inĀterĀmeĀdiĀate goals in AI governance by Mar 17, 2023, 2:59 PM; 11 points) (
- Ideas for AI labs: ReadĀing list by Apr 24, 2023, 7:00 PM; 11 points) (LessWrong;
...and while I hopefully have your attention: My team is currently hiring for a Research Manager! If you might be interested in managing one or more researchers working on a diverse set of issues relevant to mitigating extreme risks from the development and deployment of AI, please check out the job ad!
The application form should take <2 hours. The deadline is the end of the day on March 21. The role is remote and weāre able to hire in most countries.
People with a wide range of backgrounds could turn out to be the best fit for the role. As such, if youāre interested, please donāt rule yourself out due to thinking youāre not qualified unless you at least read the job ad first!
Can I ask whether there is a specific reason that you do not put the summary of the findings in this post, but only let people request access to a google drive folder?
I just browsed through it, their reasons for not doing so is also described in a section in the report.
Yeah, the āAbout sharing information from this reportā section attempts to explain this. Also, for what itās worth, I approved all access requests, generally within 24 hours.
That said, FYI Iāve now switched to the folder being viewable by anyone with the link, rather than requiring requesting access, though we still have the policies in āAbout sharing information from this reportā. (This switch was partly because my sense of the risks vs benefits has changed, and partly because we apparently hit the max number of people who can be individually shared on a folder.)