AI safety resources and materials

TagLast edit: Oct 5, 2022, 2:10 PM by Lizka

AI safety resources and materials include syllabi and other educational content related to AI safety.

Related entries

Teaching materials | collections and resources | research summary | AI risk | AI safety

List of AI safety newsletters and other resources

LizkaMay 1, 2023, 5:24 PM

49 points

5 comments4 min readEA link

How to pursue a career in technical AI alignment

Charlie Rogers-SmithJun 4, 2022, 9:36 PM

265 points

9 comments39 min readEA link

List of AI safety courses and resources

Daniel del CastilloSep 6, 2021, 2:26 PM

51 points

8 comments1 min readEA link

Cost-effectiveness of student programs for AI safety research

Center for AI SafetyJul 10, 2023, 5:23 PM

53 points

7 comments15 min readEA link

All AGI Safety questions welcome (especially basic ones) [April 2023]

StevenKaasApr 8, 2023, 4:21 AM

111 points

173 comments1 min readEA link

Modeling the impact of AI safety field-building programs

Center for AI SafetyJul 10, 2023, 5:22 PM

83 points

0 comments7 min readEA link

MATS AI Safety Strategy Curriculum v2

DanielFilanOct 7, 2024, 11:01 PM

29 points

1 comment1 min readEA link

[Linkpost] AI Alignment, Explained in 5 Points (updated)

Daniel_EthApr 18, 2023, 8:09 AM

31 points

2 comments1 min readEA link

(medium.com)

Announcing aisafety.training

JJ HepburnJan 17, 2023, 1:55 AM

110 points

4 comments1 min readEA link

Cost-effectiveness of professional field-building programs for AI safety research

Center for AI SafetyJul 10, 2023, 5:26 PM

38 points

2 comments18 min readEA link

The Importance of AI Alignment, explained in 5 points

Daniel_EthFeb 11, 2023, 2:56 AM

50 points

4 comments13 min readEA link

AI Safety Arguments: An Interactive Guide

Lukas Trötzmüller🔸Feb 1, 2023, 7:21 PM

32 points

5 comments3 min readEA link

Onboarding students to EA/AIS in 4 days with an intensive fellowship

gergoDec 5, 2023, 10:07 AM

17 points

0 comments4 min readEA link

AI Safety University Organizing: Early Takeaways from Thirteen Groups

Agustín Covarrubias 🔸Oct 2, 2024, 2:39 PM

46 points

3 comments9 min readEA link

Distribution Shifts and The Importance of AI Safety

Leon LangSep 29, 2022, 10:38 PM

7 points

0 comments1 min readEA link

[Question] How to create curriculum for self-study towards AI alignment work?

OIUJHKDFSJan 7, 2023, 7:53 PM

10 points

5 comments1 min readEA link

AI Risk Intro 1: Advanced AI Might Be Very Bad

L Rudolf LSep 11, 2022, 10:57 AM

22 points

0 comments30 min readEA link

Poster Session on AI Safety

Neil CrawfordNov 12, 2022, 3:50 AM

8 points

0 comments4 min readEA link

How to become an AI safety researcher

peterbarnettApr 12, 2022, 11:33 AM

113 points

15 comments14 min readEA link

Thread: Reflections on the AGI Safety Fundamentals course?

CliffordMay 18, 2023, 1:11 PM

27 points

7 comments1 min readEA link

(My suggestions) On Beginner Steps in AI Alignment

Joseph BloomSep 22, 2022, 3:32 PM

37 points

3 comments9 min readEA link

AGISF adaptation for in-person groups

Sam MarksJan 17, 2023, 6:33 PM

30 points

0 comments3 min readEA link

(www.lesswrong.com)

Levelling Up in AI Safety Research Engineering

GabeMSep 2, 2022, 4:59 AM

166 points

21 comments17 min readEA link

Resources that (I think) new alignment researchers should know about

AkashOct 28, 2022, 10:13 PM

20 points

2 comments1 min readEA link

$20K in Bounties for AI Safety Public Materials

TW123Aug 5, 2022, 2:57 AM

45 points

11 comments6 min readEA link

What are some other introductions to AI safety?

Vishakha AgrawalFeb 17, 2025, 11:48 AM

9 points

0 comments7 min readEA link

(aisafety.info)

What are the “no free lunch” theorems?

Vishakha AgrawalFeb 4, 2025, 2:02 AM

3 points

0 comments1 min readEA link

(aisafety.info)

Top AI safety newsletters, books, podcasts, etc – new AISafety.com resource

Bryce RobertsonMar 4, 2025, 5:01 PM

9 points

0 comments1 min readEA link

Overview: AI Safety Outreach Grassroots Orgs

SeverinMay 12, 2025, 2:38 PM

11 points

0 comments1 min readEA link

How I switched careers from software engineer to AI policy operations

Lucie Philippon 🔸Apr 13, 2025, 6:41 AM

12 points

1 comment5 min readEA link

(www.lesswrong.com)

AI Safety Info Distillation Fellowship

robertskmilesFeb 17, 2023, 4:16 PM

80 points

1 comment1 min readEA link

We are sharing a new website template for AI Safety groups!

AIS HungaryMar 13, 2024, 4:40 PM

11 points

2 comments1 min readEA link

All AGI Safety questions welcome (especially basic ones) [May 2023]

StevenKaasMay 8, 2023, 10:30 PM

19 points

11 comments1 min readEA link

AI is advancing fast

Vishakha AgrawalApr 23, 2025, 11:04 AM

2 points

2 comments2 min readEA link

(aisafety.info)

AI Safety Fundamentals: An Informal Cohort Starting Soon! (cross-posted to lesswrong.com)

TiagoJun 4, 2023, 6:21 PM

6 points

0 comments1 min readEA link

(www.lesswrong.com)

Concrete Steps to Get Started in Transformer Mechanistic Interpretability

Neel NandaDec 26, 2022, 1:00 PM

18 points

0 comments12 min readEA link

Project idea: Aggregation of events for our new era

Julian NalenzJan 26, 2025, 6:16 PM

8 points

1 comment3 min readEA link

AI Safety & Entrepreneurship v1.0

Chris LeongApr 26, 2025, 2:37 PM

27 points

0 comments1 min readEA link

An audio version of the alignment problem from a deep learning perspective by Richard Ngo Et Al

MiguelFeb 3, 2023, 7:32 PM

18 points

0 comments1 min readEA link

(www.whitehatstoic.com)

Map of AI Safety v2

Bryce RobertsonApr 15, 2025, 1:04 PM

59 points

6 comments1 min readEA link

Learning as much Deep Learning math as I could in 24 hours

PhosphorousJan 8, 2023, 2:19 AM

58 points

6 comments7 min readEA link

My attempt at explaining the case for AI risk in a straightforward way

JulianHazellMar 25, 2023, 4:32 PM

25 points

7 comments18 min readEA link

(muddyclothes.substack.com)

AI’s goals may not match ours

Vishakha AgrawalMay 28, 2025, 12:07 PM

2 points

0 comments3 min readEA link

What AI Safety Materials Do ML Researchers Find Compelling?

Vael GatesDec 28, 2022, 2:03 AM

130 points

12 comments1 min readEA link

Summary of 80k’s AI problem profile

JakubKJan 1, 2023, 7:48 AM

19 points

0 comments5 min readEA link

(www.lesswrong.com)

[Question] Idea: Repository for AI Safety Presentations

EitanJan 6, 2025, 1:04 PM

14 points

3 comments1 min readEA link

Seeking input on a list of AI books for broader audience

Darren McKeeFeb 27, 2023, 10:40 PM

49 points

14 comments5 min readEA link

AI Safety For Dummies (Like Me)

Madhav MalhotraAug 24, 2022, 8:26 PM

22 points

7 comments20 min readEA link

[Question] Best introductory overviews of AGI safety?

JakubKDec 13, 2022, 7:04 PM

21 points

8 comments2 min readEA link

(www.lesswrong.com)

Map of all 40 copyright suits v. AI in U.S.

RemmeltMar 26, 2025, 7:57 AM

16 points

0 comments1 min readEA link

(chatgptiseatingtheworld.com)

AI may attain human level soon

Vishakha AgrawalApr 23, 2025, 11:10 AM

2 points

1 comment2 min readEA link

(aisafety.info)

New AI risk intro from Vox [link post]

JakubKDec 21, 2022, 5:50 AM

7 points

1 comment2 min readEA link

(www.vox.com)

AI Safety Executive Summary

Sean OsierSep 6, 2022, 8:26 AM

20 points

2 comments5 min readEA link

(seanosier.notion.site)

[Question] What should I read about defining AI “hallucination?”

James-Hartree-LawJan 23, 2025, 1:00 AM

2 points

0 comments1 min readEA link

Uncontrollable AI as an Existential Risk

Karl von WendtOct 9, 2022, 10:37 AM

28 points

0 comments1 min readEA link

An A.I. Safety Presentation at RIT

Nicholas KrossMar 27, 2023, 11:49 PM

5 points

0 comments1 min readEA link

There should be a public adversarial collaboration on AI x-risk

pradyuprasadJan 23, 2023, 4:09 AM

56 points

5 comments2 min readEA link

Announcement: Learning Theory Online Course

YegregJan 28, 2025, 8:32 AM

5 points

0 comments3 min readEA link

(www.lesswrong.com)

Human-level is not the limit

Vishakha AgrawalApr 23, 2025, 11:16 AM

3 points

0 comments2 min readEA link

(aisafety.info)

EA Netherlands’ guide to AI safety careers

James HerbertJan 16, 2025, 5:22 PM

25 points

0 comments1 min readEA link

(effectiefaltruisme.nl)

AI may pursue goals

Vishakha AgrawalMay 28, 2025, 12:04 PM

2 points

0 comments1 min readEA link

Big list of AI safety videos

JakubKJan 9, 2023, 6:09 AM

9 points

0 comments1 min readEA link

(docs.google.com)

China x AI Reference List

Saad SiddiquiMar 13, 2024, 6:57 PM

61 points

3 comments3 min readEA link

(docs.google.com)

AI Risk Intro 2: Solving The Problem

L Rudolf LSep 24, 2022, 9:33 AM

11 points

0 comments28 min readEA link

(www.perfectlynormal.co.uk)

The road from human-level to superintelligent AI may be short

Vishakha AgrawalApr 23, 2025, 11:19 AM

3 points

0 comments2 min readEA link

(aisafety.info)

What are the differences between AGI, transformative AI, and superintelligence?

Vishakha AgrawalJan 23, 2025, 10:11 AM

12 points

0 comments3 min readEA link

(aisafety.info)

How do fictional stories illustrate AI misalignment?

Vishakha AgrawalJan 15, 2025, 6:16 AM

4 points

0 comments2 min readEA link

(aisafety.info)

Power-Seeking AI and Existential Risk

antoniofrancaibOct 11, 2022, 9:47 PM

10 points

0 comments1 min readEA link

Problems of people new to AI safety and my project ideas to mitigate them

Igor IvanovMar 3, 2023, 5:35 PM

20 points

0 comments7 min readEA link

My experience building mathematical ML skills with a course from UIUC

Naoya OkamotoJun 9, 2024, 11:41 AM

2 points

0 comments10 min readEA link

ENAIS has launched a newsletter for AIS fieldbuilders

gergoNov 22, 2024, 10:45 AM

25 points

0 comments1 min readEA link

“AI Risk Discussions” website: Exploring interviews from 97 AI Researchers

Vael GatesFeb 2, 2023, 1:00 AM

46 points

1 comment1 min readEA link

Let’s talk about uncontrollable AI

Karl von WendtOct 9, 2022, 10:37 AM

12 points

2 comments1 min readEA link

Highly Opinionated Advice on How to Write ML Papers

Neel NandaMay 12, 2025, 1:59 AM

22 points

0 comments1 min readEA link

Resources I send to AI researchers about AI safety

Vael GatesJan 11, 2023, 1:24 AM

43 points

0 comments1 min readEA link

What are some good podcasts about AI safety?

Vishakha AgrawalFeb 17, 2025, 10:32 AM

8 points

1 comment1 min readEA link

(aisafety.info)

No comments.

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer