RSS

AI safety re­sources and materials

TagLast edit: Oct 5, 2022, 2:10 PM by Lizka

AI safety resources and materials include syllabi and other educational content related to AI safety.

Related entries

Teaching materials | collections and resources | research summary | AI risk | AI safety

List of AI safety newslet­ters and other resources

LizkaMay 1, 2023, 5:24 PM
49 points
5 comments4 min readEA link

List of AI safety courses and resources

Daniel del CastilloSep 6, 2021, 2:26 PM
51 points
8 comments1 min readEA link

How to pur­sue a ca­reer in tech­ni­cal AI alignment

Charlie Rogers-SmithJun 4, 2022, 9:36 PM
265 points
9 comments39 min readEA link

Cost-effec­tive­ness of stu­dent pro­grams for AI safety research

Center for AI SafetyJul 10, 2023, 5:23 PM
53 points
7 comments15 min readEA link

All AGI Safety ques­tions wel­come (es­pe­cially ba­sic ones) [April 2023]

StevenKaasApr 8, 2023, 4:21 AM
111 points
173 comments1 min readEA link

Cost-effec­tive­ness of pro­fes­sional field-build­ing pro­grams for AI safety research

Center for AI SafetyJul 10, 2023, 5:26 PM
38 points
2 comments18 min readEA link

Model­ing the im­pact of AI safety field-build­ing programs

Center for AI SafetyJul 10, 2023, 5:22 PM
83 points
0 comments7 min readEA link

[Linkpost] AI Align­ment, Ex­plained in 5 Points (up­dated)

Daniel_EthApr 18, 2023, 8:09 AM
31 points
2 comments1 min readEA link
(medium.com)

An­nounc­ing aisafety.training

JJ HepburnJan 17, 2023, 1:55 AM
110 points
4 comments1 min readEA link

MATS AI Safety Strat­egy Cur­ricu­lum v2

DanielFilanOct 7, 2024, 11:01 PM
29 points
1 comment1 min readEA link

AI Safety Ar­gu­ments: An In­ter­ac­tive Guide

Lukas Trötzmüller🔸Feb 1, 2023, 7:21 PM
32 points
5 comments3 min readEA link

On­board­ing stu­dents to EA/​AIS in 4 days with an in­ten­sive fellowship

gergoDec 5, 2023, 10:07 AM
17 points
0 comments4 min readEA link

The Im­por­tance of AI Align­ment, ex­plained in 5 points

Daniel_EthFeb 11, 2023, 2:56 AM
50 points
4 comments13 min readEA link

$20K in Boun­ties for AI Safety Public Materials

TW123Aug 5, 2022, 2:57 AM
45 points
11 comments6 min readEA link

How to be­come an AI safety researcher

peterbarnettApr 12, 2022, 11:33 AM
113 points
15 comments14 min readEA link

(My sug­ges­tions) On Begin­ner Steps in AI Alignment

Joseph BloomSep 22, 2022, 3:32 PM
37 points
3 comments9 min readEA link

Lev­el­ling Up in AI Safety Re­search Engineering

GabeMSep 2, 2022, 4:59 AM
165 points
21 comments17 min readEA link

AI Safety Univer­sity Or­ga­niz­ing: Early Take­aways from Thir­teen Groups

Agustín Covarrubias 🔸Oct 2, 2024, 2:39 PM
46 points
3 comments9 min readEA link

Distri­bu­tion Shifts and The Im­por­tance of AI Safety

Leon LangSep 29, 2022, 10:38 PM
7 points
0 comments1 min readEA link

AGISF adap­ta­tion for in-per­son groups

Sam MarksJan 17, 2023, 6:33 PM
30 points
0 comments3 min readEA link
(www.lesswrong.com)

Re­sources that (I think) new al­ign­ment re­searchers should know about

AkashOct 28, 2022, 10:13 PM
20 points
2 comments1 min readEA link

Poster Ses­sion on AI Safety

Neil CrawfordNov 12, 2022, 3:50 AM
8 points
0 comments4 min readEA link

Thread: Reflec­tions on the AGI Safety Fun­da­men­tals course?

CliffordMay 18, 2023, 1:11 PM
27 points
7 comments1 min readEA link

AI Risk In­tro 1: Ad­vanced AI Might Be Very Bad

L Rudolf LSep 11, 2022, 10:57 AM
22 points
0 comments30 min readEA link

[Question] How to cre­ate cur­ricu­lum for self-study to­wards AI al­ign­ment work?

OIUJHKDFSJan 7, 2023, 7:53 PM
10 points
5 comments1 min readEA link

An A.I. Safety Pre­sen­ta­tion at RIT

Nicholas / Heather KrossMar 27, 2023, 11:49 PM
5 points
0 comments1 min readEA link

All AGI Safety ques­tions wel­come (es­pe­cially ba­sic ones) [May 2023]

StevenKaasMay 8, 2023, 10:30 PM
19 points
11 comments1 min readEA link

AI Safety Fun­da­men­tals: An In­for­mal Co­hort Start­ing Soon! (cross-posted to less­wrong.com)

TiagoJun 4, 2023, 6:21 PM
6 points
0 comments1 min readEA link
(www.lesswrong.com)

ENAIS has launched a newslet­ter for AIS fieldbuilders

gergoNov 22, 2024, 10:45 AM
25 points
0 comments1 min readEA link

China x AI Refer­ence List

Saad SiddiquiMar 13, 2024, 6:57 PM
61 points
3 comments3 min readEA link
(docs.google.com)

We are shar­ing a new web­site tem­plate for AI Safety groups!

AIS HungaryMar 13, 2024, 4:40 PM
10 points
2 comments1 min readEA link

What are some other in­tro­duc­tions to AI safety?

Vishakha AgrawalFeb 17, 2025, 11:48 AM
9 points
0 comments7 min readEA link
(aisafety.info)

What are some good pod­casts about AI safety?

Vishakha AgrawalFeb 17, 2025, 10:32 AM
8 points
1 comment1 min readEA link
(aisafety.info)

[Question] Idea: Re­pos­i­tory for AI Safety Presentations

EitanJan 6, 2025, 1:04 PM
14 points
3 comments1 min readEA link

Map of all 40 copy­right suits v. AI in U.S.

RemmeltMar 26, 2025, 7:57 AM
15 points
0 comments1 min readEA link
(chatgptiseatingtheworld.com)

How do fic­tional sto­ries illus­trate AI mis­al­ign­ment?

Vishakha AgrawalJan 15, 2025, 6:16 AM
4 points
0 comments2 min readEA link
(aisafety.info)

EA Nether­lands’ guide to AI safety careers

James HerbertJan 16, 2025, 5:22 PM
25 points
0 comments1 min readEA link
(effectiefaltruisme.nl)

[Question] What should I read about defin­ing AI “hal­lu­ci­na­tion?”

James-Hartree-LawJan 23, 2025, 1:00 AM
2 points
0 comments1 min readEA link

What are the differ­ences be­tween AGI, trans­for­ma­tive AI, and su­per­in­tel­li­gence?

Vishakha AgrawalJan 23, 2025, 10:11 AM
12 points
0 comments3 min readEA link
(aisafety.info)

Pro­ject idea: Ag­gre­ga­tion of events for our new era

Julian NalenzJan 26, 2025, 6:16 PM
8 points
1 comment3 min readEA link

An­nounce­ment: Learn­ing The­ory On­line Course

YegregJan 28, 2025, 8:32 AM
5 points
0 comments3 min readEA link
(www.lesswrong.com)

Top AI safety newslet­ters, books, pod­casts, etc – new AISafety.com resource

Bryce RobertsonMar 4, 2025, 5:01 PM
9 points
0 comments1 min readEA link

Let’s talk about un­con­trol­lable AI

Karl von WendtOct 9, 2022, 10:37 AM
12 points
2 comments1 min readEA link

New AI risk in­tro from Vox [link post]

JakubKDec 21, 2022, 5:50 AM
7 points
1 comment2 min readEA link
(www.vox.com)

[Question] Best in­tro­duc­tory overviews of AGI safety?

JakubKDec 13, 2022, 7:04 PM
21 points
8 comments2 min readEA link
(www.lesswrong.com)

Power-Seek­ing AI and Ex­is­ten­tial Risk

antoniofrancaibOct 11, 2022, 9:47 PM
10 points
0 comments1 min readEA link

AI Risk In­tro 2: Solv­ing The Problem

L Rudolf LSep 24, 2022, 9:33 AM
11 points
0 comments28 min readEA link
(www.perfectlynormal.co.uk)

AI Safety Ex­ec­u­tive Summary

Sean OsierSep 6, 2022, 8:26 AM
20 points
2 comments5 min readEA link
(seanosier.notion.site)

Un­con­trol­lable AI as an Ex­is­ten­tial Risk

Karl von WendtOct 9, 2022, 10:37 AM
28 points
0 comments1 min readEA link

AI Safety For Dum­mies (Like Me)

Madhav MalhotraAug 24, 2022, 8:26 PM
22 points
7 comments20 min readEA link

What AI Safety Ma­te­ri­als Do ML Re­searchers Find Com­pel­ling?

Vael GatesDec 28, 2022, 2:03 AM
130 points
12 comments1 min readEA link

Con­crete Steps to Get Started in Trans­former Mechanis­tic Interpretability

Neel NandaDec 26, 2022, 1:00 PM
18 points
0 comments12 min readEA link

Sum­mary of 80k’s AI prob­lem profile

JakubKJan 1, 2023, 7:48 AM
19 points
0 comments5 min readEA link
(www.lesswrong.com)

Big list of AI safety videos

JakubKJan 9, 2023, 6:09 AM
9 points
0 comments1 min readEA link
(docs.google.com)

Learn­ing as much Deep Learn­ing math as I could in 24 hours

PhosphorousJan 8, 2023, 2:19 AM
58 points
6 comments7 min readEA link

Re­sources I send to AI re­searchers about AI safety

Vael GatesJan 11, 2023, 1:24 AM
43 points
0 comments1 min readEA link

My ex­pe­rience build­ing math­e­mat­i­cal ML skills with a course from UIUC

Naoya OkamotoJun 9, 2024, 11:41 AM
2 points
0 comments10 min readEA link

There should be a pub­lic ad­ver­sar­ial col­lab­o­ra­tion on AI x-risk

pradyuprasadJan 23, 2023, 4:09 AM
56 points
5 comments2 min readEA link

“AI Risk Dis­cus­sions” web­site: Ex­plor­ing in­ter­views from 97 AI Researchers

Vael GatesFeb 2, 2023, 1:00 AM
46 points
1 comment1 min readEA link

An au­dio ver­sion of the al­ign­ment prob­lem from a deep learn­ing per­spec­tive by Richard Ngo Et Al

MiguelFeb 3, 2023, 7:32 PM
18 points
0 comments1 min readEA link
(www.whitehatstoic.com)

AI Safety Info Distil­la­tion Fellowship

robertskmilesFeb 17, 2023, 4:16 PM
80 points
1 comment1 min readEA link

What are the “no free lunch” the­o­rems?

Vishakha AgrawalFeb 4, 2025, 2:02 AM
3 points
0 comments1 min readEA link
(aisafety.info)

Seek­ing in­put on a list of AI books for broader audience

Darren McKeeFeb 27, 2023, 10:40 PM
49 points
14 comments5 min readEA link

Prob­lems of peo­ple new to AI safety and my pro­ject ideas to miti­gate them

Igor IvanovMar 3, 2023, 5:35 PM
19 points
0 comments7 min readEA link

My at­tempt at ex­plain­ing the case for AI risk in a straight­for­ward way

JulianHazellMar 25, 2023, 4:32 PM
25 points
7 comments18 min readEA link
(muddyclothes.substack.com)
No comments.