Public communication on AI safety

TagLast edit: Nov 27, 2023, 10:13 PM by Sarah Cheng

The public communication on AI safety tag covers depictions of AI safety in media as well as meta-discussions regarding the most effective ways to represent the topic when engaging with journalists or publishing content for a broader audience, and posts that discuss ways to convey AI safety to various audiences.

Analogy Bank for AI Safety

utilistrutilJan 29, 2024, 2:35 AM

14 points

5 comments1 min readEA link

 “Near Midnight in Suicide City”

Greg_Colbourn ⏸️ Dec 6, 2024, 7:54 PM

5 points

0 comments1 min readEA link

(www.youtube.com)

New blog: Planned Obsolescence

AjeyaMar 27, 2023, 7:46 PM

198 points

9 comments1 min readEA link

(www.planned-obsolescence.org)

AI Safety Action Plan—A report commissioned by the US State Department

Agustín Covarrubias 🔸Mar 11, 2024, 10:13 PM

25 points

1 comment1 min readEA link

(www.gladstone.ai)

Talking publicly about AI risk

Jan_KulveitApr 24, 2023, 9:19 AM

152 points

13 comments1 min readEA link

Careless talk on US-China AI competition? (and criticism of CAIS coverage)

Oliver SourbutSep 20, 2023, 12:46 PM

52 points

19 comments1 min readEA link

(www.oliversourbut.net)

Why some people disagree with the CAIS statement on AI

David_MossAug 15, 2023, 1:39 PM

144 points

15 comments16 min readEA link

If trying to communicate about AI risks, make it vivid

Michael Noetel 🔸May 27, 2024, 12:59 AM

19 points

2 comments2 min readEA link

World’s first major law for artificial intelligence gets final EU green light

Dane ValerieMay 24, 2024, 2:57 PM

3 points

1 comment2 min readEA link

(www.cnbc.com)

My cover story in Jacobin on AI capitalism and the x-risk debates

GarrisonFeb 12, 2024, 11:34 PM

154 points

10 comments6 min readEA link

(jacobin.com)

US public perception of CAIS statement and the risk of extinction

Jamie EJun 22, 2023, 4:39 PM

126 points

4 comments9 min readEA link

 Jan Leike: “I’m excited to join @AnthropicAI to continue the superalignment mission!”

defun 🔸May 28, 2024, 6:08 PM

35 points

11 comments1 min readEA link

(x.com)

AI Alignment in The New Yorker

Eleni_AMay 17, 2023, 9:19 PM

23 points

0 comments1 min readEA link

(www.newyorker.com)

I designed an AI safety course (for a philosophy department)

Eleni_ASep 23, 2023, 9:56 PM

27 points

3 comments2 min readEA link

Articles about recent OpenAI departures

bruceMay 17, 2024, 5:38 PM

126 points

12 comments1 min readEA link

(www.vox.com)

Worrisome misunderstanding of the core issues with AI transition

Roman LeventovJan 18, 2024, 10:05 AM

4 points

3 comments1 min readEA link

My Proven AI Safety Explanation (as a computing student)

Mica WhiteFeb 6, 2024, 3:58 AM

8 points

4 comments6 min readEA link

New voluntary commitments (AI Seoul Summit)

Zach Stein-PerlmanMay 21, 2024, 11:00 AM

12 points

1 comment1 min readEA link

(www.gov.uk)

[Linkpost] Statement from Scarlett Johansson on OpenAI’s use of the “Sky” voice, that was shockingly similar to her own voice.

LinchMay 20, 2024, 11:50 PM

46 points

8 comments1 min readEA link

(variety.com)

xAI raises $6B

andzuckJun 5, 2024, 3:26 PM

18 points

1 comment1 min readEA link

(x.ai)

A short conversation I had with Google Gemini on the dangers of unregulated LLM API use, while mildly drunk in an airport.

EvanMcCormickDec 17, 2024, 12:25 PM

1 point

0 comments8 min readEA link

AI Risk is like Terminator; Stop Saying it’s Not

skluugMar 8, 2022, 7:17 PM

191 points

43 comments10 min readEA link

(skluug.substack.com)

US public opinion of AI policy and risk

Jamie EMay 12, 2023, 1:22 PM

111 points

7 comments15 min readEA link

Keep Chasing AI Safety Press Coverage

GilApr 4, 2023, 8:40 PM

106 points

16 comments5 min readEA link

FLI open letter: Pause giant AI experiments

Zach Stein-PerlmanMar 29, 2023, 4:04 AM

220 points

38 comments1 min readEA link

Max Tegmark’s new Time article on how we’re in a Don’t Look Up scenario [Linkpost]

Jonas Hallgren 🔸Apr 25, 2023, 3:47 PM

41 points

0 comments1 min readEA link

Claude 3.5 Sonnet

Zach Stein-PerlmanJun 20, 2024, 6:00 PM

31 points

0 comments1 min readEA link

(www.anthropic.com)

FT: We must slow down the race to God-like AI

Angelina LiApr 24, 2023, 11:57 AM

33 points

2 comments2 min readEA link

(www.ft.com)

Let’s think about slowing down AI

Katja_GraceDec 23, 2022, 7:56 PM

334 points

9 comments1 min readEA link

[Linkpost] OpenAI leaders call for regulation of “superintelligence” to reduce existential risk.

Lowe LundinMay 25, 2023, 2:14 PM

5 points

0 comments1 min readEA link

The Overton Window widens: Examples of AI risk in the media

AkashMar 23, 2023, 5:10 PM

112 points

11 comments1 min readEA link

Pausing AI Developments Isn’t Enough. We Need to Shut it All Down

EliezerYudkowskyApr 9, 2023, 3:53 PM

50 points

3 comments1 min readEA link

Survey of 2,778 AI authors: six parts in pictures

Katja_GraceJan 6, 2024, 4:43 AM

176 points

10 comments1 min readEA link

Is fear productive when communicating AI x-risk? [Study results]

Johanna RonigerJan 22, 2024, 5:38 AM

73 points

10 comments5 min readEA link

Keep Making AI Safety News

GilMar 31, 2023, 8:11 PM

67 points

4 comments1 min readEA link

Spreading messages to help with the most important century

Holden KarnofskyJan 25, 2023, 8:35 PM

129 points

21 comments18 min readEA link

(www.cold-takes.com)

A transcript of the TED talk by Eliezer Yudkowsky

MikhailSaminJul 12, 2023, 12:12 PM

39 points

0 comments1 min readEA link

AI Safety Newsletter #3: AI policy proposals and a new challenger approaches

Oliver ZApr 25, 2023, 4:15 PM

35 points

1 comment4 min readEA link

(newsletter.safe.ai)

AI policy ideas: Reading list

Zach Stein-PerlmanApr 17, 2023, 7:00 PM

60 points

3 comments1 min readEA link

Did Bengio and Tegmark lose a debate about AI x-risk against LeCun and Mitchell?

Karl von WendtJun 25, 2023, 4:59 PM

80 points

24 comments1 min readEA link

The Cruel Trade-Off Between AI Misuse and AI X-risk Concerns

simeon_cApr 22, 2023, 1:49 PM

21 points

17 comments1 min readEA link

[linkpost] “What Are Reasonable AI Fears?” by Robin Hanson, 2023-04-23

Arjun PanicksseryApr 14, 2023, 11:26 PM

41 points

3 comments4 min readEA link

(quillette.com)

Branding AI Safety Groups: A Field Guide

Agustín Covarrubias 🔸May 13, 2024, 5:17 PM

44 points

6 comments1 min readEA link

How bad a future do ML researchers expect?

Katja_GraceMar 13, 2023, 5:47 AM

165 points

20 comments1 min readEA link

The Best Argument is not a Simple English Yud Essay

Jonathan BostockSep 19, 2024, 3:29 PM

75 points

3 comments5 min readEA link

(www.lesswrong.com)

An EA used deceptive messaging to advance her project; we need mechanisms to avoid deontologically dubious plans

MikhailSaminFeb 13, 2024, 11:11 PM

22 points

39 comments5 min readEA link

Against most, but not all, AI risk analogies

Matthew_BarnettJan 14, 2024, 7:13 PM

43 points

9 comments1 min readEA link

AI 2027: What Superintelligence Looks Like (Linkpost)

Manuel AllgaierApr 11, 2025, 10:31 AM

51 points

3 comments42 min readEA link

(ai-2027.com)

Sam Altman’s Chip Ambitions Undercut OpenAI’s Safety Strategy

GarrisonFeb 10, 2024, 7:52 PM

286 points

20 comments3 min readEA link

(garrisonlovely.substack.com)

Short review of our TensorTrust-based AI safety university outreach event

Milan Weibel🔹Sep 22, 2024, 2:54 PM

15 points

0 comments2 min readEA link

Excerpts from “Majority Leader Schumer Delivers Remarks To Launch SAFE Innovation Framework For Artificial Intelligence At CSIS”

Chris LeongJul 21, 2023, 11:15 PM

19 points

0 comments1 min readEA link

(www.democrats.senate.gov)

Announcing New Beginner-friendly Book on AI Safety and Risk

Darren McKeeNov 25, 2023, 3:57 PM

114 points

9 comments1 min readEA link

Pausing AI Developments Isn’t Enough. We Need to Shut it All Down by Eliezer Yudkowsky

jacquesthibsMar 29, 2023, 11:30 PM

212 points

75 comments3 min readEA link

(time.com)

Organizing a debate with experts and MPs to raise AI xrisk awareness: a possible blueprint

OttoApr 19, 2023, 10:50 AM

75 points

1 comment4 min readEA link

[US] NTIA: AI Accountability Policy Request for Comment

Kyle J. LuccheseApr 13, 2023, 4:12 PM

47 points

4 comments1 min readEA link

(ntia.gov)

Tarbell Fellowship 2024 - Applications Open (AI Journalism)

Cillian_Sep 28, 2023, 10:38 AM

58 points

1 comment3 min readEA link

News: Spanish AI image outcry + US AI workforce “regulation”

Benevolent_RainSep 26, 2023, 7:43 AM

9 points

0 comments1 min readEA link

INTERVIEW: StakeOut.AI w/ Dr. Peter Park

Jacob-HaimesMar 5, 2024, 6:04 PM

21 points

7 comments1 min readEA link

(into-ai-safety.github.io)

[Linkpost] 538 Politics Podcast on AI risk & politics

jackvaApr 11, 2023, 5:03 PM

64 points

5 comments1 min readEA link

(fivethirtyeight.com)

[Linkpost] ‘The Godfather of A.I.’ Leaves Google and Warns of Danger Ahead

imp4rtial 🔸May 1, 2023, 7:54 PM

43 points

3 comments3 min readEA link

(www.nytimes.com)

The UK AI Safety Summit tomorrow

SebastianSchmidtOct 31, 2023, 7:09 PM

17 points

2 comments2 min readEA link

AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks

Center for AI SafetyOct 31, 2023, 7:24 PM

21 points

0 comments6 min readEA link

(newsletter.safe.ai)

President Biden Issues Executive Order on Safe, Secure, and Trustworthy Artificial Intelligence

Tristan WilliamsOct 30, 2023, 11:15 AM

143 points

8 comments3 min readEA link

(www.whitehouse.gov)

AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities

Center for AI SafetyAug 29, 2023, 3:03 PM

12 points

0 comments8 min readEA link

(newsletter.safe.ai)

The Bletchley Declaration on AI Safety

Hauke HillebrandtNov 1, 2023, 11:44 AM

60 points

3 comments4 min readEA link

(www.gov.uk)

Announcing Superintelligence Imagined: A creative contest on the risks of superintelligence

TaylorJnsJun 12, 2024, 3:20 PM

17 points

0 comments1 min readEA link

Biden-Harris Administration Announces First-Ever Consortium Dedicated to AI Safety

ben.smithFeb 9, 2024, 6:40 AM

15 points

1 comment1 min readEA link

(www.nist.gov)

Disrupting malicious uses of AI by state-affiliated threat actors

Agustín Covarrubias 🔸Feb 14, 2024, 9:28 PM

22 points

1 comment1 min readEA link

(openai.com)

Introducing StakeOut.AI

Harry LukFeb 17, 2024, 12:21 AM

52 points

6 comments9 min readEA link

My article in The Nation — California’s AI Safety Bill Is a Mask-Off Moment for the Industry

GarrisonAug 15, 2024, 7:25 PM

134 points

0 comments1 min readEA link

(www.thenation.com)

Proposing the Conditional AI Safety Treaty (linkpost TIME)

OttoNov 15, 2024, 1:56 PM

12 points

6 comments3 min readEA link

(time.com)

Demis Hassabis — Google DeepMind: The Podcast

Zach Stein-PerlmanAug 16, 2024, 12:00 AM

22 points

2 comments1 min readEA link

(www.youtube.com)

Anthropic Announces new S.O.T.A. Claude 3

Joseph MillerMar 4, 2024, 7:02 PM

10 points

5 comments1 min readEA link

(twitter.com)

Claude Doesn’t Want to Die

GarrisonMar 5, 2024, 6:00 AM

22 points

14 comments10 min readEA link

(garrisonlovely.substack.com)

AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs Plus, Forecasting the Future with LLMs, and Regulatory Markets

Center for AI SafetyMar 7, 2024, 4:37 PM

15 points

2 comments8 min readEA link

(newsletter.safe.ai)

OpenAI o1

Zach Stein-PerlmanSep 12, 2024, 6:54 PM

38 points

0 comments1 min readEA link

OpenAI: Preparedness framework

Zach Stein-PerlmanDec 18, 2023, 6:30 PM

24 points

0 comments1 min readEA link

(openai.com)

OpenAI announces new members to board of directors

Will Howard🔹Mar 9, 2024, 11:27 AM

47 points

12 comments2 min readEA link

(openai.com)

Among the A.I. Doomsayers—The New Yorker

Agustín Covarrubias 🔸Mar 11, 2024, 9:12 PM

66 points

0 comments1 min readEA link

(www.newyorker.com)

Cybersecurity and AI: The Evolving Security Landscape

Center for AI SafetyMar 14, 2024, 8:14 PM

9 points

0 comments12 min readEA link

(www.safe.ai)

INTERVIEW: Round 2 - StakeOut.AI w/ Dr. Peter Park

Jacob-HaimesMar 18, 2024, 9:26 PM

8 points

0 comments1 min readEA link

(into-ai-safety.github.io)

Some thoughts from a University AI Debate

Charlie HarrisonMar 20, 2024, 5:03 PM

26 points

2 comments1 min readEA link

Podcast: Interview series featuring Dr. Peter Park

Jacob-HaimesMar 26, 2024, 12:35 AM

1 point

0 comments2 min readEA link

(into-ai-safety.github.io)

AISN #28: Center for AI Safety 2023 Year in Review

Center for AI SafetyDec 23, 2023, 9:31 PM

17 points

1 comment5 min readEA link

(newsletter.safe.ai)

AI safety advocates should consider providing gentle pushback following the events at OpenAI

I_machinegun_KellyDec 22, 2023, 9:05 PM

86 points

5 comments3 min readEA link

(www.lesswrong.com)

NYT is suing OpenAI&Microsoft for alleged copyright infringement; some quick thoughts

MikhailSaminDec 28, 2023, 6:37 PM

29 points

0 comments1 min readEA link

AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copyright Infringement, and Congressional Questions about Research Standards in AI Safety

Center for AI SafetyJan 4, 2024, 4:03 PM

5 points

0 comments6 min readEA link

(newsletter.safe.ai)

#176 – The final push for AGI, understanding OpenAI’s leadership drama, and red-teaming frontier models (Nathan Labenz on the 80,000 Hours Podcast)

80000_HoursJan 4, 2024, 4:00 PM

15 points

0 comments22 min readEA link

U.S. Commerce Secretary Gina Raimondo Announces Expansion of U.S. AI Safety Institute Leadership Team [and Paul Christiano update]

PhibApr 16, 2024, 5:10 PM

116 points

8 comments1 min readEA link

(www.commerce.gov)

AI Safety Newsletter #42: Newsom Vetoes SB 1047 Plus, OpenAI’s o1, and AI Governance Summary

Center for AI SafetyOct 1, 2024, 8:33 PM

10 points

0 comments6 min readEA link

(newsletter.safe.ai)

£1 million prize for the most cutting-edge AI solution for public good [link post]

rileyharrisJan 17, 2024, 2:36 PM

8 points

0 comments2 min readEA link

(manchesterprize.org)

I read every major AI lab’s safety plan so you don’t have to

sarahhwDec 16, 2024, 2:12 PM

67 points

2 comments11 min readEA link

(longerramblings.substack.com)

AISN #35: Lobbying on AI Regulation Plus, New Models from OpenAI and Google, and Legal Regimes for Training on Copyrighted Data

Center for AI SafetyMay 16, 2024, 2:26 PM

14 points

0 comments6 min readEA link

(newsletter.safe.ai)

The Failed Strategy of Artificial Intelligence Doomers

yhoisethFeb 5, 2025, 7:34 PM

12 points

2 comments1 min readEA link

(letter.palladiummag.com)

Mitigating extreme AI risks amid rapid progress [Linkpost]

AkashMay 21, 2024, 8:04 PM

36 points

1 comment1 min readEA link

Publication of the International Scientific Report on the Safety of Advanced AI (Interm Report)

James HerbertMay 21, 2024, 9:58 PM

11 points

2 comments2 min readEA link

(www.gov.uk)

Helen Toner (ex-OpenAI board member): “We learned about ChatGPT on Twitter.”

defun 🔸May 29, 2024, 7:40 AM

123 points

13 comments1 min readEA link

(x.com)

The U.S. and China Need an AI Incidents Hotline

christian.rJun 3, 2024, 6:46 PM

25 points

0 comments1 min readEA link

(www.lawfaremedia.org)

Anthropic rewrote its RSP

Zach Stein-PerlmanOct 15, 2024, 2:30 PM

32 points

1 comment1 min readEA link

Is principled mass-outreach possible, for AGI X-risk?

Nicholas KrossJan 21, 2024, 5:45 PM

12 points

2 comments1 min readEA link

AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety Institutes

Center for AI SafetyJan 24, 2024, 7:38 PM

7 points

1 comment6 min readEA link

(newsletter.safe.ai)

AI Safety: Why We Need to Keep Our Smart Machines in Check

adityaraj@eanitaDec 17, 2024, 12:29 PM

1 point

0 comments2 min readEA link

(medium.com)

Executive Director for AIS France—Expression of interest

gergoDec 19, 2024, 8:11 AM

33 points

0 comments4 min readEA link

Executive Director for AIS Brussels—Expression of interest

gergoDec 19, 2024, 9:15 AM

29 points

0 comments4 min readEA link

It is time to start war gaming for AGI

yanni kyriacosOct 17, 2024, 5:14 AM

14 points

4 comments1 min readEA link

OpenAI defected, but we can take honest actions

RemmeltOct 21, 2024, 8:41 AM

19 points

1 comment2 min readEA link

Miles Brundage resigned from OpenAI, and his AGI readiness team was disbanded

GarrisonOct 23, 2024, 11:42 PM

57 points

4 comments7 min readEA link

(garrisonlovely.substack.com)

Finishing The SB-1047 Documentary In 6 Weeks

Michaël TrazziOct 28, 2024, 8:26 PM

67 points

0 comments4 min readEA link

Why Did Elon Musk Just Offer to Buy Control of OpenAI for $100 Billion?

GarrisonFeb 11, 2025, 12:20 AM

152 points

2 comments6 min readEA link

(garrisonlovely.substack.com)

o3 is not being released to the public. First they are only giving access to external safety testers. You can apply to get early access to do safety testing

Kat WoodsDec 20, 2024, 6:30 PM

13 points

0 comments1 min readEA link

(openai.com)

We are in a New Paradigm of AI Progress—OpenAI’s o3 model makes huge gains on the toughest AI benchmarks in the world

GarrisonDec 22, 2024, 9:45 PM

26 points

0 comments4 min readEA link

(garrisonlovely.substack.com)

What is it to solve the alignment problem?

Joe_CarlsmithFeb 13, 2025, 6:42 PM

25 points

1 comment1 min readEA link

(joecarlsmith.substack.com)

Deepseek and Taiwan

YadavFeb 13, 2025, 5:19 PM

5 points

1 comment3 min readEA link

(45seconds.substack.com)

Frontier AI systems have surpassed the self-replicating red line

Greg_Colbourn ⏸️ Dec 10, 2024, 4:33 PM

25 points

14 comments1 min readEA link

(github.com)

Chinese Researchers Crack ChatGPT: Replicating OpenAI’s Advanced AI Model

Evan_GaensbauerJan 5, 2025, 3:50 AM

1 point

0 comments1 min readEA link

(www.geeky-gadgets.com)

AI Lab Retaliation: A Survival Guide

Jay ReadyJan 4, 2025, 11:05 PM

8 points

1 comment12 min readEA link

(morelightinai.substack.com)

Altman on the board, AGI, and superintelligence

OscarD🔸Jan 6, 2025, 2:37 PM

20 points

1 comment1 min readEA link

(blog.samaltman.com)

DeepSeek Made it Even Harder for US AI Companies to Ever Reach Profitability

GarrisonFeb 19, 2025, 9:02 PM

30 points

1 comment3 min readEA link

(garrisonlovely.substack.com)

Tarbell Fellowship 2025 - Applications Open (AI Journalism)

Tarbell Center for AI JournalismJan 8, 2025, 3:25 PM

62 points

0 comments1 min readEA link

Are AI safetyists crying wolf?

sarahhwJan 8, 2025, 8:54 PM

61 points

21 comments16 min readEA link

(longerramblings.substack.com)

Is AI Hitting a Wall or Moving Faster Than Ever?

GarrisonJan 9, 2025, 10:18 PM

35 points

3 comments5 min readEA link

(garrisonlovely.substack.com)

OpenAI’s o3 model scores 3% on the ARC-AGI-2 benchmark, compared to 60% for the average human

YarrowMay 1, 2025, 1:57 PM

12 points

2 comments3 min readEA link

(arcprize.org)

The Compendium, A full argument about extinction risk from AGI

adamShimiOct 31, 2024, 12:02 PM

9 points

1 comment2 min readEA link

(www.thecompendium.ai)

Ex-OpenAI employee amici leave to file denied in Musk v OpenAI case?

TFDMay 2, 2025, 12:31 PM

8 points

0 comments2 min readEA link

(www.thefloatingdroid.com)

RA x ControlAI video: What if AI just keeps getting smarter?

WriterMay 2, 2025, 2:19 PM

14 points

1 comment1 min readEA link

Exploring AI Safety through “Escape Experiment”: A Short Film on Superintelligence Risks

Gaetan_Selle 🔷Nov 10, 2024, 4:42 AM

4 points

0 comments2 min readEA link

The Game Board has been Flipped: Now is a good time to rethink what you’re doing

LintzAJan 28, 2025, 9:20 PM

389 points

69 comments13 min readEA link

OpenAI lost $5 billion in 2024 (and its losses are increasing)

RemmeltMar 31, 2025, 4:17 AM

0 points

3 comments1 min readEA link

(www.wheresyoured.at)

Epoch AI alumni launch Mechanize to “automate the whole economy”

Henry Stanley 🔸Apr 18, 2025, 10:12 AM

101 points

52 comments1 min readEA link

What the Headlines Miss About the Latest Decision in the Musk vs. OpenAI Lawsuit

GarrisonMar 6, 2025, 7:49 PM

87 points

9 comments6 min readEA link

(garrisonlovely.substack.com)

PSA: Saying “1 in 5” Is Better Than “20%” When Informing about risks publicly

BlankaJan 30, 2025, 7:03 PM

17 points

1 comment1 min readEA link

Inside OpenAI’s Controversial Plan to Abandon its Nonprofit Roots

GarrisonApr 18, 2025, 6:46 PM

17 points

1 comment11 min readEA link

(garrisonlovely.substack.com)

China Hawks are Manufacturing an AI Arms Race

GarrisonNov 20, 2024, 6:17 PM

103 points

3 comments5 min readEA link

(garrisonlovely.substack.com)

OpenAI’s CBRN tests seem unclear

Luca Righetti 🔸Nov 21, 2024, 5:26 PM

82 points

3 comments7 min readEA link

[Question] Seeking Tangible Examples of AI Catastrophes

clifford.banesNov 25, 2024, 7:55 AM

9 points

2 comments1 min readEA link

OpenAI’s o1 tried to avoid being shut down, and lied about it, in evals

Greg_Colbourn ⏸️ Dec 6, 2024, 3:25 PM

23 points

9 comments1 min readEA link

(www.transformernews.ai)

A better “Statement on AI Risk?” [Crosspost]

Knight LeeDec 30, 2024, 7:36 AM

4 points

0 comments3 min readEA link

Terminology suggestion: standardize terms for probability ranges

Egg SyntaxAug 30, 2024, 4:05 PM

2 points

0 comments1 min readEA link

AI Safety Newsletter #41: The Next Generation of Compute Scale Plus, Ranking Models by Susceptibility to Jailbreaking, and Machine Ethics

Center for AI SafetySep 11, 2024, 7:11 PM

12 points

0 comments5 min readEA link

(newsletter.safe.ai)

Meta: Frontier AI Framework

Zach Stein-PerlmanFeb 3, 2025, 10:00 PM

23 points

0 comments1 min readEA link

(ai.meta.com)

OpenAI Alums, Nobel Laureates Urge Regulators to Save Company’s Nonprofit Structure

GarrisonApr 23, 2025, 11:01 PM

60 points

2 comments8 min readEA link

(garrisonlovely.substack.com)

Anthropic is being sued for copying books to train Claude

RemmeltAug 31, 2024, 2:57 AM

3 points

0 comments1 min readEA link

(fingfx.thomsonreuters.com)

Unions for AI safety?

dEAsignSep 24, 2023, 12:13 AM

7 points

12 comments2 min readEA link

[Congressional Hearing] Oversight of A.I.: Legislating on Artificial Intelligence

Tristan WilliamsNov 1, 2023, 6:15 PM

5 points

1 comment7 min readEA link

(www.judiciary.senate.gov)

Amazon to invest up to $4 billion in Anthropic

Davis_KingsleySep 25, 2023, 2:55 PM

38 points

34 comments1 min readEA link

(twitter.com)

Announcing #AISummitTalks featuring Professor Stuart Russell and many others

OttoOct 24, 2023, 10:16 AM

9 points

1 comment1 min readEA link

Go Mobilize? Lessons from GM Protests for Pausing AI

Charlie HarrisonOct 24, 2023, 3:01 PM

48 points

11 comments31 min readEA link

The Dissolution of AI Safety

RokoDec 12, 2024, 10:46 AM

−7 points

0 comments1 min readEA link

(www.transhumanaxiology.com)

[Linkpost] NY Times Feature on Anthropic

GarrisonJul 12, 2023, 7:30 PM

34 points

3 comments5 min readEA link

(www.nytimes.com)

Sam Altman fired from OpenAI

LarksNov 17, 2023, 9:07 PM

133 points

89 comments1 min readEA link

(openai.com)

Thoughts on yesterday’s UN Security Council meeting on AI

Greg_Colbourn ⏸️ Jul 19, 2023, 4:46 PM

31 points

2 comments1 min readEA link

AI Impacts Quarterly Newsletter, Apr-Jun 2023

HarlanJul 18, 2023, 6:01 PM

4 points

0 comments3 min readEA link

(blog.aiimpacts.org)

AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from Oppenheimer

Center for AI SafetyJul 25, 2023, 4:45 PM

7 points

0 comments6 min readEA link

(newsletter.safe.ai)

[Crosspost] An AI Pause Is Humanity’s Best Bet For Preventing Extinction (TIME)

OttoJul 24, 2023, 10:18 AM

36 points

3 comments7 min readEA link

(time.com)

Linkpost: 7 A.I. Companies Agree to Safeguards After Pressure From the White House

MHR🔸Jul 21, 2023, 1:23 PM

61 points

4 comments1 min readEA link

(www.nytimes.com)

[link post] AI Should Be Terrified of Humans

BrianKJul 24, 2023, 11:13 AM

28 points

0 comments1 min readEA link

(time.com)

[Linkpost] Eric Schwitzgebel: AI systems must not confuse users about their sentience or moral status

Zachary Brown🔸Aug 18, 2023, 5:21 PM

6 points

0 comments2 min readEA link

(www.sciencedirect.com)

AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight

Center for AI SafetyAug 1, 2023, 3:24 PM

15 points

0 comments8 min readEA link

Eliciting responses to Marc Andreessen’s “Why AI Will Save the World”

ColemanJul 17, 2023, 7:58 PM

2 points

2 comments1 min readEA link

(a16z.com)

Frontier Model Forum

Zach Stein-PerlmanJul 26, 2023, 2:30 PM

40 points

7 comments1 min readEA link

(blog.google)

Asterisk Magazine Issue 03: AI

alejandroJul 24, 2023, 3:53 PM

34 points

3 comments1 min readEA link

(asteriskmag.com)

AISN #27: Defensive Accelerationism, A Retrospective On The OpenAI Board Saga, And A New AI Bill From Senators Thune And Klobuchar

Center for AI SafetyDec 7, 2023, 3:57 PM

10 points

0 comments6 min readEA link

(newsletter.safe.ai)

Gavin Newsom vetoes SB 1047

LarksSep 30, 2024, 12:06 AM

39 points

14 comments1 min readEA link

(www.wsj.com)

The costs of caution

Kelsey PiperMay 1, 2023, 8:04 PM

112 points

17 comments4 min readEA link

AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks

Center for AI SafetyMay 2, 2023, 4:51 PM

35 points

2 comments5 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #1 [CAIS Linkpost]

AkashApr 10, 2023, 8:18 PM

38 points

0 comments1 min readEA link

AI Safety Newsletter #2: ChaosGPT, Natural Selection, and AI Safety in the Media

Oliver ZApr 18, 2023, 6:36 PM

56 points

1 comment4 min readEA link

(newsletter.safe.ai)

My choice of AI misalignment introduction for a general audience

BillMay 3, 2023, 12:15 AM

7 points

2 comments1 min readEA link

(youtu.be)

AI X-risk in the News: How Effective are Recent Media Items and How is Awareness Changing? Our New Survey Results.

OttoMay 4, 2023, 2:04 PM

49 points

1 comment9 min readEA link

[Link Post: New York Times] White House Unveils Initiatives to Reduce Risks of A.I.

RockwellMay 4, 2023, 2:04 PM

50 points

1 comment2 min readEA link

An Update On The Campaign For AI Safety Dot Org

yanni kyriacosMay 5, 2023, 12:19 AM

26 points

4 comments1 min readEA link

AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models

Center for AI SafetyMay 9, 2023, 3:26 PM

60 points

0 comments4 min readEA link

(newsletter.safe.ai)

AI-Risk in the State of the European Union Address

Sam BogerdSep 13, 2023, 1:27 PM

25 points

0 comments3 min readEA link

(state-of-the-union.ec.europa.eu)

The International PauseAI Protest: Activism under uncertainty

Joseph MillerOct 12, 2023, 5:36 PM

136 points

3 comments4 min readEA link

AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control

Center for AI SafetyMay 16, 2023, 3:14 PM

32 points

1 comment6 min readEA link

(newsletter.safe.ai)

Efficacy of AI Activism: Have We Ever Said No?

Charlie HarrisonOct 27, 2023, 4:52 PM

78 points

25 comments20 min readEA link

Sam Altman / Open AI Discussion Thread

John SalterNov 20, 2023, 9:21 AM

40 points

36 comments1 min readEA link

Ilya: The AI scientist shaping the world

David VargaNov 20, 2023, 12:43 PM

6 points

1 comment4 min readEA link

Former Israeli Prime Minister Speaks About AI X-Risk

Yonatan CaleMay 20, 2023, 12:09 PM

73 points

6 comments1 min readEA link

Possible OpenAI’s Q* breakthrough and DeepMind’s AlphaGo-type systems plus LLMs

BurnydelicNov 23, 2023, 7:02 AM

13 points

4 comments2 min readEA link

[Linkpost] “Governance of superintelligence” by OpenAI

Daniel_EthMay 22, 2023, 8:15 PM

51 points

6 comments2 min readEA link

(openai.com)

OpenAI board received letter warning of powerful AI

JordanStoneNov 23, 2023, 12:16 AM

26 points

2 comments1 min readEA link

(www.reuters.com)

[Question] Would an Anthropic/OpenAI merger be good for AI safety?

MNov 22, 2023, 8:21 PM

6 points

1 comment1 min readEA link

Rishi Sunak mentions “existential threats” in talk with OpenAI, DeepMind, Anthropic CEOs

Arjun PanicksseryMay 24, 2023, 9:06 PM

44 points

2 comments1 min readEA link

Tim Cook was asked about extinction risks from AI

Saul MunnJun 6, 2023, 6:46 PM

8 points

1 comment1 min readEA link

Could AI accelerate economic growth?

Tom_DavidsonJun 7, 2023, 7:07 PM

28 points

0 comments6 min readEA link

On DeepMind and Trying to Fairly Hear Out Both AI Doomers and Doubters (Rohin Shah on The 80,000 Hours Podcast)

80000_HoursJun 12, 2023, 12:53 PM

28 points

1 comment15 min readEA link

UK government to host first global summit on AI Safety

DavidNashJun 8, 2023, 1:24 PM

78 points

1 comment5 min readEA link

(www.gov.uk)

Linkpost: Dwarkesh Patel interviewing Carl Shulman

Stefan_SchubertJun 14, 2023, 3:30 PM

110 points

5 comments1 min readEA link

(podcastaddict.com)

Google DeepMind releases Gemini

YarrowDec 6, 2023, 5:39 PM

21 points

7 comments1 min readEA link

(deepmind.google)

Communication by existential risk organizations: State of the field and suggestions for improvement

Existential Risk Communication ProjectAug 13, 2024, 7:06 AM

10 points

3 comments13 min readEA link

No comments.

Public com­mu­ni­ca­tion on AI safety

Related entries

Public communication on AI safety