AI Safety Newsletter

Center for AI Safety

May 16, 2023, 3:22 PM

AI Safety Newsletter #1 [CAIS Linkpost]

AkashApr 10, 2023, 8:18 PM

38 points

0 comments1 min readEA link

AI Safety Newsletter #2: ChaosGPT, Natural Selection, and AI Safety in the Media

Oliver ZApr 18, 2023, 6:36 PM

56 points

1 comment4 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #3: AI policy proposals and a new challenger approaches

Oliver ZApr 25, 2023, 4:15 PM

35 points

1 comment4 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks

Center for AI SafetyMay 2, 2023, 4:51 PM

35 points

2 comments5 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models

Center for AI SafetyMay 9, 2023, 3:26 PM

60 points

0 comments4 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control

Center for AI SafetyMay 16, 2023, 3:14 PM

32 points

1 comment6 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs, and Senate Hearings on AI

Center for AI SafetyMay 23, 2023, 9:42 PM

23 points

0 comments6 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI

Center for AI SafetyMay 30, 2023, 11:44 AM

16 points

3 comments6 min readEA link

(newsletter.safe.ai)

AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level?

Center for AI SafetyJun 6, 2023, 3:56 PM

12 points

2 comments7 min readEA link

(newsletter.safe.ai)

AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence

Center for AI SafetyJun 27, 2023, 3:25 PM

30 points

3 comments7 min readEA link

(newsletter.safe.ai)

AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave

Center for AI SafetyJul 5, 2023, 3:33 PM

25 points

0 comments9 min readEA link

(newsletter.safe.ai)

AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use

Center for AI SafetyJul 12, 2023, 4:58 PM

26 points

0 comments4 min readEA link

(newsletter.safe.ai)

AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from Oppenheimer

Center for AI SafetyJul 25, 2023, 4:45 PM

7 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight

Center for AI SafetyAug 1, 2023, 3:24 PM

15 points

0 comments8 min readEA link

AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety

Center for AI SafetyAug 8, 2023, 3:52 PM

12 points

0 comments5 min readEA link

(newsletter.safe.ai)

AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities

Center for AI SafetyAug 29, 2023, 3:03 PM

12 points

0 comments8 min readEA link

(newsletter.safe.ai)

AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy

Center for AI SafetySep 5, 2023, 2:59 PM

13 points

0 comments5 min readEA link

(newsletter.safe.ai)

AISN #22: The Landscape of US AI Legislation - Hearings, Frameworks, Bills, and Laws

Center for AI SafetySep 19, 2023, 2:43 PM

15 points

1 comment5 min readEA link

(newsletter.safe.ai)

AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering

Center for AI SafetyOct 4, 2023, 5:10 PM

7 points

0 comments5 min readEA link

(newsletter.safe.ai)

AISN #24: Kissinger Urges US-China Cooperation on AI, China’s New AI Law, US Export Controls, International Institutions, and Open Source AI

Center for AI SafetyOct 18, 2023, 5:03 PM

16 points

1 comment6 min readEA link

(newsletter.safe.ai)

AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks

Center for AI SafetyOct 31, 2023, 7:24 PM

21 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #26: National Institutions for AI Safety, Results From the UK Summit, and New Releases From OpenAI and xAI

Center for AI SafetyNov 15, 2023, 4:03 PM

11 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #27: Defensive Accelerationism, A Retrospective On The OpenAI Board Saga, And A New AI Bill From Senators Thune And Klobuchar

Center for AI SafetyDec 7, 2023, 3:57 PM

10 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #28: Center for AI Safety 2023 Year in Review

Center for AI SafetyDec 23, 2023, 9:31 PM

17 points

1 comment5 min readEA link

(newsletter.safe.ai)

AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copyright Infringement, and Congressional Questions about Research Standards in AI Safety

Center for AI SafetyJan 4, 2024, 4:03 PM

5 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety Institutes

Center for AI SafetyJan 24, 2024, 7:38 PM

7 points

1 comment6 min readEA link

(newsletter.safe.ai)

AISN #31: A New AI Policy Bill in California Plus, Precedents for AI Governance and The EU AI Office

Center for AI SafetyFeb 21, 2024, 9:55 PM

27 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs Plus, Forecasting the Future with LLMs, and Regulatory Markets

Center for AI SafetyMar 7, 2024, 4:37 PM

15 points

2 comments8 min readEA link

(newsletter.safe.ai)

AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI Landscape, and National Investments in AI

Center for AI SafetyApr 12, 2024, 4:11 PM

19 points

0 comments9 min readEA link

(newsletter.safe.ai)

AISN #34: New Military AI Systems Plus, AI Labs Fail to Uphold Voluntary Commitments to UK AI Safety Institute, and New AI Policy Proposals in the US Senate

Center for AI SafetyMay 2, 2024, 4:12 PM

21 points

5 comments8 min readEA link

(newsletter.safe.ai)

AISN #35: Lobbying on AI Regulation Plus, New Models from OpenAI and Google, and Legal Regimes for Training on Copyrighted Data

Center for AI SafetyMay 16, 2024, 2:26 PM

14 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #36: Voluntary Commitments are Insufficient Plus, a Senate AI Policy Roadmap, and Chapter 1: An Overview of Catastrophic Risks

Center for AI SafetyMay 30, 2024, 6:23 PM

6 points

0 comments5 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #37: US Launches Antitrust Investigations Plus, recent criticisms of OpenAI and Anthropic, and a summary of Situational Awareness

Center for AI SafetyJun 18, 2024, 6:08 PM

15 points

0 comments5 min readEA link

(newsletter.safe.ai)

AISN #38: Supreme Court Decision Could Limit Federal Ability to Regulate AI Plus, “Circuit Breakers” for AI systems, and updates on China’s AI industry

Center for AI SafetyJul 9, 2024, 7:29 PM

8 points

0 comments5 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #40: California AI Legislation Plus, NVIDIA Delays Chip Production, and Do AI Safety Benchmarks Actually Measure Safety?

Center for AI SafetyAug 21, 2024, 6:10 PM

17 points

0 comments6 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #39: Implications of a Trump Administration for AI Policy Plus, Safety Engineering

Center for AI SafetyJul 29, 2024, 5:48 PM

6 points

0 comments6 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #41: The Next Generation of Compute Scale Plus, Ranking Models by Susceptibility to Jailbreaking, and Machine Ethics

Center for AI SafetySep 11, 2024, 7:11 PM

12 points

0 comments5 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #42: Newsom Vetoes SB 1047 Plus, OpenAI’s o1, and AI Governance Summary

Center for AI SafetyOct 1, 2024, 8:33 PM

10 points

0 comments6 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #43: White House Issues First National Security Memo on AI Plus, AI and Job Displacement, and AI Takes Over the Nobels

Center for AI SafetyOct 28, 2024, 4:02 PM

6 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #44: The Trump Circle on AI Safety Plus, Chinese researchers used Llama to create a military tool for the PLA, a Google AI system discovered a zero-day cybersecurity vulnerability, and Complex Systems

Center for AI SafetyNov 19, 2024, 4:36 PM

11 points

0 comments5 min readEA link

(newsletter.safe.ai)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer