Archive
About
Search
Log In
Home
All
Wiki
Shortform
Recent
Comments
RSS
Center for AI Safety
Karma:
1,008
All
Posts
Comments
New
Top
Old
AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks
Center for AI Safety
2 May 2023 16:51 UTC
35
points
2
comments
5
min read
EA
link
(newsletter.safe.ai)
AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models
Center for AI Safety
9 May 2023 15:26 UTC
60
points
0
comments
4
min read
EA
link
(newsletter.safe.ai)
AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control
Center for AI Safety
16 May 2023 15:14 UTC
32
points
1
comment
6
min read
EA
link
(newsletter.safe.ai)
AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs, and Senate Hearings on AI
Center for AI Safety
23 May 2023 21:42 UTC
23
points
0
comments
6
min read
EA
link
(newsletter.safe.ai)
Statement on AI Extinction—Signed by AGI Labs, Top Academics, and Many Other Notable Figures
Center for AI Safety
30 May 2023 9:06 UTC
427
points
28
comments
1
min read
EA
link
(www.safe.ai)
AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI
Center for AI Safety
30 May 2023 11:44 UTC
16
points
3
comments
6
min read
EA
link
(newsletter.safe.ai)
AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level?
Center for AI Safety
6 Jun 2023 15:56 UTC
12
points
2
comments
7
min read
EA
link
(newsletter.safe.ai)
AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence
Center for AI Safety
27 Jun 2023 15:25 UTC
30
points
3
comments
7
min read
EA
link
(newsletter.safe.ai)
Catastrophic Risks from AI #5: Rogue AIs
Center for AI Safety
27 Jun 2023 22:06 UTC
16
points
1
comment
1
min read
EA
link
Catastrophic Risks from AI #6: Discussion and FAQ
Center for AI Safety
27 Jun 2023 23:23 UTC
10
points
0
comments
1
min read
EA
link
AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave
Center for AI Safety
5 Jul 2023 15:33 UTC
25
points
0
comments
9
min read
EA
link
(newsletter.safe.ai)
Modeling the impact of AI safety field-building programs
Center for AI Safety
10 Jul 2023 17:22 UTC
81
points
0
comments
7
min read
EA
link
Cost-effectiveness of student programs for AI safety research
Center for AI Safety
10 Jul 2023 17:23 UTC
53
points
6
comments
15
min read
EA
link
Cost-effectiveness of professional field-building programs for AI safety research
Center for AI Safety
10 Jul 2023 17:26 UTC
36
points
2
comments
18
min read
EA
link
AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use
Center for AI Safety
12 Jul 2023 16:58 UTC
26
points
0
comments
4
min read
EA
link
(newsletter.safe.ai)
AISN#15: China and the US take action to regulate AI, results from a tournament forecasting AI risk, updates on xAI’s plan, and Meta releases its open-source and commercially available Llama 2
Center for AI Safety
19 Jul 2023 1:40 UTC
5
points
0
comments
6
min read
EA
link
(newsletter.safe.ai)
AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from Oppenheimer
Center for AI Safety
25 Jul 2023 16:45 UTC
7
points
0
comments
6
min read
EA
link
(newsletter.safe.ai)
AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight
Center for AI Safety
1 Aug 2023 15:24 UTC
15
points
0
comments
8
min read
EA
link
AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety
Center for AI Safety
8 Aug 2023 15:52 UTC
12
points
0
comments
5
min read
EA
link
(newsletter.safe.ai)
An Overview of Catastrophic AI Risks
Center for AI Safety
15 Aug 2023 21:52 UTC
37
points
1
comment
13
min read
EA
link
(www.safe.ai)
Back to top