RSS

aogara

Karma: 3,158

Research Engineering Intern at the Center for AI Safety. Helping to write the AI Safety Newsletter. Studying CS and Economics at the University of Southern California, and running an AI safety club there.

AISN #34: New Mili­tary AI Sys­tems Plus, AI Labs Fail to Uphold Vol­un­tary Com­mit­ments to UK AI Safety In­sti­tute, and New AI Policy Pro­pos­als in the US Senate

Center for AI Safety2 May 2024 16:12 UTC
21 points
5 comments8 min readEA link
(newsletter.safe.ai)

AISN #33: Re­assess­ing AI and Biorisk Plus, Con­soli­da­tion in the Cor­po­rate AI Land­scape, and Na­tional In­vest­ments in AI

Center for AI Safety12 Apr 2024 16:11 UTC
19 points
0 comments9 min readEA link
(newsletter.safe.ai)

AISN #32: Mea­sur­ing and Re­duc­ing Hazardous Knowl­edge in LLMs Plus, Fore­cast­ing the Fu­ture with LLMs, and Reg­u­la­tory Markets

Center for AI Safety7 Mar 2024 16:37 UTC
15 points
2 comments8 min readEA link
(newsletter.safe.ai)

AISN #31: A New AI Policy Bill in Cal­ifor­nia Plus, Prece­dents for AI Gover­nance and The EU AI Office

Center for AI Safety21 Feb 2024 21:55 UTC
27 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #30: In­vest­ments in Com­pute and Mili­tary AI Plus, Ja­pan and Sin­ga­pore’s Na­tional AI Safety Institutes

Center for AI Safety24 Jan 2024 19:38 UTC
7 points
1 comment6 min readEA link
(newsletter.safe.ai)

AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copy­right In­fringe­ment, and Con­gres­sional Ques­tions about Re­search Stan­dards in AI Safety

Center for AI Safety4 Jan 2024 16:03 UTC
5 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #28: Cen­ter for AI Safety 2023 Year in Review

Center for AI Safety23 Dec 2023 21:31 UTC
17 points
1 comment5 min readEA link
(newsletter.safe.ai)

AISN #27: Defen­sive Ac­cel­er­a­tionism, A Ret­ro­spec­tive On The OpenAI Board Saga, And A New AI Bill From Se­na­tors Thune And Klobuchar

Center for AI Safety7 Dec 2023 15:57 UTC
10 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #26: Na­tional In­sti­tu­tions for AI Safety, Re­sults From the UK Sum­mit, and New Re­leases From OpenAI and xAI

Center for AI Safety15 Nov 2023 16:03 UTC
11 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #25: White House Ex­ec­u­tive Order on AI, UK AI Safety Sum­mit, and Progress on Vol­un­tary Eval­u­a­tions of AI Risks

Center for AI Safety31 Oct 2023 19:24 UTC
21 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #24: Kiss­inger Urges US-China Co­op­er­a­tion on AI, China’s New AI Law, US Ex­port Con­trols, In­ter­na­tional In­sti­tu­tions, and Open Source AI

Center for AI Safety18 Oct 2023 17:03 UTC
16 points
1 comment6 min readEA link
(newsletter.safe.ai)

AISN #23: New OpenAI Models, News from An­thropic, and Rep­re­sen­ta­tion Engineering

Center for AI Safety4 Oct 2023 17:10 UTC
7 points
0 comments5 min readEA link
(newsletter.safe.ai)

AISN #22: The Land­scape of US AI Leg­is­la­tion - Hear­ings, Frame­works, Bills, and Laws

Center for AI Safety19 Sep 2023 14:43 UTC
15 points
1 comment5 min readEA link
(newsletter.safe.ai)

MLSN: #10 Ad­ver­sar­ial At­tacks Against Lan­guage and Vi­sion Models, Im­prov­ing LLM Hon­esty, and Trac­ing the In­fluence of LLM Train­ing Data

Center for AI Safety13 Sep 2023 18:02 UTC
7 points
0 comments5 min readEA link
(newsletter.mlsafety.org)

AISN #21: Google Deep­Mind’s GPT-4 Com­peti­tor, Mili­tary In­vest­ments in Au­tonomous Drones, The UK AI Safety Sum­mit, and Case Stud­ies in AI Policy

Center for AI Safety5 Sep 2023 14:59 UTC
13 points
0 comments5 min readEA link
(newsletter.safe.ai)

AISN #20: LLM Pro­lifer­a­tion, AI De­cep­tion, and Con­tin­u­ing Drivers of AI Capabilities

Center for AI Safety29 Aug 2023 15:03 UTC
12 points
0 comments8 min readEA link
(newsletter.safe.ai)

AISN #18: Challenges of Re­in­force­ment Learn­ing from Hu­man Feed­back, Microsoft’s Se­cu­rity Breach, and Con­cep­tual Re­search on AI Safety

Center for AI Safety8 Aug 2023 15:52 UTC
12 points
0 comments5 min readEA link
(newsletter.safe.ai)

AISN #17: Au­to­mat­i­cally Cir­cum­vent­ing LLM Guardrails, the Fron­tier Model Fo­rum, and Se­nate Hear­ing on AI Oversight

Center for AI Safety1 Aug 2023 15:24 UTC
15 points
0 comments8 min readEA link

AISN #16: White House Se­cures Vol­un­tary Com­mit­ments from Lead­ing AI Labs and Les­sons from Oppenheimer

Center for AI Safety25 Jul 2023 16:45 UTC
7 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #13: An in­ter­dis­ci­plinary per­spec­tive on AI proxy failures, new com­peti­tors to ChatGPT, and prompt­ing lan­guage mod­els to misbehave

Center for AI Safety5 Jul 2023 15:33 UTC
25 points
0 comments9 min readEA link
(newsletter.safe.ai)