RSS

Dan H

Karma: 1,214

https://​​danhendrycks.com

AISN #34: New Mili­tary AI Sys­tems Plus, AI Labs Fail to Uphold Vol­un­tary Com­mit­ments to UK AI Safety In­sti­tute, and New AI Policy Pro­pos­als in the US Senate

Center for AI SafetyMay 2, 2024, 4:12 PM
21 points
5 comments8 min readEA link
(newsletter.safe.ai)

AISN #33: Re­assess­ing AI and Biorisk Plus, Con­soli­da­tion in the Cor­po­rate AI Land­scape, and Na­tional In­vest­ments in AI

Center for AI SafetyApr 12, 2024, 4:11 PM
19 points
0 comments9 min readEA link
(newsletter.safe.ai)

AISN #32: Mea­sur­ing and Re­duc­ing Hazardous Knowl­edge in LLMs Plus, Fore­cast­ing the Fu­ture with LLMs, and Reg­u­la­tory Markets

Center for AI SafetyMar 7, 2024, 4:37 PM
15 points
2 comments8 min readEA link
(newsletter.safe.ai)

AISN #31: A New AI Policy Bill in Cal­ifor­nia Plus, Prece­dents for AI Gover­nance and The EU AI Office

Center for AI SafetyFeb 21, 2024, 9:55 PM
27 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #30: In­vest­ments in Com­pute and Mili­tary AI Plus, Ja­pan and Sin­ga­pore’s Na­tional AI Safety Institutes

Center for AI SafetyJan 24, 2024, 7:38 PM
7 points
1 comment6 min readEA link
(newsletter.safe.ai)

AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copy­right In­fringe­ment, and Con­gres­sional Ques­tions about Re­search Stan­dards in AI Safety

Center for AI SafetyJan 4, 2024, 4:03 PM
5 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #28: Cen­ter for AI Safety 2023 Year in Review

Center for AI SafetyDec 23, 2023, 9:31 PM
17 points
1 comment5 min readEA link
(newsletter.safe.ai)

AISN #27: Defen­sive Ac­cel­er­a­tionism, A Ret­ro­spec­tive On The OpenAI Board Saga, And A New AI Bill From Se­na­tors Thune And Klobuchar

Center for AI SafetyDec 7, 2023, 3:57 PM
10 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #26: Na­tional In­sti­tu­tions for AI Safety, Re­sults From the UK Sum­mit, and New Re­leases From OpenAI and xAI

Center for AI SafetyNov 15, 2023, 4:03 PM
11 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #25: White House Ex­ec­u­tive Order on AI, UK AI Safety Sum­mit, and Progress on Vol­un­tary Eval­u­a­tions of AI Risks

Center for AI SafetyOct 31, 2023, 7:24 PM
21 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #24: Kiss­inger Urges US-China Co­op­er­a­tion on AI, China’s New AI Law, US Ex­port Con­trols, In­ter­na­tional In­sti­tu­tions, and Open Source AI

Center for AI SafetyOct 18, 2023, 5:03 PM
16 points
1 comment6 min readEA link
(newsletter.safe.ai)

AISN #23: New OpenAI Models, News from An­thropic, and Rep­re­sen­ta­tion Engineering

Center for AI SafetyOct 4, 2023, 5:10 PM
7 points
0 comments5 min readEA link
(newsletter.safe.ai)

AISN #22: The Land­scape of US AI Leg­is­la­tion - Hear­ings, Frame­works, Bills, and Laws

Center for AI SafetySep 19, 2023, 2:43 PM
15 points
1 comment5 min readEA link
(newsletter.safe.ai)

MLSN: #10 Ad­ver­sar­ial At­tacks Against Lan­guage and Vi­sion Models, Im­prov­ing LLM Hon­esty, and Trac­ing the In­fluence of LLM Train­ing Data

Center for AI SafetySep 13, 2023, 6:02 PM
7 points
0 comments5 min readEA link
(newsletter.mlsafety.org)

AISN #21: Google Deep­Mind’s GPT-4 Com­peti­tor, Mili­tary In­vest­ments in Au­tonomous Drones, The UK AI Safety Sum­mit, and Case Stud­ies in AI Policy

Center for AI SafetySep 5, 2023, 2:59 PM
13 points
0 comments5 min readEA link
(newsletter.safe.ai)

AISN #20: LLM Pro­lifer­a­tion, AI De­cep­tion, and Con­tin­u­ing Drivers of AI Capabilities

Center for AI SafetyAug 29, 2023, 3:03 PM
12 points
0 comments8 min readEA link
(newsletter.safe.ai)

An Overview of Catas­trophic AI Risks

Center for AI SafetyAug 15, 2023, 9:52 PM
37 points
1 comment13 min readEA link
(www.safe.ai)

AISN #18: Challenges of Re­in­force­ment Learn­ing from Hu­man Feed­back, Microsoft’s Se­cu­rity Breach, and Con­cep­tual Re­search on AI Safety

Center for AI SafetyAug 8, 2023, 3:52 PM
12 points
0 comments5 min readEA link
(newsletter.safe.ai)