Dan H

Karma: 1,214

https://danhendrycks.com

Dan H May 3, 2024, 5:16 AM
4 points
0 ∶ 1
in reply to: Zach Stein-Perlman’s comment on: Introducing AI Lab Watch
I mean Google does basic things like use Yubikeys where other places don’t even reliably do that. Unclear what a good checklist would look like, but maybe one could be created.

AISN #34: New Military AI Systems Plus, AI Labs Fail to Uphold Voluntary Commitments to UK AI Safety Institute, and New AI Policy Proposals in the US Senate

Center for AI SafetyMay 2, 2024, 4:12 PM

21 points

5 comments8 min readEA link

(newsletter.safe.ai)

Dan H May 2, 2024, 6:45 AM
13 points
5 ∶ 0
on: Introducing AI Lab Watch
To my understanding, Google has better infosec than OpenAI and Anthropic. They have much more experience protecting assets.

AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI Landscape, and National Investments in AI

Center for AI SafetyApr 12, 2024, 4:11 PM

19 points

0 comments9 min readEA link

(newsletter.safe.ai)

AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs Plus, Forecasting the Future with LLMs, and Regulatory Markets

Center for AI SafetyMar 7, 2024, 4:37 PM

15 points

2 comments8 min readEA link

(newsletter.safe.ai)

AISN #31: A New AI Policy Bill in California Plus, Precedents for AI Governance and The EU AI Office

Center for AI SafetyFeb 21, 2024, 9:55 PM

27 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety Institutes

Center for AI SafetyJan 24, 2024, 7:38 PM

7 points

1 comment6 min readEA link

(newsletter.safe.ai)

AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copyright Infringement, and Congressional Questions about Research Standards in AI Safety

Center for AI SafetyJan 4, 2024, 4:03 PM

5 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #28: Center for AI Safety 2023 Year in Review

Center for AI SafetyDec 23, 2023, 9:31 PM

17 points

1 comment5 min readEA link

(newsletter.safe.ai)

AISN #27: Defensive Accelerationism, A Retrospective On The OpenAI Board Saga, And A New AI Bill From Senators Thune And Klobuchar

Center for AI SafetyDec 7, 2023, 3:57 PM

10 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #26: National Institutions for AI Safety, Results From the UK Summit, and New Releases From OpenAI and xAI

Center for AI SafetyNov 15, 2023, 4:03 PM

11 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks

Center for AI SafetyOct 31, 2023, 7:24 PM

21 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #24: Kissinger Urges US-China Cooperation on AI, China’s New AI Law, US Export Controls, International Institutions, and Open Source AI

Center for AI SafetyOct 18, 2023, 5:03 PM

16 points

1 comment6 min readEA link

(newsletter.safe.ai)

AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering

Center for AI SafetyOct 4, 2023, 5:10 PM

7 points

0 comments5 min readEA link

(newsletter.safe.ai)

AISN #22: The Landscape of US AI Legislation - Hearings, Frameworks, Bills, and Laws

Center for AI SafetySep 19, 2023, 2:43 PM

15 points

1 comment5 min readEA link

(newsletter.safe.ai)

MLSN: #10 Adversarial Attacks Against Language and Vision Models, Improving LLM Honesty, and Tracing the Influence of LLM Training Data

Center for AI SafetySep 13, 2023, 6:02 PM

7 points

0 comments5 min readEA link

(newsletter.mlsafety.org)

AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy

Center for AI SafetySep 5, 2023, 2:59 PM

13 points

0 comments5 min readEA link

(newsletter.safe.ai)

AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities

Center for AI SafetyAug 29, 2023, 3:03 PM

12 points

0 comments8 min readEA link

(newsletter.safe.ai)

An Overview of Catastrophic AI Risks

Center for AI SafetyAug 15, 2023, 9:52 PM

37 points

1 comment13 min readEA link

(www.safe.ai)

AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety

Center for AI SafetyAug 8, 2023, 3:52 PM

12 points

0 comments5 min readEA link

(newsletter.safe.ai)