RSS

Zach Stein-Perlman

Karma: 5,371

AI strategy & governance. ailabwatch.org.

Meta: Fron­tier AI Framework

Zach Stein-PerlmanFeb 3, 2025, 10:00 PM
23 points
0 comments1 min readEA link
(ai.meta.com)

o3

Zach Stein-PerlmanDec 20, 2024, 9:00 PM
84 points
5 comments1 min readEA link

The cur­rent state of RSPs

Zach Stein-PerlmanNov 4, 2024, 4:00 PM
19 points
1 comment1 min readEA link

IAPS: Map­ping Tech­ni­cal Safety Re­search at AI Companies

Zach Stein-PerlmanOct 24, 2024, 8:30 PM
24 points
0 comments1 min readEA link
(www.iaps.ai)

What AI com­pa­nies should do: Some rough ideas

Zach Stein-PerlmanOct 21, 2024, 2:00 PM
14 points
1 comment1 min readEA link

An­thropic rewrote its RSP

Zach Stein-PerlmanOct 15, 2024, 2:30 PM
32 points
1 comment1 min readEA link

Model evals for dan­ger­ous capabilities

Zach Stein-PerlmanSep 23, 2024, 11:00 AM
19 points
0 comments1 min readEA link

OpenAI o1

Zach Stein-PerlmanSep 12, 2024, 6:54 PM
38 points
0 comments1 min readEA link

Demis Hass­abis — Google Deep­Mind: The Podcast

Zach Stein-PerlmanAug 16, 2024, 12:00 AM
22 points
2 comments1 min readEA link
(www.youtube.com)

Claude 3.5 Sonnet

Zach Stein-PerlmanJun 20, 2024, 6:00 PM
31 points
0 comments1 min readEA link
(www.anthropic.com)

AI com­pa­nies’ commitments

Zach Stein-PerlmanMay 31, 2024, 12:00 AM
9 points
0 comments1 min readEA link

Maybe An­thropic’s Long-Term Benefit Trust is powerless

Zach Stein-PerlmanMay 27, 2024, 1:00 PM
134 points
21 comments1 min readEA link

AI com­pa­nies aren’t re­ally us­ing ex­ter­nal evaluators

Zach Stein-PerlmanMay 26, 2024, 7:05 PM
88 points
4 comments1 min readEA link

New vol­un­tary com­mit­ments (AI Seoul Sum­mit)

Zach Stein-PerlmanMay 21, 2024, 11:00 AM
12 points
1 comment1 min readEA link
(www.gov.uk)

Deep­Mind’s “​​Fron­tier Safety Frame­work” is weak and unambitious

Zach Stein-PerlmanMay 18, 2024, 3:00 AM
54 points
1 comment1 min readEA link

Deep­Mind: Fron­tier Safety Framework

Zach Stein-PerlmanMay 17, 2024, 5:30 PM
23 points
0 comments1 min readEA link
(deepmind.google)

In­tro­duc­ing AI Lab Watch

Zach Stein-PerlmanApr 30, 2024, 5:00 PM
128 points
23 comments1 min readEA link
(ailabwatch.org)

Staged release

Zach Stein-PerlmanApr 20, 2024, 1:00 AM
16 points
0 comments1 min readEA link

Deep­Mind: Eval­u­at­ing Fron­tier Models for Danger­ous Capabilities

Zach Stein-PerlmanMar 21, 2024, 11:00 PM
28 points
0 comments1 min readEA link
(arxiv.org)

OpenAI: Pre­pared­ness framework

Zach Stein-PerlmanDec 18, 2023, 6:30 PM
24 points
0 comments1 min readEA link
(openai.com)