RSS

Teun van der Weij

Karma: 189

[Paper] AI Sand­bag­ging: Lan­guage Models can Strate­gi­cally Un­der­perform on Evaluations

Teun van der WeijJun 13, 2024, 10:04 AM
22 points
2 comments1 min readEA link
(arxiv.org)

List of pro­jects that seem im­pact­ful for AI Governance

JaimeRVJan 14, 2024, 4:52 PM
35 points
2 comments13 min readEA link

Beyond Hu­mans: Why All Sen­tient Be­ings Mat­ter in Ex­is­ten­tial Risk

Teun van der WeijMay 31, 2023, 9:21 PM
12 points
0 comments13 min readEA link

An­nounc­ing the Euro­pean Net­work for AI Safety (ENAIS)

Esben KranMar 22, 2023, 5:57 PM
124 points
3 comments3 min readEA link

Teun_Van_Der_Weij’s Quick takes

Teun van der WeijJun 23, 2022, 1:52 AM
1 point
1 comment1 min readEA link