Archive
About
Search
Log In
Home
All
Wiki
Shortform
Recent
Comments
RSS
Teun van der Weij
Karma:
189
All
Posts
Comments
New
Top
Old
[Paper] AI Sandbagging: Language Models can Strategically Underperform on Evaluations
Teun van der Weij
13 Jun 2024 10:04 UTC
22
points
2
comments
1
min read
EA
link
(arxiv.org)
List of projects that seem impactful for AI Governance
JaimeRV
14 Jan 2024 16:52 UTC
35
points
2
comments
13
min read
EA
link
Beyond Humans: Why All Sentient Beings Matter in Existential Risk
Teun van der Weij
31 May 2023 21:21 UTC
12
points
0
comments
13
min read
EA
link
Announcing the European Network for AI Safety (ENAIS)
Esben Kran
22 Mar 2023 17:57 UTC
124
points
3
comments
3
min read
EA
link
Teun_Van_Der_Weij’s Quick takes
Teun van der Weij
23 Jun 2022 1:52 UTC
1
point
1
comment
1
min read
EA
link
Back to top