Archive
About
Search
Log In
Home
All
Wiki
Shortform
Recent
Comments
RSS
Teun van der Weij
Karma:
196
All
Posts
Comments
New
Top
Old
How to mitigate sandbagging
Teun van der Weij
23 Mar 2025 17:19 UTC
3
points
0
comments
8
min read
EA
link
The Elicitation Game: Evaluating capability elicitation techniques
Teun van der Weij
27 Feb 2025 20:33 UTC
3
points
0
comments
2
min read
EA
link
[Paper] AI Sandbagging: Language Models can Strategically Underperform on Evaluations
Teun van der Weij
13 Jun 2024 10:04 UTC
24
points
2
comments
2
min read
EA
link
(arxiv.org)
List of projects that seem impactful for AI Governance
JaimeRV
14 Jan 2024 16:52 UTC
40
points
2
comments
13
min read
EA
link
Beyond Humans: Why All Sentient Beings Matter in Existential Risk
Teun van der Weij
31 May 2023 21:21 UTC
12
points
0
comments
13
min read
EA
link
Announcing the European Network for AI Safety (ENAIS)
Esben Kran
22 Mar 2023 17:57 UTC
124
points
3
comments
3
min read
EA
link
Teun_Van_Der_Weij’s Quick takes
Teun van der Weij
23 Jun 2022 1:52 UTC
1
point
1
comment
EA
link
Back to top