Archive
About
Search
Log In
Home
All
Wiki
Shortform
Recent
Comments
Teun van der Weij comments on
[Paper] AI Sandbagging: Language Models can Strategically Underperform on Evaluations
Teun van der Weij
14 Jun 2024 8:55 UTC
1
point
0 ∶ 0
Ha, you’re clearly right. We will fix it.
Back to top
Ha, you’re clearly right. We will fix it.