Archive
About
Search
Log In
Home
All
Wiki
Shortform
Recent
Comments
RSS
Mantas Mazeika
Karma:
75
All
Posts
Comments
New
Top
Old
A Benchmark for Measuring Honesty in AI Systems
Mantas Mazeika
4 Mar 2025 17:44 UTC
29
points
0
comments
2
min read
EA
link
(www.mask-benchmark.ai)
AI forecasting bots incoming
Center for AI Safety
9 Sep 2024 19:55 UTC
−2
points
6
comments
4
min read
EA
link
(www.safe.ai)
An Overview of Catastrophic AI Risks
Center for AI Safety
15 Aug 2023 21:52 UTC
37
points
1
comment
13
min read
EA
link
(www.safe.ai)
Introducing the ML Safety Scholars Program
TW123
4 May 2022 13:14 UTC
157
points
42
comments
3
min read
EA
link
Back to top