RSS

Mantas Mazeika

Karma: 75

A Bench­mark for Mea­sur­ing Hon­esty in AI Systems

Mantas Mazeika4 Mar 2025 17:44 UTC
29 points
0 comments2 min readEA link
(www.mask-benchmark.ai)

AI fore­cast­ing bots incoming

Center for AI Safety9 Sep 2024 19:55 UTC
−2 points
6 comments4 min readEA link
(www.safe.ai)

An Overview of Catas­trophic AI Risks

Center for AI Safety15 Aug 2023 21:52 UTC
37 points
1 comment13 min readEA link
(www.safe.ai)

In­tro­duc­ing the ML Safety Schol­ars Program

TW1234 May 2022 13:14 UTC
157 points
42 comments3 min readEA link