RSS

Mantas Mazeika

Karma: 68

A Bench­mark for Mea­sur­ing Hon­esty in AI Systems

Mantas MazeikaMar 4, 2025, 5:44 PM
22 points
0 comments2 min readEA link
(www.mask-benchmark.ai)

AI fore­cast­ing bots incoming

Center for AI SafetySep 9, 2024, 7:55 PM
−2 points
6 comments4 min readEA link
(www.safe.ai)

An Overview of Catas­trophic AI Risks

Center for AI SafetyAug 15, 2023, 9:52 PM
37 points
1 comment13 min readEA link
(www.safe.ai)

In­tro­duc­ing the ML Safety Schol­ars Program

TW123May 4, 2022, 1:14 PM
157 points
42 comments3 min readEA link