RSS

TheMcDouglas

Karma: 378

A Selec­tion of Ran­domly Selected SAE Features

TheMcDouglas1 Apr 2024 9:09 UTC
25 points
2 comments1 min readEA link

AI Align­ment Re­search Eng­ineer Ac­cel­er­a­tor (ARENA): call for applicants

TheMcDouglas7 Nov 2023 9:43 UTC
46 points
3 comments10 min readEA link

ARENA 2.0 - Im­pact Report

TheMcDouglas26 Sep 2023 17:13 UTC
17 points
0 comments13 min readEA link

An Anal­ogy for Un­der­stand­ing Transformers

TheMcDouglas13 May 2023 12:20 UTC
7 points
0 comments1 min readEA link

AI Align­ment Re­search Eng­ineer Ac­cel­er­a­tor (ARENA): call for applicants

TheMcDouglas17 Apr 2023 20:30 UTC
41 points
2 comments1 min readEA link

AI Risk In­tro 2: Solv­ing The Problem

L Rudolf L24 Sep 2022 9:33 UTC
11 points
0 comments28 min readEA link
(www.perfectlynormal.co.uk)

AI Risk In­tro 1: Ad­vanced AI Might Be Very Bad

L Rudolf L11 Sep 2022 10:57 UTC
22 points
0 comments30 min readEA link

Join ASAP (AI Safety Ac­countabil­ity Pro­gramme)

TheMcDouglas10 Sep 2022 11:15 UTC
54 points
20 comments3 min readEA link

In­tro­duc­ing the Ex­is­ten­tial Risks In­tro­duc­tory Course (ERIC)

Nandini Shiralkar19 Aug 2022 15:57 UTC
57 points
14 comments7 min readEA link

An­nounc­ing the Distil­la­tion for Align­ment Practicum (DAP)

Jonas Hallgren18 Aug 2022 22:31 UTC
20 points
2 comments3 min readEA link

MIRI Con­ver­sa­tions: Tech­nol­ogy Fore­cast­ing & Grad­u­al­ism (Distil­la­tion)

TheMcDouglas13 Jul 2022 10:45 UTC
27 points
9 comments19 min readEA link

Skil­ling-up in ML Eng­ineer­ing for Align­ment: re­quest for comments

TheMcDouglas24 Apr 2022 6:40 UTC
8 points
0 comments1 min readEA link

How I use Anki: ex­pand­ing the scope of SRS

TheMcDouglas12 Apr 2022 8:40 UTC
17 points
2 comments17 min readEA link

Where’s WALY?

TheMcDouglas2 Apr 2022 2:03 UTC
29 points
1 comment1 min readEA link