RSS

Neel Nanda

Karma: 3,796

I lead the DeepMind mechanistic interpretability team

Con­crete open prob­lems in mechanis­tic in­ter­pretabil­ity: a tech­ni­cal overview

Neel Nanda6 Jul 2023 11:35 UTC
26 points
1 comment29 min readEA link

Con­crete Steps to Get Started in Trans­former Mechanis­tic Interpretability

Neel Nanda26 Dec 2022 13:00 UTC
18 points
0 comments12 min readEA link

A Bare­bones Guide to Mechanis­tic In­ter­pretabil­ity Prerequisites

Neel Nanda29 Nov 2022 18:43 UTC
54 points
1 comment3 min readEA link
(neelnanda.io)

An Ex­tremely Opinionated An­no­tated List of My Favourite Mechanis­tic In­ter­pretabil­ity Papers

Neel Nanda18 Oct 2022 21:23 UTC
19 points
0 comments12 min readEA link
(www.neelnanda.io)

Con­crete Ad­vice for Form­ing In­side Views on AI Safety

Neel Nanda17 Aug 2022 23:26 UTC
58 points
4 comments9 min readEA link
(www.alignmentforum.org)

Things That Make Me En­joy Giv­ing Ca­reer Advice

Neel Nanda17 Jun 2022 20:49 UTC
34 points
3 comments8 min readEA link

How I Formed My Own Views About AI Safety

Neel Nanda27 Feb 2022 18:52 UTC
130 points
12 comments13 min readEA link
(www.neelnanda.io)

Sim­plify EA Pitches to “Holy Shit, X-Risk”

Neel Nanda11 Feb 2022 1:57 UTC
184 points
78 comments10 min readEA link
(www.neelnanda.io)

My Overview of the AI Align­ment Land­scape: A Bird’s Eye View

Neel Nanda15 Dec 2021 23:46 UTC
45 points
15 comments16 min readEA link
(www.alignmentforum.org)

Op­ti­mi­sa­tion-fo­cused in­tro­duc­tion to EA pod­cast episode

Neel Nanda15 Jan 2021 9:59 UTC
8 points
1 comment1 min readEA link
(art19.com)

Ret­ro­spec­tive on Teach­ing Ra­tion­al­ity Workshops

Neel Nanda3 Jan 2021 17:15 UTC
41 points
9 comments30 min readEA link

Lo­cal Group Event Idea: EA Com­mu­nity Talks

Neel Nanda20 Dec 2020 17:12 UTC
26 points
4 comments5 min readEA link

Make a Public Com­mit­ment to Writ­ing EA Fo­rum Posts

Neel Nanda18 Nov 2020 18:23 UTC
21 points
11 comments1 min readEA link

Helping each other be­come more effective

Neel Nanda30 Oct 2020 21:33 UTC
10 points
0 comments10 min readEA link
(www.neelnanda.io)

What al­tru­ism means to me

Neel Nanda15 Aug 2020 8:25 UTC
14 points
0 comments7 min readEA link
(www.neelnanda.io)

The world is full of wasted motion

Neel Nanda5 Aug 2020 20:41 UTC
21 points
2 comments11 min readEA link
(www.neelnanda.io)