RSS

𝕮𝖎𝖓𝖊𝖗𝖆

Karma: 2,214

Theoretical Computer Science Msc student at the University of [Redacted] in the United Kingdom.

I’m an aspiring alignment theorist; my research vibes are descriptive formal theories of intelligent systems (and their safety properties) with a bias towards constructive theories.

I think it’s important that our theories of intelligent systems remain rooted in the characteristics of real world intelligent systems; we cannot develop adequate theory from the null string as input.

Beren’s “De­con­fus­ing Direct vs Amor­tised Op­ti­mi­sa­tion”

𝕮𝖎𝖓𝖊𝖗𝖆7 Apr 2023 8:57 UTC
9 points
0 comments1 min readEA link

Orthog­o­nal­ity is Expensive

𝕮𝖎𝖓𝖊𝖗𝖆3 Apr 2023 1:57 UTC
18 points
4 comments1 min readEA link

“Dangers of AI and the End of Hu­man Civ­i­liza­tion” Yud­kowsky on Lex Fridman

𝕮𝖎𝖓𝖊𝖗𝖆30 Mar 2023 15:44 UTC
28 points
0 comments1 min readEA link

Sparks of Ar­tifi­cial Gen­eral In­tel­li­gence: Early ex­per­i­ments with GPT-4 | Microsoft Research

𝕮𝖎𝖓𝖊𝖗𝖆23 Mar 2023 5:45 UTC
15 points
0 comments1 min readEA link

Google in­vests $300mn in ar­tifi­cial in­tel­li­gence start-up An­thropic | FT

𝕮𝖎𝖓𝖊𝖗𝖆3 Feb 2023 19:43 UTC
155 points
5 comments1 min readEA link
(www.ft.com)

AI Risk Man­age­ment Frame­work | NIST

𝕮𝖎𝖓𝖊𝖗𝖆26 Jan 2023 15:27 UTC
50 points
0 comments1 min readEA link

Hereti­cal Thoughts on AI | Eli Dourado

𝕮𝖎𝖓𝖊𝖗𝖆19 Jan 2023 16:11 UTC
138 points
15 comments1 min readEA link

My Thoughts on Bostrom’s “Apol­ogy for an Old Email”

𝕮𝖎𝖓𝖊𝖗𝖆12 Jan 2023 21:35 UTC
155 points
35 comments1 min readEA link

[Ru­mour] Microsoft to in­vest $10B in OpenAI, will re­ceive 75% of prof­its un­til they re­coup in­vest­ment: GPT would be in­te­grated with Office

𝕮𝖎𝖓𝖊𝖗𝖆10 Jan 2023 23:43 UTC
25 points
2 comments1 min readEA link

[Dis­cus­sion] How Broad is the Hu­man Cog­ni­tive Spec­trum?

𝕮𝖎𝖓𝖊𝖗𝖆7 Jan 2023 0:59 UTC
16 points
1 comment1 min readEA link

Why The Fo­cus on Ex­pected Utility Max­imisers?

𝕮𝖎𝖓𝖊𝖗𝖆27 Dec 2022 15:51 UTC
11 points
1 comment1 min readEA link

Against Agents as an Ap­proach to Aligned Trans­for­ma­tive AI

𝕮𝖎𝖓𝖊𝖗𝖆27 Dec 2022 0:47 UTC
4 points
0 comments1 min readEA link

The Limit of Lan­guage Models

𝕮𝖎𝖓𝖊𝖗𝖆26 Dec 2022 11:17 UTC
10 points
0 comments1 min readEA link

[DISC] Are Values Ro­bust?

𝕮𝖎𝖓𝖊𝖗𝖆21 Dec 2022 1:13 UTC
4 points
0 comments1 min readEA link

Si­mu­la­tors and Mindcrime

𝕮𝖎𝖓𝖊𝖗𝖆9 Dec 2022 15:20 UTC
1 point
0 comments1 min readEA link

Why I’m Scep­ti­cal of Foom

𝕮𝖎𝖓𝖊𝖗𝖆8 Dec 2022 10:01 UTC
22 points
7 comments1 min readEA link

“Far Co­or­di­na­tion”

𝕮𝖎𝖓𝖊𝖗𝖆23 Nov 2022 17:14 UTC
5 points
0 comments1 min readEA link

In Defence of Tem­po­ral Dis­count­ing in Longter­mist Ethics

𝕮𝖎𝖓𝖊𝖗𝖆13 Nov 2022 21:30 UTC
17 points
5 comments3 min readEA link

X-risk Miti­ga­tion Does Ac­tu­ally Re­quire Longter­mism

𝕮𝖎𝖓𝖊𝖗𝖆13 Nov 2022 19:40 UTC
35 points
6 comments1 min readEA link

So, I Want to Be a “Think­fluencer”

𝕮𝖎𝖓𝖊𝖗𝖆15 Aug 2022 6:05 UTC
17 points
7 comments5 min readEA link