𝕮𝖎𝖓𝖊𝖗𝖆

Karma: 2,241

Theoretical Computer Science Msc student at the University of [Redacted] in the United Kingdom.

I’m an aspiring alignment theorist; my research vibes are descriptive formal theories of intelligent systems (and their safety properties) with a bias towards constructive theories.

I think it’s important that our theories of intelligent systems remain rooted in the characteristics of real world intelligent systems; we cannot develop adequate theory from the null string as input.

Beren’s “Deconfusing Direct vs Amortised Optimisation”

𝕮𝖎𝖓𝖊𝖗𝖆7 Apr 2023 8:57 UTC

9 points

0 comments3 min readEA link

Orthogonality is Expensive

𝕮𝖎𝖓𝖊𝖗𝖆3 Apr 2023 1:57 UTC

18 points

4 comments1 min readEA link

(www.beren.io)

“Dangers of AI and the End of Human Civilization” Yudkowsky on Lex Fridman

𝕮𝖎𝖓𝖊𝖗𝖆30 Mar 2023 15:44 UTC

28 points

0 comments1 min readEA link

(www.youtube.com)

Sparks of Artificial General Intelligence: Early experiments with GPT-4 | Microsoft Research

𝕮𝖎𝖓𝖊𝖗𝖆23 Mar 2023 5:45 UTC

15 points

0 comments1 min readEA link

(arxiv.org)

Google invests $300mn in artificial intelligence start-up Anthropic | FT

𝕮𝖎𝖓𝖊𝖗𝖆3 Feb 2023 19:43 UTC

155 points

5 comments1 min readEA link

(www.ft.com)

AI Risk Management Framework | NIST

𝕮𝖎𝖓𝖊𝖗𝖆26 Jan 2023 15:27 UTC

50 points

0 comments2 min readEA link

(www.nist.gov)

“Heretical Thoughts on AI” by Eli Dourado

𝕮𝖎𝖓𝖊𝖗𝖆19 Jan 2023 16:11 UTC

142 points

15 comments3 min readEA link

(www.elidourado.com)

My Thoughts on Bostrom’s “Apology for an Old Email”

𝕮𝖎𝖓𝖊𝖗𝖆12 Jan 2023 21:35 UTC

156 points

35 comments1 min readEA link

Microsoft Plans to Invest $10B in OpenAI; $3B Invested to Date | Fortune

𝕮𝖎𝖓𝖊𝖗𝖆10 Jan 2023 23:43 UTC

25 points

2 comments2 min readEA link

(fortune.com)

[Question] [Discussion] How Broad is the Human Cognitive Spectrum?

𝕮𝖎𝖓𝖊𝖗𝖆7 Jan 2023 0:59 UTC

16 points

1 comment2 min readEA link

[Question] Why The Focus on Expected Utility Maximisers?

𝕮𝖎𝖓𝖊𝖗𝖆27 Dec 2022 15:51 UTC

11 points

1 comment3 min readEA link

Against Agents as an Approach to Aligned Transformative AI

𝕮𝖎𝖓𝖊𝖗𝖆27 Dec 2022 0:47 UTC

4 points

0 comments2 min readEA link

The Limit of Language Models

𝕮𝖎𝖓𝖊𝖗𝖆26 Dec 2022 11:17 UTC

10 points

0 comments4 min readEA link

[Question] [DISC] Are Values Robust?

𝕮𝖎𝖓𝖊𝖗𝖆21 Dec 2022 1:13 UTC

4 points

0 comments2 min readEA link

Why I’m Sceptical of Foom

𝕮𝖎𝖓𝖊𝖗𝖆8 Dec 2022 10:01 UTC

22 points

7 comments3 min readEA link

“Far Coordination”

𝕮𝖎𝖓𝖊𝖗𝖆23 Nov 2022 17:14 UTC

5 points

0 comments9 min readEA link

In Defence of Temporal Discounting in Longtermist Ethics

𝕮𝖎𝖓𝖊𝖗𝖆13 Nov 2022 21:30 UTC

17 points

5 comments3 min readEA link

X-risk Mitigation Does Actually Require Longtermism

𝕮𝖎𝖓𝖊𝖗𝖆13 Nov 2022 19:40 UTC

35 points

6 comments1 min readEA link

So, I Want to Be a “Thinkfluencer”

𝕮𝖎𝖓𝖊𝖗𝖆15 Aug 2022 6:05 UTC

17 points

7 comments6 min readEA link

Are “Bad People” Really Unwelcome in EA?

𝕮𝖎𝖓𝖊𝖗𝖆9 Aug 2022 11:32 UTC

73 points

36 comments4 min readEA link