Eleni_A

Karma: 389

Talking EA to my philosophy friends

Eleni_A14 Jul 2022 8:48 UTC

11 points

0 comments5 min readEA link

Why did I misunderstand utilitarianism so badly?

Eleni_A16 Jul 2022 16:36 UTC

18 points

1 comment4 min readEA link

[Question] Slowing down AI progress?

Eleni_A26 Jul 2022 8:46 UTC

16 points

9 comments1 min readEA link

Eleni_A 2 Aug 2022 13:27 UTC
1 point
0 ∶ 0
on: Eleni’s Shortform
Five types of people on AI risks:
1. Wants AGI as soon as possible, ignores safety.
2. Wants AGI, but primarily cares about alignment.
3. Doesn’t understand AGI/doesn’t think it’ll happen anytime during her lifetime; thinks about robots that might take people’s jobs.
4. Understands AGI, but thinks the timelines are long enough not to worry about it right now.
5. Doesn’t worry about AGI; being locked-in in our choices and “normal accidents” are both more important/risky/scary.

[Question] AI risks: the most convincing argument

Eleni_A6 Aug 2022 20:26 UTC

7 points

2 comments1 min readEA link

Eleni_A 7 Aug 2022 22:35 UTC
3 points
0 ∶ 0
on: Why does no one care about AI?
Here’s my attempt to reflect on the topic: https://forum.effectivealtruism.org/posts/PWKWEFJMpHzFC6Qvu/alignment-is-hard-communicating-that-is-harder

“Normal accidents” and AI systems

Eleni_A8 Aug 2022 18:43 UTC

5 points

1 comment1 min readEA link

(www.achan.ca)

Deception as the optimal: mesa-optimizers and inner alignment

Eleni_A16 Aug 2022 3:45 UTC

19 points

0 comments5 min readEA link

Alignment’s phlogiston

Eleni_A18 Aug 2022 1:41 UTC

18 points

1 comment2 min readEA link

Who ordered alignment’s apple?

Eleni_A28 Aug 2022 14:24 UTC

5 points

0 comments3 min readEA link

Alignment is hard. Communicating that, might be harder

Eleni_A1 Sep 2022 11:45 UTC

17 points

1 comment3 min readEA link

But what are your core values?

Eleni_A3 Sep 2022 13:51 UTC

15 points

0 comments2 min readEA link

An Epistemological Account of Intuitions in Science

Eleni_A3 Sep 2022 23:21 UTC

5 points

0 comments17 min readEA link

Three scenarios of pseudo-alignment

Eleni_A5 Sep 2022 20:26 UTC

7 points

0 comments3 min readEA link

A New York Times article on AI risk

Eleni_A6 Sep 2022 0:46 UTC

20 points

0 comments1 min readEA link

(www.nytimes.com)

It’s (not) how you use it

Eleni_A7 Sep 2022 13:28 UTC

6 points

3 comments2 min readEA link

Eleni_A 7 Sep 2022 17:50 UTC
1 point
0 ∶ 0
in reply to: Sharmake’s comment on: It’s (not) how you use it
I don’t think it’s restricted only to agentic technologies; my model is for all technologies that involve risk. My toy example is that even producing a knife requires the designer to think about its dangers in advance and propose precautions.

There is no royal road to alignment

Eleni_A17 Sep 2022 13:24 UTC

18 points

2 comments3 min readEA link

Eleni_A 17 Sep 2022 19:04 UTC
3 points
0 ∶ 0
in reply to: Noah Scales’s comment on: There is no royal road to alignment
Both Redwood and Anthropic have labs and do empirical work. This is also an example of experimental work: https://twitter.com/Karolis_Ram/status/1540301041769529346

Against the weirdness heuristic

Eleni_A5 Oct 2022 14:13 UTC

5 points

0 comments2 min readEA link