Daniel_Friedrich

Karma: 131

Exploring how cognitive science can improve AI safety, governance and prioritization.

I’d be excited to intern for any research project.

Always happy to chat!

Daniel_Friedrich 18 Aug 2021 14:16 UTC
3 points
0 ∶ 0
on: AMA: Jason Brennan, author of “Against Democracy” and creator of a Georgetown course on EA
What can an EA academic do to improve the incentives in the research side of academia? To help reward quality or even positive impact?

Daniel_Friedrich 20 Oct 2021 21:39 UTC
3 points
0 ∶ 0
on: Prioritization Research for Advancing Wisdom and Intelligence
The core idea sounds very interesting: Increasing rationality likely has effects which can be generalized, therefore having a measure could help evaluate wider social outreach causes.

Defining intelligence could be an AI-complete problem, but I think the problem is complicated enough as a simple factor analysis (i. e. even without knowing what we’re talking about :). I think estimating impact once we know the increase in any measure of rationality is the easier part of the problem—for ex. knowing how much promoting long-termist thinking increases support for AI regulation, we’re only a few steps from getting the QALY. The harder part for people starting out in social outreach might be to estimate how many people they can get on board of thinking more long-termistically with their specific intervention.
So I think it might be very useful to put together a list of all attempts to calculate the impact of various social outreach strategies for anyone who’s considering a new one to be able to find some reference points because the hardest estimates here also seem to be the most important (e. g. the probability Robert Wright would decrease oversuspicion between powers). My intuition tells me differences in attitudes are something intuition could predict quite well, so the wisdom of the crowd could work well here.
The best source I found when I tried to search whether someone tried to put changing society into numbers recently is this article by The Sentience Institute.

Also, this post adds some evidence based intervention suggestions to your list.

Daniel_Friedrich 28 Mar 2022 18:43 UTC
2 points
0 ∶ 0
on: Seeking Survey Responses—Attitudes Towards AI risks
When coming up with a similar project,* I thought the first step should be to conduct exploratory interviews with EAs that would reveal their hypotheses about the psychological factors that may go into one’s decision to take AI safety seriously. My guess would be that ideological orientation would explain the most variance.

*which I most likely won’t realize (98 %)
Edit: My project has been accepted for the CHERI summer research program, so I’ll keep you posted!

Daniel_Friedrich 3 May 2022 16:22 UTC
4 points
0 ∶ 0
on: 2021 EA Mental Health Survey Results
I’d love to see a deeper inquiry into which problems of EAs are most effectively reduced by which interventions. The suggestion there’s a lack of “skilled therapists used to working with intelligent, introspective clients” is a significant novel consideration for me as an aspiring psychologist and this kind of hybrid research could help me calibrate my intuitions.

Exploratory survey on psychology of AI risk perception

Daniel_Friedrich2 Aug 2022 20:34 UTC

1 point

0 comments1 min readEA link

(forms.gle)

A new place to discuss cognitive science, ethics and human alignment

Daniel_Friedrich4 Nov 2022 14:34 UTC

9 points

1 comment2 min readEA link

(www.facebook.com)

Daniel_Friedrich 26 Feb 2023 11:52 UTC
3 points
0 ∶ 0
on: Daniel_Friedrich’s Shortform
I got access to Bing Chat. It seems:
- It only searches through archived versions of websites (it doesn’t retrieve today’s news articles, it accessed an older version of my Wikipedia user site)
- During archivation, it only downloads the content one can see without any engagement with the website (tested on Reddit “see spoiler” buttons which reveal new content in the code. It could retrieve info from posts that gained less attention but weren’t hidden behind the spoiler button)
I. e. it’s still in a box of sorts, unless it’s much more intelligent than it pretends.
Edit: A recent ACX post argues text-predicting oracles might be safer, as their ability to form goals is super limited, but it provides 2 models how even they could be dangerous: By simulating an agent or via a human who decides to take bad advice like “run the paperclip maximizer code”. Scott implies thinking it would spontaneously form goals is extreme, linking a post by Veedrac. The best argument there seems to be: It only has memory equivalent to 10 human seconds. I find this convincing for the current models but it also seems limiting for the intelligence of these systems, so I’m afraid for future models, the incentives are aligned with reducing this safety valve.

AI safety and consciousness research: A brainstorm

Daniel_Friedrich15 Mar 2023 14:33 UTC

11 points

1 comment9 min readEA link

Daniel_Friedrich 2 Apr 2023 10:49 UTC
5 points
1 ∶ 0
on: Recruit the World’s best for AGI Alignment
1. If Big Tech finds these kinds of salaries cost-effective to solve their problems, I would consider it a strong argument in favor of this project.
2. I imagine Elon Musk could like this project given that he believes in small effective teams of geniuses.
3. I’d say “polymaths” is a good label for people I’d expect to make progress like Yudkowsky, Bostrom, Hanson and von Neumann.
  1. Edit: This may be fame-selection (engineers don’t often get credit, particularly in teams) or self-selection (interest in math+society).
4. The Manhattan and Enigma projects seem like examples where this kind of strategy just worked out. Some consideration that come to mind:
  1. There could be selection effects.
  2. From what I can find, members of these teams weren’t lured in by a lot of money. However, the salience of the AI threat in society is tiny, compared to that of WWII and large incentives could compensate that.
  3. I’ve read money can sometimes decrease intrinsic motivation, that drives exploration & inventions, however these findings are being rebutted by newer studies. Apart from that, my guess would be that getting those teams together is the key part and if large money can facilitate that, great.
5. A wild idea that might help in case a similar phenomenon works in the sub-population of geniuses & which could make this project more appealing to donors: Limit a portion of these salaries, so that the recipients could only use them for socially beneficial uses.

Daniel_Friedrich 10 May 2023 9:02 UTC
5 points
1 ∶ 0
on: EA Forum feature suggestion thread
Suggestion: Integrated search in LessWrong, EA Forum, Alignment Forum and perhaps Progress Forum posts.

Daniel_Friedrich 13 May 2023 11:23 UTC
1 point
0 ∶ 0
on: AI safety and consciousness research: A brainstorm
Update: I’m pleased to learn Yudkowsky seems to have suggested a similar agenda in a recent interview with Dwarkesh Patel (timestamp) as his greatest source of predictable hope about AI. It’s a rather fragmented bit but the gist is: Perhaps people doing RLHF get a better grasp on what to aim for by studying where “niceness” comes from in humans. He’s inspired by the idea that “consciousness is when the mask eats the shoggoth” and suggests, “maybe with the right bootstrapping you can let that happen on purpose”.
I see a very important point here: Human intelligence isn’t misaligned with evolution in a random direction, it is misaligned in the direction of maximizing positive qualia. Therefore, it seems very likely that consciousness played a causal role in the evolution of human moral alignment—and such causal role needs to be possible to study.

A quick review of resource depletion, waste and overpopulation

Daniel_Friedrich25 Sep 2023 21:11 UTC

24 points

0 comments12 min readEA link

A tool for searching rationalist & EA webs

Daniel_Friedrich29 Sep 2023 15:20 UTC

11 points

8 comments1 min readEA link

(ratsearch.blogspot.com)

Daniel_Friedrich 1 Oct 2023 11:59 UTC
1 point
0 ∶ 0
in reply to: DAOMaximalist’s comment on: A tool for searching rationalist & EA webs
Thanks, I’ve changed it up

Daniel_Friedrich 3 Nov 2023 13:10 UTC
2 points
0 ∶ 0
in reply to: Martin (Huge) Vlach’s comment on: A tool for searching rationalist & EA webs
Yes, OpenAI’s domain name is in the list because they have a blog

Daniel_Friedrich 3 Nov 2023 20:04 UTC
1 point
0 ∶ 0
in reply to: Martin (Huge) Vlach’s comment on: A tool for searching rationalist & EA webs
My intention was to make any content published by OpenAI accessible

Daniel_Friedrich 24 Nov 2023 12:39 UTC
9 points
0 ∶ 0
on: Daniel_Friedrich’s Shortform
Recently, I made RatSearch for googling within EA-adjecent webs. Now, you can try the GPT bot version! (GPT plus required)
The bot is instructed to interpret what you want to know in relation to EA and search for it on the Forums. If it fails, it searches through the whole web, while prioritizing the orgs listed by EA News.
Cons: ChatGPT uses Bing, which isn’t entirely reliable when it comes to indexing less visited webs.
Pros: It’s fun for brainstorming EA connections/perspective, even when you just type a raw phrase like “public transport” or “particle physics”
Neutral: I have yet to experiment whether it works better when you explicitly limit the search using the site: operator—try AltruSearch 2. It seems better at digging deeper within the EA ecosystem; AltruSearch 1 seems better at digging wider.

Update (12/8): The link now redirects to an updated version with very different instructions. You can still access the older version here.
What links here?

Daniel_Friedrich 24 Nov 2023 13:25 UTC
1 point
0 ∶ 0
in reply to: Daniel_Friedrich’s comment on: EA Forum feature suggestion thread
I recently made RatSearch for this purpose. You can also try the GPT bot version (more information here).

Daniel_Friedrich 27 Nov 2023 12:15 UTC
1 point
0 ∶ 0
in reply to: Pablo’s comment on: Daniel_Friedrich’s Shortform
Sorry, I don’t have any experience with that.

More evidence X-risk amplifies action against current AI harms

Daniel_Friedrich22 Dec 2023 15:21 UTC

27 points

2 comments2 min readEA link

(osf.io)

Daniel_Friedrich

Ex­plo­ra­tory sur­vey on psy­chol­ogy of AI risk perception

A new place to dis­cuss cog­ni­tive sci­ence, ethics and hu­man alignment

AI safety and con­scious­ness re­search: A brainstorm

A quick re­view of re­source de­ple­tion, waste and overpopulation

A tool for search­ing ra­tio­nal­ist & EA webs

More ev­i­dence X-risk am­plifies ac­tion against cur­rent AI harms