RomanHauksson

Karma: 213

Trying to make sure the development of powerful artificial intelligence goes well.

RomanHauksson 17 Aug 2022 22:02 UTC
11 points
0 ∶ 0
on: What We Owe The Future is out today
Congratulations on the launch! This is huge. I have to ask, though: why is the ebook version not free? I would assume that if you wanted to promote longtermism to a broad audience, you would make the book as accessible as possible. Maybe charging for a copy actually increases the number of people who end up reading it? For example, it would rank higher on bestseller lists, attracting more eyes. Or perhaps the reason is simply to raise funds for EA?

RomanHauksson 13 Dec 2022 17:53 UTC
5 points
0 ∶ 0
on: CFAR Anki deck
I made a similar deck a few months ago, and there might be some overlap: https://github.com/RomanHN/CFAR_jargon

RomanHauksson 9 Feb 2023 18:26 UTC
65 points
29 ∶ 3
on: In (mild) defence of the social/professional overlap in EA
I would like to emphasize that when we discuss community norms in EA, we should remember the ultimate goal of this community is to improve the world / humanity’s future as much as possible, not to make our lives as enjoyable as possible. Increasing the wellbeing of EAs is instrumentally useful for increased productivity and attracting more people to make sacrifices like “donate tens of thousands of dollars” or “change your career plan to work on this problem”, but ultimately the point isn’t to create a jolly in-group of ambitious nerds. For example, if the meshing of polyamorous and professional relationships causes less qualified candidates to earn positions in EA organizations, this may be net negative, even if the polyamorous relationships make people really happy.

RomanHauksson 3 Mar 2023 18:24 UTC
4 points
0 ∶ 0
on: RomanHauksson’s Shortform
It would be great to have data about gaps in professional skills between what EAs are training up in and what EA organizations find most useful and neglected. I’ve heard that there’s a gap in information security expertise within the AI safety field, but it would be nice to see data to back this up before I commit to self-studying cybersecurity. Maybe someone could do a survey of EA organization managers asking them what skills they’re looking for and what roles they’ve been having a hard time filling, as well as a survey of early-career EAs asking them what skills they have and what they’re learning. We could also do this survey regularly and observe trends.

RomanHauksson 7 Mar 2023 16:59 UTC
12 points
4 ∶ 2
in reply to: Grayden’s comment on: After launch. How are CE charities progressing?
The high success rate almost makes me think CE should be incubating even more ambitious, riskier projects, with the expectation of a lower success rate but higher overall EV. Very uncertain about this intuition though, would be interested to hear what CE thinks.

“Small World”: website that shows you which city your friends are in

RomanHauksson8 Mar 2023 22:34 UTC

8 points

2 comments1 min readEA link

(smallworld.kiwi)

RomanHauksson 9 Mar 2023 3:33 UTC
1 point
0 ∶ 0
in reply to: Guy Raveh’s comment on: “Small World”: website that shows you which city your friends are in
Yeah, it’s not perfect… I’d like to be able to silently block people too, in case I no longer want to hang out with them. But hey, it’s open source, maybe we can improve it.

RomanHauksson 17 Mar 2023 18:25 UTC
3 points
0 ∶ 0
on: Getting Better at Writing: Why and How
Does anyone have any advice on how I can use language models to write nonfiction text better? For making a specific piece of text better, but also for learning how to write better in the long term. Maybe a tool like Grammarly but more advanced? It would give critiques of the writing I have so far, ask questions, give wording suggestions, point out which sentences are especially well-written, et cetera.

RomanHauksson 27 Mar 2023 0:01 UTC
2 points
0 ∶ 1
on: RomanHauksson’s Shortform
giving the alignment research community an edge
epistemic status: shower thought
On whether advancements in humanity’s understanding of AI alignment will be fast enough compared to advancements in its understanding of how to create AGI, many factors stack in favor of AGI: more organizations are working on it, there’s a direct financial incentive to do so, people tend to be more excited about the prospect of AGI than cautious about misalignment, et cetera. But one factor that gives me a bit of hope (besides the idea that alignment might turn out to be easier to figure out than AGI) is that alignment researchers tend to be cooperative while AGI researchers tend to be competitive. Alignment researchers are motivated to save the world, not make a buck, so if their discoveries are helpful for alignment, they’ll go public, and if they’re helpful for alignment but also maybe capabilities, they’ll go public only to other alignment researchers. Meanwhile, each company trying to create AGI only has their own cutting-edge research to work with – they tend to keep to themselves, while we’re more united.
I’m curious about the ways that the alignment research community could augment this dynamic. One way could be restricting access to helpful information to only other alignment researchers, namely 1) discoveries that might be helpful for alignment but also AGI and 2) knowledge related to AI-assisted research and development. I get the impression this is already a norm, but the community might benefit from more formal and overt methods for doing this. For example, tammy created a “locked post” feature on her website that gives her control over who can decrypt certain posts of hers that relate to capabilities. Along the same vein, maybe the AI Alignment Forum could add a feature that works similar to Twitter Circle, where access to posts could be restricted to trusted members of a group:
Twitter Circle is a way to send Tweets to select people, and share your thoughts with a smaller crowd. You choose who’s in your Twitter Circle, and only the individuals you’ve added can reply to and interact with the Tweets you share in the circle.
Of course, then the forum developer team would have to up their security since nation state actors (for example) would be incentivized to hack the forum to learn all the latest AGI-related discoveries those alignment people are trying to keep to themselves. Another worry is that moles will network their way deep into the alignment community to gain access to privileged information, then pass it on to some company or nation (there might even already be moles today, without formal methods of restricting information). I’m sure there’s pre-existing literature on how to mitigate these risks.
Those with more knowledge about AI strategy, feel free to pick apart these thoughts; I only felt comfortable sharing them in the shortform because I feel like there’s a lot about this subject that I’m missing. Perhaps this has been written about before.

RomanHauksson 30 Mar 2023 21:54 UTC
9 points
1 ∶ 0
on: Stop Using Discord as an Archive
DiscordChatExporter is a tool that enables you to download an archive of all the messages in a server or channel.

RomanHauksson 4 Apr 2023 0:13 UTC
1 point
0 ∶ 0
in reply to: DirectedEvolution’s comment on: Getting Better at Writing: Why and How
Besides reading the Cyborgism post, I admit I have not searched around yet; my apologies.

RomanHauksson 4 Apr 2023 18:37 UTC
1 point
0 ∶ 0
on: Why might AI be a x-risk? Succinct explanations please
I think it’s important to give the audience some sort of analogy that they’re already familiar with, such as evolution producing humans, humans introducing invasive species in new environments, and viruses. These are all examples of “agents in complex environments which aren’t malicious or Machiavellian, but disrupt the original group of agents anyway”.
I believe these analogies are not object-level enough to be arguments for AI X-risk in themselves, but I think they’re a good way to help people quickly understand the danger of a superintelligent, goal-directed agent.

RomanHauksson 27 Apr 2023 0:11 UTC
1 point
0 ∶ 0
on: RomanHauksson’s Shortform
I plan to do some self-studying in my free time over the summer, on topics I would describe as “most useful to know in the pursuit of making the technological singularity go well”. Obviously, this includes technical topics within AI alignment, but I’ve been itching to learn a broad range of subjects to make better decisions about, for example, what position I should work in to have the most counterfactual impact or what research agendas are most promising. I believe this is important because I aim to eventually attempt something really ambitious like founding an organization, which would require especially good judgement and generalist knowledge. What advice do you have on prioritizing topics to self-study and for how much depth? Any other thoughts or resources about my endeavor? I would be super grateful to have a call with you if this is something you’ve thought a lot about (Calendly link). More context: I’m a undergraduate sophomore studying Computer Science.

So far, my ordered list includes:
1. Productivity
2. Learning itself
3. Rationality and decision making
4. Epistemology
5. Philosophy of science
6. Political theory, game theory, mechanism design, artificial intelligence, philosophy of mind, analytic philosophy, forecasting, economics, neuroscience, history, psychology...
7. ...and it’s at this point that I realize I’ve set my sights too high and I need to reach out for advice on how to prioritize subjects to learn!

RomanHauksson 28 Apr 2023 7:09 UTC
5 points
1 ∶ 0
on: AI safety logo design contest, due mid-May
Suggestion: use a well-designed voting system such as STAR voting, approval voting, or quadratic voting.

RomanHauksson 7 May 2023 20:55 UTC
5 points
0 ∶ 0
on: ALTERING TO URGENCY: EA and Africa
Where did you copy the quote from?

RomanHauksson 3 Jul 2023 7:23 UTC
6 points
1 ∶ 0
in reply to: Jason’s comment on: Jason’s Shortform
Rational Animations is probably the YouTube channel the report is referring to, in case anyone’s curious.

RomanHauksson 6 Jul 2023 23:06 UTC
9 points
3 ∶ 0
on: Announcing the EA Archive
Can we set up a torrent link for this?

RomanHauksson 8 Jul 2023 4:46 UTC
6 points
0 ∶ 0
in reply to: Aaron Bergman’s comment on: Announcing the EA Archive
I can look into how to set up a torrent link tomorrow and let you know how it goes!

RomanHauksson 23 Jul 2023 6:15 UTC
8 points
1 ∶ 0
on: What do XPT forecasts tell us about AI risk?
Here are a couple of excerpts from relevant comments from the Astral Codex Ten post about the tournament. From the anecdotes, it seems as though this tournament had some flaws in execution, namely that the “superforcasters” weren’t all that. But I want to see more context if anyone has it.

From Jacob:

I signed up for this tournament (I think? My emails related to a Hybrid Forecasting-Persuasion tournament that at the very least shares many authors), was selected, and partially participated. I found this tournament from it being referenced on ACX and am not an academic, superforecaster, or in any way involved or qualified whatsoever. I got the Stage 1 email on June 15.

From magic9mushroom:

I participated and AIUI got counted as a superforecaster, but I’m really not. There was one guy in my group (I don’t know what happened in other groups) who said X-risk can’t happen unless God decides to end the world. And in general the discourse was barely above “normal Internet person” level, and only about a third of us even participated in said discourse. Like I said, haven’t read the full paper so there might have been some technique to fix this, but overall I wasn’t impressed.
What links here?
- Greg_Colbourn's comment on Timelines are short, p(doom) is high: a global stop to frontier AI development until x-safety consensus is our only reasonable hope by Greg_Colbourn (12 Oct 2023 15:03 UTC; 4 points)
- Mo Putera's comment on What do XPT forecasts tell us about AI risk? by Forecasting Research Institute (24 Jul 2023 9:32 UTC; 3 points)

RomanHauksson 30 Jul 2023 4:51 UTC
1 point
0 ∶ 0
on: How valuable is it to attend a top global university?
80,000 Hours had an article with advice for new college students, and a section towards the end touches on your question.

Make sure to check out OpenPhil’s undergraduate scholarship if you haven’t yet.

RomanHauksson

“Small World”: web­site that shows you which city your friends are in

giving the alignment research community an edge

“Small World”: website that shows you which city your friends are in