CEO of Convergence.
David_Kristoffersson
Excellent points, Carl. (And Stefan’s as well.) We would love to see follow-up posts exploring nuances like these, and I put them into the Convergence list of topics worth elaborating.
Sounds like you got some pretty great engagement out of this experiment! Great work! This exact kind of project, and the space of related ideas seems well worth exploring further.
The five people that we decided to reject were given feedback about their translations as well as their motivation letters. We also provided two simple call-to-actions to them: (1) read our blog and join our newsletter, and (2) follow our FB page and attend our public events. None of these five people have so far done these actions to our awareness.
Semi-general comment regarding rejections: I think, overall, rejection is a sensitive matter. And if we do want rejected applicants (to stipends, jobs, projects, …) to try more or to maintain their interest in the specific project and in EA overall, we need to take a lot of care. I’m, for example, concerned that the difficulty of getting jobs at EA orgs and the situation of being rejected from them discourages many people from engaging closer with EA. Perhaps just being sympathetic and encouraging enough will do a lot of good. Perhaps there’s more we could do.
Variant of Korthon’s comment:
I never look at the “forum favorites” section. It seems like it’s looked the same forever and it takes up a lot of screen real estate without any use for me!
Vision of Earth fellows Kyle Laskowski and Ben Harack had a poster session on this topic at EA Global San Francisco 2019: https://www.visionofearth.org/wp-content/uploads/2019/07/Vision-of-Earth-Asteroid-Manipulation-Poster.pdf
They were also working on a paper on the topic.
Thank you for this article, Michael! I like seeing the different mainline definitions of existential risk and catastrophe alongside each other, and having some common misunderstandings clarified.
Just a minor comment:
That said, at least to me, it seems that “destruction of humanity’s longterm potential” could be read as meaning the complete destruction. So I’d personally be inclined to tweak Ord’s definitions to:
An existential catastrophe is the destruction of the vast majority of humanity’s long-term potential.
An existential risk is a risk that threatens the destruction of the vast majority of humanity’s long-term potential.[4]
Ord was presumably going for brevity in his book, and I think his definition succeeds quite well! I don’t think generally adding 4 words to Ord’s short nice definition would be worth it. There’s other details that could be expanded on as well (like how we can mostly consider the definition in Bostrom 2012 to be a more expanded one). Expanding helps with discussing a particular point, though.
I think this is an excellent initiative, thank you, Michael! (Disclaimer: Michael and I work together on Convergence.)
An assortment of thoughts:
More and more studious estimates of x-risks seem clearly very high value to me due to how much the likelihood of risks and events affect priorities and how the quality of the estimates affect our communication about these matters.
More estimates should generally should increase our common knowledge of the risks, and individually, if people think about how to make these estimates, they will reach a deeper understanding of the questions.
Breaking down the causes of one’s estimates is generally valuable. It allows one to improve one’s estimates, understanding of causation, and to discuss them in more detail.
More estimates can be bad if low quality estimates swamp out better quality ones somehow.
Estimates building on new (compared to earlier estimates) sources of information are especially interesting. Independent data sources increase our overall knowledge.
I see space for someone writing an intro post on how to do estimates of this type better. (Scott Alexander’s old posts here might be interesting.)
This kind of complexity tells me that we should talk more often of risk %’s in terms of the different scenarios they are associated with. E.g., the form of current trajectory Ord is using, and also possibly better (if society would act further more wisely) and possible worse trajectories (society makes major mistakes), and what the probabilities are under these.
We can’t disentangle talking about future risks and possibilities entirely from the different possible choices of society since these choices are what shapes the future. What we do affect these choices.
(Also, maybe you should edit the original post to include the quote you included here or parts of it.)
Causal diagrams of the paths to existential catastrophe
Happy to see you found it useful, Adam! Yes, general technological development corresponding to scaling of the vector is exactly the kind of intuition it’s meant to carry.
But beyond the trajectories (and maybe specific distances), are you planning on representing the other elements you mention? Like the uncertainty or the speed along trajectories?
Thanks for your comment. Yes; the other elements, like uncertainty, would definitely be part of further work on the trajectories model.
I think that if I could unilaterally and definitively decide on the terms, I’d go with “differential technological development” (so keep that one the same), “differential intellectual development”, and “differential development”. I.e., I’d skip the word “progress”, because we’re really talking about something more like “lasting changes”, without the positive connotations.
I agree, “development” seems like a superior word to reduce ambiguities. But as you say, this is a summary post, so it might not the best place to suggest switching up terms.
Here’s two long form alternatives to “differential progress”/”differential development”: differential societal development, differential civilizational development.
State Space of X-Risk Trajectories
Thanks Tobias, I think you make a really good point! You’re definitely right that there are some in the cause area who don’t think the technological transformation is likely.
I don’t think you’ve established that the ‘technological transformation’ is essential.
What I wanted to say with this post is that it’s essential to the view of a large majority in the cause area. The article is not really meant to do a good job at arguing that it should be essential to peoples’ views.
It’s possible I’m wrong about the size of the majority; but this was definitely my impression.
You may believe that shaping AI / the technological transformation would offer far more leverage than other interventions, but some will disagree with that, which is a strong reason to not include this in the definition.
I personally believe in something like a mix of shaping technology with other interventions, such as the ones you mentioned (“moral circle expansion, improving international cooperation, improving political processes (e.g. trying to empower future people, voting reform, reducing polarisation”).
Four components of strategy research
The long term future is especially popular among EAs living in Oxford, not surprising given the focus of the Global Priorities Institute on longtermism
Even more than that, The Future of Humanity Institute has been in Oxford since 2005!
I’m not arguing “AI will definitely go well by default, so no one should work on it”. I’m arguing “Longtermists currently overestimate the magnitude of AI risk”.
Thanks for the clarification Rohin!
I also agree overall with reallyeli.
I’m sympathetic to many of the points, but I’m somewhat puzzled by the framing that you chose in this letter.
Why AI risk might be solved without additional intervention from longtermist
Sends me the message that longtermists should care less about AI risk.
Though, the people in the “conversations” all support AI safety research. And, from Rohin’s own words:
Overall, it feels like there’s around 90% chance that AI would not cause x-risk without additional intervention by longtermists.
10% chance of existential risk from AI sounds like a problem of catastrophic proportions to me. It implies that we need many more resources spent on existential risk reduction. Though perhaps not strictly on technical AI safety. Perhaps more marginal resources should be directed to strategy-oriented research instead.
Good point, ‘x-risk’ is short and ‘reduction’ should be or should become implicit after some short steps of thinking. It will work well in many circumstances. For example, in “I work with x-risk”, just as “I work with/in global poverty” works. Though some interjections that occur to me in the moment are: “the cause of x-risk” feels clumsy, “letter, dash, and then a word” feels like an odd construct, and it’s a bit negatively oriented.
Thank you for your thoughtful comment!
All work is future oriented Indeed. You don’t tend to employ the word ‘future’ or emphasize it for most work though.
One alternative could be ‘full future’, signifying that it encompasses both the near and long term.
I think there should be space for new and more specific terms. ‘Long term’ has strengths, but it’s overloaded with many meanings. ‘Existential risk reduction’ is specific but quite a mouthful; something shorter would be great. I’m working on another article where I will offer one new alternative.
Metaculus: Will quantum computing “supremacy” be achieved by 2025? [prediction closed on Jun 1, 2018.]
While I find it plausible that it will happen, I’m not personally convinced that quantum computers will be practically very useful due the difficulties in scaling them up.