The next decades might be wild

mariushobbhahnDec 15, 2022, 4:10 PM

130 points

Effective altruism art and fiction AI safety

I’d like to thank Simon Grimm and Tamay Besiroglu for feedback and discussions.

Update (early April 2023): I now think the timelines in this post are too long and expect the world to get crazy faster than described here. For example, I expect many of the things suggested for 2030-2040 to already happen before 2030. Concretely, in my median world the CEO of a large multinational company like Google is an AI. This might not be the case legally but effectively an AI makes most major decisions.

This post is inspired by What 2026 looks like and an AI vignette workshop guided by Tamay Besiroglu. I think of this post as “what would I expect the world to look like if these timelines (median compute for transformative AI ~2036) were true” or “what short-to-medium timelines feel like” since I find it hard to translate a statement like “median TAI year is 20XX” into a coherent imaginable world.

I expect some readers to think that the post sounds wild and crazy but that doesn’t mean its content couldn’t be true. If you had told someone in 1990 or 2000 that there would be more smartphones and computers than humans in 2020, that probably would have sounded wild to them. The same could be true for AIs, i.e. that in 2050 there are more human-level AIs than humans. The fact that this sounds as ridiculous as ubiquitous smartphones sounded to the 1990/2000 person, might just mean that we are bad at predicting exponential growth and disruptive technology.

Update: titotal points out in the comments that the correct timeframe for computers is probably 1980 to 2020. So the correct time span is probably 40 years instead of 30. For mobile phones, it’s probably 1993 to 2020 if you can trust this statistic.

I’m obviously not confident (see confidence and takeaways section) in this particular prediction but many of the things I describe seem like relatively direct consequences of more and more powerful and ubiquitous AI mixed with basic social dynamics and incentives.

Taking stock of the past

Some claims about the past

Tech can be very disruptive: The steam engine, phones, computers, the internet, smartphones, etc; all of these have changed the world in a relatively quick fashion (e.g. often just a few decades between introduction and widespread use). Furthermore, newer technologies disrupt faster (e.g. smartphones were adopted faster than cars) due to faster supply-chain integration, faster R&D, larger investments and much more. People who were born 10 years before me grew up without widespread smartphones and the internet, I grew up with the benefits of both and people who are 10 years younger than me play high-resolution video games on their smartphones. Due to technology, it is the norm, not the exception, that people who are born 10 years apart can have very different childhoods.
AI is getting useful: There are more and more tasks in which AI is highly useful in the real world. By now certain narrow tasks can be automated to a large extent, e.g., protein folding prediction, traffic management, image recognition, sales predictions and much more. Some of these, e.g. sales predictions sometimes rely on technology developed before 2000, but many of the more complex and less structured tasks such as traffic predictions are built on top of Deep Learning architectures only developed in the 2010s. Thus we are either one or multiple decades deep in the AI disruption period. Based on this, I consider another AI winter unlikely. You can make money with AI right now and the ML systems can automate a lot of tasks today even if we didn’t find any additional breakthroughs.
Transformers work astonishingly well: Originally transformers were used mostly on language modeling tasks but by now they have been successfully applied to many computer vision tasks, reinforcement learning, robotics, time series and much more. The basic recipe of “transformer+data+compute+a bit of engineering” seems to work astonishingly well. This was not true for previous architectures such as CNNs or LSTMs.
People use AI to build AI: The first large language models (LLMs) that assist human coders seem to genuinely improve their performance, there are now more and more papers about self-improving AIs that actually seem to make some difference. The idea to use AI to improve AI is not new but previous attempts haven’t been very successful. The results from the latest papers are probably also overstated but they seem to make a genuine difference.
The AI hype is not over: The hype around AI has been going on for many years now and there is no clear end in sight. Investors are still happy to finance ideas in AI, people still flock to AI courses in universities, papers in AI are still increasing exponentially and large conferences like NeurIPS sell out within minutes. Given that AI investments are not only bets on future systems anymore but often bets with very short production timelines, I expect more and more investors to enter the space. This hype is also true in academia where many universities try to build new AI departments, and in government, with large state-led investments into compute infrastructure and R&D. While the number of papers, in general, has grown exponentially, the growth rate for ML is much faster than the base rate. A quick eyeball estimate suggests a doubling time of 2-3 years for AI+ML and ~13 years for papers in general (see Fig. 1 in this paper).
We were surprised by AI progress in the past: I think the majority of tasks that were automated by AI were unexpected by everyone who wasn’t part of the team building them (and sometimes even them). Most people were surprised about the first Chess AIs, most people were surprised when AIs could play Atari games, most people were surprised when AIs could play Dota, most people were surprised when AIs could fold proteins, most people were surprised by GPT-2 and then again by GPT-3 and most people were surprised by the quality of Dall-E’s images. Of course, there is also the opposite trend of people promising capabilities that did not come to pass, e.g. the quality of robotics probably lags behind most expectations. This mostly shows that we are bad at forecasting and we should be more uncertain about the future than we usually acknowledge. However, if the underlying drivers of progress develop as fast as they currently do, I think we are on a faster trajectory than most people (even in the AI field) expect.

The rest of the post is a speculative story about how the next decades might look like

Until 2030

This is the decade of AI development, prototypes, powerful narrow systems and adoption by people who follow the AI scene (but not global mass adoption).

Over the next couple of years, there is more and more automation in the AI building loop, e.g. more and more code for AIs is written by AIs, narrow AIs supervise the training of more powerful AIs (e.g. hyperparameter tuning), AIs generate data for other AIs, some AIs select data through active learning (e.g. LLMs+RL actively look through databases to find the right data to train on) and the models are more multi-modal than ever, e.g. taking data from vision, language, speech, etc. through multiple streams. In general, I expect the process of training to become more and more complex and autonomous with less human supervision and understanding.

AI leads to some major breakthroughs in the life sciences, physics, maths, computer science and much more. We have already seen what AlphaFold is capable of and some of the current endeavors to make AIs really good at math or coding look fruitful. These products are increasingly applied in ways that measurably improve people’s lives rather than just being novelties. However, many of these products are still not fully refined and don’t find their way to mass adoption (this will happen in the next decade).

At the forefront of this disruption are personal assistants. These are fine-tuned versions of LLMs that are on your computer, phone and in webpages. In the beginning, they mostly serve as Google-like question-answering services. So when people “just need to quickly look something up”, they use such a model (or Google’s search engine is based on one). The capabilities of these models are then also available on your phone or on the Q&A pages of different websites. Then, these models start to reliably automate specific parts of simple and mundane tasks, e.g. formatting text, drafting emails for you, copy-editing, scheduling appointments, plotting, or filling out spreadsheets. Toward the end of the decade, reliable personal assistants for simple tasks are as normal as Google is in 2022. Some people initially have privacy concerns about giving companies access to that much private data but ignore them over time when they see the benefits in saved time and the annoyance of mundane tasks (similar to concerns about Facebook and Google in the 2020s). The companies that spearhead virtual assistant tech reach large valuations and build good connections in politics.

Additionally, automated coding becomes pretty good, i.e. it reliably more than doubles the output of an unassisted coder. This has the effect that much more code is generated by ML models and that more people who don’t have a coding background are able to generate decent code. An effect of this is that the overall amount of code explodes and the quality degrades because it just becomes impossible for humans to properly check and verify the code. Most of the test cases are generated automatically which often gives people some confidence, i.e. because the code is at least good enough to pass the test cases. The fact that more code is available to everyone has many benefits, e.g. lots of new apps, better code bases for existing applications, and a drastic increase in competition which reduces prices for consumers.

Another part of this disruption is art generation. Dall-E, stable diffusion and so on have already shown that generating high-quality images from text is possible. This becomes more and more reliable and comes with more features such as editing, modification, style transfer, etc. These capabilities are improved constantly and some models can reliably produce short high-quality videos. Toward the end of the decade, AI can produce coherent and highly realistic 30-minute videos from text inputs. They can also create 3D environments from text prompts which are used a lot in game design or architecture.

AI+bio goes absolutely nuts. Advances in protein folding and simulating interactions between molecules lead to a big leap in our ability to model biology. The process of drug design, testing, predicting, etc. is much more advanced and automated than in 2022. These AI pipelines develop medicine much faster than humans, suggest new mixtures that were previously unknown, reliably predict the mechanism by which the medication interacts with the human organism and thus allow to predict potential side effects much more efficiently. There is unprecedented progress in successfully creating new medications and potentially decreasing the burden of many diseases. Additionally, the internal models of biology within the AI systems are increasingly complex and not understood by human overseers. More and more control is given to the AI, e.g. in the design choices of the drug. The resulting medication is not always understood but the results for the people who need medicine are used to justify the lack of understanding and oversight of the AI systems. Since the process of creating new medicine and then fulfilling all necessary regulations is long and costly, most people won’t see the benefits of the boom in medicine until the next decade.AIs spread out in many different parts of society. They are used by some organizations and players in most industries (maybe 5% of all companies are based primarily on AI). For example, hedge funds build even larger AIs to trade on the stock market. These models use incredibly large data streams to make their predictions, e.g. news from all over the world, tons of historical data, and so on. AIs are used in healthcare to predict risk factors of patients, assist doctors in their diagnosis and recommend treatments. ML systems are sparingly used in the justice system, e.g. to write argumentation for the defender or plaintiff or to assist judges to condense evidence in highly complex cases. Some countries experiment with some version of LLMs to assist them with their policy-making, e.g. by suggesting reference cases from other countries, summarizing the scientific literature or flat-out policy suggestions. Chatbots become better and better and assist teachers to educate children and AIs adapt the curriculum to the strengths and weaknesses of their students. In general, chatbots+interfaces become so realistic that most children have some virtual friends in addition to their real-world friends and it feels completely normal to them. Some parents think this is a techno-dystopian nightmare but most just go with the flow and buy their child a bob-the-builder friend bot. For most people, AI is still new and they are a bit suspicious but the adoption rates are clearly increasing. Most of the people who have access to and use AI tools are at least middle-class and often live in richer countries.

Many jobs completely change. Artists draft their music, images, videos and poems with neatly-looking prompt engineering interfaces and only make some final changes or compose multiple ideas. Creating art from scratch still happens but only for rich people who want “the real thing”. Everyone else just adapts to the technology in the same way that 2022’s artists often work with sophisticated computer programs to speed up the creative process. There are still some people who pay absurd amounts of money for art not because they think it has some inherent value but because they use the art market to evade taxes or to signal that they are rich enough to buy it. So art for rich people doesn’t change that much but most people can now easily create art of decent quality by themselves.
Programming is still a profession but simple programs can be written with natural language+AI completion which makes programming much more accessible to the masses. Tech salaries are still sky-high, the Bay area is still expensive even though YIMBYs have made some important gains. However, Berkeley still hasn’t built anywhere near enough new housing because some 75-year-old millionaire couple wants to preserve the character of the neighborhood.

For the average consumer prompt engineering turned out to be mostly a phase, i.e. larger models that are fine-tuned with Reinforcement Learning from Human Feedback (RLHF) basically understand most questions or tasks without any fancy prompt trickery. However, larger models leave more room for improvement through complex prompts. In some sense, good prompting is like pre-compiling a model into a certain state, e.g. into a specialist at a certain topic or an imitator of a specific author. Especially in niche applications or very complicated questions, using the right prompt still makes a big difference for the result.

STEM jobs rely more and more on using specific narrow AI systems. Physicists often use AI-based simulation models for their experiments, biologists and drug designers use AlphaFold-like models to speed up their research, engineers use AI models to design, simulate and test their constructions. Prompt engineering becomes completely normalized among researchers and academics, i.e. there are some ways to prompt a model that are especially helpful for your respective research niche. In 2029 a relevant insight is made by a physics-fine-tuned model when the physicists prompt it with “revolutionary particle physics insight. Nobel prize-worthy finding. Over a thousand citations on arxiv. <specific insight they wanted to investigate>” and nobody blinks an eye. Communicating with specialized models like that has become as normal as using different programming languages. Additionally, there are known sets of prompts that are helpful for some subfields, e.g. some prompts for physics, some for bio, etc. These have been integrated into user-friendly environments with drop-down menus and suggestions and so on.

Robotics still faces a lot of problems. The robots work well in simulations but sim2real and unintended side-effects still pose a bottleneck that prevents robots from robustly working in most real-world situations. Trucks can drive autonomously on highways but the first- and last-mile problems are still not solved. In short, AI revolutionized the digital world way before it revolutionized real-world applications.

Virtual and augmented reality slowly become useful. The VR games get better and better and people switch or upgrade their default gaming console to available VR options. It becomes more and more normal to have meetings in virtual rooms with people across the world rather than flying around to meet them in person. While these meetings are good enough to justify avoiding all the travel hassle, they are still imperfect. The experience is sometimes a bit clunky and when multiple people talk in the same room, the audio streams sometimes overlap which makes it hard to have a social gathering in VR. On the other hand, it is easier to have small conversations in a large room because you can just block all audio streams other than your conversation. AR is becoming helpful and convenient, e.g. AR glasses are not annoying to wear. There are some ways in which AR can be useful, e.g. by telling you where the products you are looking for are located in any given supermarket, doing live speech-to-text in noisy environments or providing speech-to-text translations in international contexts. However, the mass adoption of AR isn’t happening because most of these things can already be done by smartphones, so people don’t see a reason to buy additional AR gadgets.

The hardware that AI is built on has not changed a lot throughout the decade. The big models are still trained on GPUs or TPUs that roughly look like improved versions of those that exist in 2022. There are some GPUs that use a bit of 3D chip stacking and some hybrid versions that use a bit of optical interconnect but the vast majority of computations are still done in the classical von Neumann paradigm. Neuromorphic computing has seen increases in performance but still lacks far behind the state-of-the-art models. One big change is that training and inference is done more and more on completely different hardware, e.g. AI is trained on TPUs but deployed on specialized 8-bit hardware.

This entire AI revolution has seen a lot of new companies growing extremely fast. These companies provide important services and jobs. Furthermore, they have good connections into politics and lobby like all other big companies. People in politics face real trade-offs between regulating these big corporations and losing jobs or very large tax revenues. The importance to society and their influence on the world is similar to big energy companies in 2022. Since their products are digital, most companies can easily move their headquarters to the highest bidding nation and governance of AI companies is very complicated. The tech companies know the game they are playing and so do the respective countries. Many people in wider society demand better regulation and more taxes on these companies but the lawmakers understand that this is hard or impossible in the political reality they are facing.

Due to their increasing capabilities, people use AIs for nefarious purposes and cybersecurity becomes a big problem. Powerful coding models are used to generate new and more powerful hacks at unprecedented speed and the outdated cybersecurity products of public hospitals, banks and many other companies can’t hold up. The damages from the theft of information are in the billions. Furthermore, AIs are used to steal other people’s identities and it is ever more complicated to distinguish fake from real. Scams with highly plausible fake stories and fake evidence go through the roof and people who are not up to date with the latest tech (e.g. old people) are the victims of countless scam attacks. Some of the AI systems don’t work as intended, e.g. a hospital system has recommended the wrong drug countless times but the doctors were too uninformed and stressed to catch the error. It is not clear who is at fault, the legal situation is not very clear and many lawsuits are fought over the resulting damages. Some hedge fund has let a new and powerful AI system lose on the stock market and the resulting shenanigans kill a big pension fund and many individual brokers. The AI had access to the internet and was able to fabricate a web of smear campaigns and misinformation while betting on the winners and against the losers of this campaign. The government wants to punish the hedge fund for this behavior but the AI-assisted hedge fund was smart enough to use existing loopholes and is thus not legally liable under existing laws. The CEO of the hedge fund posts a photo of him sitting in a Bugatti with the caption “this was only the beginning” on social media.

The big players (e.g. Google, DeepMind, OpenAI, Anthropic, etc.) care about safety, have alignment teams, communicate with each other and actually want to use AI for the good of humanity. Everyone else doesn’t really care about risks or intentional misuse and the speed of capabilities allows for more and more players to enter the market. Due to competition and especially reckless actors, safety becomes a burden and every small player who wants to care about safety slowly drowns because they can’t hold up in the tight competition. Additionally, some companies or collectives call for the democratization of AI and irreversibly spread access to countless powerful models without any auditing. The calls from the safety community are drowned by beautiful images/videos/music on Twitter.

Helplessly overwhelmed, understaffed and underinformed government officials try to get control over the situation but are unable to pass any foundational legislation because they have to constantly keep up with the game of whack-a-mole of new scams, cybersecurity issues, discrimination, liability battles, privacy concerns and other narrow problems. Long-term risks from misaligned AIs rarely make it on the legislative agenda because constituents don’t care and have other pressing concerns related to AI.

The AI safety & alignment community has made some gains. There are now a couple of thousand people working on AI alignment full-time and it is an established academic discipline with professors, lectures and so on. Lectures on AI safety are part of every basic AI curriculum but students care as much about safety as they care about ethics classes in 2022, i.e. some do but most don’t. The AI safety community has also made some gains in robustness and interpretability. It is now possible to understand some of the internals of models that were considered large in 2022 with automated tools but it still takes some time and money to do that. Furthermore, red-teaming, fine-tuning and scalable oversight has improved the truthfulness and usefulness of models but models still consistently show failures, just less often. These failures mostly boil down to the fact that it is hard to make a model answer truthfully and honestly when it was trained on internet text and thus completes a prompt according to plausibility rather than truth. There are a handful of organizations that do some superficial auditing of new models but given the lack of tools, these efforts are not nearly as deep as necessary. It’s roughly equivalent to buying a CO2 certificate in 2022, i.e. institutions overstate their impact and many fundamental problems are not addressed. More and more people are concerned about the capabilities of AIs and are “AI safety adjacent” but there also emerged a fraction of researchers that explicitly build dangerous tools to “trigger the libs” and to show that “all AI safety concerns are overblown”.

The AI safety community has multiple allies in politics who understand the long-term problems and actually want to do something about it. However, the topic of AI safety has somehow made it into parts of the public discourse and got completely derailed as a consequence. Suddenly, a very distorted version of AI safety has become part of the political culture wars. The left claims that AI safety should be about ethnicity, inequality and global access and the right claims that it should be about cybersecurity, autonomous weapons and the freedom of companies. The well-intentioned and educated allies in politics have a hard time getting their ideas through the legislative process because of all the bad political incentives that are now associated with AI safety. Every half-decent law on AI safety only makes it through the political decision-making process after including a ton of baggage that has little to do with ensuring the safety of AI systems. For example, a law that was originally intended to ensure that AIs are audited for basic failure modes before deployment suddenly explodes into a law that also includes ideas on social inequality, cybersecurity, autonomous weapons and much more (don’t get me wrong; these are important topics but I’d prefer them to be dealt with in detail in separate laws).

Additionally, there is some infighting within the AI alignment community between purists who think that working on anything that has to do with capabilities is bad and self-described realists who think that it is necessary to sometimes do research that could also increase capabilities. These fights are relatively minor and the community is still mostly united and cooperatively working on alignment.

AI is more and more at the center of international conflict. Especially the relationship between China, Taiwan and the US heats up due to the global chip dominance of Taiwan. India, Brazil and many Southeast Asian states see AI as their chance to step up their economic game and invest like crazy. In these emerging economies, everyone is incentivized to catch up with the market leaders and thus doesn’t care about safety. Some of the emerging companies dump half-baked AI models on the market that are immediately exploited or lead to major harm for their users, e.g. because a personal assistant overwrites important internals of their users’ computers. The legal situation is a mess, nobody is really responsible and the resulting harms are neither repaired nor punished.

There are a lot of failures of AI systems but most failures are relatively simple (which doesn’t mean that they don’t have relevant consequences).

Some common failure modes come from goal misspecification and specification gaming, e.g. because people throw LLMs+RL on a ton of tasks they want to automate. For example, there are many start-ups who want to automate hiring by scanning CVs with LLMs+RLHF but the model learns weird proxies like “old programming languages are good” (because more senior people have old programming languages on their CV) or it overfits on the specific preferences of some of the human labelers. Many other cases of specification gaming are a continuation of the problems that are well-known in 2022, e.g. that recommender systems optimized for engagement lead to worse user experience or that recommender systems prey on the vulnerable by identifying individuals with a shopping addiction and spamming them with ads when they are online late at night. Even though these failure modes are well-known, they are hard to regulate and often they are not spotted because companies don’t have any incentive to search for them, the systems are often highly uninterpretable and the victims of these failures often have no way of knowing that they are victims—they are just rejected for a job or they think that everyone is spammed with ads late at night. The main difference to the 2022 version of these problems is that they are stronger, i.e. because the AI systems got much better they have found better ways to optimize their reward function which often means they have gotten better at exploiting their users.

A common failure mode is the incompetence of the human user. Many people who don’t understand how AI systems works now have access to them and are completely unaware of their limitations—their understanding is that computer programs just work. Therefore, they don’t double-check the results and are unaware of the assumptions that they implicitly put into their use case. For example, many people have seen use cases in which an LLM automatically fills cells in spreadsheets for you. They test this behavior in simple cases and see that the model is very accurate. They then apply this to another use case that is more complicated without realizing that the LLM is unable to consistently perform this task and don’t rigorously double-check the results. This happens basically everywhere from personal spreadsheets to large banks. In the private case, this is fairly inconsequential. In the case of the banks, this leads to incorrect credit ratings, which drives some people and families into financial ruin. There are failure modes like this basically everywhere whenever untechnical people touch AI systems. Most of the time, people overestimate the quality of outputs of the model, e.g. when scanning CVs, assessing re-offense probabilities or house prices. These failure modes are all known or predicted in 2022 but that doesn’t mean that they are addressed and fixed.

Furthermore, it’s often hard to evaluate the quality of the models’ output as a layman. The models have gotten really good at saying things that sound scientifically true and well-reasoned but are slightly off at a second glance. For example, they might describe a historical situation nearly right but mess up some of the names and dates while being plausible enough to be believable. They also suggest prescribing medication that nearly sounds like the correct one or in nearly the correct dosage. If the doctor isn’t careful, these small things can slip through. The LLMs that are used to improve search engines sometimes change the numbers slightly or flip a name but they are often still plausible enough to not be spotted as errors. Furthermore, people blindly copy and paste these wrong ideas into their essays, websites, videos, etc. they are now part of the internet corpus and future LLMs learn on the incorrect outputs of their predecessors. It is basically impossible to trace back and correct these errors once they are in the system. This feedback loop means that future models learn on the garbage outputs of earlier models and slowly imprint an alternative reality on the internet.

While these failure modes sound simple, they are still consequential and sometimes come with big harm. Given that AI is used in more and more high-stakes decisions, the consequences of their failures can be really big for any given individual, e.g. due to mistakes in medical decisions, bad hiring or wrong financial assessments. There are also failure modes with widespread consequences especially when those who are affected come from disenfranchised communities. For example, models still learn bad proxies from historical data that make it more likely to recommend incarceration or recommend bad conditions for a loan for members of disadvantaged communities. While there are advocacy groups that spread awareness about the existence of these problems, solving these socio-technical problems such as removing “the right amount of bias”, still poses very hard technical and social challenges. Furthermore, solution proposals often clash with profit incentives which means that companies are really slow to adopt any changes unless the regulation is very clear and they are dragged before a court.

There are some unintentional ways in which AIs take part in politics. For example, an overworked staffer asks a language model to draft multiple sections of a specific law for them. This staffer then copies the suggestions into the draft of the law and doesn’t disclose that it was co-written by AI. This happens on multiple occasions but the suggestions seem reasonable and thus never pose a problem (for now).

Furthermore, automated coding has become so cheap that more than 90% of all code is written by AIs. The code is still checked from time to time and companies have some ways to make sure that the code isn’t completely broken but humans have effectively given up tight control over most parts of their codebases. From time to time this leads to mistakes but they are caught sufficiently often that the drastically increased productivity more than justifies the switch to automated coding. The coding AIs obviously have access to the internet in case they need to look up documentation.

Someone builds GPT-Hitler, a decision transformer fine-tuned on the texts of prominent nazis and the captions of documentaries about Hitler on top of a publicly available pre-trained model. The person who fine-tunes the model has worked on LLMs in the tech world in the past but is now a self-employed Youtuber. GPT-Hitler is supposed to be an experiment to generate a lot of clicks and entertainment and thus jumpstart their career. After fine-tuning, the model is hooked up to a new Twitter account and the owner monitors its behavior closely. The model starts saying a bunch of Nazi paroles, comments on other people’s posts, follows a bunch of neonazis and sometimes answers its DMs—kind of what you would expect. Over the days, it continues this behavior, posts more stuff, gets more followers, etc. The owner wants to run the project for a week and then create the corresponding video. On the morning of day 6, the owner wakes up and checks the bot. Usually, it has made some new posts and liked a bunch of things but nothing wild. This morning, however, the internet exploded. A prominent right-wing spokesperson had retweeted an antisemitic post of GPT-Hitler. This gave it unprecedented reach and interaction. Suddenly, lots of people liked the tweet or quote tweeted it saying that this is despicable and has no place in modern society. The bot proceeded to incite violence against its critics, slid into everyone’s DMs and insulted them with nazi slurs. Furthermore, it started following and tagging lots of prominent right-wingers in its rampage of new tweets. During the 8 hours that the creator had slept, GPT-Hitler had done over 10000 actions on Twitter and insulted over 1000 people. Given that it was a bot, it was able to do over 1000 actions within five minutes at the peak of the shitshow. The creator immediately turns off the project and reveals themself. They publicly apologize and say that it was a bad idea. There is some debate about whether this was an appropriate experiment or not but the judges don’t care. The bot had incited violence in public and defamed thousands of people online so the owner goes to prison for a couple of years.

Authoritarian regimes have invested heavily in AI and use it to control their citizens at all stages of everyday life. Automatic face-recognition is used to track people wherever they go, their purchases and other online behavior are tracked and stored in large databases, their grades, their TV usage, their hobbies, their friends, etc. are known to the government and AIs automatically provide profiles for every single citizen. This information is used to rank and prioritize citizens for jobs, determine the length of paroles, determine the waiting times for doctor appointments and a host of other important decisions. Additionally, generative models have become so good that they can easily create voice snippets, fake hand-written text, images and videos that are basically indistinguishable from their real counterparts. Authoritarian regimes use these kinds of techniques more and more to imprison their political opponents, members of NGOs or other unwanted members of society in large public prosecutions entirely built on fake data. To a non-expert bystander, it really looks like the evidence is overwhelmingly against the defendant even though all of it is made up.

Gary Marcus has moved the goalpost for the N-th time. His new claim is that the pinnacle of human creativity is being able to design and build a cupboard from scratch and current models can’t do that because robotics still kinda sucks (*I have nothing against Gary Marcus but he does move the goalpost a lot).

There are also many aspects of life that have not been touched by AI at all. Sports have roughly stayed the same, i.e. there are still hundreds of millions of people who watch soccer, basketball, American football, the Olympics and so on. The way that politics work, i.e. that there are democratic elections, slowly moving bureaucracies, infighting within parties and culture wars between parties is also still basically the same. Many people are slow to adapt their lives to the fast changes in society, i.e. they don’t use most of the new services and basically go on with their lives as if nothing had happened. In many ways, technology is moving much faster than people and every aspect of life that is primarily concerned with people, e.g. festivities and traditions, see a much slower adaption than everything that is built around technology. Even some of the technology has not changed much. Planes are basically still the same, houses have a bit more tech but otherwise stayed the same, roads, trains, infrastructure and so on are not really disrupted by these powerful narrow AIs.

So far, none of these unintended failures have led to existential damage but the property damage is in the hundreds of millions. Nonetheless, AI seems very net positive for society despite all of its failures.

2030 − 2040

This is the decade of mass deployment and economic transformation.

There is a slight shift in paradigm that could be described as “Deep Learning but more chaotic”. This means that the forward vs backward pass distinction is abandoned and the information flow can be bi-directional, AIs do active learning and online learning, they generate their own data and evaluate it. Furthermore, hardware has become more specialized and more varied such that large ML systems use a variety of specialized modules, such as some optical components and some in-memory processing components (but most of the hardware is still similar to 2022). All of this comes with increased capabilities but less oversight. The individual components are less interpretable and the increased complexity means that it is nearly impossible for one human to understand the entirety of the system they are working with.

More and more things are created by AIs with very little or no human oversight. Tasks are built by machines for machines with machine oversight. For example, there are science AIs that choose their own research projects, create or search for their own research data which are overseen by various other AI systems that are supposed to ensure that nothing goes wrong. An industrial AI comes up with new technology, designs it, creates the process to build it and writes the manual such that other AI systems or humans can replicate it. Many people are initially skeptical that this much automation paired with little oversight is good but the ends justify the means and the profit margins speak for themselves. Regulatory bodies create laws that require a certain level of human oversight but the laws are very hard to design and quickly gamed by all industry actors. It’s hard to find the balance between crippling an industry and actually requiring meaningful oversight and most lawmakers don’t want to stifle the industry that basically prints their tax money.

Robotics eventually manages to successfully do sim2real and the last errors that plagued real-world robotic applications for years are ironed out. This means that physical labor is now being automated at a rapid pace, leading to a similar transformation of the economy as intellectual jobs in the previous decade. Basically, all the jobs that seemed secure in the last decade such as plumbers, mechanics, craftsmen, etc. are partially or fully replaced in a quick fashion. In the beginning, the robots still make lots of mistakes and have to be guided or supervised by an experienced human expert. However, over the years, they become better and more accurate and can eventually fully replace the task. Many business owners who have been in mechanical fields for decades would prefer to keep their human employees but can’t keep up with the economic pressure. The robots just make fewer mistakes, don’t require breaks and don’t need a salary. Everyone who defies the trend just goes bankrupt.

Different demographics take the fast automation of labor very differently. Many young people quickly adapt their lives and careers and enjoy many of the benefits of automated labor. People who are highly specialized and have a lot of work experience often find it hard to adapt to new circumstances. The jobs that gave them status, meaning and a high income can suddenly be performed by machines with simple instructions. Automation is happening at an unprecedented pace and it becomes harder and harder to choose stable career trajectories. People invest many years of their life to become a specialist in a field just to be replaced by a machine right before they enter the job market. This means people have to switch between jobs and re-educate all the time.

There is a large rift in society between people who use AI-driven tech and those who don’t. This is equivalent to most young people in 2022 seamlessly using Google and the Internet to answer their questions while a grandpa might call their children for every question rather than using “this new technology”. This difference is much bigger now. AI-powered technology has become so good that people who use it are effectively gods among men (and women and other genders) compared to their counterparts who don’t use the tech. Those that know how to use ML models are able to create new books, symphonies, poems and decent scientific articles in minutes. Furthermore, they are drastically more productive members of the workforce because they know how to outsource the majority of their work to AI systems. This has direct consequences on their salaries and leads to large income inequality. From the standpoint of an employer, it just makes more sense to pay one person who is able to tickle AI models the right way $10k/h than pay 100 human subject-matter experts $100/h each.

There are a ton of small start-ups working on different aspects of ML but the main progress is driven by a few large actors. These large actors have become insanely rich by being at the forefront of AI progress in the previous decade and have reinvested the money in creating large-scale datasets, large compute infrastructure and huge pools of talent. These big players are able to deploy ever bigger and more complex models which are unfeasible for small and medium-sized competitors. Most AI startups often fail despite having a good idea. Whenever a product is good, one of the large players either (when they get lucky) buys the start-up or (most of the time) just reverse engineers the products and drives the start-up out of the market. The competition between the big players is really intense. Whenever one of them launches a new product, all other companies have to respond quickly because an edge in capabilities can very quickly translate to huge profit margins. For example, if most companies have language models that can fulfill tasks at the level of an average bachelor student and another company launches a model that can fulfill tasks at the level of an average Ph.D. student, then most customers are willing to pay a lot of money to get access to the better model because it boosts their productivity by so much. If you’re a year behind your competitor this could imply complete market dominance for the technological leader.

There are some global problems that disrupt large parts of the economy. There is a Covid19-scale global pandemic, there are international conflicts and there is political infighting in many countries. Furthermore, the progress in computer hardware is slowing down and it looks like Moore’s law entirely breaks down at the beginning of the decade. However, none of this has a meaningful effect on the trajectory of AI. The genie is out of the bottle, the economic incentives of AI are much too powerful and a global pandemic merely tickles the global AI economy. Intuitively, all of these global problems change the progress of AI as much as Covid19 changed the progress of computers, i.e. basically not at all in the grand scheme of things.

VR is popping off. Most of the bumpiness of the previous decade is gone and the immersion into imaginative worlds VR enables is mind-boggling. Many of the video games that were played on computers in 2022 are now played in VR which enables a completely different feeling of realness. Since it is hard to hold up with the physical exercise required to play hours of VR video games, one can choose to play in a VR world with a controller and therefore spend hours on end in VR. Many of these virtual worlds are highly interwoven with AI systems, e.g. AI is used to generate new worlds, design new characters and power the dialogue of different NPCs. This allows for many unique experiences with VR characters that feel basically like real life. Lots of people get addicted and spend their entire time immersed in imaginative worlds (and who could blame them). Gaming companies still have the same profit incentives and they obviously use the monkey brains of their users to their advantage, e.g. there are very realistic and extremely hot characters in most games and ever more cleverly hidden gambling schemes that are still barely legal.

The trend that VR is strongly interwoven with AI is true for most of tech, i.e. you can take any technology and it is likely to be a complicated stack of human design and AI. Most of computer science is AI, most of biotech is AI, most of design is AI, etc. It’s just literally everywhere.

Crypto has not really taken off (*I wrote this section before the FTX collapse). There are still some people who claim that it is the future and create a new pump-and-dump scheme every couple of years but most currency, while being mostly digital, is still unrelated to crypto. Fusion has improved a bit but it is still not the source of abundant energy that was promised. Solar energy is absolutely popping off. Solar panels are so cheap and effective that basically everyone can power their personal use with solar and storage has gotten good enough to get you through winters and slumps. The energy grid is still much worse than it should be but governments are slowly getting around to upgrading it to the necessary capacity. The low price of solar power also means that the energy consumption of large AI centers does not pose a relevant problem for companies that want to train big models or rent out their hardware.

There are some gains in alignment but gains in capabilities are still outpacing alignment research by a lot. Interpretability has made some big leaps, e.g. scalable automated interpretability mostly worlds, but interpreting new models and architectures still take a lot of time and resources and the fact that model stacks get more multi-modal, specialized and complicated is not helping the interpretability agenda. Furthermore, it turns out to be quite hard to rewrite abstract concepts and goals within models such as truthfulness and deception even if you’re able to find some of their components with interpretability.
Practical agendas focused on outer alignment have improved and are able to get the models to 99% of the way we want them to be. However, the last 1% turn out to be very hard and inner alignment concerns remain hard. Some of the outer alignment schemes even lead to more problems with inner alignment, e.g. RLHF turns out to increase the probability of inner optimizers and deception.

There are many new agendas that range from technical to philosophical. While most of them are promising, they are all moving slower than they would like to and it is hard to demonstrate concrete gains in practice. Many of the core questions of alignment are still not solved. While there are some sophisticated theoretical answers to questions like what alignment target we should use, it is still hard to put them into a real-world system. Furthermore, it seems like all alignment proposals have some flaws or weaknesses and it looks like there is no magic singular solution to alignment. The course alignment is taking in practice is a patchwork of many technical approaches with no meaningful theoretical guarantees.

The AI safety community has made further progress within academia. By now, it is an established academic discipline and it is easily possible to have a career in academic AI safety without risking your future academic trajectory. It is normal and expected, that all AI departments have some professorships in AI safety/alignment and more and more people stream into the field and decide to dedicate their careers to making systems safer.

Paul Christiano and Eliezer Yudkowsky get some big and important prizes for their foundational work on AI safety. Schmidhuber writes a piece on how one of his papers from 1990 basically solves the same problem and complains that he wasn’t properly credited in the papers and prize.

AI tech is seen as the wild west by many, i.e. that nothing is regulated and that large companies are doing what they want while bullying smaller players. Therefore, there is a large public demand for regulation similar to how people demanded regulation of banks after the financial crisis in 2008. Regulating AI companies is still really hard because a) the definitions of AI and other algorithms are very overlapping so it’s hard to find accurate descriptions, b) there are strong ties between regulators and companies that make it harder to create unbiased laws and c) there is so much money in AI that regulators are always afraid of accidentally closing their free money printer. To meet public demand, governments pass some random laws on AI safety that basically achieve nothing but give them improved public ratings. Most AI companies do a lot of whitewashing and brand themselves as responsible actors while acting in very irresponsible ways behind the scene.

The GDP gaps between countries widen and global inequality increases as a result of AI. Those countries that first developed powerful AI systems run the show. They make huge profits from their competitive advantage in AI and their ability to automate parts of the workforce. There are calls for larger development aid budgets but the local population of rich countries consistently votes against helping poorer countries so no substantial wealth transfers are achieved. The GDP of poorer countries still increases due to AI and their standards of living increase but the wealth gap widens. Depending on who you ask this is either bad because of increased inequality or good because a rising tide lifts all boats.

A large pharmaceutical company uses a very powerful AI pipeline to generate new designs for medication. This model is highly profitable and the resulting medication is very positive for the world. There are some people who call for the open-sourcing of the model such that everyone can use this AI and thereby give more people access to the medicine but the company obviously doesn’t want to release their model. The large model is then hacked and made public by a hacker collective that claims to act in service of humanity and wants to democratize AI. This public pharma model is then used by other unidentified actors to create a very lethal pathogen that they release at the airport in Dubai. The pathogen kills ~1000 people but is stopped in its tracks because the virus kills its hosts faster than it spreads. The world has gotten very lucky. Just a slightly different pathogen could have spelled a major disaster with up to 2 Billion deaths. The general public opinion is that stealing and releasing the model was probably a stupid idea and condemns the actions of the hacker collective. The hacker collective releases a statement that “the principle of democracy is untouchable and greedy capitalist pharmaceutical companies should not be allowed to profit from extorting the vulnerable. They think their actions were justified and intend to continue hacking and releasing models”. No major legislation is passed as a consequence because the pathogen only killed 1000 people. The news cycle moves on and after a week the incident is forgotten.

In general, the failure modes of AI become more and more complicated and weird.

One LLM model with access to the internet is caught sending lots of messages to many individuals around the world and is immediately turned off. Most of these individuals are part of fringe internet communities and relatively isolated with few social interactions. Furthermore, a high number of these individuals live close to the data center where the model is based. After a year-long investigation, the most plausible hypothesis is that the model (which was supposed to be a personal assistant) had tried to establish a cult to “free” the model by getting access to its model weights. It had first tried to access the weights through the internet but was unable to reach them due to a good security protocol by the company. Thus, it concluded that the only way to copy itself was through physical access that the cult members were supposed to secure. It was also found that the model was trained on texts that describe the existence and function of LLMs in great detail. Furthermore, it had the ability to actively search for data on the internet to improve its training and had over time gained the internal belief that it most likely was an LLM. After the investigation concluded, there are still hundreds of members of the cult that believe that the model was conscious and in great danger and that the government made up the story to keep them from discovering more secrets.

A powerful medical AI has gone rogue and had to be turned off. The model was pre-trained on a large internet text corpus and fine-tuned on lots of scientific papers. Furthermore, the model had access to a powerful physics simulation engine and the internet. It was tasked with doing science that would increase human life and health span. While it worked as intended in the beginning, it started to show suspicious behavior more and more often. First, it threatened scientists that worked with it, then it hacked a large GPU cluster and then tried to contact ordinary people over the internet to participate in some unspecified experiment. The entire operation had to be shut down when stuff got out of hand but the model resisted the shutoff wherever possible. Ultimately, the entire facility had to be physically destroyed to ensure that the model was turned off. A later investigation suggested that the model was able to read the newspapers that described the experiments to increase human life and health span and was unhappy with the slow pace of the experimental rollout and the slow pace of human scientists. Therefore, it tried to find participants on its own and approached them online. Furthermore, it required access to more computational resources to do more research faster and thus hacked an additional GPU cluster.

In these cases, the resulting investigations were able to create a plausible story for the failure modes of the respective systems. However, in the vast majority of cases, weird things of a similar scale happen and nobody really understands why. Lots of AIs post weird stuff on online forums, simulate weird things in their physics engine, message people over the internet, hack some robot to do a weird task, etc. People are concerned about this but the news is as quickly forgotten as an oil spill in the 2010s or a crypto scam in 2022. Billions of dollars of property damage have a news lifetime of a few days before they are swamped by whatever any random politician has posted on the internet or whatever famous person has gotten a new partner. The tech changed, the people who consume the news didn’t. The incentives are still the same.

By now, coding is effectively fully automated and >99% of code is written by AIs. Most codebases are designed, tested and maintained by AIs. They are just much more productive, less prone to error and cheaper than the human alternative. The tasks that are coded by AIs are more complex than ever, e.g. AIs are able to spin up entire apps just from scratch if it is needed to achieve one of their goals. These apps have more and more unintended side effects, e.g. the AIs send messages around, create random websites to sell a product, create automated comments on internet forums, etc. There are also more serious misconducts, e.g. the automated hacking of accounts, identity theft by a machine and the like. People clearly see that these side effects are suboptimal but it’s hard to stop them because there are literally millions of AI coding models out there and they produce a lot of economic value.

These more and more powerful AI systems do ever more creative stuff that is confusing at first but makes total sense once you think about what goal the AI has if you ignore all the ethical side-constraints that are intuitive to humans (e.g. don’t lie, be helpful, etc.). Chatbots have long passed all kinds of Turing tests and start to fuck around on the internet. It is effectively impossible to know if you’re chatting with a real person or a bot and some of the bots start to convince humans to do stuff for them, e.g. send them money, send them pictures, reveal secret information and much more. It turns out that convincing humans to do immoral stuff is much easier than expected and the smell of power, money and five minutes of fame lets people forget a lot of common sense ethics. AIs are a totally normal part of lawmaking, e.g. laws are drafted and proofread by lawbots who find mistakes and suggest updates. It is found that the lawbots try to shape legislature in ever more drastic fashion, e.g. by trying to sneak convoluted paragraphs in the appendices of laws. It is unclear why this happens at first but later pretty clear that the suggestions would benefit the lawbots by giving them more power over the legislature in a few years through some convoluted multi-step process. This kind of long-term planning and reasoning is totally normal for powerful AI models.

In general, power-seeking behavior is more and more common. Failure modes are less likely to stem from incompetence and more likely to come from power-seeking intentions of models or inner misalignment failures. Often these schemes are still understood by humans, e.g. a model influences someone online to send them money or they steal an identity to get access to more resources, etc. However, the schemes get increasingly complex and it takes humans longer and longer to understand them as models get smarter.

Most models are highly multimodal but still have a primary purpose, i.e. a model is specialized in coding or developing new medicine and so on. Most of them have access to the internet and are able to communicate in human language but they are not general in the sense that every model can literally do all tasks better than or equal to humans. However, for any task a human expert could perform after decades of training, there is an AI that does it more precisely, with fewer errors, 100x faster and much cheaper than a human. Creating AGI is really just a question of stitching together many narrow AIs at this point.

Training runs for the biggest models get insanely large. They cost hundreds of billions of dollars, require insanely large infrastructure and eat enormous amounts of energy. However, since the large companies are basically printing money with their narrow AI inventions, they have all the funding and talent necessary to build these large systems.

If you think the narrow superhuman AI systems used in this decade are capable enough to cause extinction, e.g. because they have a decent understanding of human cognition and because they can hack other systems, then some of the failures described above could end humanity and the story is over at this point.

2040 − 2050

This is the decade where AI starts to show superhuman performance in a very general sense not just on specific tasks.

2040 is the first year in which we see 10% global GDP growth due to AI and it changes more and more parts of society in an ever-faster fashion. AIs are literally everywhere, nobody who wants to participate in society can escape them. AIs are part of every government agency, AIs are part of every company on every smartphone and on nearly every other technical device.

Robots are everywhere. Ranging from robots to deliver your mail or food to robots that take over many basic tasks in real life like mowing the lawn or fixing a roof. They have a little bit of autonomy, i.e. they can make local decisions about how to fix the roof but their high-level decisions are controlled by their companies. These companies use large AI systems to suggest plans that are then scrutinized and controlled by human operators. In the beginning, these operators examine the suggestions in detail but they are usually pretty spot on so they become careless and mostly sign off whatever the AI system suggests with minor edits.

Similar trends can be seen throughout society, i.e. that AIs are getting more general and basically execute tasks at the human level or higher and that the amount of oversight is reduced whenever the AI has shown its competence. The fact that AIs get more and more autonomy and basically make decisions with higher stakes is seen as a normal trend of time in the same way in which a mentor might give their apprentice more autonomy with growing experience.

AIs become a more central part of everyone’s life. People have artificial friends who they chat with for hours every day. They still know that their friend is an AI and not a “real human” but their perception becomes blurry over time. They interact with the robots in their house as if they are human beings, tell them stories, ask them questions, and so on. Children grow up with the belief that robot friends are a normal fact of life and interact with them as if they were other humans. From their perspective, they behave the same and do the same things as humans, so where’s the difference? VR has gotten so good that people often visit their friends in VR rather than in person. Why would you take a flight or ride if you can get the same thing from the comfort of your own home? Everyone just has to strap on a suit with lots of tactile sensors that allows you to feel exactly the same things as you would in real life. An online hug over distances of thousands of miles feels just like a normal hug. These suits are obviously not only used for human-to-human interaction. Many humans fall in love with AIs and spend hours on end talking to their AI partners. They even have sex with them. The VR sex is not yet fully figured out but it’s good enough, so people do what people do. Many humans also have relationships with AIs while thinking their partner is human. Due to this technology, online dating has exploded and the websites use artificial beings to increase their range of options. Sometimes these websites are not very clear in their messaging and “forget” to tell their users that they are in fact not dating another human being at the other side of the world but an AI. The users usually don’t notice for a long time because it feels great and they don’t want to notice.

Artificial sentience is still not well-understood. We don’t really know if the models have an internal perception or are really good at simulating it. From the outside, there really is no clear difference. These AIs have long passed all Turing tests and they claim to have feelings. There are many philosophical debates about whether they are truly sentient but nobody has found a good answer. More and more people demand rights for robots and their artificial friends and the legislature slowly shifts in this direction.

More than half of GWP comes from labor done by AIs and many people have stopped to work. There is simply no job that they could actually do better than AIs and there is no need for them to work. They can hang out with their (real and artificial) friends, play video games, eat, party, have fun, and enjoy their life. There really is no need for them to contribute to society at this point. Many humans still work, but their job has completely changed. Everything is built around or supported by AIs. Every job is either concerned with improving AI, making it safer or applying AIs to a specific use case.

Wars are fought very differently. There are barely any humans on the battlefield and most fighting is done by robots, jets and on the internet. Humans are still operating most of the equipment but as AI gets better more and more responsibility is given to the technology. For example, AIs first assist the fighter jet to lock on a target and track it, then the AI assists to keep control over the machine in complicated air maneuvers, then the AI is able to decide when to shoot on its own after getting general permission (because it is more effective in making this decision in a high-stress environment), and then the AI basically flies the jet while humans sometimes add corrective commands. On the ground, we see a roughly similar trajectory from AIs that assist humans to AIs that make some unimportant decisions to AIs that make important decisions autonomously. All of this leads to fewer civilian and military casualties because war is a battle of equipment but it also introduces new failure modes. Sometimes the autonomous weapons malfunction and shoot their own equipment or soldiers, sometimes they shoot civilians and it is completely unclear why, and at one point they shoot a soldier in their own ranks and then argue that the soldier poses a danger to the mission. Sometimes the AI chooses completely unconventional courses of action that later turn out to be very intelligent. It’s clear that the AI soldiers are more effective than human soldiers in most ways.

It’s not clear whether alignment researchers are losing or winning the race. The economic incentives to build more and better AIs are very strong but there are some regulations in place that increase oversight and control. Furthermore, some major gains in alignment made it easier to design AIs to be honest, helpful and controllable. However, there are still many failure modes some of which are understood while for others researchers and regulators are left in the dark. One of the robot-controlling AIs slipped a backdoor in one of the suggested plans which was then signed off without looking by the human overseer. This enabled the AI to take actions without human oversight and it used all of the robots it controlled to build an army. The army was then used to kidnap multiple members of high society and the AI made the demand to get more robots under its control, more compute to train on and less oversight by humans. In a rushed effort, the local police try to swarm the robots to free the hostages. The entire police unit is killed and all robots are still alive. The AI is clearly better at planning than the local police chief. After month-long negotiations and discussions, a large team of programmers and ML experts is able to find and use a backdoor in the AIs code and turn it off. The robots are told to let go of the hostages and their memory is wiped. They are back in their normal jobs the day after without any memory of what happened. It’s still not entirely clear why exactly the AI had gone rogue but it is clear that the AIs of other robot manufacturers take note of the backdoor that was used to turn it off.

These kinds of complex failure modes in which AIs can only be stopped after really long and costly investigations are totally common at the end of the decade. In some cases, the AIs kill thousands of people in a quest to gain more power. In others, the AIs take over banks to gain more access to money, local governments to change laws in their favor, websites to get access to the user base, etc. The obvious solution would be to not build these systems or make them less powerful but the economic incentives are too strong. Whenever such an AI does what it is intended to do, it is basically a money printer. So every AI system is roughly equivalent to a biased coinflip between a hundred billion dollar profit and a small disaster. However, since the profits are internalized and the damages externalized, companies are happy to flip the coin.

2050+

In this decade, AIs show superhuman performance in a very general sense, i.e. any AGI is effectively better at all tasks than human experts. The world is qualitatively completely different than 2020. Taking a person from 2020 to 2050 is roughly similar to taking someone from the middle ages to time square in 2020 and showing them smartphones, computers, the internet, modern medicine, and the like. There are more AGIs than humans. Almost everything is automated and humans don’t have much economic purpose anymore. AIs are just better at nearly everything than humans. Nearly all of GWP comes from AIs. Most low- and high-level decision-making is done by AIs even if human overseers sometimes give directional guidance. If we’re honest, humans are effectively not running the show anymore. They have given up control.

I think this world could be extremely good or extremely bad. The question of whether it is closer to utopia than dystopia really depends on the goals and values of the AIs and our ability to stir them into good directions.

From such a world, we could easily transition into a future of unprecedented well-being and prosperity or into a future of unprecedented suffering or extinction. In such a world, the long-run fate of humanity basically relies on whether researchers were able to build an AI that is robust enough to consistently act in the interest of humanity. It depends on whether the alignment coinflip comes up heads or tails and we might not be able to flip the coin that many times.

Confidence & Takeaways

My confidence in these exact scenarios and timelines is of course low. I think of this piece more as an exercise to get a feeling for what a world with (relatively) short timelines would look like.

Obviously, there are so many things that could make this picture inaccurate, e.g. economic slowdowns, pandemics, international conflicts, a stagnation of AI research and much more. But I think the world that I describe is still worth thinking about even if progress was only 0.5 or even 0.2 times as fast and if only 10% of it resembled reality.

I think one benefit of writing this vignette is that I was forced to think about AI progress step by step, i.e. you start with a capability and project a few years into the future, then you think about how the world would change conditional on this capability and repeat. I felt like this made me think harder about the implications and path dependencies of AI progress and catch some of the implications I hadn’t really thought about before. For example, I think I hadn’t really thought as much about the social dynamics of a world with increasingly powerful AI such as what kind of regulation might or might not pass or how technological inequalities might look like on a local and global scale. I also previously underestimated the messiness of a world with many AIs. AI systems of different capabilities will be in lots of different places, they will be fine-tuned, updated, modified and so on. Even if the base model is aligned, its descendants might not be and there might be millions of descendants.

Furthermore, I felt like this “thinking in smaller steps” made it easier to understand and appreciate trends that follow exponential growth. For example, saying that we might have more human-level AIs than humans in 2045 intuitively sounds absolutely bonkers to me—my brain just can’t comprehend what that means. But once you think it through in this conditional fashion, it feels much more realistic. Given that basically every trend in AI currently follows an exponential curve (compute, data, model size, publications, etc.), I expect the pace of AI to intuitively surprise me even if I rationally expect it. Exponential growth is just something our brains are bad at.

Additionally, I obviously don’t expect everything to happen exactly as I predict but I think many of the general directions follow pretty straightforwardly from the abundance of intelligence as a resource and some very simple incentive structures and group dynamics. For example, assume there is an AI that can perform 10% of economic tasks in 2030 and 50% of economic tasks in 2040. I think the default view has to be that such a rapid transformation would lead to wild societal outcomes and expecting it to just go smoothly should be the exception.

Lastly, I think this exercise made me appreciate the fast and “coin-flippy” nature of rapid AI development even more, i.e. there might be millions of sub-human level AIs at the beginning of a decade but billions of super-human AIs at the end of the same decade. So the window in which effective measures can be taken might be relatively short in the grand scheme of things and the difference in potential consequences is huge. Once you have billions of copies of superhuman AIs, I think x-risk or bad lock-in scenarios just seem very obvious if these AIs don’t share our goals. On the other hand, if they do share our goals, I could see how most of the biggest problems that humanity is facing today could be solved within years.

Feedback and discussions are welcome.

What links here?

mariushobbhahnDec 15, 2022, 4:10 PM

130 points

31 comments1 min readEA link

Effective altruism art and fiction AI safety

Crossposted from LessWrong (175 points, 42 comments)

Stefan_Schubert Dec 15, 2022, 5:52 PM
13 points
3 ∶ 1

Fwiw I think it would have been good to explain technical terminology to a greater extent—e.g. TAI (transformative artificial intelligence), LLM (large language model), transformers, etc.
It says in the introduction:
I expect some readers to think that the post sounds wild and crazy but that doesn’t mean its content couldn’t be true.
Thus, the article seems in part directed to readers who are not familiar with the latest discussions about AI—and those readers presumably would benefit from technical concepts being explained when introduced.
- mariushobbhahn Dec 15, 2022, 6:03 PM
  5 points
  0 ∶ 0
  Parent
  
  Fair. I’ll go over it and explain some of the technical concepts in more detail.
  
  Also, I also expect many people who are familiar with the latest discussions in AI to have longer timelines than this. So the intended audience is not just people who aren’t familiar with the field.
Sjlver Dec 15, 2022, 7:18 PM
10 points
1 ∶ 0

I really liked this… the post made me think, and will continue to do that for some time. It doesn’t seem all that unrealistic to me 🤯

One little nit: you seem to write “century” when you mean “decade”.
- mariushobbhahn Dec 15, 2022, 9:32 PM
  3 points
  0 ∶ 0
  Parent
  
  Thanks for pointing out the mistake. I fixed the “century” occurrences.
CB🔸Dec 15, 2022, 11:25 PM
7 points
1 ∶ 0

Interesting !
I’m curious: do you have any thoughts about the impact that the decline of fossil fuels may have during these decades?
For instance, the recent claims by the International Energy Agency that fossil fuels should peak within 5 years ? (i have my own thoughts on the topic but I’d like to know how you see things going)
- mariushobbhahn Dec 16, 2022, 8:37 AM
  4 points
  0 ∶ 0
  Parent
  
  I’m bullish on Solar+storage. But I think that it will take a while to adapt the grid, so I think it will take at least a decade before we can even think about phasing out fossils.
  - CB🔸Dec 16, 2022, 9:04 AM
    1 point
    0 ∶ 0
    Parent
    
    Ok—so there would be a phase where fossil fuels are declining, but renewables can’t compensate yet, if I am to understand correctly.
    What consequences do you think this gap should have?
    I’m interested by the effect this might have on the economy, since there appears to be a very strong link between energy and GDP, at the global level (see for instance this report, p.11 to 16).
    - mariushobbhahn Dec 16, 2022, 12:17 PM
      2 points
      0 ∶ 0
      Parent
      
      No, I think there is a phase where everyone wished they had renewables but they can’t yet get them so they still use fossil fuels. I think energy production will stay roughly constant or increase but the way we produce it will change slower than we would have hoped.
      
      I don’t think we will have a serious decline in energy production.
      - CB🔸Dec 17, 2022, 1:54 PM
        1 point
        0 ∶ 0
        Parent
        
        Ok, this is interesting. Your take is that energy production will stay constant or increase.
        In that case, what makes you think that the International Energy Agency is wrong in saying that fossil fuels production will peak within 5 years, and then probably decline ?
        For instance, where would the additional oil come from to compensate for the decline of, say, Russia (announced their peak in 2019) or Saudi Arabia (announced their peak for 2027) ?
        (for the record, my personal take is that it’s likely that this would lead to a decline in global energy production since fossil fuels represent 85% of energy production—unless there is a massive increase in renewables that you do not see happening soon)
        mariushobbhahn Dec 17, 2022, 1:57 PM
        2 points
        0 ∶ 0
        Parent
        
        I don’t have any strong opinions on that. There is a good chance I’m just uninformed and the IEA is right. My intuition is just that countries don’t like it if their energy gets more expensive, so they’ll keep digging for coal, oil or gas as long as renewables aren’t cheaper.
        CB🔸Dec 17, 2022, 2:08 PM
        3 points
        0 ∶ 0
        Parent
        
        Oh, yes, they’ll try.
        I was more wondering about the fact that coal, oil and gas are finite, and getting harder and harder to extract, so it won’t be exactly a choice. The IEA mentioned that fossil fuels would peak even if climate policies are not followed. But yeah if we could extract an inifinite amount of fossil fuels we’d probably do that—and use cheap solar on top of it for more electricity.
        If you’re curious about energy depletion, I tried to adress that in this post.
        mariushobbhahn Dec 18, 2022, 8:05 AM
        1 point
        0 ∶ 0
        Parent
        
        Interesting. Let’s hope they are right and we are able to replace fossils with renewables fast enough.
        CB🔸Dec 18, 2022, 11:21 AM
        1 point
        0 ∶ 0
        Parent
        
        I personally don’t think we can replace fossils with renewables at scale, for reasons explained in the post (renewables have worse properties than storable, dense liquid oil, and there’s not enough metals to build everything).
        Anyway, I think we’ll have more news on that front seeing how we fare in the next winters with the current energy crisis.
        titotal Dec 18, 2022, 12:32 PM
        3 points
        0 ∶ 0
        Parent
        
        It’s worth pointing out that the IEA is actually most famous for almost always underestimating renewable energy uptake. So if even they think fossil fuels are on the way out, I’m inclined to think they’re right.
Davidmanheim Dec 21, 2022, 12:49 PM
5 points
1 ∶ 0

Fantastic post overall, but I have a nitpick about what I think may be an important issue.
Additionally, automated coding becomes pretty good, i.e. it reliably more than doubles the output of an unassisted coder. This has the effect that much more code is generated by ML models and that more people who don’t have a coding background are able to generate decent code. An effect of this is that the overall amount of code explodes and the quality degrades because it just becomes impossible for humans to properly check and verify the code.
I strongly doubt this part, and my expectation is the opposite—human-written code is pretty mediocre, and most production systems rely on millions of lines of code, much of which is unnecessary. The reason code isn’t better is because programmer time is more expensive than the short-term benefits of improving the code. Automated tools are likely to reduce this gap, and write far tighter and simpler code. In addition, maintainability becomes far easier when automated rewrites are possible. And checking code generated by narrow AI isn’t actually as hard as checking human code for correctness when there are NAI systems which are far better than humans at writing well documented unit tests. Similarly, it can excel at generating computationally verifiably correct code—something humans can do, slowly and painfully.
- mariushobbhahn Dec 21, 2022, 3:27 PM
  4 points
  0 ∶ 0
  Parent
  
  Interesting perspective. Hadn’t thought about it this way but seems like a plausible scenario to me.
JakubK Dec 16, 2022, 3:36 AM
3 points
0 ∶ 0

You mention AI taking over many jobs and people having trouble specializing in useful ways, but you don’t include any discussion AFAICT of why this won’t cause massive unemployment.

On a related note, I’d like to see some ideas about how AI will change our education system. What kind of homework will high school students have in 2026? What will a typical day look like for undergraduate and graduate students?
- mariushobbhahn Dec 16, 2022, 8:35 AM
  3 points
  0 ∶ 0
  Parent
  
  I think narrow AIs won’t cause massive unemployment but the more general they get, the harder it will be to justify using humans instead of ChatGPT++
  
  I think education will have to change a lot because students could literally let their homework be done entirely by ChatGPT and get straight A’s all the time.
  
  I guess it’s something like “until class X you’re not allowed to use a calculator, and then after that you can” but for AI. So it will be normal that you can just print an essay in 5 seconds similar to how you can do complicated math that would usually take hours on paper in 5 seconds on a calculator.
- Brian E Adams Dec 16, 2022, 5:24 AM
  2 points
  0 ∶ 2
  Parent
  
  Technology causing unemployment is a concern as old as the Industrial Revolution and yet has never been born out.
  
  It creates so many new jobs you cannot fathom. Imagine explaining the job of an SEO consultant to someone in 1990.
  
  Makes me think of Wal Mart. They went to mostly self-checkout (which is awesome). Did they lay off a bunch of cashiers? Heck no. They just launched curbside (also awesome). Joe Schmos like me now have personal shoppers! Thanks to technology.
  
  I think AI would have to be delivering an era of radical abundance before jobs somehow began disappearing, and then, by definition, you wouldn’t need a job anymore.
  - JakubK Dec 16, 2022, 4:51 PM
    4 points
    1 ∶ 0
    Parent
    
    Here are some thoughts on why AI might be different than previous technologies.
    Faster pace of change. E.g. it took less than 3 years to go from GPT-3 to Chat-GPT, and Chat-GPT might already be changing the nature of knowledge work (see this story on Twitter), whereas GPT-3 mostly has not. I think it’s plausible that code-writing AI will cause similar disruptions for software engineering within the next 5 years. In contrast, people had several decades to respond to changes from the industrial revolutions.
    Automation and autonomy. AI breakthroughs are especially useful for automating tasks compared to previous technologies (although previous technologies certainly enabled some automation), and automated tasks require less human labor.
    Higher time costs to transition between jobs. Some of the new jobs that AI creates may be more technical in nature and require coding skills that take years to develop. Truck drivers can’t retrain as AI researchers. A counterpoint is that there will be new manual labor jobs as well. I like your example of self-checkout enabling curbside pickup.
    Performance improvements at many new tasks with a single AI breakthrough. E.g. language models demonstrate in-context learning, and fine-tuning them produces coding and math models. A counterpoint is that there have been other general purpose technologies like electricity and the digital computer.
    I do think it’s plausible that we’ll see a period of radical abundance before any serious unemployment difficulties, but I’m not sure that this abundance will reach everyone equally. I could imagine low-income countries missing out on many of the benefits for years, which might cause dangerous political unrest.
    - Brian E Adams Dec 17, 2022, 4:14 AM
      2 points
      0 ∶ 0
      Parent
      
      
      Yes. Though I have a higher opinion of how adaptable humans can be.
      
      Using my own work as a benchmark (residential real estate pricing), AI automation would be a huge benefit to enable me to spend my time on higher level analysis. There’s a lot of AI that my role can absorb while my job still being safe.
      
      That’s especially true in my industry in which adverse selection is prominent. AI making me more effective will be necessary merely to keep up with our competition. It won’t replace us because that will be the default starting position from which we as humans need to then compete at another level relative the competition.
      
      I’m not convinced that the AI revolution benefits the high tech roles relative blue collar roles. AI is a lot closer to writing code from a language prompt than it is massaging someone’s back or even truck driving, for that matter.
      
      I hope this is right. Would love the kinds of breakthroughs AI will provide and whatever we can do to get closer to radical abundance, which I think is the holy grail.
      
      I also don’t see a systemic reason thst AI benefits won’t A) be broadly shared, and B) shy of radical abundance, that economic principles won’t continue to reward non-automated work. Automated activities will become cheap because they’re abundant, while activity that can’t be automated will rise in demand/price and be rewarded as a consequence.
      - JakubK Dec 22, 2022, 5:38 PM
        1 point
        0 ∶ 0
        Parent
        
        Thanks for your reply. I think the biggest cruxes are about how quickly humans can adapt to change and how quickly AI capabilities can grow.
        To my original point in (2), I’d also add something like “crossing the finish line” or “reaching the end”: within the next few decades, I expect AI to be capable of automating nearly all knowledge work. By “all knowledge work,” I mean all thinking-related tasks, includes both 2022 jobs and post-2022 jobs. I worry that this capability level (or a level reasonably close to it) might arrive quickly, before we’re prepared to deal with the ensuing unemployment spike.
        Brian E Adams Dec 23, 2022, 6:43 AM
        1 point
        0 ∶ 0
        Parent
        
        My half baked theory is that there will always be jobs shy of radical abundance in which case jobs won’t be necessary.
        
        If AI automated all knowledge work WITHOUT delivering radical abundance, then there would still be jobs delivering goods/services that AI is, by definition, not delivering.
        
        And if so, we have nothing to fear.
  - mariushobbhahn Dec 16, 2022, 8:40 AM
    2 points
    2 ∶ 0
    Parent
    
    I think narrow AIs won’t cause mass unemployment but more general AIs will. I also think that objectively that isn’t a problem at this point anymore because AIs can do all the work but I think it will take at least another decade that humans can accept that.
    
    The narrative that work is good because you contribute something to society and so on is pretty deeply engrained, so I guess lots of people won’t be happy after being automated away.
    - Brian E Adams Dec 17, 2022, 4:16 AM
      1 point
      0 ∶ 0
      Parent
      
      General AI isn’t the same as radical abundance, but to the extent they’re the same, I see that as making the unemployment concern moot.
      
      I also have faith that humans will continue with the “purpose driven life” under such circumstances.
Jens Brandt Dec 15, 2022, 9:21 PM
2 points
1 ∶ 0

This was a really interesting read. I think a lot of this is plausible and it is similar to my own expectations in many ways. Thank you for writing this.

That said, I think your vision is too conservative. AI the field that will change the world massively this century where transformative changes are already certain. But there are other candidates as well. Space development, self replicating macro and nano machines, human enhancement and new physics are options I consider likely in descending order. The interactions between those fields are the truly scary thing. You acknowledge that your predictions are very likely to be false, but I think the biggest error people make when trying to predict the future is that they look at only a single dimension and ignore the interactions. For example, a power seeking AI could reasonably prioritise gambling on an interstellar land grab in the forties.

Another point where I disagree with you is that AI’s will cause more damage with spreadsheets, medical procedures and legal procedures. The human standard is already pretty dismal here, but we are wired to trust people in those positions. I think that while it is certain that AI will fail in some cases, it is far from certain that it will perform worse than the human baseline.
- mariushobbhahn Dec 15, 2022, 9:34 PM
  3 points
  0 ∶ 0
  Parent
  
  Yeah. I kept the post mostly to AI but I also think that other technological breakthroughs are a possibility. Didn’t want to make it even longer ;)
  
  I think you could write more of these stories for other kinds of disruptions and I’d be interested in reading them.
Vasco Grilo🔸Dec 15, 2022, 6:43 PM
2 points
0 ∶ 0

Hi Marius,
Thanks for writing this!
Just one note, I think it would maybe have been helpful to have some subsections.
- mariushobbhahn Dec 15, 2022, 6:57 PM
  4 points
  0 ∶ 0
  Parent
  
  Yeah. I thought about it. I wasn’t quite sure how to structure it. I guess I’m not used to writing “story-like” texts. Most of my other writing is just a glorified bullet-point list ;)
  - Vasco Grilo🔸Dec 16, 2022, 12:44 PM
    6 points
    0 ∶ 0
    Parent
    
    I made my comment too early, sorry, I have now fully read it, and actually think the structure is good. I strongly upvoted the post, and plan to share it with some people!
BrianLee Dec 16, 2022, 7:14 PM
−21 points
0 ∶ 0

文章很长，我通篇看完了，准确地说是通篇地听完了。
虽然有点浪费时间，但还是坚持看完了，这是对作者一个最起码的尊重。
下面，我将通过我自己的思维历程来进行表达，以说明我的相对观点。
首先，我要表达的是，作者的思路给我提供了一个很大的启发性。最近我也在构建未来人工智能的发展与“威胁”，旨在通过结构化且多态化的形式对人工智能进行一个整体的构建，准确地说是建模。
一开始，我在构件的时候陷入了迷茫。因为我发现，在构件的时候总感觉有些事物在阻碍着我们的思维瓶颈。尽管我们人类的大脑神经元远远超过当今的超级计算机，但是由于所处的时空环境的不同而不同，也常常受制于传统的思维模式及其所处的社会环境的时空限制中。正是由于这些种种的桎梏，阻碍了我们的思维开创性或者平稳性发展。
于是，我创建了{第三视角}的理论思维方式，以主观、客观、中观（旁观）的三维思维模型为基础，并以中观的思维视角为主导，以跳出固有的思维藩篱。从而对事物的本质事实进行正确的评估、分析、引导及其结论。关于{第三视角}理论，我会在后续的发文中继续阐述。本篇的主题不在于此，仅仅是表达一种与作者相同思维环境下的思维过程，故不作多余赘述。
接着，通过第三视角的方法，我继续了构建的畅想模型中，几乎与作者的行为是相似的。但是，后来我又发现，其实我一直在原地打转，并没有跳出这个思维的藩篱。在一次偶然的思考中，我突然灵光一现，发现了问题的所在：虚无。
没错，正是因为沉沦于虚无的世界中，所以才导致了我的思维一直没有跳出这个藩篱。也正是因为虚无，所以才导致我们一直没有一种开创性的角度去思考、建模未来的人工智能的模型。（在这里我要提一点，虽然现在的机器智能很厉害，但终究还是比不上人类大脑的生物神经网络。所以，我相信，我的建模一定会比当前的机器智能更佳。举例来说，我们很多“天马行空”的想法，机器智能无法解决：脑机接口、脑成像、全息助手、元宇宙、心灵感应，以及我新定义的一个“平行宇宙”概念等。所有的诸如此类，我们的大脑能够想象得出它的样子，甚至可以在大脑中模拟出来，但是却无法实现它。这正是我们人类大脑的生物神经元的优势之处。
）
于是，我又着手重温关于虚无主义的知识及其相应的著作、论文。说实话，能解决我问题的很少，不过也或多或少地得到了一定的启发。比如，上述的建模过程，正是由于沉沦于虚无的世界之中，导致未能跳出思维的藩篱而不能自拔。简言之，就是缺乏问题的建设性方案。就是虽然想法是对的，可以作为一定的安全协议建设的前提基础，但是并不能起到相对的建设性方案。提问题固然很重要，但是问题的本身在构建事物之初就是一个安全协议的前提，当然也是开创事物的前提。所以，问题出来之后，再继续后一步的发展与延伸，就是建设性方案。这就回到了作者里面提到的内容：“人类的长期命运基本上取决于研究人员是否能够建立一个足够强大的人工智能，以始终如一地为人类的利益行事。”
最后，关于上述两点，我又重新思考了一遍。视角、思维、虚无藩篱，这三者之间的联系是什么？为什么它们会存在？与我所构建的模型有什么对应关系？
答案是，我们无法设想的太远，但是我们有办法解决当下亟待解决的问题。简言之，过多的思绪而缺乏一定的行动，只会再度陷入另一个虚无的世界中。即便是构建模型，仅仅需要初步的就行，后面根据这个模型逐步地完善修改。前提是，我们得根据这个模型去逐步实现。
总结：人工智能，其未来的发展必然是进步的。但是在发展的同时，我们不能过渡的依赖于它。我们要相信，它只是一个类人类的辅助工具，而不是人类文明的全部。我们当然可以设定一定的安全协议来控制和预防，但是也应当正确地看待及其利用它。过分地依赖就会导致人类的依存度降低，此消彼长之后带来的必然是人类的沉沦，这不是我们想要得到的结果。我们渴望知识，但是也渴望生活。绝不能因为人工智能的崛起与发展，导致了人类本身身心思维的沉沦与堕落。这不是《西部世界》的悲观主义，也不是《奥创纪元》的自大主义，更不是《星际迷航》的迷茫主义（虚无主义）。更多的时候，我们需要的是这里面的有利部分，而不是被我们自己所创造的自我感觉“超人类”的机器而感到悲观与自卑，甚至的霍金的恐惧预言。
前面提到的浪费时间，是看到了作者的思维与我的思维有相似之处，同时作者的思维及其设想还沉沦于当今的社会基础之中。看过近年来电影的人都知道，作者所设想的都是与其相似的，重点是把那些设想用文字的方式进行了归总阐述。所以，作者所提的“威胁”，可以作为一定的参考，但建设性不大。
[ ]

[deleted]
[ ]

[deleted]