Chris Leong’s Quick takes

Chris Leong21 Aug 2019 11:17 UTC

6 points

77 comments EA link

Chris Leong 31 May 2025 15:23 UTC
70 points
17 ∶ 0
Looks like Mechanize is choosing to be even more irresponsible than we previously thought. They’re going straight for automating software engineering. Would love to hear their explanation for this.
“Software engineering automation isn’t going fast enough”^[1] - oh really?

This seems even less defensible than their previous explanation of how their work would benefit the world.
1. ^
  Not an actual quote
What links here?
- Seth Ariel Green 🔸's comment on Seth Ariel Green ’s Quick takes by Seth Ariel Green 🔸 (2 Jun 2025 18:10 UTC; 8 points)
- Chris Leong's comment on Chris Leong’s Quick takes by Chris Leong (1 Jun 2025 3:34 UTC; 5 points)
- Matthew_Barnett 31 May 2025 22:45 UTC
  23 points
  2 ∶ 24
  Parent
  Some useful context is that I think a software singularity is unlikely to occur; see this blog post for some arguments. Loosely speaking, under the view expressed in the linked blog post, there aren’t extremely large gains from automating software engineering tasks beyond the fact that these tasks represent a significant (and growing) fraction of white collar labor by wage bill.
  Even if I thought a software singularity will likely happen in the future, I don’t think this type of work would be bad in expectation, as I continue to think that accelerating AI is likely good for the world. My main argument is that speeding up AI development will hasten large medical, technological, and economic benefits to people alive today, without predictably causing long-term harms large enough to outweigh these clear benefits. For anyone curious about my views, I’ve explained my perspective on this issue at length on this forum and elsewhere.
  - Chris Leong 1 Jun 2025 4:05 UTC
    5 points
    3 ∶ 0
    Parent
    Note: Matthew’s comment was negative just now. Please don’t vote it into the negative and use the disagree button instead. Even though I don’t think Matthew’s defense is persuasive, it deserves to be heard.
    I wrote a critique of that article here. TLDR: “It has some strong analysis at points, but unfortunately, it’s undermined by some poor choices of framing/focus that mean most readers will probably leave more confused than when they came”.
    
    ”A software singularity is unlikely to occur”—Unlikely enough that you’re willing to bet the house on it? Feels like you’re picking up pennies in front of a steamroller.
    I continue to think that accelerating AI is likely good for the world
    AI is already going incredibly fast. Why would you want to throw more fuel on the fire?
    
    Is it that you honestly think AI is moving too slow at the moment (no offense, but seems crazy to me) or is your worry that current trends are misleading and AI might slow in the future?
    Regarding the latter, I agree that once timelines start to get sufficiently long, there might actually be an argument for accelerating them (but in order to reach AGI before biotech causes a catastrophe, rather than the more myopic reasons you’ve provided). But if your worry is stagnation, why not actually wait until things appear to have stalled and then perhaps consider doing something like this?
    Or why didn’t you just stay at Epoch, which was a much more robust and less fragile theory of action? (Okay, I don’t actually think articles like this are high enough quality to be net-positive, but you were 90% of the way towards having written a really good article. The framing/argument just needed to be a little bit tighter, which could have been achieved with another round of revisions).
    
    The main reason not to wait is… missing the opportunity to cash in on the current AI boom.
    - Ryan Greenblatt 1 Jun 2025 21:25 UTC
      11 points
      5 ∶ 3
      Parent
      The main reason not to wait is… missing the opportunity to cash in on the current AI boom.
      This is a clear strawman. Matthew has given reasons why he thinks acceleration is good which aren’t this.
      - Chris Leong 2 Jun 2025 3:10 UTC
        2 points
        0 ∶ 0
        Parent
        I bet the strategic analysis for Mechanize being a good choice (net-positive and positive relative to alternatives) is paper-thin, even given his rough world view.
        Ryan Greenblatt 2 Jun 2025 4:24 UTC
        6 points
        2 ∶ 1
        Parent
        Might be true, doesn’t make that not a strawman. I’m sympathetic to thinking it’s implausible that mechanize would be the best thing to do on altruistic grounds even if you share views like those of the founders. (Because there is probably something more leveraged to do and some weight on cooperativeness considerations.)
        Chris Leong 2 Jun 2025 4:36 UTC
        21 points
        8 ∶ 1
        Parent
        Sometimes the dollar signs can blind someone and cause them not to consider obvious alternatives. And they will feel that they made the decision for reasons other than the money, but the money nonetheless caused the cognitive distortion that ultimately led to the decision.
        
        I’m not claiming that this happened here. I don’t have any way of really knowing. But it’s certainly suspicious. And I don’t think anything is gained by pretending that it’s not.
Chris Leong 18 Apr 2025 4:58 UTC
65 points
17 ∶ 13
I guess orgs need to be more careful about who they hire as forecasting/evals researchers in light of a recently announced startup.

Sometimes things happen, but three people at the same org...

This is also a massive burning of the commons. It is valuable for forecasting/evals orgs to be able to hire people with a diversity of viewpoints in order to counter bias. It is valuable for folks to be able to share information freely with folks at such forecasting orgs without having to worry about them going off and doing something like this.
However, this only works if those less worried about AI risks who join such a collaboration don’t use the knowledge they gain to cash in on the AI boom in an acceleratory way. Doing so undermines the very point of such a project, namely, to try to make AI go well. Doing so is incredibly damaging to trust within the community.

Now let’s suppose you’re an x-risk funder considering whether to fund their previous org. This org does really high-quality work, but the argument for them being net-positive is now significantly weaker. This is quite likely to make finding future funding harder for them.
This is less about attacking those three folks and more just noting that we need to strive to avoid situations where things like this happen in the first place. This requires us to be more careful in terms of who gets hired.

There’s been some discussions on the EA forum along the lines of “why do we care about value alignment shouldn’t we just hire who can best do the job”. My answer to that is that it’s myopic to only consider what happens whilst they’re working for you. Hiring someone or offering them an opportunity empowers them, you need to consider whether they’re someone who you want to empower^[1].
1. ^
  Admittedly, this isn’t quite the same as value alignment. Suppose someone were diligent, honest, wise and responsible. You might want to empower them even if their views were extremely different from yours. Stronger: even if their views were the opposite in many ways. But in the absence of this, value alignment matters.
What links here?
- Joseph 18 Apr 2025 14:13 UTC
  24 points
  7 ∶ 0
  Parent
  I’d like to suggest a little bit more clarity here. The phrases you use refer to some knowledge that isn’t explicitly stated here. “in light of a recently announced startup” and “three people at the same org” make sense to someone who already knows the context of what you are writing about, but it is confusing to a reader who doesn’t have the same background knowledge that you do.
  - harfe 18 Apr 2025 16:19 UTC
    11 points
    3 ∶ 0
    Parent
    Once upon a time, some people were arguing that AI might kill everyone, and EA resources should address that problem instead of fighting Malaria. So OpenPhil poured millions of dollars into orgs such as EpochAI (they got 9 million). Now 3 people from EpochAI created a startup to provide training data to help AI replace human workers. Some people are worried that this startup increases AI capabilities, and therefore increases the chance that AI will kill everyone.
  - NickLaing 18 Apr 2025 16:06 UTC
    9 points
    3 ∶ 0
    Parent
    100 percent agree. I dont understand the entire post because I don’t know the context. I don’t think alluding to something helps, better to say it explicitly.
    - Henry Stanley 🔸 18 Apr 2025 16:39 UTC
      7 points
      0 ∶ 0
      Parent
      I tend to agree; better to be explicit especially as the information is public knowledge anyway.
      
      It refers to this: https://forum.effectivealtruism.org/posts/HqKnreqC3EFF9YcEs/
- Derek Shiller 18 Apr 2025 17:02 UTC
  17 points
  8 ∶ 1
  Parent
  Also, it is worrying if the optimists easily find financial opportunities that depend on them not changing their minds. Even if they are honest and have the best of intentions, the disparity in returns to optimism is epistemically toxic.
- Jason 18 Apr 2025 23:17 UTC
  7 points
  1 ∶ 0
  Parent
  I agree that we need to be careful about who we are empowering.
  “Value alignment” is one of those terms which has different meanings to different people. For example, the top hit I got on Google for “effective altruism value alignment” was a ConcernedEAs post which may not reflect what you mean by the term. Without knowing exactly what you mean, I’d hazard a guess that some facets of value alignment are pretty relevant to mitigating this kind of risk, and other facets are not so important. Moreover, I think some of the key factors are less cognitive or philosophical than emotional or motivational (e.g., a strong attraction toward money will increase the risk of defecting, a lack of self-awareness increases the risk of motivated reasoning toward goals one has in a sense repressed).
  So, I think it would be helpful for orgs to consider what elements of “value alignment” are of particular importance here, as well as what other risk or protective factors might exist outside of value alignment, and focus on those specific things.
  - Chris Leong 18 Apr 2025 23:44 UTC
    3 points
    0 ∶ 0
    Parent
    Agreed. “Value alignment” is a simplified framing.
- Chris Leong 1 Jun 2025 3:34 UTC
  5 points
  0 ∶ 0
  Parent
  Short update—TLDR—mechanise is going straight for automating software engineering.
- Yarrow Bouchard 🔸 21 Apr 2025 16:51 UTC
  −1 points
  0 ∶ 0
  Parent
  If you only hire people who you believe are intellectually committed to short AGI timelines (and who won’t change their minds given exposure to new evidence and analysis) to work in AGI forecasting, how can you do good AGI forecasting?
  
  One of the co-founders of Mechanize, who formerly worked at Epoch AI, says he thinks AGI is 30 to 40 years away. That was in this video from a few weeks ago on Epoch AI’s YouTube channel.
  
  He and one of his co-founders at Mechanize were recently on Dwarkesh Patel’s podcast (note: Dwarkesh Patel is an investor in Mechanize) and I didn’t watch all of it but it seemed like they were both arguing for longer AGI timelines than Dwarkesh believes in.
  
  I also disagree with the shortest AGI timelines and found it refreshing that within the bubble of people who are fixated on near-term AGI, at least a few people expressed a different view.
  
  I think if you restrict who you hire to do AGI forecasting based on strong agreement with a predetermined set of views, such as short AGI timelines and views on AGI alignment and safety, then you will just produce forecasts that re-state the views you already decided were the correct ones while you were hiring.
  - Chris Leong 22 Apr 2025 5:47 UTC
    0 points
    0 ∶ 0
    Parent
    I wasn’t suggesting only hiring people who believe in short-timelines. I believe that my original post adequately lays out my position, but if any points are ambiguous, feel free to request clarification.
    - Yarrow Bouchard 🔸 22 Apr 2025 15:45 UTC
      1 point
      0 ∶ 0
      Parent
      I don’t know how Epoch AI can both “hire people with a diversity of viewpoints in order to counter bias” and ensure that your former employees won’t try to “cash in on the AI boom in an acceleratory way”. These seem like incompatible goals.
      
      I think Epoch has to either:
      
      Accept that people have different views and will have different ideas about what actions are ethical, e.g., they may view creating an AI startup focused on automating labour as helpful to the world and benign
      
      or
      
      Only hire people who believe in short AGI timelines and high AGI risk and, as a result, bias its forecasts towards those conclusions
      
      Is there a third option?
      - David Mathers🔸 23 Apr 2025 10:24 UTC
        4 points
        1 ∶ 0
        Parent
        Presumably there are at least some people who have long timelines, but also believe in high risk and don’t want to speed things up. Or people who are unsure about timelines, but think risk is high whenever it happens. Or people (like me) who think X-risk is low* and timelines very unclear, but even a very low X-risk is very bad. (By very low, I mean like at least 1 in 1000, not 1 in 1x10^17 or something. I agree it is probably bad to use expected value reasoning with probabilities as low as that.)
        
        I think you are pointing at a real tension though. But maybe try to see it a bit from the point of view of people who think X-risk is real enough and raised enough by acceleration that acceleration is bad. It’s hardly going to escape their notice that projects at least somewhat framed as reducing X-risk often end up pushing capabilities forward. They don’t have to be raging dogmatists to worry about this happening again, and it’s reasonable for them to balance this risk against risks of echo chambers when hiring people or funding projects.
        
        *I’m less surely merely catastrophic biorisk from human misuse is low sadly.
      - Chris Leong 22 Apr 2025 16:38 UTC
        −6 points
        0 ∶ 1
        Parent
        Why don’t we ask ChatGPT? (In case you’re wondering, I’ve read every word of this answer and I fully endorse it, though I think there are better analogies that the journalism example ChatGPT used).
        
        Hopefully, this clarifies a possible third option (one that my original answer was pointing at).
        I think there is a third option, though it’s messy and imperfect. The third option is to:
        Maintain epistemic pluralism at the level of research methods and internal debate, while being selective about value alignment on key downstream behaviors.
        In other words:
        You hire researchers with a range of views on timelines, takeoff speeds, and economic impacts, so long as they are capable of good-faith engagement and epistemic humility.
        But you also have clear social norms, incentives, and possibly contractual commitments around what counts as harmful conflict of interest — e.g., spinning out an acceleratory startup that would directly undermine the mission of your forecasting work.
        This requires drawing a distinction between research belief diversity and behavioral alignment on high-stakes actions. That’s tricky! But it’s not obviously incoherent.
        The key mechanism that makes this possible (if it is possible) is something like:
        “We don’t need everyone to agree on the odds of doom or the value of AGI automation in theory. But we do need shared clarity on what types of action would constitute a betrayal of the mission or a dangerous misuse of privileged information.”
        So you can imagine hiring someone who thinks timelines are long and AGI risk is overblown but who is fully on board with the idea that, given the stakes, forecasting institutions should err on the side of caution in their affiliations and activities.
        This is analogous to how, say, journalists might disagree about political philosophy but still share norms about not taking bribes from the subjects they cover.
        Caveats and Challenges:
        Enforceability is hard.
        Noncompetes are legally dubious in many jurisdictions, and “cash in on the AI boom” is vague enough that edge cases will be messy. But social signaling and community reputation mechanisms can still do a lot of work here.
        Self-selection pressure remains.
        Even if you say you’re open to diverse views, the perception that Epoch is “aligned with x-risk EAs” might still screen out applicants who don’t buy the core premises. So you risk de facto ideological clustering unless you actively fight against that.
        Forecasting bias could still creep in via mission alignment filtering.
        Even if you welcome researchers with divergent beliefs, if the only people willing to comply with your behavioral norms are those who already lean toward the doomier end of the spectrum, your epistemic diversity might still collapse in practice.
        Summary:
        The third option is:
        Hire for epistemic virtue, not belief conformity, while maintaining strict behavioral norms around acceleratory conflict of interest.
        It’s not a magic solution — it requires constant maintenance, good hiring processes, and clarity about the boundaries between “intellectual disagreement” and “mission betrayal.” But I think it’s at least plausible as a way to square the circle.”
        Yarrow Bouchard 🔸 23 Apr 2025 0:49 UTC
        −1 points
        0 ∶ 0
        Parent
        So, you want to try to lock in AI forecasters to onerous and probably illegal contracts that forbid them from founding an AI startup after leaving the forecasting organization? Who would sign such a contract? This is even worse than only hiring people who are intellectually pre-committed to certain AI forecasts. Because it goes beyond a verbal affirmation of their beliefs to actually attempting to legally force them to comply with the (putative) ethical implications of certain AI forecasts.
        
        If the suggestion is simply promoting “social norms” against starting AI startups, well, that social norm already exists to some extent in this community, as evidenced by the response on the EA Forum. But if the norm is too weak, it won’t prevent the undesired outcome (the creation of an AI startup), and if the norm is too strong, I don’t see how it doesn’t end up selecting forecasters for intellectual conformity. Because non-conformists would not want to go along with such a norm (just like they wouldn’t want to sign a contract telling them what they can and can’t do after they leave the forecasting company).
- Holly Elmore ⏸️ 🔸 18 Apr 2025 20:53 UTC
  −10 points
  6 ∶ 13
  Parent
  Why not attack them? They defected. They did a really bad thing.
Chris Leong 6 Oct 2024 13:31 UTC
17 points
5 ∶ 0
EA needs more communications projects.

Unfortunately, the EA Communications Fellowship and the EA Blog prize shut down^[1]. Any new project needs to be adapted to the new funding environment.
If someone wanted to start something in this vein, what I’d suggest would be something along the lines of AI Safety Camp. People would apply with a project to be project leads and then folk could apply to these projects. Projects would likely run over a few months, part-time remote^[2].
Something like this would be relatively cheap as it would be possible for someone to run this on a volunteer basis, but it might also make sense for there to be a paid organiser at a certain point.
1. ^
  Likely due to the collapse of FTX
2. ^
  Despite the name, AI Safety Camp is now remote.
Chris Leong 6 Jul 2024 6:37 UTC
17 points
9 ∶ 2
I’m pretty bullish on having these kinds of debates. While EA is doing well at having an impact in the world, the forum has started to feel intellectually stagnant in some ways. And I guess I feel that these debates provide a way to move the community forward intellectually. That’s something I’ve been feeling has been missing for a while.
Chris Leong 21 Jun 2024 18:52 UTC
17 points
3 ∶ 5
Let Manifest be Manifest.

Having a space that is intellectually-edgy, but not edge-lord maxing seems extremely valuable. Especially given how controversial some EA ideas were early on (and how controversial wild animal welfare and AI welfare still are).
In fact, I’d go further and suggest that it would be great if they were to set up their own forum. This would allow us to nudge certain discussions into an adjacent, not-explicitly EA space instead of discussing it here.

Certain topics are a poor fit for the forum because they rate high on controversy + low-but-non-zero on relevance to EA. It’s frustrating having these discussions on the forum as it may turn some people off, but at the same time declaring these off-topic risks being intellectually stiffling. Sometimes things turn out to be more important than you thought when you dive into the details. So I guess I’d really love to see another non-EA space end up being the first port of call for such discussions, with the hope that only the highest quality and most relevant ideas would make it over to the EA forum.
- Jason 22 Jun 2024 13:22 UTC
  4 points
  1 ∶ 0
  Parent
  Although I have mixed feelings on the proposal, I’m voting insightful because I appreciate that you are looking toward an actual solution that at least most “sides” might be willing to live with. That seems more insightful than what the Forum’s standard response soon ends up as: rehashing fairly well-worn talking points every time an issue like this comes up.
- yanni kyriacos 26 Jun 2024 1:54 UTC
  2 points
  0 ∶ 0
  Parent
  Considering how much skepticism there is in EA about forecasting being a high priority cause area anyway, this seems like an ok idea :)
- harfe 21 Jun 2024 20:06 UTC
  1 point
  0 ∶ 0
  Parent
  
  In fact, I’d go further and suggest that it would be great if they were to set up their own forum.
  
  Manifold already has a highly active discord, where they can discuss all the manifold-specific issues. This did not prevent the EA Forum from discussing the topic, and I doubt it would be much different if Manifold had a proper forum instead of a discord.
  
  This is annoying because many of these discussions rate high on controversy but low on importance for EA.
  
  It might seem low on importance for EA to you, but I suspect some people who are upset about Manifest inviting right-wing people do not consider it low-importance.
  - Chris Leong 21 Jun 2024 20:11 UTC
    4 points
    0 ∶ 0
    Parent
    Oh, I wasn’t referring to redirecting the discussions about Manifest onto a new forum. More discussions about pro-natalism or genetic engineering to improve welfare. To be clear, I was suggesting a forum associated with Manifest rather than one more narrowly associated with Manifold.
Chris Leong 2 Oct 2024 7:32 UTC
12 points
3 ∶ 0
I’d love to see the EA forum add a section titled “Get Involved” or something similar.
There is the groups directory, but it’s one of only many ways that folks can get more involved, from EAGx Conferences, to Virtual Programs, 80,000 Hours content/courses to donating.
- Sarah Cheng 🔸 3 Oct 2024 4:24 UTC
  2 points
  0 ∶ 0
  Parent
  Thanks for the suggestion Chris! I’d be really excited for the Forum (or for EA.org) to have a nice page like that, and I think others at CEA agree. We did a quick experiment in the past by adding the “Take action” sidebar link that goes to the Opportunities to take action topic page, and the link got very few clicks. We try not to add clutter to the site without good reason so we removed that link for logged in users (it’s still visible for logged out users since they’re more likely to get value from it). Since then we’ve generally deprioritized it. I would like us to pick it back up at some point, though first we’d need to decide where it should live (EA.org or here) and what it should look like, design-wise.
  For now, I recommend people make updates to the Opportunities to take action wiki text to help keep it up-to-date! I’ve done so myself a couple times but I think it would be better as a team effort. :)
  - Gemma 🔸 3 Oct 2024 11:01 UTC
    5 points
    0 ∶ 0
    Parent
    Have the forum team considered running an online event to collaborate on improving wikis? I think wikis are a deeply underrated forum feature and a fantastic way for people who aren’t new but aren’t working in EA to directly contribute to the EA project.
    
    I wrote a quick take a while ago about how it’s probably too hard for people to edit wikis atm—I actually can’t link to it but here are my quick takes: Gemma Paterson’s Quick takes — EA Forum (effectivealtruism.org)
    What links here?
    EA Forum update (Oct 2024) by Sarah Cheng 🔸 (24 Oct 2024 13:03 UTC; 46 points)
    - Sarah Cheng 🔸 3 Oct 2024 22:21 UTC
      2 points
      0 ∶ 0
      Parent
      I’m glad that you like the wiki! ^^ I agree that it’s a nice way for people in the community to contribute.
      I believe no one on the team has focused on the wiki in a while, and I think before we invest time into it we should have a more specific vision for it. But I do like the idea of collaborative wiki editing events, so thanks for the nudge! I’ll have a chat with @Toby Tremlett🔹 to see what he thinks. For reference, we do have a Wiki FAQ page, which is a good starting point for people who want to contribute.
      About your specific suggestion, thank you for surfacing it and including detailed context — that’s quite helpful. I agree that ideally people could contribute to the wiki with lower karma. I’ll check if we can lower the minimum at least. Any more substantive changes (like making a “draft” change and getting it approved by someone else) would take more technical work, so I’m not sure when we would prioritize it.
      (It looks like your link to a specific quick take did work, but if you think there’s a bug then let me know!)
      - Gemma 🔸 4 Oct 2024 13:04 UTC
        3 points
        0 ∶ 0
        Parent
        Ah glad the link worked. Not sure why it looked like it didn’t.
        
        Let me know if you do end up interested in doing an editing event—happy to host an in person coworking session for it in London.
  - Chris Leong 3 Oct 2024 11:11 UTC
    2 points
    0 ∶ 0
    Parent
    Interesting. I still think it could be valuable even with relatively few clicks. You might only even need someone to click on it once.
    - Sarah Cheng 🔸 3 Oct 2024 21:26 UTC
      4 points
      0 ∶ 0
      Parent
      Yeah I agree, it does feel like a thing that should exist, like there’s some obvious value to it even though I got some evidence that there was low demand for it on the Forum. I think it would be faster to add to EA.org instead so perhaps we should just add a static page there.
      I like that we have a list in the wiki, so that people in the EA community can help us keep the info up-to-date by editing it, but practically speaking people don’t spend much time doing that.
- Rebecca 2 Oct 2024 10:25 UTC
  2 points
  0 ∶ 0
  Parent
  A list like that could be added to the EA Handbook, which is linked on the forum sidebar
Chris Leong 15 Sep 2019 0:22 UTC
12 points
0 ∶ 0
If we run any more anonymous surveys, we should encourage people to pause and consider whether they are contributing productively or just venting. I’d still be in favour of sharing all the responses, but I have enough faith in my fellow EAs to believe that some would take this to heart.
Chris Leong 23 May 2024 5:22 UTC
11 points
0 ∶ 0
I’ll post some extracts from the commitments made at the Seoul Summit. I can’t promise that this will be a particularly good summary, I was originally just writing this for myself, but maybe it’s helpful until someone publishes something that’s more polished:
Frontier AI Safety Commitments, AI Seoul Summit 2024

The major AI companies have agreed to Frontier AI Safety Commitments. In particular, they will publish a safety framework focused on severe risks: “internal and external red-teaming of frontier AI models and systems for severe and novel threats; to work toward information sharing; to invest in cybersecurity and insider threat safeguards to protect proprietary and unreleased model weights; to incentivize third-party discovery and reporting of issues and vulnerabilities; to develop and deploy mechanisms that enable users to understand if audio or visual content is AI-generated; to publicly report model or system capabilities, limitations, and domains of appropriate and inappropriate use; to prioritize research on societal risks posed by frontier AI models and systems; and to develop and deploy frontier AI models and systems to help address the world’s greatest challenges”

″Risk assessments should consider model capabilities and the context in which they are developed and deployed”—I’d argue that the context in which it is deployed should account take into account whether it is open or closed source/weights as open-source/weights can be subsequently modified.

”They should also be accompanied by an explanation of how thresholds were decided upon, and by specific examples of situations where the models or systems would pose intolerable risk.”—always great to make policy concrete”

In the extreme, organisations commit not to develop or deploy a model or system at all, if mitigations cannot be applied to keep risks below the thresholds.”—Very important that when this is applied the ability to iterate on open-source/weight models is taken into account
https://www.gov.uk/government/publications/frontier-ai-safety-commitments-ai-seoul-summit-2024/frontier-ai-safety-commitments-ai-seoul-summit-2024

Seoul Declaration for safe, innovative and inclusive AI by participants attending the Leaders’ Session
Signed by Australia, Canada, the European Union, France, Germany, Italy, Japan, the Republic of Korea, the Republic of Singapore, the United Kingdom, and the United States of America.

”We support existing and ongoing efforts of the participants to this Declaration to create or expand AI safety institutes, research programmes and/or other relevant institutions including supervisory bodies, and we strive to promote cooperation on safety research and to share best practices by nurturing networks between these organizations”—guess we should now go full-throttle and push for the creation of national AI Safety institutes
“We recognise the importance of interoperability between AI governance frameworks”—useful for arguing we should copy things that have been implemented overseas.
“We recognize the particular responsibility of organizations developing and deploying frontier AI, and, in this regard, note the Frontier AI Safety Commitments.”—Important as Frontier AI needs to be treated as different from regular AI.

https://www.gov.uk/government/publications/seoul-declaration-for-safe-innovative-and-inclusive-ai-ai-seoul-summit-2024/seoul-declaration-for-safe-innovative-and-inclusive-ai-by-participants-attending-the-leaders-session-ai-seoul-summit-21-may-2024
Seoul Statement of Intent toward International Cooperation on AI Safety Science

Signed by the same countries.
“We commend the collective work to create or expand public and/or government-backed institutions, including AI Safety Institutes, that facilitate AI safety research, testing, and/or developing guidance to advance AI safety for commercially and publicly available AI systems”—similar to what we listed above, but more specifically focused on AI Safety Institutes which is a great.

”We acknowledge the need for a reliable, interdisciplinary, and reproducible body of evidence to inform policy efforts related to AI safety”—Really good! We don’t just want AIS Institutes to run current evaluation techniques on a bunch of models, but to be actively contributing to the development of AI safety as a science.
“We articulate our shared ambition to develop an international network among key partners to accelerate the advancement of the science of AI safety”—very important for them to share research among each other
https://www.gov.uk/government/publications/seoul-declaration-for-safe-innovative-and-inclusive-ai-ai-seoul-summit-2024/seoul-statement-of-intent-toward-international-cooperation-on-ai-safety-science-ai-seoul-summit-2024-annex

Seoul Ministerial Statement for advancing AI safety, innovation and inclusivity
Signed by: Australia, Canada, Chile, France, Germany, India, Indonesia, Israel, Italy, Japan, Kenya, Mexico, the Netherlands, Nigeria, New Zealand, the Philippines, the Republic of Korea, Rwanda, the Kingdom of Saudi Arabia, the Republic of Singapore, Spain, Switzerland, Türkiye, Ukraine, the United Arab Emirates, the United Kingdom, the United States of America, and the representative of the European Union
“It is imperative to guard against the full spectrum of AI risks, including risks posed by the deployment and use of current and frontier AI models or systems and those that may be designed, developed, deployed and used in future”—considering future risks is a very basic, but core principle

”Interpretability and explainability”—Happy to interpretability explicitly listed

”Identifying thresholds at which the risks posed by the design, development, deployment and use of frontier AI models or systems would be severe without appropriate mitigations”—important work, but could backfire if done poorly

”Criteria for assessing the risks posed by frontier AI models or systems may include consideration of capabilities, limitations and propensities, implemented safeguards, including robustness against malicious adversarial attacks and manipulation, foreseeable uses and misuses, deployment contexts, including the broader system into which an AI model may be integrated, reach, and other relevant risk factors.”—sensible, we need to ensure that the risks of open-sourcing and open-weight models are considered in terms of the ‘deployment context’ and ‘foreseeable uses and misuses’

”Assessing the risk posed by the design, development, deployment and use of frontier AI models or systems may involve defining and measuring model or system capabilities that could pose severe risks,”—very pleased to see a focus beyond just deployment

”We further recognise that such severe risks could be posed by the potential model or system capability or propensity to evade human oversight, including through safeguard circumvention, manipulation and deception, or autonomous replication and adaptation conducted without explicit human approval or permission. We note the importance of gathering further empirical data with regard to the risks from frontier AI models or systems with highly advanced agentic capabilities, at the same time as we acknowledge the necessity of preventing the misuse or misalignment of such models or systems, including by working with organisations developing and deploying frontier AI to implement appropriate safeguards, such as the capacity for meaningful human oversight”—this is massive. There was a real risk that these issues were going to be ignored, but this is now seeming less likely.

”We affirm the unique role of AI safety institutes and other relevant institutions to enhance international cooperation on AI risk management and increase global understanding in the realm of AI safety and security.”—“Unique role”, this is even better!

”We acknowledge the need to advance the science of AI safety and gather more empirical data with regard to certain risks, at the same time as we recognise the need to translate our collective understanding into empirically grounded, proactive measures with regard to capabilities that could result in severe risks. We plan to collaborate with the private sector, civil society and academia, to identify thresholds at which the level of risk posed by the design, development, deployment and use of frontier AI models or systems would be severe absent appropriate mitigations, and to define frontier AI model or system capabilities that could pose severe risks, with the ambition of developing proposals for consideration in advance of the AI Action Summit in France”—even better than above b/c it commits to a specific action and timeline
https://www.gov.uk/government/publications/seoul-ministerial-statement-for-advancing-ai-safety-innovation-and-inclusivity-ai-seoul-summit-2024
Chris Leong 7 Mar 2025 10:03 UTC
10 points
4 ∶ 2
For the record, I see the new field of “economics of transformative AI” as overrated.

Economics has some useful frames, but it also tilts people towards being too “normy” on the impacts of AI and it doesn’t have a very good track record on advanced AI so far.

I’d much rather see multidisciplinary programs/conferences/research projects, including economics as just one of the perspectives represented, then economics of transformative AI qua economics of transformative AI.
(I’d be more enthusiastic about building economics of transformative AI as a field if we were starting five years ago, but these things take time and it’s pretty late in the game now, so I’m less enthusiastic about investing field-building effort here and more enthusiastic about pragmatic projects combining a variety of frames).
- huw 8 Mar 2025 6:06 UTC
  6 points
  1 ∶ 0
  Parent
  (Could you elaborate on ‘economics doesn’t have a very good track record on advanced AI so far’? I haven’t heard this before)
  - Chris Leong 9 Mar 2025 5:19 UTC
    6 points
    1 ∶ 0
    Parent
    Things in AI have been moving fast, most economists seem to have expected it to have moved slower. Sorry, I don’t really want to get into more detail as writing a proper response would end up taking me more time than I want to spend defending this “Quick take”.
- MichaelDickens 7 Mar 2025 15:42 UTC
  4 points
  1 ∶ 4
  Parent
  I think “economics of transformative AI” only matters in the narrow slice of worlds (maybe 20% of my probability?) where AI is powerful enough to transform the economy, but not powerful enough to kill everyone or to create a post-scarcity utopia. So I think you’re right.
  - Chris Leong 8 Mar 2025 1:10 UTC
    4 points
    1 ∶ 0
    Parent
    It has some relevance to strategy as well, such as in terms of how fast we develop the tech and how broadly distributed we expect it to be, however there’s a limit to how much additional clarity we can expect to gain over short time period.
- Rebecca 7 Mar 2025 10:55 UTC
  4 points
  0 ∶ 0
  Parent
  What other disciplines would you want to see?
  - Chris Leong 7 Mar 2025 11:50 UTC
    11 points
    2 ∶ 0
    Parent
    As an example, I expect political science and international relations to be better for looking at issues related to power distribution rather than economics (though the economic frame adds some value as well). Historical studies of coups seems pretty relevant as well.
    When it comes to predicting future progress, I’d be much more interested in hearing the opinions of folks who combine knowledge of economics with knowledge of ML or computer hardware, rather than those who are solely economists. Forecasting seems like another relevant discipline, as is future studies and history of science.
Chris Leong 30 Aug 2023 15:01 UTC
5 points
1 ∶ 0
To Community Build or Not
One underrated factor in whether to engage in community-building^[1] is how likely you are to move to a hub.
I suspect that in most cases people can achieve more when they are part of a group, rather than when they are by themselves. Let’s assume that your local community doesn’t already provide what you need. Let’s further assume that an online community isn’t sufficient for your needs either:

Then you have two main options:
• If there’s already a hub that provides the community that you need, then you could move there
• You could try to build up the local community
There are a lot of advantages to the former. It can be quicker than trying to build up a community yourself and being in the hub will probably lead to you having more direct impact than you could have even if you managed to build up your local community quite a bit. So while either option could end up being more impactful, there’s a lot of reasons why it might make sense for people who are willing to move to just focus on figuring out how to set themselves up in a hub as soon as possible.

However, there are some people who are just not going to move to a hub, because they’re too rooted in their current location. My suspicion is that more of these people should be focusing on building up the community.
Since there are less opportunities outside of the hub, the opportunity cost is lower, but more importantly, someone who is planning to stay in the same location over the longer term is likely to capture more of the value from their own community-building efforts.
Obviously, this doesn’t apply to everyone and there are definitely people who can have far more impact through direct work, even whilst outside of a hub, than through community building. I would just like to see more people who are planning to stay put pick up this option.
1. ^
  Here I’m using community-building in a broad sense.
Chris Leong 21 Aug 2019 11:17 UTC
5 points
0 ∶ 0
One of the vague ideas spinning around in my head is that maybe in addition to EA which is a fairly open, loosely co-ordinated, big-tent movement with several different cause areas, there would also in value in a more selective, tightly co-ordinated, narrow movement focusing just on the long term future. Interestingly, this would be an accurate description of some EA orgs, with the key difference being that these orgs tend to rely on paid staff rather than volunteers. I don’t have a solid idea of how this would work, but just thought I’d put this out there...
- Moses 22 Aug 2019 14:16 UTC
  1 point
  0 ∶ 0
  Parent
  Oh, I would’ve sworn that was already the case (with the understanding that, as you say, there is less volunteering involved, because with the “inner” movement being smaller, more selective, and with tighter/more personal relationships, there is much less friction in the movement of money, either in the form of employment contracts or grants).
Chris Leong 13 Apr 2025 12:11 UTC
4 points
0 ∶ 0
If this were a story, there’d be some kind of academy taking in humanity’s top talent and skilling them up in alignment.
Most of the summer fellowships seem focused on finding talent that is immediately useful. And I can see how this is tempting given the vast numbers of experienced and talented folks seeking to enter the space. I’d even go so far as to suggest that the majority of our efforts should probably be focused on finding people who will be useful fairly quickly.

Nonetheless, it does seem as though there should be at least one program that aims to find the best talent (even if they aren’t immediately useful) and which provides them with the freedom to explore and the intellectual environment in which to do so.
I wish I could articulate my intuition behind this clearer, but the best I can say for now is that my intuition is that continuing to scale existing fellowships would likely provide decreasing marginal returns and such an academy wouldn’t be subject to this because it would be providing a different kind of talent.
Chris Leong 12 Nov 2024 2:25 UTC
4 points
0 ∶ 0
Someone really needs to make Asterisk meetup groups a thing.
Chris Leong 20 Oct 2024 14:51 UTC
4 points
1 ∶ 1
There is a world that needs to be saved. Saving the world is a team sport. All we can do is to contribute our part of the puzzle, whatever that may be and no matter how small, and trust in our companions to handle the rest. There is honor in that, no matter how things turn out in the end.
- yanni kyriacos 23 Oct 2024 5:07 UTC
  2 points
  0 ∶ 0
  Parent
  hear hear 👏🏼👏🏼
Chris Leong 27 Aug 2024 12:56 UTC
4 points
1 ∶ 5
What could principles-first EA look like?
Zachary Robinson recently stated that CEA would choose to emphasize a principles-first approach to EA. Here are my thoughts on the kinds of strategic decisions that naturally synergies with this high-level strategy:
- Growth strategy: Less focused on fast growth, more focus on attracting value-aligned talent:
  - Eternal September effects make it hard to both grow fast and maintain high-fidelity transmission of EA principles.
  - Recruiting from audiences that are capable of engaging in nuanced discussions of what these principles imply
- Local EA groups: More emphasis on making events attractive for long-term members to attend vs. recruiting new members:
  - Greater focus on advertising events in ways that bring repeat customers vs. maximising throughput
- Community investment: If the aim is to build a relatively small, high-talent community instead of a mass movement, then it makes sense to shift some amount of resources from outreach to improving the effectiveness of the community.
  - More emphasis on epistemics improves our ability to pick the right goals and achieve those goals intelligently (rather than throwing people at the problem)
    Upskilling programs such as the Introductory/Advanced Fellowship or the Precipice reading group are helpful here
    It may also make sense to start some new programs or orgs focused on topics like epistemics, leadership training or conflict-resolution (I like how there’s a EA mental health—I forget the name—which is running training at scale)
    Mentorship programs may also help with improving the effectiveness of individuals
  - The community can be more effective with less people if we make progress on long-standing community issues such as the lack of low-cost EA Hub or the limited support for people trying to establish themselves in major hubs like San Fransisco or London:
    It also makes more sense for the community to fix its own issues now that EA is less on the frontlines for AI Safety/x-risk (please comment if you’d like me to explain this in more detail)
  - Question: How can the EA community recursively self-improve?
- Weirdness: A principles-first approach suggests that the community should be more tolerant of weirdness than if we were pursuing a fast-growth strategy:
  - It also suggests more focus on the margin with being a community rather than a professional group given that professional groups experience strong pressure to make themselves seem more respectable.
  - It also suggests that avoiding jargon to maximise accessibility is less of a priority
- Forum debates: Running debates like this to move the community’s understanding of different cause areas forward becomes more important for the principles-first approach
Additional comments:
- Some of these proposals make more sense in light of the rise of cause-specific groups (many groups now focus exclusively on AI safety, Effective Animal Advocates have their own conference, Giving What We Can is doing its own movement-building for those focused on donations, particularly donations to global poverty):
  - If a particular cause area wants a higher rate of growth, then cause-specific groups can pursue this objective.
  - Similarly, cause-specific groups can choose to be more professional or more focused on developing respectability.
- A lower-growth strategy makes more sense given the pummelling EA has taken in the public relations realm:
  - Growth would be more challenging these days
  - Attempting to grow really fast is more likely to spark backlash now
  - Recruiting top-notch folks and developing the knowledge and skills of members of the community will improve the impression that folks form about EA
- A lower-growth strategy makes more sense given the reduction in available EA funding:
  - When there was more funding available, it made more sense to bring in lots of people so that we could rapidly proliferate projects and orgs
  - We also had more funding to support people who joined, so the marginal benefit from adding people was greater
- There are many people in EA who either don’t have the skills to directly work on high-priority areas or wouldn’t enjoy having a career in these areas. Some of these people want to directly do thing rather than just Earn to Give:
  - A greater focus on improving the community would mean that there would be more things for these folks to do.
Chris Leong 8 Oct 2024 11:19 UTC
3 points
3 ∶ 6
I’m not really focused on animal rights nor do I spend much time thinking about it, so take this comment with a grain of salt.

However, if I wanted to make the future go well for animals I’d be offering free vegan meals in the Bay Area or running a conference on how to ensure that the transition to advanced AI systems goes well for animals in the Bay Area.

Reality check: Sorry for being harsh, but you’re not going to end factory farming before the transition to advanced AI technologies. Max 1-2% chance of that happening. So the best thing to do is to ensure that this goes well for animals and not just humans.
Anyway, that concludes my hot-take.
- AllisonA 8 Oct 2024 17:49 UTC
  8 points
  0 ∶ 0
  Parent
  There is an AI, Animals, & Digital Minds conference that’s being planned in the Bay Area for earlyish 2025! Updates will be announced in the AI & Animals newsletter.
- Jason 8 Oct 2024 23:02 UTC
  3 points
  0 ∶ 0
  Parent
  I’m confused about the theory of impact for “free vegan meals in the Bay Area” idea. A few recipients might work in AI, but I don’t see the link between eating a vegan meal offered for free and making more animal-friendly AI development choices.
  - Chris Leong 9 Oct 2024 4:12 UTC
    3 points
    1 ∶ 0
    Parent
    Presumably you’d be doing outreach at the same time to influence values.
[ ]
[deleted]
- MichaelDickens 4 Sep 2025 4:41 UTC
  2 points
  0 ∶ 0
  Parent
  Looks like you double posted this
  - Chris Leong 4 Sep 2025 4:44 UTC
    2 points
    0 ∶ 0
    Parent
    I can’t see an option to delete.
Chris Leong 27 Feb 2025 5:33 UTC
2 points
0 ∶ 0
I just created a new Discord server for generated AI safety reports (ie. using Deep Research or other tools). Would be excited to see you join (ps. Open AI now provides uses on the plus plan 10 queries per month using Deep Research).
https://discord.gg/bSR2hRhA
Chris Leong 17 Dec 2024 11:45 UTC
2 points
1 ∶ 4
Maybe I’m missing something, but I think it’s a negative sign that mirror bacteria seems to have pretty much not been discussed within the EA community until now (that said, what really matters is the percent of biosecurity folk in the community who have heard of this issue).
Chris Leong 12 Sep 2024 19:56 UTC
2 points
0 ∶ 0
Chris Leong 10 Sep 2023 8:45 UTC
2 points
0 ∶ 0
I think I posted in one of the threads that I have no knowledge of what private evidence Nonlinear may have, but I just realised that I actually do. I don’t think it’s a big enough deal for me to go back and try to track down the actual comments and edit them, but I thought it was good practise to note this on short form nonetheless.
Chris Leong 27 Aug 2019 8:05 UTC
2 points
0 ∶ 0
I suspect that it could be impactful to study say a masters of AI or computer science even if you don’t really need it. University provides one of the best opportunities to meet and deeply connect with people in a particular field and I’d be surprised if you couldn’t persuade at least a couple of people of the importance of AI safety without really trying. On the other hand, if you went in with the intention of networking as much as possible, I think you could have much more success.
Chris Leong 29 Aug 2024 4:59 UTC
1 point
0 ∶ 0
Is anyone doing broad AI Safety outreach to techies in the Bay Area?

It seems very important to have a group doing this given how much opinions within Bay Area tech influence how AI is developed.
If SB 1047 doesn’t pass, this ball being dropped may be partially to blame.
Chris Leong 4 Aug 2024 6:21 UTC
1 point
2 ∶ 5
Maybe EA should try to find a compromise on the unpaid internship issue? For example, unpaid internships up to a maximum of 2 days/week being considered acceptable with the community?
This would provide additional opportunities for people to skill up, whilst ensuring that these opportunities would still be broadly accessible.

(In countries where this is legally allowed)
- Ben Millwood🔸 8 Aug 2024 8:26 UTC
  10 points
  4 ∶ 1
  Parent
  You say “find a compromise” as if this is a big and contentious issue, but I… don’t really see it coming up a lot? I know Kat Woods has recently posted elsewhere about how lots of unpaid internships are being suppressed because random bystanders on the internet object to them, but I just don’t actually see that happening. I would imagine that often management capacity is more of a bottleneck than pay anyway?
Chris Leong 4 Sep 2025 3:24 UTC
−2 points
4 ∶ 2
I took a look at the post announcing Epoch.
It was interesting noting this comment by Ofer:

Jaime Sevilla replied:
Additionally, looking at the post itself:
It’s up to the reader to form their own judgement, but it certainly seems to me that the AI Safety community was too ready to trust Epoch.
- tlevin 10 Sep 2025 14:35 UTC
  13 points
  5 ∶ 1
  Parent
  Weak-downvoted; I think it’s fair game to say an org acted in an untrustworthy way, but I think it’s pretty essential to actually sketch the argument rather than screenshotting their claims and not specifying what they’ve done that contradicts the claims. It seems bad to leave the reader in a position of being like, “I don’t know what the author means, but I guess Epoch must have done something flagrantly contradictory to these goals and I shouldn’t trust them,” rather than elucidating the evidence so the reader can actually “form their own judgment.” Ben_West then asked in two comments for these specifics, and I still don’t know what you mean (and I think I’m pretty high-percentile among forum readers on the dimension of “familiar with drama/alleged bad behavior of AI safety orgs”).
  Would remove the downvote if you fill in the implicit part of the argument here: what information/explanation would a reader need to know what you mean by “it certainly seems to me that the AI Safety community was too ready to trust Epoch” in the context of these screenshots?
  - Chris Leong 10 Sep 2025 17:24 UTC
    3 points
    1 ∶ 0
    Parent
    Honestly, I don’t care enough to post any further replies. I’ve spent too much time on this whole Epoch thing already (not just through this post, but through other comments). I’ve been reflecting recently on how I spend my time and I’ve realised that I often make poor decisions here. I’ve shared my opinion, if your opinion is different, that’s perfectly fine, but I’m out.
- Ben_West🔸 6 Sep 2025 0:44 UTC
  10 points
  1 ∶ 0
  Parent
  Why do you think we were too ready to trust them? Are you implying that they later violated what Jaime says here?
  - Chris Leong 6 Sep 2025 5:39 UTC
    3 points
    3 ∶ 0
    Parent
    Trust has never been just about whether someone technically lied.
    - Ben_West🔸 6 Sep 2025 15:44 UTC
      8 points
      7 ∶ 0
      Parent
      Sure, but I just genuinely don’t know what you are complaining about here. I can make a few guesses but it seems better to just ask what you mean.
      - Chris Leong 6 Sep 2025 18:22 UTC
        3 points
        1 ∶ 1
        Parent
        It can be a mistake to have trusted someone without there necessarily having been misbehavior. I’m not saying there wasn’t misbehavior, that’s just not my focus here.
Chris Leong 10 Sep 2024 4:11 UTC
−2 points
1 ∶ 1
Someone needs to be doing mass outreach about AI Safety to techies in the Bay Area.
I’m generally more of a fan of niche outreach over mass outreach, but Bay Area tech culture influences how AI is developed. If SB 1047 is defeated, I wouldn’t be surprised if the lack of such outreach ended up being a decisive factor.
There’s now enough prominent supporters of AI Safety and AI is hot enough that public lectures or debates could draw a big crowd. Even though a lot of people have been exposed to these ideas before, there’s something about in-person events that make ideas seem real.
[ ]
[deleted]

Chris Leong’s Quick takes

Caveats and Challenges:

Summary: