mic

Karma: 2,028

mic Apr 26, 2022, 7:15 PM
3 points
0 ∶ 0
on: [$20K In Prizes] AI Safety Arguments Competition
Companies and governments will find it strategically valuable to develop advanced AIs which are able to execute creative plans in pursuit of a goal achieving real-world outcomes. Current large language models have a rich understanding of the world which generalizes to other domains, and reinforcement learning agents already achieve superhuman performance at various games. With further advancements in AI research and compute, we are likely to see the development of human-level AI this century. But for a wide variety of goals, it is often valuable to pursue instrumental goals such as acquiring resources, self-preservation, seeking power, and eliminating opposition. By default, we should expect that highly capable agents will have these unsafe instrumental objectives.

The vast majority of actors would not want to develop unsafe systems. However, there are reasons to think that alignment will be hard with modern deep learning systems, and difficulties with making large language models safe provide empirical support of this claim. Misaligned AI may seem acceptably safe and only have catastrophic consequences with further advancements in AI capabilities, and it may be unclear in advance whether a model is dangerous. In the heat of an AI race between companies or governments, proper care may not be taken to make sure that the systems being developed behave as intended.

(This is technically two paragraphs haha. You could merge them into one paragraph, but note that the second paragraph is mostly by Joshua Clymer.)

mic Apr 18, 2022, 10:39 AM
12 points
0 ∶ 1
on: How I failed to form views on AI safety
But I am a bit at loss on why people in the AI safety field think it is possible to build safe AI systems in the first place. I guess as long as it is not proven that the properties of safe AI systems are contradictory with each other, you could assume it is theoretically possible. When it comes to ML, the best performance in practice is sadly often worse than the theoretical best.
To me, this belief that AI safety is hard or impossible would imply that AI x-risk is quite high. Then, I’d think that AI safety is very important but unfortunately intractable. Would you agree? Or maybe I misunderstood what you were trying to say.
I agree that x-risk from AI misuse is quite underexplored.
For what it’s worth, AI safety and governance researchers do assign significant probability to x-risk from AI misuse. AI Governance Week 3 — Effective Altruism Cambridge comments:
For context on the field’s current perspectives on these questions, a 2020 survey of AI safety and governance researchers (Clarke et al., 2021) found that, on average [1], researchers currently guess there is: [2]
A 10% chance of existential catastrophe from misaligned, influence-seeking AI [3]
A 6% chance of existential catastrophe from AI-exacerbated war or AI misuse
A 7% chance of existential catastrophe from “other scenarios”

mic Apr 16, 2022, 5:37 PM
10 points
0 ∶ 0
on: What is a neutral life like?
Relevant: happiness—How happy are people relative to neutral (as measured by experience sampling)? - Psychology & Neuroscience Stack Exchange

mic Apr 15, 2022, 2:06 AM
13 points
0 ∶ 0
in reply to: Linch’s comment on: Free-spending EA might be a big problem for optics and epistemics
For what it’s worth, even though I prioritize longtermist causes, reading
Maybe it depends on the cause area but the price I’m willing to pay to attract/retain people who can work on meta/longtermist things is just so high that it doesn’t seem worth factoring in things like a few hundred pounds wasted on food.
made me fairly uncomfortable, even though I don’t disagree with the substance of the comment, as well as
2) All misallocations of money within EA community building is lower than misallocations of money caused by donations that were wasted by donating to less effective cause areas (for context, Open Phil spent ~200M in criminal justice reform, more than all of their EA CB spending to date), and

mic Apr 13, 2022, 6:32 AM
11 points
0 ∶ 0
on: Free-spending EA might be a big problem for optics and epistemics
Free food and free conferences are things that are somewhat standard among various non-EA university groups. It’s easy to object to whether they’re an effective use of money, but I don’t think they’re excessive except under the EA lens of maximizing cost-effectiveness. I think if we reframe EA universities groups as being about empowering students to tackle pressing global issues through their careers, and avoid mentioning effective donations and free food in the same breath, then it’s less confusing why there is free stuff being offered. (Besides apparently being more appealing to students, I also genuinely think high-impact careers should be the focus of EA university groups.)
I’m in favor of making EA events and accommodation feel less fancy.
There are other expenses that I’d be more concerned about from an optics perspective about than free food and conferences.
You find out that if you build a longtermist group in your university, EA orgs will pay you for your time, fly you to conferences and hubs around the world and give you all the resources you could possibly make use of. This is basically the best deal that any student society can currently offer. Given this, how much time are you going to spend critically evaluating the core claims of longtermism?
It’s worth noting that these perks are available for new EA groups in general, not even particularly longtermist EA groups. That said, I think there are plenty of additional perks to being a longtermist (career advising from 80,000 Hours, grants from the Long-Term Future Fund or the FTX Future Fund to work on projects, etc.) that you might want to be one even if you’re intellectually unsure about it. I think another incentive pushing university organizers in favor of a longtermist direction is: it doesn’t make sense to be spending this much money on free food and conferences from a neartermist perspective, at least in my opinion.

mic Apr 7, 2022, 2:55 AM
2 points
0 ∶ 0
in reply to: Nikola’s comment on: Nikola’s Shortform
I’ve considered this before and I’m not sure I agree. If I’m at a +10 utility for the next 10 years and afterwards will be at +1,000,000 utility for the following 5,000 years, I might just feel like skipping ahead to be feeling +1,000,000 utility, simply from being impatient about getting to feel even better.

mic Apr 7, 2022, 1:24 AM
2 points
0 ∶ 0
in reply to: tlevin’s comment on: University Groups Should Do More Retreats
Got it, I’m surprised by how little time it took to organize HEA’s spring retreat. What programming was involved?

mic Apr 6, 2022, 10:50 PM
2 points
0 ∶ 0
on: University Groups Should Do More Retreats
For me, the main value of retreats/conferences has been forming lots connections, but I haven’t become significantly more motivated to be more productive, impactful, or ambitious. I have a couple questions which I think would be helpful for organizers to decide whether they should be running more retreats:
- How many hours does it take to organize a retreat?
- To what extent can the value of a retreat be ⁸⁰⁄₂₀′d with a series of 1-on-1s? (Perhaps while taking a walk through a scenic part of campus) Would that save organizer time?
- Do you have estimates as to how many participants have significant plan changes after a retreat?

mic Apr 6, 2022, 10:02 PM
4 points
0 ∶ 0
in reply to: Daniel H’s comment on: Intro to AI/ML Reading Group at EA Georgetown!
My experience with EA at Georgia Tech is that a relatively small proportion of people who complete our intro program participate in follow-up programs, so I think it’s valuable to have content you think is important in your initial program instead of hoping that they’ll learn it in a later program. I think plenty of Georgetown students would be interested in signing up for an AI policy/governance program, even if it includes lots of x-risk content.

mic Apr 6, 2022, 6:58 AM
5 points
0 ∶ 0
on: The Vultures Are Circling
As a community that values good epidemics
good epistemics?
Thanks for posting about this; I had no idea this was happening to a significant extent.

mic Apr 6, 2022, 6:38 AM
2 points
0 ∶ 0
on: Intro to AI/ML Reading Group at EA Georgetown!
To the extent that the program is meant to provide an introduction to “catastrophic and existential risk reduction in the context of AI/ML”, I think it should include some more readings on the alignment problem, existential risk from misaligned AI, transformative AI or superintelligence. I think the AI Safety Fundamentals AI Governance Program has some good readings for this.

mic Apr 5, 2022, 4:43 PM
18 points
0 ∶ 0
on: Where is the Social Justice in EA?
I think lack of diversity in EA is largely due to founder effects, and EA is working on this. There’s an emerging effort to have EA outreach in more global south countries like India, the Philippines, and Mexico, and local EA community-builders are working hard on that.
For what it’s worth, it seems to me that EA university groups have more racial and gender diversity than the broader EA movement, which I think is because they reach a broader base of people, compared to the type of people who randomly stumble across EA on the internet.
The EA community is exclusive by level of education. I have seen much written about how EA considers itself merit-based, however, to be recognized for epistemic merit, one would need to have at least a post-secondary education to achieve a reputable job.
I’m not sure I agree. I think many EA orgs are anti-credentialist enough that you wouldn’t need any university education if you have the skills, which could be built through, e.g., doing independent work funded by the Long-Term Future Fund. Actually, I think dropping out of college to do EA work is even more badass. Compared to academia, a good number of AI alignment researchers don’t have graduate degrees, though it is true that many EA leaders have graduate degrees.

mic Mar 31, 2022, 12:18 AM
15 points
0 ∶ 0
on: Announcing What We Owe The Future
Any chance the price could be reduced to lower the barrier to pre-ordering? It costs $30 on Amazon US for a hard copy, which is a lot to ask for.
What links here?
- Charles He's comment on What are great marketing ideas to encourage pre-orders of What We Owe The Future? by abier (Mar 31, 2022, 5:21 AM; 2 points)

mic Mar 30, 2022, 1:24 PM
3 points
0 ∶ 0
on: AI safety starter pack
you can usually find small online projects throughout the year
Where?

mic Mar 20, 2022, 8:05 PM
7 points
0 ∶ 0
on: $100 bounty for the best ideas to red team
Red team: Argue that moral circle expansion is or is not an effective lever for improving the long-term future. Subpoint: challenge the claim that ending factory farming effectively promotes moral circle expansion to wild animals or digital sentient beings.
Related:
- Why I prioritize moral circle expansion over artificial intelligence alignment—EA Forum and the comments on the post

mic Mar 20, 2022, 7:58 PM
5 points
0 ∶ 0
on: $100 bounty for the best ideas to red team
Red team: Certain types of prosaic AI alignment (e.g., arguably InstructGPT) promote the illusion of safety without genuinely reducing existential risk from AI, or are capabilities research disguised as safety research. (A claim that I’ve heard from EleutherAI, rather indelicately phrased, and would like to see investigated)

mic Mar 20, 2022, 7:50 PM
8 points
0 ∶ 0
on: $100 bounty for the best ideas to red team
Red team: Is existential security likely, assuming that we avoid existential catastrophe for a century or two?
Some reasons that I have to doubt that existential security is the default outcome we should expect:
- Even superintelligent aligned AI might be flawed and fail catastrophically eventually
- Vulnerable world hypothesis
- Society is fairly unstable
- Unregulated expansion throughout the galaxy may reduce extinction risk but may increase s-risks, and may not be desirable

mic Mar 20, 2022, 7:45 PM
34 points
0 ∶ 0
on: $100 bounty for the best ideas to red team
Red team: Is the expected value of extinction risk reduction positive?
Relevant articles:
- The expected value of extinction risk reduction is positive—EA Forum
- A longtermist critique of “The expected value of extinction risk reduction is positive”—EA Forum

mic Mar 20, 2022, 7:38 PM
3 points
0 ∶ 0
in reply to: QubitSwarm99’s comment on: $100 bounty for the best ideas to red team
I helped start WikiProject Effective Altruism a few months ago, but I think that the items on our WikiProject’s to-do list are not as valuable as, say, organizing a local EA group at a top university, or writing a useful post on the EA Forum. One tricky thing about Wikipedia is that you have to be objective, so while someone might read an article on effective altruism and be like “wow, this is a cool idea”, you can’t engage them further. I also think that the articles are already pretty decent.

mic Mar 20, 2022, 4:29 PM
5 points
0 ∶ 0
on: Career Advice: Philosophy + Programming → AI Safety
Richard Ngo recently wrote a post on careers in AI safety.

I think you could divide AI safety careers into six categories. I’ve written some quick tentative thoughts on how you could get started, but I’m not an expert in this for sure.
- Software engineering: infrastructure, building environments, etc.
  - Do LeetCode/NeetCode and other interview prep and get referrals to try to get a really good entry-level software engineering job. Work in software engineering for a few years, try to get really good at engineering (e.g., being able to dive into a large, unfamiliar codebase and submit a significant pull request within a few weeks). Maybe learn in-demand skills like parallel computing, data engineering, information security, etc. Then, try to get into a software engineering role at Anthropic, Redwood Research, etc. Anthropic is generally looking for fairly experienced engineers, as they aren’t able to provide enough mentorship at this stage for new engineers.
- ML implementation: converting a research idea into a working model.
  - Take an ML course (you can apply for a grant from the Long-Term Future Fund if necessary), especially in deep natural language processing or reinforcement learning, reproduce some ML papers, maybe do a master’s in ML if you want, apply for ML jobs at Redwood or Anthropic.
- ML research direction: coming up with good ideas, designing experiments.
  - Maybe do a PhD in machine learning, apply to CHAI or DeepMind or OpenAI? But I’ve heard that a PhD takes way too long and many AI safety orgs aren’t that credentialist. I have no idea what I’m talking about here.
- Theory research: building good abstractions, mathematical reasoning.
  - Go through the AGI Safety Fundamentals technical alignment program or dive deep into alignment research that seems interesting to you. Think about the Eliciting Latent Knowledge problem and Richard Ngo’s Alignment research exercises, and maybe apply for a grant from the Long-Term Future Fund to do independent research.
- AI policy
  - I’m not that familiar with this, but I think you could start with the AGI Safety Fundamentals governance program
- Non-technical roles in AI safety orgs such as Redwood Research. I’m also personally excited about AI safety field-building at top universities, something like EA movement-building at universities, based on the experience of EA at Georgia Tech, OxAI Safety Hub, EA NYU, and AI Safety @ MIT this semester.
Again, check out Richard Ngo’s post on careers in AI safety, and apply for relevant internships/residencies. AI jobs that aren’t related to safety can still be helpful for gaining experience so you can transition to safety work.