Sebastian_Farquhar

Karma: 240

Changes in funding in the AI safety field

Sebastian_Farquhar3 Feb 2017 13:09 UTC

34 points

10 comments7 min readEA link

Sebastian_Farquhar 11 Jan 2022 16:50 UTC
20 points
0 ∶ 0
on: What is the role of Bayesian ML for AI alignment/safety?
I began my PhD with a focus on Bayesian deep learning with exactly the same reasoning as you. I also share your doubts about the relevance of BDL to long-term safety. I have two clusters of thoughts: some reasons why BDL might be worth pursuing regardless, and alternative approaches.

Considerations about BDL and important safety research:
- Don’t overfit to recent trends. LLMs are very remarkable. Before them, DRL was very remarkable. I don’t know what will be remarkable next. My hunch is that we won’t get AGI by just doing more of what we are doing now. (People I respect disagree with that, and I am uncertain. Also, note I don’t say we could’t get AGI that way.)
- Bayesian inference is powerful and general. The original motivation is still real. It is tempered by your (in my view, correct) observation that existing methods for approximate inference have big flaws. My view is that probability still describes the correct way to update given evidence and so it contains deep truths about reliable information processing. That means that understanding approximate Bayesian inference is still a useful guide for anyone trying to automatically process information correctly (and being aware of the necessary assumptions). And an awful lot of failure modes for AGI involve dangerous mistaken generalization. Also note that statements like “simple non-Bayesian techniques such as ensembles” are controversial, and there’s considerable debate about whether ensembles are working because they perform approximate integration. Andrew Gordon Wilson has written a lot about this, and I tentatively agree with much of it.
- Your PhD is not your career. As Mark points out, a PhD is just the first step. You’ll learn how to do research. You really won’t start getting that good at it until a few years in, by which point you’ll write up the thesis and start working on something different. You’re not even supposed to just keep doing your thesis as you continue your research. The main thing is to have a great research role model, and I think Phillip is quite good (by reputation, I don’t know him personally).
- BDL teaches valuable skills. Honestly, I just think statistics is super important for understanding modern deep learning, and it gives you a valuable lens to reason about why things are working. There are other specialisms that can develop valuable skills. But I’d be nervous about trading the opportunity to develop deep familiarity with the stats for practical experience on current SoTA systems (because stats will stay true and important, but SoTA won’t stay SoTA). (People I respect disagree with that, and I am uncertain.)
Big picture, I think intellectual diversity among AGI safety researchers is good, Bayesian inference is important and fundamental, and lots of people glom on to whatever the latest hot thing is (currently LLMs), leading to rapid saturation.
So what is interesting to work on? I’m currently thinking about two main things:
- I don’t think that exact alignment is possible, in ways that are similar to how exact Bayesian inference is generally possible. So I’m working on trying to learn from the ways in which approximate inference is well/poorly defined to get insights for how alignment can be well/poorly defined and approximated. (Here I agree 100% with Mark that most of what is hard in AGI safety remains framing the problem correctly.)
- I think a huge problem for AGI-esque systems is about to be hunting for dangerous failures. There’s a lot of BDL work on ‘actively’ finding informative data, but mostly for small-data in low-dimensions. I’m much more interested in huge data, high-dimensions, which creates whole new problems (e.g., you can’t just compute a score function for each possible datapoint). (Note that this is almost exactly the opposite to Mark’s point below! But I don’t exactly disagree with him, it’s just that lots of things are worth trying.)
There are other things that are important, and I agree that OOD detection is also important (and I’m working on a conceptual paper on this, rather than a detection method specifically). If you’d like to speak about any of this stuff I’m happy to talk. You can reach me at sebastian.farquhar@cs.ox.ac.uk

New UK aid strategy – prioritising research and crisis response

Sebastian_Farquhar2 Dec 2015 19:38 UTC

17 points

4 comments4 min readEA link

Effective Altruism is a Big Tent

Sebastian_Farquhar24 Aug 2015 11:52 UTC

13 points

6 comments4 min readEA link

We are Seb Farquhar and Owen Cotton-Barratt from the Global Priorities Project, AUsA!

Sebastian_Farquhar17 Mar 2015 15:59 UTC

11 points

31 comments1 min readEA link

CEA is launching a winter fundraising round

Sebastian_Farquhar9 Dec 2015 16:39 UTC

11 points

54 comments1 min readEA link

Sebastian_Farquhar 25 Feb 2016 10:19 UTC
11 points
0 ∶ 0
on: Should you start your own project now rather than later?
Often (in EA in particular) the largest cost to a failed started project isn’t to you, but is a hard-to-see counterfactual impact.

Imagine I believe that building a synth bio safety field is incredibly important. Without a real background in synth bio, I go about building the field but because I lack context and subtle field knowledge, I screw it up having reached out to almost all the key players. They would now are be conditioned to think that synth bio safety is something that is pursued by naive outsiders who don’t understand synth bio. This makes it harder for future efforts to proceed. It makes it harder for them to raise funds. It makes it harder for them to build a team.

The worst case is that you start a project, fail, but don’t quit. This can block the space, and stop better projects from entering it.

These can be worked around, but it seems that many of your assumptions are conditional on not having these sorts of large negative counterfactual impacts. While that may work out, it seems overconfident to assume a 0% chance of this, especially if the career capital building steps are actually relevant domain knowledge building.
What links here?
- Should you start your own project now rather than later? by tyleralterman (25 Feb 2016 2:22 UTC; 5 points)

Why averting a DALY through deworming may be better than through malaria nets

Sebastian_Farquhar16 Mar 2015 13:51 UTC

8 points

9 comments1 min readEA link

Sebastian_Farquhar 10 Dec 2015 16:15 UTC
7 points
0 ∶ 0
in reply to: Denise_Melchin’s comment on: CEA is launching a winter fundraising round
It’s a good question, and one that we ask ourselves a lot. If we thought we were worse than AMF and that wasn’t likely to change, we would close up shop. I am fairly confident that we produce more value than AMF, partly because our activities raise more for AMF than they take away. However, I think it’s right to be uncertain about this and Owen makes some good points.

In addition, I think most of the value of CEA’s activities comes from long term potential of our projects and EA as a whole—as Ben discusses here.

Our positive effect on AMF is clearest at Giving What We Can which has a return of roughly 100:1 in high-value donations (counterfactually adjusted and time-discounted, but not all to AMF). Even if you assume that not a single member of GWWC gives another penny ever, the ratio is still 5:1. It is unclear if the marginal return on a donation to GWWC is higher or lower than the average return. It would be higher if we thought that GWWC could still realise increasing economies of scale. It would be lower if we thought most of the value comes from the idea itself and not execution on it. I tend to think marginal funds are more effective than average funds, but I’m very uncertain. A fuller discussion is here (http://effective-altruism.com/ea/ql/giving_what_we_can_needs_your_help_this_christmas/)

At 80k, the metrics are less directly comparable. At the last review we estimated it cost £1,670 to achieve a significant plan change (and these costs have been coming down every review cycle, indicating we are getting more cost-effective). It’s unclear how much each plan change is worth—but it seems very likely that getting someone to earn-to-give or move to do valuable direct work will be worth far more than £1,670 to AMF even within one year.

GPP impact is extremely hard to estimate because idea change and policy-work are chaotic and complex. In order to get a lower bound, we can focus on just one policy that we advocated which was successfully implemented—increasing the research budget for treatment and vaccines for malaria, TB, and NTDs and pandemic prevention by £2.5bn over 5 years. If our calculations are correct this move was worth $1.5bn-$30bn in donations to AMF. Even if we are only responsible for a very small part of this, it isn’t hard to imagine our 2015 budget outperformed a donation to AMF. (See discussion here http://globalprioritiesproject.org/2015/12/new-uk-aid-strategy-prioritising-research-and-crisis-response/)

EA Outreach is probably hardest to compare directly against AMF-type charities because much of our estimate of its value depends on the fact that we think effective altruism and its ideas have huge upside potential. Any attempt to calculate the direct impact within the first year of its running in terms of money to AMF would short-change the value of the work.

Because most of the money that goes to CEA has a huge counterfactual positive impact on funding for AMF, I’m quite confident in recommending giving to CEA.

With respect to your question about growth in costs—I think Owen has some good thoughts here. It seems, however, that the unit costs of CEA outputs are stable or decreasing so the growth in costs represents expanding outputs rather than decreasing marginal returns.

Donations to Global Priorities Project matched for just two more weeks

Sebastian_Farquhar18 Jan 2016 14:04 UTC

6 points

10 comments1 min readEA link

Sebastian_Farquhar 25 Aug 2015 11:28 UTC
6 points
0 ∶ 0
in reply to: Tom_Ash’s comment on: Effective Altruism is a Big Tent
That’s right. It would seem extremely unlikely that one should have a multi-billion dollar industry with no-one thinking about what happens if it succeeds at its aim.

It’s very important for EAs to recognise that there probably isn’t a single best cause (and that even if there is, the uncertainties are too big to allow us to identify it). Even if there was an identifiable best cause, it is likely to change, so it’s bad for EAs to identify too strongly with any one cause.

There’s a broader risk in focusing on marginal cost-effectiveness—that it leads to local rather than global optimisation. It’s a good heuristic, but bad to rely on too much.

Sebastian_Farquhar 17 Mar 2015 20:31 UTC
6 points
0 ∶ 0
in reply to: RyanCarey’s comment on: We are Seb Farquhar and Owen Cotton-Barratt from the Global Priorities Project, AUsA!
There are a lot of different audiences. Political decision-makers, the public at large, and academics are three.

Decision-makers in government are often (at least in the UK) very well intentioned and keen to use the right models and assumptions. But they are also very busy and have little time to do research and learn. We believe the best way to influence them is to engage with their work, understand what they are struggling with, and then produce really concise and useable frameworks for them. It’s really important to physically get paper copies into their offices. This is the approach we used while engaging with the National Risk Assessment, and one we will continue to use. For example, a contact in central government has suggested that, despite extensive academic work on the topic, decision-makers still do not really understand discount rates and could use a very clear ‘how-to’ note that can be passed around.

Influencing the public at large is going to take interaction with journalists and branding experts. It is a regrettable accident that the EA movement so far has been light on these skills—we hope that will change and are reaching out to journalists, marketing experts and PR workers (I spoke yesterday with a worker at a PR agency for academic public impact).

EAs may want to influence academics. Potential routes for this include doing impressive direct work (publishing, attending conferences etc.) to encourage others to build on the work. But an alternative strategy is to ‘pull side-ways’ (by offering prizes, hosting conferences, persuading top researchers etc.).

Sebastian_Farquhar 17 Mar 2015 20:14 UTC
6 points
0 ∶ 0
in reply to: RyanCarey’s comment on: We are Seb Farquhar and Owen Cotton-Barratt from the Global Priorities Project, AUsA!
This is a really important topic that we aren’t discussing enough in the EA community. At the moment, Owen is working on a paper on modelling the marginal value of different research topics. It seems very likely that we will build on that paper by estimating the marginal value of a range of promising technology areas to compare against each other (a DCP for technology, as it were). This work wouldn’t address sequencing issues, and those are really important and something we should address as a society. Owen has some preliminary ideas in this direction and GPP may investigate this further. This work is, however, part of a very full pipeline of other work.

This highlights another important point—we aren’t the first to face these issues. People have been dealing with, and making predictions about, radical future-changing technologies for centuries. GPP has already applied for funding to hire a researcher to investigate the historical track record of such predictions, and predictions of mitigation strategies, to make us smarter about estimating which sorts of ex-risks and future challenges we are best placed to act to mitigate. We’ve also had interest from some donors to part-fund such activities. If anyone is interested in matching that contribution we may be able to speed up that hire.

Sebastian_Farquhar 11 Mar 2014 14:38 UTC
6 points
0 ∶ 0
on: The history of the term ‘effective altruism’
I remember one of my favourites for the name of CEA as the Federation for Effective Altruism Research. Or the Society for the Progress of Empathetic Consequentialism Through Reasoned Evaluation. I think the first may have been yours, Will. ;)

We’re hiring an AI Senior Policy Fellow

Sebastian_Farquhar2 Jul 2015 16:10 UTC

5 points

0 comments1 min readEA link

Sebastian_Farquhar 9 Dec 2015 23:52 UTC
5 points
0 ∶ 0
in reply to: number42’s comment on: CEA is launching a winter fundraising round
Not quite, our total budget for 2016 is about £1.2m, about $1.8m (detailed breakdown on page 12 of the prospectus).

The sum of the funding targets is greater than our budget because at the moment many of our organisations have quite small reserves and need to raise more than they plan to spend this year in order to have healthy reserves at the end of the year. That would allow us to only fundraise once per year, which is a much more efficient use of staff time. General advice is for charities to have roughly 6-18 months of reserves at all times.

CEA is Hiring! Applications due by 18th October

Sebastian_Farquhar24 Sep 2015 13:18 UTC

4 points

0 comments1 min readEA link

Sebastian_Farquhar 11 Dec 2015 16:50 UTC
4 points
0 ∶ 0
in reply to: AGB’s comment on: CEA is launching a winter fundraising round
The EA community, broadly defined, donates a huge amount of money. GiveWell moved more than $20m (excluding GoodVentures) in 2015, source and credits effective altruism as motivating a substantial part of this. Giving What We Can moved more than $3m. FLI committed grants worth about $7m. Leverhulme Foundation granted $15m for existential risk research. This is far from exhaustive, but we’re looking at something on the order of magnitude of $50m fairly easily.

Relative to this, CEA’s $1.8m does not seem nearly as large. I think one of the sources of intuitive surprise is just that the EA movement as a whole seems to be roughly doubling or a bit more in size every year which means that the heuristics we have for thinking about size become out of date very quickly. Relative to EA as a whole, CEA may be shrinking slightly since we have been growing a little slower than doubling.

Most of the projects have significant non-EA funding, but this is something we’re trying to grow (for example by recruiting for a development manager who could expand our non-EA donor base). 80k got a lot of funding through YCombinator and associated leads. GWWC gets a substantial amount from people with a strong interest in development and giving but less in effective altruism itself. GPP gets significant funding from grant sources that wouldn’t otherwise fund EA work. Even EAO got at least $50k from people who have not typically given to EA charities, which is surprising for an organisation focused so heavily on the EA community itself. I’ve gone over our numbers and think 80k may have gotten more than half its budget outside the EA community recently. GPP gets around 40%. This is pretty loose stuff though, because it’s so hard to define what counts as EA money and we don’t have good access to the counterfactuals. Ben also makes a really important point about the donors who move from giving to us to supporting other EA projects.

Sebastian_Farquhar 17 Mar 2015 22:40 UTC
4 points
0 ∶ 0
in reply to: HaydnBelfield’s comment on: We are Seb Farquhar and Owen Cotton-Barratt from the Global Priorities Project, AUsA!

If a reader wants to help GPP, what should they do?

At the moment GPP is funding constrained. We have an enormous pipeline of work—at one end we have literally hundreds of ideas we would love to pursue, but we also have several person-years of work on the table which is simply adapting our existing research to a particular audience to have impact. Anyone who is either able to donate or knows someone who might be able to would be enormously helpful. Based on the experience of other EA organisations, it is possible that we will become talent-constrained within the next year or two.

Beyond that, we continue to value introductions to individuals in governments or foundations. We also have more of these introductions available than we can currently pursue all of, but this is something where variety and quality of the lead is important. Knowing we could access a particular type of individual is useful, even when we do not pursue the lead immediately. We have a good system for tracking these opportunities to pursue later. We would also love to be able to help academics focus their research directions with an eye to impact. Introductions to academics who may be receptive and are in a position to choose their research direction would therefore be great.

Lastly, we really value challenge to our ideas. This AMA has already thrown up some questions that will change how we plan and think about our work. Anyone is welcome to send me critiques either as a PM or emailing seb[at]prioritisation-dot-org. I have had some extremely productive follow-on conversations with EAs who sent me feedback like that.

What would you do with a) £2,000 b) £10,000 c) £20,000?

At the moment, additional funding goes towards making sure we have a sustainable foundation for the organisation. Best-practice is to have 12 months of reserves, which at this point means raising an additional £20-25k (this is a rough number and does not include some pledged donations not yet received). Once we have raised that level, we would like to hire an additional member of staff. We expect, counting overhead costs like office space, HR, finance etc. that an additional staff member would cost us £35-40k. In order to offer credible job-security to a new hire, we would like to have at least a full year of reserves set aside to fund that hire.

All this means that, in order to comfortably hire a new staff member in the next CEA recruitment cycle we are raising towards a target of £100,000.

A picture of the historical unit costs of some of our outputs (to be distinguished from outcomes) is available in our strategy document, although these are very rough estimates. You can also find more details of our funding needs.

What do you think your room-for-more-funding is?

I think we could comfortably absorb £150,000 (which would build 12 months of reserves and allow us to hire two researchers, and possibly an intern). Funds beyond that could be put to creative use (for example, hiring researchers qua the University is more expensive, but might let us get better talent) but might be better directed at other organisations.

You’re based in the UK—there’s about to be an election, then five years of a new government. How does that affect your plans?

At the moment, individuals in government are largely distracted by the upcoming elections, so we have deprioritised outreach to UK policy-makers. We plan to spend the time until the election (May 7th) preparing policy briefs and fundraising so that we can focus on policy outreach in the months following the election. Conventional wisdom is that this is the best time to pursue policy objectives.

We have probably not devoted enough resources to developing contacts in the Opposition. The election is too close to call, so this may not end up being a problem, but we are open to pursuing strong leads in this period despite the attention of politicians being elsewhere.

Who are the key decision-makers/stakeholders in your area? Have you mapped them out—how they relate, what their responsibilities are? What Government Departments are you mainly interested in? Which are you monitoring? Are there any consultations open at the moment that you are submitting to? Same question for Parliamentary Committees.

Because we are trying to appeal to such a broad range of communities and enable comparison between them, there are a very large number of stakeholders. Within the UK government, we have the most to say to similarly broad organisations (Cabinet Office and Treasury) as well as departments like DFID or DoH (similarly PHE) where we have specific interests that overlap. Similarly, within foundations, we see many existing metacharity organisations as stakeholders to engage with (including GiveWell, Copenhagen Consensus, DCP, WHO and others).

Consultations and parliamentary committees are an excellent point—this is something that I’ve been monitoring since I joined the team. In that period (just under two months) we have not seen any for which we felt we had sufficiently valuable things to contribute (which were also a priority for us). It is too early to say, though, whether that avenue will prove effective in the long run.

Sebastian_Farquhar 17 Mar 2015 21:12 UTC
4 points
0 ∶ 0
in reply to: HaydnBelfield’s comment on: We are Seb Farquhar and Owen Cotton-Barratt from the Global Priorities Project, AUsA!

How many people work full-time and part-time on GPP? What are sustainable growth predictions?

I and Owen effectively work full-time on GPP (Owen has some teaching commitments as well). Toby Ord, Rob Wiblin, and Niel Bowerman all contribute irregularly to GPP projects, averaging a couple hours a week each. We aim to hire 1-2 new staff this year depending on fundraising.

Do you model yourself as a think-tank?

Somewhat, although think tanks have a wide variety of models and the type is not that well-defined (some have barely any staff while others have hundreds; some mostly lobby while others mostly do research). We are similar to many think tanks in that our goal is to influence policy and academic work without being a formal part of either system. Some of the future models of GPP look less like a think-tank.

What think-tanks have you looked at, spoken to, or modelled yourself upon?

We’ve spoken to people at a few think tanks, about specific issues like fundraising rather than their general approach, but have not modelled ourselves on any particular one. I think this is a good point though, and we may have underinvested in this area. Would be great to have a conversation with you about this some time.

Have you reached out to e.g. RUSI, BASIC, etc? Do you plan to?

We have not and do not currently have plans to, although it might make sense in the future. Our current focus has been less on topics related to defense (our current work in existential risk, for example, is focused on civilian biosafety risks).

What are your plans for the next a) 6 months b) year c) 5 years?

For the next 6 months we plan to test out models for impact. At around that point we aim to use what we’ve learned to focus our work onto the model which appears most effective, while continuing to evaluate and explore options. We plan to review that decision periodically with the possibility of future ‘pivots’ (drawing on the best-practice start-up literature). Some of our work has natural timescales which are shorter than other parts, so we will be able to reach conclusions earlier.

Models we are considering have strong commonalities and build off of our skills and current work, but might look different operationally. They include, for example, a focused policy think-tank, a policy evaluation think-tank, a policy evaluation consultancy, an academic organisation trying to seed ‘prioritisation’ as an academic discipline, or a cause comparison meta-charity organisation.

In what ways are you experimenting and iterating?

In our work-plan we divide activities around impact strategies. For example, one work-stream is to produce a really focused policy proposal worked through at a very detailed level and to get lobby groups in that field to push it forward. Another is to engage with an existing policy evaluation framework and suggest specific improvements. Once we do one, for example by producing a ‘topic primer’ on Unprecedented Technological Risks, we deprioritise similar activities to try to get more information about other routes to impact. By doing this, and evaluating the impact of each approach, we plan to focus down to a small number of effective and synergistic mechanisms for impact.

We are very aware that some of our approaches will have a high intrinsic variance, and are trying to correct for that in how we assess progress. Clearly, however, this will not be easy since we can never get a satisfactory sample size.

We are also ramping up the work we do to measure impact, both by getting better at tracking our inputs and by asking for more feedback on our outputs. Our recent push to increase engagement with our work is also partly in order to increase the quality of the feedback we get from producing it.