jsteinhardt

Karma: 770

jsteinhardt 1 Dec 2016 4:14 UTC
10 points
0 ∶ 0
in reply to: Ben Pace’s comment on: Why I’m donating to MIRI this year

(including the bizarre-ness of OpenPhil’s analysis of number-of-papers-written, which is not how one measures progress of fundamentals research.)

What in the grant write-up makes you think the focus was on number-of-papers-written? I was one of the reviewers and that was definitely not our process.

(Disclaimer: I’m a scientific advisor for OpenPhil, all opinions here are my own.)
What links here?
- Why I’m donating to MIRI this year by Owen Cotton-Barratt (30 Nov 2016 22:21 UTC; 34 points)

jsteinhardt 2 Dec 2016 4:39 UTC
3 points
0 ∶ 0
in reply to: Owen Cotton-Barratt’s comment on: Why I’m donating to MIRI this year
I feel like I care a lot about theory-building, and at least some of the other internal and external reviewers care a lot about it as well. As an example, consider External Review #1 of Paper #3 (particularly the section starting “How significant do you feel these results are for that?”). Here are some snippets (link to document here):

The first paragraph suggests that this problem is motivated by the concern of assigning probabilities to computations. This can be viewed as an instance of the more general problems of (a) modeling a resource-bounded decision maker computing probabilities and (b) finding techniques to help a resource-bounded decision maker compute probabilities. I find both of these problems very interesting. But I think that the model here is not that useful for either of these problems. Here are some reasons why:

It’s not clear why the properties of uniform coherence are the “right” ones to focus on. Uniform coherence does imply that, for any fixed formula, the probability converges to some number, which is certainly a requirement that we would want. This is implied by the second property of uniform coherence. But that property considers not just constant sequences of formulas, but sequence where the nth formula implies the (n+1)st. Why do we care about such sequences? [...]

The issue of computational complexity is not discussed in the paper, but it is clearly highly relevant. [...]

Several more points are raised, followed by (emphasis mine):

I see no obvious modification of uniformly coherent schemes that would address these concerns. Even worse, despite the initial motivation, the authors do not seem to be thinking about these motivational issues.

For another example, see External Review #1 of Paper #4 (I’m avoiding commenting on internal reviews because I want to be sensitive to breaking anonymity).

On the website, it is promised that this paper makes a step towards figuring out how to come up with “logically non-omniscient reasoners”. [...]

This surely sounds impressive, but there is the question whether this is a correct interpretation of Theorem 5. In particular, one could imagine two cases: a) we are predicting a single type of computation, and b) we are predicting several types of computations. In case (a), why would the delays matter in asymptotic convergence in the first place? [...] In case (b), the setting that is studied is not a good abstraction: in this case there should be some “contextual information” available to the learner, otherwise the only way to distinguish between two types of computations will be based on temporal relation, which is a very limiting assumption here.

To end with some thoughts of my own: in general, when theory-building I think it is very important to consider both the relevance of the theoretical definitions to the original problem of interest, and the richness of what can actually be said. I don’t think that definitions can be assessed independently of the theory that can be built from them. At the danger of self-promotion, I think that my own work here, which makes both definitional and theoretical contributions relevant to ML + security, does a good job of putting forth definitions and justifying them (by showing that we can get unexpectedly strong results in the setting considered, via a nice and fairly general algorithm, and that these results have unexpected and important implications for initially unrelated-seeming problems). I also claim that this work is relevant to AI safety but perhaps others will disagree.

jsteinhardt 2 Dec 2016 5:54 UTC
2 points
0 ∶ 0
in reply to: jsteinhardt’s comment on: Why I’m donating to MIRI this year
Also, I realized it might not be clear why I thought the quotes above are relevant to whether the reviews addressed the “theory-building” aspect. The point is it seems to me that the quoted parts of the reviews are directly engaging with whether the definitions make sense / the results are meaningful, which is a question about the adequacy of the theory for addressing the claimed questions, and not of its technical impressiveness. (I could imagine you don’t feel this addresses what you meant by theory-building, but in that case you’ll have to be more specific for me to understand what you have in mind.)

jsteinhardt 11 Dec 2016 10:46 UTC
4 points
0 ∶ 0
on: How many hits does hits-based giving get? A concrete study idea to find out (and a $1500 offer for implementation)
I like this idea. One danger (in both directions) with comparing to VC is that my impression is venture capital is way more focused on prestige and connections than funding charities is. In particular, if you can successfully become a prestigious, well-connected VC firm, then all of the Stanford/MIT students (for instance) will want you to fund their start-up, and picking with only minimal due diligence from among that group is likely to already be fairly profitable. [Disclaimer: I’m only tangentially connected to the VC world so this could be completely wrong, feel free to correct me.]

If this is true, what should we expect to see? We should expect that (1) VCs put in less research than OpenPhil (or similar organizations) when making investments, (2) hits-based is very successful for VC firms conditioned on having a strong established reputation. I would guess that both of these are true, though I’m unsure of the implications.

jsteinhardt 28 Dec 2016 17:07 UTC
1 point
0 ∶ 0
in reply to: benmusch’s comment on: We Must Reassess What Makes a Charity Effective

So jobs don’t go away, they are just created in other areas.

This isn’t really true. Yes, probably there is some job replacement so that the jobs don’t literally disappear 1-for-1. But there will probably be fewer jobs, and I don’t think it’s easy to say (without doing some research) whether it’s 0.1 or 0.5 or 0.9 fewer jobs for each malaria net maker that goes away.

jsteinhardt 30 Dec 2016 20:46 UTC
2 points
0 ∶ 0
in reply to: Peter Wildeford’s comment on: My Donations for 2016
Thanks. I think my reasons are basically the same as those in this post: http://effective-altruism.com/ea/14d/donor_lotteries_demonstration_and_faq/.

jsteinhardt 12 Jan 2017 19:19 UTC
33 points
0 ∶ 0
on: Building Cooperative Epistemology (Response to “EA has a Lying Problem”, among other things)
I strongly agree with the points Ben Hoffman has been making (mostly in the other threads) about the epistemic problems caused by holding criticism to a higher standard than praise. I also think that we should be fairly mindful that providing public criticism can have a high social cost to the person making the criticism, even though they are providing a public service.

There are definitely ways that Sarah could have improved her post. But that is basically always going to be true of any blog post unless one spends 20+ hours writing it.

I personally have a number of criticisms of EA (despite overall being a strong proponent of the movement) that I am fairly unlikely to share publicly, due to the following dynamic: anything I write that wouldn’t incur unacceptably high social costs would have to be a highly watered-down version of the original point, and/or involve so much of my time to write carefully that it wouldn’t be worthwhile.

While I’m sympathetic to the fact that there’s also a lot of low-quality / lazy criticism of EA, I don’t think responses that involve setting a high bar for high-quality criticism are the right way to go.

(Note that I don’t think that EA is worse than is typical in terms of accepting criticism, though I do think that there are other groups / organizations that substantially outperform EA, which provides an existence proof that one can do much better.)
What links here?
- Daniel_Dewey's comment on Building Cooperative Epistemology (Response to “EA has a Lying Problem”, among other things) by Raemon (13 Jan 2017 15:32 UTC; 5 points)
- Daniel_Dewey's comment on Contra the Giving What We Can pledge by AlyssaVance (13 Jan 2017 16:08 UTC; 0 points)

jsteinhardt 13 Jan 2017 8:30 UTC
5 points
0 ∶ 0
in reply to: Benjamin_Todd’s comment on: Building Cooperative Epistemology (Response to “EA has a Lying Problem”, among other things)
I think parts of academia do this well (although other parts do it poorly, and I think it’s been getting worse over time). In particular, if you present ideas at a seminar, essentially arbitrarily harsh criticism is fair game. Of course, this is different from the public internet, but it’s still a group of people, many of whom do not know each other personally, where pretty strong criticism is the norm.

My impression is that criticism has traditionally been a strong part of Jewish culture, but I’m not culturally Jewish so can’t speak directly.

I heard that Bridgewater did a bunch of stuff related to feedback/criticism but again don’t know a ton about it.

Of course, none of these examples address the fact that much of the criticism of EA happens over the internet, but I do feel that some of the barriers to criticism online also carry over in person (though others don’t).

jsteinhardt 14 Jan 2017 19:24 UTC
4 points
0 ∶ 0
in reply to: Brian_Tomasik’s comment on: Building Cooperative Epistemology (Response to “EA has a Lying Problem”, among other things)
In my post, I said

anything I write that wouldn’t incur unacceptably high social costs would have to be a highly watered-down version of the original point, and/or involve so much of my time to write carefully that it wouldn’t be worthwhile.

I would expect that conditioned on spending a large amount of time to write the criticism carefully, it would be met with significant praise. (This is backed up at least in upvotes by past examples of my own writing, e.g. Another Critique of Effective Altruism, The Power of Noise, and A Fervent Defense of Frequentist Statistics.)

jsteinhardt 29 Jan 2017 3:35 UTC
10 points
0 ∶ 0
in reply to: Kerry_Vaughan’s comment on: 80,000 Hours: EA and Highly Political Causes

Instead of writing this like some kind of expose, it seems you could get the same results by emailing the 80K team, noting the political sensitivity of the topic, and suggesting that they provide some additional disclaimers about the nature of the recommendation.

I don’t agree with the_jaded_one’s conclusions or think his post is particularly well-thought-out, but I don’t think raising the bar on criticism like this is very productive if you care about getting good criticism. (If you think the_jaded_one’s criticism is bad criticism, then I think it makes sense to just argue for that rather than saying that they should have made it privately.)

My reasons are very similar to Benjamin Hoffman’s reasons here.

jsteinhardt 29 Jan 2017 19:18 UTC
6 points
0 ∶ 0
in reply to: the_jaded_one’s comment on: 80,000 Hours: EA and Highly Political Causes
OpenPhil made an extensive write-up on their decision to hire Chloe here: http://blog.givewell.org/2015/09/03/the-process-of-hiring-our-first-cause-specific-program-officer/. Presumably after reading that you have enough information to decide whether to trust her recommendations (taking into account also whatever degree of trust you have in OpenPhil). If you decide you don’t trust it then that’s fine, but I don’t think that can function as an argument that the recommendation shouldn’t have been made in the first place (many people such as myself do trust it and got substantial value out of the recommendation and of reading what Chloe has to say in general).

I feel your overall engagement here hasn’t been very productive. You’re mostly repeating the same point, and to the extent you make other points it feels like you’re reaching for whatever counterarguments you can think of, without considering whether someone who disagreed with you would have an immediate response. The fact that you and Larks are responsible for 20 of the 32 comments on the thread is a further negative sign to me (you could probably condense the same or more information into fewer better-thought-out comments than you are currently making).

jsteinhardt 30 Jan 2017 6:08 UTC
1 point
0 ∶ 0
in reply to: Kerry_Vaughan’s comment on: 80,000 Hours: EA and Highly Political Causes
Thanks for clarifying; your position seems reasonable to me.

jsteinhardt 28 Feb 2017 6:46 UTC
12 points
0 ∶ 0
in reply to: kbog’s comment on: What Should the Average EA Do About AI Alignment?
In general I think this sort of activism has a high potential for being net negative—AI safety already has a reputation as something mainly being pushed by outsiders who don’t understand much about AI. Since I assume this advice is targeted at the “average EA” (who presumably doesn’t know much about AI), this would only exacerbate the issue.

jsteinhardt 28 Feb 2017 6:48 UTC
11 points
0 ∶ 0
in reply to: JoshuaFox’s comment on: What Should the Average EA Do About AI Alignment?
I already mention this in my response to kbog above, but I think EAs should approach this cautiously; AI safety is already an area with a lot of noise, with a reputation for being dominated by outsiders who don’t understand much about AI. I think outreach by non-experts could end up being net-negative.

jsteinhardt 10 Jul 2017 3:44 UTC
9 points
0 ∶ 0
in reply to: Wei Dai’s comment on: My current thoughts on MIRI’s “highly reliable agent design” work
Shouldn’t this cut both ways? Paul has also spent far fewer words justifying his approach to others, compared to MIRI.

Personally, I feel like I understand Paul’s approach better than I understand MIRI’s approach, despite having spent more time on the latter. I actually do have some objections to it, but I feel it is likely to be significantly useful even if (as I, obviously, expect) my objections end up having teeth.

jsteinhardt 11 Jul 2017 15:55 UTC
5 points
0 ∶ 0
in reply to: Wei Dai’s comment on: My current thoughts on MIRI’s “highly reliable agent design” work
This doesn’t match my experience of why I find Paul’s justifications easier to understand. In particular, I’ve been following MIRI since 2011, and my experience has been that I didn’t find MIRI’s arguments (about specific research directions) convincing in 2011*, and since then have had a lot of people try to convince me from a lot of different angles. I think pretty much all of the objections I have are ones I generated myself, or would have generated myself. Although, the one major objection I didn’t generate myself is the one that I feel most applies to Paul’s agenda.

( * There was a brief period shortly after reading the sequences that I found them extremely convincing, but I think I was much more credulous then than I am now. )

jsteinhardt 11 Jul 2017 15:59 UTC
6 points
0 ∶ 0
in reply to: jsteinhardt’s comment on: My current thoughts on MIRI’s “highly reliable agent design” work
I think the argument along these lines that I’m most sympathetic to is that Paul’s agenda fits more into the paradigm of typical ML research, and so is more likely to fail for reasons that are in many people’s collective blind spot (because we’re all blinded by the same paradigm).

jsteinhardt 13 Jul 2017 15:16 UTC
14 points
0 ∶ 0
in reply to: Wei Dai’s comment on: My current thoughts on MIRI’s “highly reliable agent design” work
(Speaking for myself, not OpenPhil, who I wouldn’t be able to speak for anyways.)

For what it’s worth, I’m pretty critical of deep learning, which is the approach OpenAI wants to take, and still think the grant to OpenAI was a pretty good idea; and I can’t really think of anyone more familiar with MIRI’s work than Paul who isn’t already at MIRI (note that Paul started out pursuing MIRI’s approach and shifted in an ML direction over time).

That being said, I agree that the public write-up on the OpenAI grant doesn’t reflect that well on OpenPhil, and it seems correct for people like you to demand better moving forward (although I’m not sure that adding HRAD researchers as TAs is the solution; also note that OPP does consult regularly with MIRI staff, though I don’t know if they did for the OpenAI grant).

jsteinhardt 6 Nov 2017 21:23 UTC
5 points
0 ∶ 0
in reply to: Joey’s comment on: Talent gaps from the perspective of a talent limited organization.
FWIW, 50k seems really low to me (but I live in the U.S. in a major city, so maybe it’s different elsewhere?). Specifically, I would be hesitant to take a job at that salary, if for no other reason than I thought that the organization was either dramatically undervaluing my skills, or so cash-constrained that I would be pretty unsure if they would exist in a couple years.

A rough comparison: if I were doing a commissioned project for a non-profit that I felt was well-run and value-aligned, my rate would be in the vicinity of $50USD/hour. I’d currently be willing to go down to $25USD/hour for a project that is something I basically would have done anyways. Once I get my PhD I think my going rates would be higher, and for a senior-level position I would probably expect more than either of these numbers, unless it was a small start-up-y organization that I felt was one of the most promising organizations in existence.

EDIT: So that people don’t have to convert to per-year salaries in their heads, the above numbers if annualized would be $100k USD/year and $50k USD/year.

jsteinhardt 13 Apr 2018 2:34 UTC
4 points
0 ∶ 0
on: Comparative advantage in the talent market
I’m worried that you’re mis-applying the concept of comparative advantage here. In particular, if agents A and B both have the same values and are pursuing altruistic ends, comparative advantage should not play a role—both agents should just do whatever they have an absolute advantage at (taking into account marginal effects, but in a large population this should often not matter).

For example: suppose that EA has a “shortage of operations people” but person A determines that they would have higher impact doing direct research rather than doing ops. Then in fact the best thing is for person A to work on direct research, even if there are already many other people doing research and few people doing ops. (Of course, person A could be mistaken about which choice has higher impact, but that is different from the trade considerations that comparative advantage is based on.)

I agree with the heuristic “if a type of work seems to have few people working on it, all else equal you should update towards that work being more neglected and hence higher impact” but the justification for that again doesn’t require any considerations of trading with other people . In general, if A and B can trade in a mutually beneficial way, then either A and B have different values or one of them was making a mistake.