jsteinhardt comments on My current thoughts on MIRI’s “highly reliable agent design” work

jsteinhardt 11 Jul 2017 15:59 UTC
6 points
0 ∶ 0
I think the argument along these lines that I’m most sympathetic to is that Paul’s agenda fits more into the paradigm of typical ML research, and so is more likely to fail for reasons that are in many people’s collective blind spot (because we’re all blinded by the same paradigm).
- Wei Dai 13 Jul 2017 11:37 UTC
  31 points
  0 ∶ 0
  Parent
  That actually didn’t cross my mind before, so thanks for pointing it out. After reading your comment, I decided to look into Open Phil’s recent grants to MIRI and OpenAI, and noticed that of the 4 technical advisors Open Phil used for the MIRI grant investigation (Paul Christiano, Jacob Steinhardt, Christopher Olah, and Dario Amodei), all either have a ML background or currently advocate a ML-based approach to AI alignment. For the OpenAI grant however, Open Phil didn’t seem to have similarly engaged technical advisors who might be predisposed to be critical of the potential grantee (e.g., HRAD researchers), and in fact two of the Open Phil technical advisors are also employees of OpenAI (Paul Christiano and Dario Amodei). I have to say this doesn’t look very good for Open Phil in terms of making an effort to avoid potential blind spots and bias.
  What links here?
  - Wei Dai's comment on What should Founders Pledge research? by John G. Halstead (11 Sep 2019 22:08 UTC; 18 points)
  - jsteinhardt 13 Jul 2017 15:16 UTC
    14 points
    0 ∶ 0
    Parent
    (Speaking for myself, not OpenPhil, who I wouldn’t be able to speak for anyways.)
    
    For what it’s worth, I’m pretty critical of deep learning, which is the approach OpenAI wants to take, and still think the grant to OpenAI was a pretty good idea; and I can’t really think of anyone more familiar with MIRI’s work than Paul who isn’t already at MIRI (note that Paul started out pursuing MIRI’s approach and shifted in an ML direction over time).
    
    That being said, I agree that the public write-up on the OpenAI grant doesn’t reflect that well on OpenPhil, and it seems correct for people like you to demand better moving forward (although I’m not sure that adding HRAD researchers as TAs is the solution; also note that OPP does consult regularly with MIRI staff, though I don’t know if they did for the OpenAI grant).
    - Wei Dai 13 Jul 2017 17:06 UTC
      6 points
      0 ∶ 0
      Parent
      
      I can’t really think of anyone more familiar with MIRI’s work than Paul who isn’t already at MIRI (note that Paul started out pursuing MIRI’s approach and shifted in an ML direction over time).
      
      The Agent Foundations Forum would have been a good place to look for more people familiar with MIRI’s work. Aside from Paul, I see Stuart Armstrong, Abram Demski, Vadim Kosoy, Tsvi Benson-Tilsen, Sam Eisenstat, Vladimir Slepnev, Janos Kramar, Alex Mennen, and many others. (Abram, Tsvi, and Sam have since joined MIRI, but weren’t employees of it at the time of the Open Phil grant.)
      
      That being said, I agree that the public write-up on the OpenAI grant doesn’t reflect that well on OpenPhil, and it seems correct for people like you to demand better moving forward
      
      I had previously seen some complaints about the way the OpenAI grant was made, but until your comment, hadn’t thought of a possible group blind spot due to a common ML perspective. If you have any further insights on this and related issues (like why you’re critical of deep learning but still think the grant to OpenAI was a pretty good idea, what are your objections to Paul’s AI alignment approach, how could Open Phil have done better), would you please write them down somewhere?