Jason comments on Critiques of prominent AI safety labs: Conjecture

Jason Jun 13, 2023, 12:57 AM
18 points
9 ∶ 0
Could you say a bit more about your statement that “making recommendations such as . . . . ‘alignment people should not join Conjecture’ require an extremely high bar of evidence in my opinion”?
The poster stated that there are “more impactful places to work” and listed a number of them—shouldn’t they say that if they believe it is more likely than not true? They have stated their reasons; the reader can decide whether they are well-supported. The statement that Conjecture seems “relatively weak for skill building” seems supported by reasonable grounds. And the author’s characterization of likelihood that Conjecture is net-negative is merely “plausible.” That low bar seems hard to argue with; the base rate of for-profit companies without any known special governance safeguards acting like for-profit companies usually do (i.e., in a profit-maximinzing manner) is not low.
- mariushobbhahn Jun 13, 2023, 11:14 AM
  10 points
  7 ∶ 3
  Parent
  Maybe we’re getting too much into the semantics here but I would have found a headline of “we believe there are better places to work at” much more appropriate for the kind of statement they are making.
  1. A blanket unconditional statement like this seems unjustified. Like I said before, if you believe in CoEm, Conjecture probably is the right place to work for.
  2. Where does the “relatively weak for skill building” come from? A lot of their research isn’t public, a lot of engineering skills are not very tangible from the outside, etc. Why didn’t they just ask the many EA-aligned employees at Conjecture about what they thought of the skills they learned? Seems like such an easy way to correct for a potential mischaracterization.
  3. Almost all AI alignment organizations are “plausibly” net negative. What if ARC evals underestimates their gain-of-function research? What if Redwood’s advances in interpretability lead to massive capability gains? What if CAIS’s efforts with the letter had backfired and rallied everyone against AI safety? This bar is basically meaningless without expected values.
  
  Does that clarify where my skepticism comes from? Also, once again, my arguments should not be seen as a recommendation for Conjecture. I do agree with many of the criticisms made in the post.

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer