jacquesthibs

Karma: 1,136

I work primarily on AI Alignment. My main direction at the moment is to accelerate alignment work via language models and interpretability.

Pausing AI Developments Isn’t Enough. We Need to Shut it All Down by Eliezer Yudkowsky

jacquesthibs29 Mar 2023 23:30 UTC

211 points

75 comments3 min readEA link

(time.com)

jacquesthibs 3 Feb 2023 20:34 UTC
137 points
64 ∶ 10
on: EA, Sexual Harassment, and Abuse
People have some strong opinions about things like polyamory, but I figured I’d still voice my concern as someone who has been in EA since 2015, but has mostly only interacted with the community online (aside from 2 months in the Bay and 2 in London):

I have nothing against polyamory, but polyamory within the community gives me bad vibes. And the mixing of work and fun seems to go much further than I think it should. It feels like there’s an aspect of “free love” and I am a little concerned about doing cuddle puddles with career colleagues. I feel like all these dynamics lead to weird behaviour people do not want to acknowledge.

I repeat, I am not against polyamory, but I personally do not expect some of this bad behaviour would happen as much if in a monogamous setting since I expect there would be less sliding into sexual actions.

I’ve avoided saying this because I did not want to criticize people for being polyamorous and expected a lot would disagree with me and it not leading to anything. But I do think the “free love” nature of polyamory with career colleagues opens the door for things we might not want.

Whatever it is (poly within the community might not be part of the issue at all!), I feel like there needs to be a conversation about work and play (that people seem to be avoiding).
What links here?
- Consent Isn’t Always Enough by Jeff Kaufman (24 Feb 2023 15:43 UTC; 294 points)
- Consent Isn’t Always Enough by jefftk (LessWrong; 24 Feb 2023 15:40 UTC; 55 points)

jacquesthibs 19 Nov 2022 1:42 UTC
66 points
32 ∶ 0
on: My takes on the FTX situation will (mostly) be cold, not hot
Honestly, I’m happy with this compromise. I want to hear more about what ‘leadership’ is thinking, but I also understand the constraints you all have.

This obviously doesn’t answer the questions people have, but at least communicating this instead of radio silence is very much appreciated. For me at least, it feels like it helps reduce feelings of disconnectedness and makes the situation a little less frustrating.

Conspiracy Theories, Left Futurism, and the Attack on TESCREAL

jacquesthibs5 Jul 2023 22:56 UTC

45 points

8 comments1 min readEA link

(medium.com)

jacquesthibs 19 Nov 2023 4:25 UTC
45 points
8 ∶ 0
on: jacquesthibs’s Shortform
Quillette founder seems to be planning to write an article regarding EA’s impact on on tech:
“If anyone with insider knowledge wants to write about the impact of Effective Altruism in the technology industry please get in touch with me claire@quillette.com. We pay our writers and can protect authors’ anonymity if desired.”
It would probably be impactful if someone in the know provided a counterbalance to whoever will undoubtedly email her to disparage EA with half-truths/lies.

jacquesthibs 21 Feb 2023 6:08 UTC
45 points
17 ∶ 1
on: A statement and an apology
Since I expect some people to be a bit confused as to what exactly was the bad thing that has happened after reading this post, I think it would be great if the community health team could write a post explaining and pointing out exactly what was bad here and in other similar instances.

I think there is value in being crystal clear about what were the bad things that happened because I expect people will takeaway different things from this post.

jacquesthibs 18 Nov 2022 11:04 UTC
45 points
22 ∶ 3
on: Does Sam make me want to renounce the actions of the EA community? No. Does your reaction? Absolutely.
Personally, I’ve mostly seen people confused and trying to demonstrate willingness to re-evaluate what might have led to these bad outcomes. They may overly sway in one direction, but this only just happened and they are re-assessing their worldview in real-time. Some are just asking questions about how decisions were made in the past so we just have more information and can improve things going forward (which might mean doing nothing differently in some instances). My impression is that a lot of the criticism about EA leadership are overblown and most (if not all) were blindsided.

That said, I haven’t really had the impression it’s as bad and widespread as this post makes it seem though. Maybe I just haven’t read the same posts/comments and tweets.

I do think that working together so we can land on our feet and continue to help those in need sounds nice and hope you’ll still be there since critical posts like this are obviously needed.

jacquesthibs 14 Nov 2023 19:16 UTC
39 points
7 ∶ 0
on: jacquesthibs’s Shortform
If you work at a social media website or YouTube (or know anyone who does), please read the text below:
Community Notes is one of the best features to come out on social media apps in a long time. The code is even open source. Why haven’t other social media websites picked it up yet? If they care about truth, this would be a considerable step forward beyond. Notes like “this video is funded by x nation” or “this video talks about health info; go here to learn more” messages are simply not good enough.
If you work at companies like YouTube or know someone who does, let’s figure out who we need to talk to to make it happen. Naïvely, you could spend a weekend DMing a bunch of employees (PMs, engineers) at various social media websites in order to persuade them that this is worth their time and probably the biggest impact they could have in their entire career.
If you have any connections, let me know. We can also set up a doc of messages to send in order to come up with a persuasive DM.

jacquesthibs 24 Nov 2022 11:44 UTC
33 points
29 ∶ 2
in reply to: Sabs’s comment on: A Letter to the Bulletin of Atomic Scientists
I think the information you are sharing is useful (some parts less so, I agree with pseudonym), just don’t deadname/misgender them. EA is better than that.

jacquesthibs 12 May 2022 3:37 UTC
32 points
0 ∶ 0
on: EA will likely get more attention soon
One thing that may backfire with the slow rollout of talking to journalists is that people who mean to write about EA in bad faith will be the ones at the top of the search results. If you search something like “ea longtermism”, you might find bad faith articles many of us are familiar with. I’m concerned we are setting ourselves up to give people unaware of EA a very bad faith introduction.
Note: when I say “bad faith“ here, it may just be a matter of semantics with how some people are seeing it as. I think I might not have the vocabulary to articulate what I mean by “bad faith.” I actually agree with pretty much everything David has said in response to this comment.

Results for a survey of tool use and workflows in alignment research

jacquesthibs19 Dec 2022 15:19 UTC

30 points

0 comments1 min readEA link

jacquesthibs 2 Dec 2023 2:07 UTC
29 points
4 ∶ 0
on: jacquesthibs’s Shortform
More information about the alleged manipulative behaviour of Sam Altman
Source

jacquesthibs 26 Sep 2023 14:18 UTC
24 points
1 ∶ 0
on: Amazon to invest up to $4 billion in Anthropic
From what I understand, Amazon does not get a board seat for this investment. Figured that should be highlighted. Seems like Amazon just gets to use Anthropic’s models and maybe make back their investment later on. Am I understanding this correctly?
As part of the investment, Amazon will take a minority stake in Anthropic. Our corporate governance structure remains unchanged, with the Long Term Benefit Trust continuing to guide Anthropic in accordance with our Responsible Scaling Policy. As outlined in this policy, we will conduct pre-deployment tests of new models to help us manage the risks of increasingly capable AI systems.

jacquesthibs 30 Mar 2023 2:06 UTC
23 points
11 ∶ 11
on: Pausing AI Developments Isn’t Enough. We Need to Shut it All Down by Eliezer Yudkowsky
Here’s a comment I shared on my LessWrong shortform.
——

I’m still thinking this through, but I am deeply concerned about Eliezer’s new article for a combination of reasons:
- I don’t think it will work.
- Given that it won’t work, I expect we lose credibility and it now becomes much harder to work with people who were sympathetic to alignment, but still wanted to use AI to improve the world.
- I am not convinced as he is about doom and I am not as cynical about the main orgs as he is.
In the end, I expect this will just alienate people. And stuff like this concerns me.
I think it’s possible that the most memetically powerful approach will be to accelerate alignment rather than suggesting long-term bans or effectively antagonizing all AI use.
What links here?
- lilly's comment on Pausing AI Developments Isn’t Enough. We Need to Shut it All Down by Eliezer Yudkowsky by jacquesthibs (30 Mar 2023 17:41 UTC; 123 points)

jacquesthibs 31 Mar 2023 5:04 UTC
20 points
3 ∶ 0
in reply to: lilly’s comment on: Pausing AI Developments Isn’t Enough. We Need to Shut it All Down by Eliezer Yudkowsky
So, things have blown up way more than I expected and things are chaotic. Still not sure what will happen or if a treaty is actually in the cards, but I’m beginning to see a path to tons of more investment in alignment potentially. One example why is that Jeff Bezos just followed Eliezer on Twitter and I think it may catch the attention of pretty powerful and rich people who want to see AI go well. We are so off-distribution, could go in any direction.

jacquesthibs 30 Mar 2023 3:37 UTC
20 points
4 ∶ 2
in reply to: lilly’s comment on: Pausing AI Developments Isn’t Enough. We Need to Shut it All Down by Eliezer Yudkowsky
In case we have very different feeds, here’s a set of tweets critical about the article:

[Question] Can independent researchers get a sponsored visa for the US or UK?

jacquesthibs25 Mar 2023 3:05 UTC

18 points

2 comments1 min readEA link

jacquesthibs 10 Aug 2022 19:08 UTC
18 points
1 ∶ 0
on: Freddie deBoer: Effective Altruism Has a Novelty Problem
It’s a good project because, you know, doing good is important and we should want to do good better rather than worse. It’s utterly absurd because everyone who has ever wanted to do good has wanted to do good well, and acting as though you and your friends alone are the first to hit upon the idea of trying to do it is the kind of galactic hubris that only subcultures that have metastasized on the internet can really achieve.
This seems wrong to me. Just this week, I went on a date with someone who told me the only reason she volunteers is that it makes her feel good about herself, and she doesn’t particularly care much about the impact. And you know what, props to her for admitting something that I expect a lot of other people do as well. I don’t think there’s something wrong with it, I’m just saying that “everyone who has ever wanted to do good has wanted to do good well” seems wrong to me.

jacquesthibs 10 Nov 2022 8:45 UTC
17 points
7 ∶ 1
on: FTX will likely go bankrupt. What we know and some forecasts on what will happen next
The following tweet is being shared now: https://twitter.com/autismcapital/status/1590551673721991168?s=46&t=q60fxwumlq0Mq8CpGV3bxQ

This is obviously just a random unverified source, but I think it will be worth reflecting on this deeply once this is all said and done. It feeds directly into how EA’s maximizing behaviour can lead to these outcomes. Whether the above is true or not, it will certainly be painted as such by those who have been critical of EA.

AI Alignment YouTube Playlists

jacquesthibs9 May 2022 21:31 UTC

16 points

2 comments1 min readEA link

jacquesthibs

Paus­ing AI Devel­op­ments Isn’t Enough. We Need to Shut it All Down by Eliezer Yudkowsky

Con­spir­acy The­o­ries, Left Fu­tur­ism, and the At­tack on TESCREAL

Re­sults for a sur­vey of tool use and work­flows in al­ign­ment research

[Question] Can in­de­pen­dent re­searchers get a spon­sored visa for the US or UK?

AI Align­ment YouTube Playlists

Pausing AI Developments Isn’t Enough. We Need to Shut it All Down by Eliezer Yudkowsky

Conspiracy Theories, Left Futurism, and the Attack on TESCREAL

Results for a survey of tool use and workflows in alignment research

[Question] Can independent researchers get a sponsored visa for the US or UK?

AI Alignment YouTube Playlists