Desensitizing Deepfakes

PhibMar 29, 2023, 1:20 AM

22 points

Misinformation Cause prioritization Epistemology Less-discussed causes Building effective altruism Near-term AI ethics AI safety Community epistemic health

There is some discussion on the forum about using AI to detect whether or not something is a deepfake, and perhaps some trust in anti-deepfake bots to be better resourced etc. in this arms race. But could we give ourselves a bit of breathing room here?

Could it be incredibly valuable to accelerate desensitization to deepfakes? Or at least make people more aware of them by using humor?

It seems like a real risk that someone eventually creates a convincing and harmful deepfake, a la current President saying, “America has launched nuclear weapons against Russia.” Or vice versa or literally anything bad, with voice and video, that to our eyes is Very Real and Convincingly Terrible.

Should we be subverting people’s expectations by familiarizing them with deepfakes, perhaps best done by example? If you have not seen it already, it seems that the memes of the US Presidents playing computer games (Warning expletives+: US Presidents play Minecraft) is actually a pretty good example of this. On the flip side of this, I also recall seeing a video being shown to an older individual who, despite Biden saying the most heinous things, found it more believable that it was real than a deepfake.

So maybe spamming content for significant figures doing whacky things is effective for updating people’s models for the probability of a deepfake. I was considering further pushing the bounds here by walking the fine line of a ‘real fake’ nuclear assault, a la Biden saying, “My fellow Americans, we have launched nukes against… the Moon.” But that seems unnecessarily too close to the mark.

There could be an info hazard that by demonstrating the capabilities of deepfakes you actually show people that this is a powerful manipulation tool and increase risk of malicious use.

Unsure call to action here, thought I’d share, hope for steelmanning/meta-feedback on post, and just indicate a potentially impactful angle that seems to be working pretty well in some ways...

Desensitizing idea of deepfakes without risk, in a way that may make people more likely to question whether something said is real (could be worth more rigorous study).

PhibMar 29, 2023, 1:20 AM

22 points

10 comments1 min readEA link

Misinformation Cause prioritization Epistemology Less-discussed causes Building effective altruism Near-term AI ethics AI safety Community epistemic health

Geoffrey Miller Mar 29, 2023, 3:18 AM
3 points
2 ∶ 0

Phib—I really like this idea.
I agree that deepfakes could be a potential amplifier of global catastrophic risks such as warfare, assassinations, political instability, civil war, religious outrage, terrorism, etc. Especially if people haven’t really caught up to how realistic and deceptive they can be.
I’m also not sure of the best way to ‘inoculate’ people against deepfakes. As you mention, older people might be especially susceptible to them. As would any young people with strong ideological confirmation biases to accept that apparent misbehavior X by individual or group Y might have plausibly happened.
I expect that in the 2024 US election cycle, we’ll see an absolute deluge of partisan deepfakes that aim to discredit, mock, and satirize various candidates, analogous to the surge of social media memes in the 2016 election cycle. I don’t think voters are at all prepared for how vicious, surreal, and damaging these could get.
To most EAs, deepfakes might sound like a weird branch of the porn industry, or a minor political annoyance, compared to flashy AI developments like GPT. However, I think they really could have an utterly corrosive effect on public discourse, partisanship, and trust in news media. Overall, this seems like a moderate-scope, largely neglected, but somewhat tractable issue for EAs to give a bit of attention to.
- Phib Mar 29, 2023, 4:02 AM
  3 points
  1 ∶ 0
  Parent
  
  Hi Geoffrey, thanks for taking the time to respond so thoughtfully.
  Agree with the younger people demographic comment, as someone who identifies as young, I’m super fallible, annoyingly. Interestingly, I do have the impression that much of the current ‘entertainment deepfake’ content might be more targeted toward younger people.
  I hadn’t thought of the 2024 election and I think you’re very right in bringing it up (thanks again), it does seem like a ‘first’ big inflection point for this emerging tech to be used...
  I’m tempted to say that the awful deepfakes of this political cycle could be that terrible pill to inoculate people from the worst-case scenarios involving x-risk… but it’s hard to imagine how bad these could be and I have this idea that the deepfakes lose quite a bit of their ‘oomph’ if they are ramped up over time or even overused (per ‘desensitization’ again).
  In my mind what might separate an x-risk deepfake from the political cycle ones is that the most dangerous a deepfake might be is when it is not analyzed over time (like in a long political campaign*), but rather incorporated into or even causing an immediate decision… unsure.
  *interesting to think of the new arena of (deep)fake news...
more better Mar 29, 2023, 5:58 AM
2 points
0 ∶ 0

Thanks for bringing this to light. I think awareness around deepfakes really does need to be considered more.
So maybe spamming content for significant figures doing whacky things is effective for updating people’s models for the probability of a deepfake.
I would be slightly concerned about spamming people with deepfakes. I don’t know if the average adult knows what a deepfake is. If spammed with deepfakes, people might think the world is even crazier than it actually is and I think that could get ugly. I think a more overtly educational or interactive exchange might have a better outcome.
I tend to mentally categorize deepfakes in a “discernment of truth” bucket alongside misinformation, disinformation. I think that bucket may already be impacting the average adult a fair amount. Social media bots have been fueling these for a while, and (personally, I feel that) there has been a fairly dramatic increase in quantity of scams, spam, phishing and clickbait augmented by ChatGPT and SEO changes. How do others think about this framing and what have others observed?
I agree that this issue is likely to get much worse if it’s not acted upon in a well coordinated, scaled and rapid manner.
- Phib Mar 29, 2023, 4:13 PM
  4 points
  1 ∶ 0
  Parent
  
  Yeah discernment of truth makes sense to me—and fair, spam is probably not productive, but it got across my intention of ‘desensitizing’ people to this strategy of playing on our ‘discernment of truth’. I think Geoffrey’s comment on the next political cycle is really interesting for thinking about how that ‘spam’ may end up looking.
  - more better Mar 29, 2023, 4:39 PM
    2 points
    0 ∶ 0
    Parent
    
    I agree. This seems like an important problem.
    
    Several existing technologies can wreak havoc on epistemics / perception of the world and worsen polarization. Navigating truth vs. fiction will get more difficult. This will continue to be a problem for elections and may sow seeds for even bigger global problems.
    
    Anyone know what efforts exist to combat this?
    
    Is this a subcategory of AI safety?
    - Phib Mar 29, 2023, 10:14 PM
      1 point
      0 ∶ 0
      Parent
      
      Yeah, I have no idea, unfortunately. And yes it seems quite attached to AI capabilities.
henry james May 2, 2023, 9:01 AM
0 points
0 ∶ 0

I agree. This seems like an important problem.
Several existing technologies can wreak havoc on epistemics / perception of the world and worsen polarization. Navigating truth vs. fiction will get more difficult. This will continue to be a problem for elections and may sow https://apkinu.com/minecraft-apk/ for even bigger global problems.
Anyone know what efforts exist to combat this?
Is this a subcategory of AI safety?
- Phib May 2, 2023, 5:11 PM
  1 point
  0 ∶ 0
  Parent
  
  This is odd, you copied ’more better’s comment?