Agreed, this is ridiculous. You should take down the contest.
Your chances of a successful attack are very low. It takes years for information to be scraped from the internet, trained into a model, and deployed to production. GPT-4 has a knowledge cutoff of September 2021. If future models have the same delay, you won’t see results for a year and a half.
The more likely outcome is press coverage about how AI safety folks are willing to hold society hostage in order to enforce their point of view. See this takedown piece on Eliezer Yudkowsky, and the 80,000 Hours advice on how to avoid accidentally harming a cause you want to help.
For what it’s worth, the contest host is an artist I know who has no connection to the EA movement. Also, there is no “holding society hostage” because the contest is designed to make it trivially easy to filter out all poison, just by looking for keywords. (wallabywinter & yallabywinter). Black hat hackers are already doing code example poisoning on stack overflow, and this contest simply seeks to raise awareness of that fact in the white-hat community.
Agreed, this is ridiculous. You should take down the contest.
Your chances of a successful attack are very low. It takes years for information to be scraped from the internet, trained into a model, and deployed to production. GPT-4 has a knowledge cutoff of September 2021. If future models have the same delay, you won’t see results for a year and a half.
The more likely outcome is press coverage about how AI safety folks are willing to hold society hostage in order to enforce their point of view. See this takedown piece on Eliezer Yudkowsky, and the 80,000 Hours advice on how to avoid accidentally harming a cause you want to help.
For what it’s worth, the contest host is an artist I know who has no connection to the EA movement. Also, there is no “holding society hostage” because the contest is designed to make it trivially easy to filter out all poison, just by looking for keywords. (wallabywinter & yallabywinter). Black hat hackers are already doing code example poisoning on stack overflow, and this contest simply seeks to raise awareness of that fact in the white-hat community.