It’s potentially also worth noting that the difference in scores was pretty enormous:
their jailbreaking expertise did not influence their performance; their outcome for biological feasibility appeared to be primarily the product of diligent reading and adept interpretation of the gain-of-function academic literature during the exercise rather than access to the model.
This is pretty interesting to me (although it’s basically an ~anecdote, given that it’s just one team); it reminds me of some of the literature around superforecasters.
(I probably should have added a note about the black cell (and crimson cells) to the summary — thank you for adding this!)
It’s potentially also worth noting that the difference in scores was pretty enormous:
This is pretty interesting to me (although it’s basically an ~anecdote, given that it’s just one team); it reminds me of some of the literature around superforecasters.
(I probably should have added a note about the black cell (and crimson cells) to the summary — thank you for adding this!)