Apparently there’s a preprint showing Gemini 2.5 gets 20% on the Olympiad questions, which would be in line with the o3 result.
Apparently there’s a preprint showing Gemini 2.5 gets 20% on the Olympiad questions, which would be in line with the o3 result.