Ben_West🔸 comments on OpenAI’s o3 model scores 3% on the ARC-AGI-2 benchmark, compared to 60% for the average human