A trend showing small, increasing numbers, just above 0, is very different (qualitatively) to a trend that is all flat 0s
Then it’s a good thing I didn’t claim there was “a trend that is all flat 0s” in the comment you called “disingenuous”. I said:
It’s only with the o3-low and o1-pro models we see scores above 0% — but still below 5%. Getting above 0% on ARC-AGI-2 is an interesting result and getting much higher scores on the previous version of the benchmark, ARC-AGI, is an interesting result. There’s a nuanced discussion to be had about that topic.
This feels like such a small detail to focus on. It feels ridiculous.
Then it’s a good thing I didn’t claim there was “a trend that is all flat 0s” in the comment you called “disingenuous”. I said:
This feels like such a small detail to focus on. It feels ridiculous.