JP Addison🔸 comments on Can we evaluate the “tool versus agent” AGI prediction?

JP Addison🔸 9 Apr 2023 13:41 UTC
4 points
4 ∶ 0
I guess it depends on your priors or something. It’s agentic relative to a rock, but, relative to an AI which can pass the LSAT, it’s well below my expectations. It seems like ARC-Evals had to coax and prod GPT-4 to get it to do things it “should” have been doing with rudimentary levels of agency.