Hey, cool toy model (:
I bet thereâs not enough data on METR about how messy are the tasks to include it here, but I would expect it to have real world consequences and to tug in the direction of agents being less viable outside of well defined domains.
Cool (:
Iâm specifically interested in automating filtering EA-related opportunities and events to write our weekly announcements.
I think with a bit of tweaking that would be a public good for EA community building and might be re used by many groups.