Archive
About
Search
Log In
Home
All
Wiki
Shortform
Recent
Comments
Larks comments on
What predictions from theoretical AI Safety research have been confirmed by empirical work?
Larks
4 Jan 2025 5:46 UTC
2
points
0 ∶ 0
Conversations people have with un-RLHF’d models.
Back to top
Conversations people have with un-RLHF’d models.