Greg_Colbourn ⏸️ comments on Where I Am Donating in 2024

Greg_Colbourn ⏸️ 7 Dec 2024 19:38 UTC
6 points
1 ∶ 0
I know of at least one potential counterexample: OpenAI’s RLHF was developed by AI safety people who joined OpenAI to promote safety. But it’s not clear that RLHF helps with x-risk.
I’d go further and say that it’s not actually a counterexample. RLHF allowed OpenAI to be hugely profitable—without it they wouldn’t’ve been able to publicly release their models and get their massive userbase.
What links here?
- Greg_Colbourn ⏸️ 's comment on Where I Am Donating in 2024 by MichaelDickens (7 Dec 2024 22:01 UTC; 4 points)